zzw922cn/awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

/ 100

Emerging

This collection helps researchers and practitioners explore cutting-edge developments in creating and understanding spoken and musical audio. It compiles research papers across various aspects of speech and audio technology, including converting text to speech or music, recognizing spoken words, identifying speakers, and transforming voices. It's a resource for anyone involved in developing or researching advanced audio generation and analysis systems.

3,119 stars. No commits in the last 6 months.

Use this if you are a researcher or engineer looking for academic papers on topics like automatic speech recognition, speech synthesis, or generating music from text.

Not ideal if you are looking for ready-to-use software, code libraries, or practical tutorials for implementing speech or audio applications.

speech-technology-research audio-generation voice-recognition text-to-speech music-synthesis

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 23 / 25

How are scores calculated?

Stars

3,119

Forks

513

Language

—

License

MIT

Related tools

ivcylc/OpenMusic

OpenMusic: SOTA Text-to-music (TTM) Generation

aidayang/LatentSync-OneClick

免费视频对口型软件LatentSync一键启动整合包

iron-mukakin/Emoji-TTS

Irodori-TTSのフォーク、echo-TTSのwebuiになります。

guan-yuan/Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights