zzw922cn/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
This collection helps researchers and practitioners explore cutting-edge developments in creating and understanding spoken and musical audio. It compiles research papers across various aspects of speech and audio technology, including converting text to speech or music, recognizing spoken words, identifying speakers, and transforming voices. It's a resource for anyone involved in developing or researching advanced audio generation and analysis systems.
3,119 stars. No commits in the last 6 months.
Use this if you are a researcher or engineer looking for academic papers on topics like automatic speech recognition, speech synthesis, or generating music from text.
Not ideal if you are looking for ready-to-use software, code libraries, or practical tutorials for implementing speech or audio applications.
Stars
3,119
Forks
513
Language
—
License
MIT
Category
Last pushed
Oct 19, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/zzw922cn/awesome-speech-recognition-speech-synthesis-papers"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
ivcylc/OpenMusic
OpenMusic: SOTA Text-to-music (TTM) Generation
aidayang/LatentSync-OneClick
免费视频对口型软件LatentSync一键启动整合包
iron-mukakin/Emoji-TTS
Irodori-TTSのフォーク、echo-TTSのwebuiになります。
guan-yuan/Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing...