shivammehta25/Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

59
/ 100
Established

Matcha-TTS is for anyone who needs to convert written text into high-quality, natural-sounding spoken audio quickly. You provide the text you want spoken, and it generates the corresponding speech audio. This is ideal for content creators, educators, or businesses looking to automate narration or voiceovers.

1,259 stars.

Use this if you need to rapidly generate realistic speech from text for various applications, especially when speed and naturalness are critical.

Not ideal if you need highly customized voice modulation beyond speaking rate and temperature, or if you require fine-grained control over individual phonemes.

text-to-speech audio-generation voiceover narration content-creation
No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 23 / 25

How are scores calculated?

Stars

1,259

Forks

189

Language

Jupyter Notebook

License

MIT

Last pushed

Mar 09, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/shivammehta25/Matcha-TTS"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.