fishaudio/fish-speech

SOTA Open Source TTS

62
/ 100
Established

This project helps creators, marketers, and content producers generate incredibly natural and expressive voices from text. You input written text, optionally with emotion or prosody tags like "[whisper]" or "[excited]", and it outputs realistic spoken audio in over 80 languages. This is for anyone who needs high-quality, emotionally rich synthetic speech for voiceovers, virtual assistants, or educational content.

26,613 stars. Actively maintained with 18 commits in the last 30 days.

Use this if you need to transform text into highly realistic, emotionally nuanced speech in multiple languages with fine-grained control over vocal delivery.

Not ideal if you need a simple, quick text-to-speech solution without needing advanced emotional control or multilingual support.

content-creation voiceovers localization audio-production digital-assistants
No Package No Dependents
Maintenance 17 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

26,613

Forks

2,237

Language

Python

License

Last pushed

Mar 13, 2026

Commits (30d)

18

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/fishaudio/fish-speech"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.