fishaudio/fish-speech

SOTA Open Source TTS

/ 100

Established

This project helps creators, marketers, and content producers generate incredibly natural and expressive voices from text. You input written text, optionally with emotion or prosody tags like "[whisper]" or "[excited]", and it outputs realistic spoken audio in over 80 languages. This is for anyone who needs high-quality, emotionally rich synthetic speech for voiceovers, virtual assistants, or educational content.

26,613 stars. Actively maintained with 18 commits in the last 30 days.

Use this if you need to transform text into highly realistic, emotionally nuanced speech in multiple languages with fine-grained control over vocal delivery.

Not ideal if you need a simple, quick text-to-speech solution without needing advanced emotional control or multilingual support.

content-creation voiceovers localization audio-production digital-assistants

No Package No Dependents

Maintenance 17 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

26,613

Forks

2,237

Language

Python

License

—

Recent Releases

v2.0.0-beta 10 Mar 2026 v1.5.1 31 May 2025 v1.5.0 25 Dec 2024 v1.4.3 29 Nov 2024 v1.4.2 25 Oct 2024

Related tools

Blaizzy/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's...

lenML/Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server...

sidharthrajaram/StyleTTS2

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

mlalma/kokoro-ios

Kokoro TTS for iOS and macOSX

mlalma/KokoroTestApp

Test application for Kokoro TTS model

Explore Voice AI Tools

All categories Trending Voice AI directory Insights