FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

/ 100

Emerging

This project helps you generate natural-sounding speech from text using a small audio sample of a voice. You provide the text you want spoken and a short audio clip of the desired voice (3-10 seconds long), and it produces an audio file with that text read aloud in the cloned voice. This is ideal for content creators, educators, or anyone needing to generate customized voiceovers or audio content.

905 stars. No commits in the last 6 months.

Use this if you need to quickly generate high-quality, natural-sounding speech in a specific voice from a written script for tasks like narration or creating personalized audio messages.

Not ideal if you need a solution for real-time, ultra low-latency applications or if you don't have a clear, short audio reference for the voice you want to clone.

voice-cloning audio-content-creation narration synthetic-media content-localization

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

905

Forks

Language

Python

License

MPL-2.0

Higher-rated alternatives

index-tts/index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

stepfun-ai/Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...

lucasnewman/f5-tts-mlx

Implementation of F5-TTS in MLX

unilight/seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

RaduBolbo/F5-TTS-Emotional-CFG

Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS

Explore Voice AI Tools

All categories Trending Voice AI directory Insights