adelacvg/detail_tts

All generative model in one for better TTS model

/ 100

Emerging

This project helps audio engineers and content creators quickly generate high-quality, natural-sounding speech from text. You provide written text prompts, and it produces lifelike spoken audio files. It's designed for professionals who need to scale up audio production using extensive, even imperfect, datasets.

No commits in the last 6 months.

Use this if you need to generate realistic speech from text, especially when working with large volumes of audio data that might be messy or of varying quality.

Not ideal if you are looking for a simple, off-the-shelf text-to-speech solution without needing to train or fine-tune models.

text-to-speech audio-generation content-creation voice-synthesis media-production

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

index-tts/index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

stepfun-ai/Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...

lucasnewman/f5-tts-mlx

Implementation of F5-TTS in MLX

unilight/seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Explore Voice AI Tools

All categories Trending Voice AI directory Insights