adelacvg/detail_tts
All generative model in one for better TTS model
This project helps audio engineers and content creators quickly generate high-quality, natural-sounding speech from text. You provide written text prompts, and it produces lifelike spoken audio files. It's designed for professionals who need to scale up audio production using extensive, even imperfect, datasets.
No commits in the last 6 months.
Use this if you need to generate realistic speech from text, especially when working with large volumes of audio data that might be messy or of varying quality.
Not ideal if you are looking for a simple, off-the-shelf text-to-speech solution without needing to train or fine-tune models.
Stars
74
Forks
9
Language
Python
License
—
Category
Last pushed
Sep 08, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/adelacvg/detail_tts"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System