BoltzmannEntropy/xtts2-ui
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
This tool helps content creators, educators, or marketers generate spoken audio in a cloned voice. You provide a text script and a 10-second audio sample of the voice you want to mimic, and it outputs the script spoken in that cloned voice. This is ideal for anyone needing custom voiceovers or personalized audio messages in multiple languages.
391 stars. No commits in the last 6 months.
Use this if you need to quickly generate speech in a specific voice from text, using only a short audio sample of that voice.
Not ideal if you require extremely high-fidelity voice cloning that is indistinguishable from professional voice actors.
Stars
391
Forks
67
Language
Python
License
MIT
Category
Last pushed
Dec 06, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/BoltzmannEntropy/xtts2-ui"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
herimor/voxtream
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
EveryVoiceTTS/EveryVoice
The EveryVoice TTS Toolkit - Text To Speech for your language
thorstenMueller/Thorsten-Voice
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be...
daswer123/xtts-webui
Webui for using XTTS and for finetuning it
kadirnar/VoiceHub
VoiceHub: A Unified Inference Interface for TTS Models