dunky11/voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
This tool helps you create custom text-to-speech (TTS) voices without needing to write any code. You provide audio recordings and their corresponding text, and it generates a unique voice model. This is ideal for content creators, educators, or businesses who need branded or specialized voices for their applications.
229 stars. No commits in the last 6 months.
Use this if you need to generate high-quality, custom voiceovers from text using your own unique voice datasets.
Not ideal if you don't have access to an NVIDIA GPU, as training on a CPU will be exceptionally slow.
Stars
229
Forks
33
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 10, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/dunky11/voicesmith"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
herimor/voxtream
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
EveryVoiceTTS/EveryVoice
The EveryVoice TTS Toolkit - Text To Speech for your language
thorstenMueller/Thorsten-Voice
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be...
daswer123/xtts-webui
Webui for using XTTS and for finetuning it
kadirnar/VoiceHub
VoiceHub: A Unified Inference Interface for TTS Models