ayutaz/uCosyVoice
CosyVoice3 text-to-speech for Unity using ONNX inference. Supports zero-shot voice cloning
This project helps game developers, animators, or virtual reality creators generate realistic human speech directly within their Unity projects. You provide written text and, optionally, a short audio sample of a voice you want to clone. The system then outputs an audio file of that text spoken in a natural voice, which can even mimic a specific voice from your sample.
Use this if you need to add dynamic, lifelike voiceovers or character dialogue to your Unity game or interactive experience without relying on external services or pre-recorded audio for every line.
Not ideal if you need to synthesize speech in languages other than English, as the system is currently optimized only for English text.
Stars
16
Forks
2
Language
C#
License
Apache-2.0
Category
Last pushed
Jan 15, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ayutaz/uCosyVoice"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
herimor/voxtream
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
EveryVoiceTTS/EveryVoice
The EveryVoice TTS Toolkit - Text To Speech for your language
thorstenMueller/Thorsten-Voice
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be...
daswer123/xtts-webui
Webui for using XTTS and for finetuning it
kadirnar/VoiceHub
VoiceHub: A Unified Inference Interface for TTS Models