Kyubyong/cross_vc
Cross-lingual Voice Conversion
This project helps anyone who needs to take a voice from one speaker and make it speak in multiple different languages, while retaining the original speaker's unique vocal characteristics. You provide existing audio recordings of a voice and desired text in different languages, and it generates new audio files where the original voice speaks the new languages. This could be used by content creators, language educators, or virtual assistant developers.
No commits in the last 6 months.
Use this if you need to generate speech in multiple languages using a specific speaker's voice, rather than a generic text-to-speech voice.
Not ideal if you require production-ready quality speech conversion, as the current results are acknowledged to be imperfect.
Stars
97
Forks
25
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 05, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Kyubyong/cross_vc"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System