ORI-Muchim/PolyLangVITS
Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)
This system helps creators, educators, and content producers generate speech from text in multiple languages, including Korean, Japanese, English, and Chinese. You provide audio samples from different speakers in various languages, along with the text you want spoken. The system then outputs natural-sounding speech tailored to the distinct voices and languages of your input.
No commits in the last 6 months.
Use this if you need to create custom, multi-speaker, multilingual voiceovers or audio content for projects like e-learning modules, video narration, or podcasts.
Not ideal if you lack access to powerful computing resources, specifically a GPU with at least 12GB VRAM and a system with 16GB RAM.
Stars
75
Forks
8
Language
Python
License
MIT
Category
Last pushed
Feb 28, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ORI-Muchim/PolyLangVITS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
High-Logic/Genie-TTS
GPT-SoVITS ONNX Inference Engine & Model Converter
chinokikiss/GSV-TTS-Lite
GSV-TTS-Lite A high-performance inference engine specifically designed for the GPT-SoVITS...
FENRlR/MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
AlexandaJerry/vits-mandarin-biaobei
application of vits on mandarin tts