ORI-Muchim/PolyLangVITS

Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)

/ 100

Emerging

This system helps creators, educators, and content producers generate speech from text in multiple languages, including Korean, Japanese, English, and Chinese. You provide audio samples from different speakers in various languages, along with the text you want spoken. The system then outputs natural-sounding speech tailored to the distinct voices and languages of your input.

No commits in the last 6 months.

Use this if you need to create custom, multi-speaker, multilingual voiceovers or audio content for projects like e-learning modules, video narration, or podcasts.

Not ideal if you lack access to powerful computing resources, specifically a GPU with at least 12GB VRAM and a system with 16GB RAM.

multilingual-voiceover audiobook-production e-learning-content media-localization text-to-speech-synthesis

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

High-Logic/Genie-TTS

GPT-SoVITS ONNX Inference Engine & Model Converter

chinokikiss/GSV-TTS-Lite

GSV-TTS-Lite A high-performance inference engine specifically designed for the GPT-SoVITS...

FENRlR/MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

AlexandaJerry/vits-mandarin-biaobei

application of vits on mandarin tts

Explore Voice AI Tools

All categories Trending Voice AI directory Insights