ORI-Muchim/PolyLangVITS

Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)

37
/ 100
Emerging

This system helps creators, educators, and content producers generate speech from text in multiple languages, including Korean, Japanese, English, and Chinese. You provide audio samples from different speakers in various languages, along with the text you want spoken. The system then outputs natural-sounding speech tailored to the distinct voices and languages of your input.

No commits in the last 6 months.

Use this if you need to create custom, multi-speaker, multilingual voiceovers or audio content for projects like e-learning modules, video narration, or podcasts.

Not ideal if you lack access to powerful computing resources, specifically a GPU with at least 12GB VRAM and a system with 16GB RAM.

multilingual-voiceover audiobook-production e-learning-content media-localization text-to-speech-synthesis
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 12 / 25

How are scores calculated?

Stars

75

Forks

8

Language

Python

License

MIT

Last pushed

Feb 28, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ORI-Muchim/PolyLangVITS"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.