IDEA-Emdoor-Lab/UniTTS

A TTS Trained on Universal Audio.

24
/ 100
Experimental

This project helps generate natural-sounding speech from text, particularly for Chinese. You provide the system with text you want to convert to speech, and optionally, a reference audio clip and its corresponding text to guide the voice's style. The output is high-quality, expressive audio that captures subtle emotional nuances. This is ideal for content creators, educators, or anyone needing realistic spoken audio for Chinese applications.

No commits in the last 6 months.

Use this if you need to generate highly natural and emotionally expressive Chinese speech from text, especially when you want to match a specific voice style or handle conversational and complex narratives.

Not ideal if your primary need is for speech synthesis in languages other than Chinese and English, or if you require an extremely lightweight, low-resource system for basic text-to-speech without advanced emotional modeling.

audio-content-creation speech-synthesis language-education voice-acting multimedia-localization
No License Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 7 / 25
Maturity 7 / 25
Community 8 / 25

How are scores calculated?

Stars

41

Forks

3

Language

Python

License

Category

text-to-speech

Last pushed

Jun 06, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/IDEA-Emdoor-Lab/UniTTS"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.