xuan3986/UDDETTS
The first LLM that unifies discrete and dimensional emotions for controllable emotional TTS
This project helps create speech that conveys specific emotions by taking text and desired emotional parameters (like how happy or sad the voice should sound). It produces natural-sounding audio that can express a wide range of feelings. This tool is for researchers, content creators, or anyone needing to generate expressive voiceovers or dialogue with precise emotional control.
No commits in the last 6 months.
Use this if you need to generate high-quality, synthetic speech where you can precisely control the emotional tone and intensity of the voice.
Not ideal if you are looking for an out-of-the-box solution without any setup, as it requires downloading datasets and running training scripts.
Stars
8
Forks
1
Language
Python
License
Apache-2.0
Category
Last pushed
Sep 28, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/xuan3986/UDDETTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System