xuan3986/UDDETTS

The first LLM that unifies discrete and dimensional emotions for controllable emotional TTS

/ 100

Experimental

This project helps create speech that conveys specific emotions by taking text and desired emotional parameters (like how happy or sad the voice should sound). It produces natural-sounding audio that can express a wide range of feelings. This tool is for researchers, content creators, or anyone needing to generate expressive voiceovers or dialogue with precise emotional control.

No commits in the last 6 months.

Use this if you need to generate high-quality, synthetic speech where you can precisely control the emotional tone and intensity of the voice.

Not ideal if you are looking for an out-of-the-box solution without any setup, as it requires downloading datasets and running training scripts.

emotional-speech-synthesis voiceover-generation AI-audio-production human-computer-interaction digital-voice-creation

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 4 / 25

Maturity 15 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

index-tts/index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

stepfun-ai/Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...

lucasnewman/f5-tts-mlx

Implementation of F5-TTS in MLX

unilight/seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Explore Voice AI Tools

All categories Trending Voice AI directory Insights