WelkinYang/EMPHASIS-pytorch

EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System

/ 100

Experimental

EMPHASIS helps you create spoken audio from text with specific emotional tones. You provide the text you want spoken and indicate the desired emotions, and it generates speech that conveys those feelings. This is for anyone like content creators, voiceover artists, or educators who need to produce expressive audio.

No commits in the last 6 months.

Use this if you need to transform written content into natural-sounding speech that carries distinct emotional nuances.

Not ideal if you need a simple text-to-speech system without emotion control, or if you require highly specific, custom voice characteristics.

speech-synthesis voiceover content-creation audio-production e-learning

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

index-tts/index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

stepfun-ai/Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...

lucasnewman/f5-tts-mlx

Implementation of F5-TTS in MLX

unilight/seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Explore Voice AI Tools

All categories Trending Voice AI directory Insights