WelkinYang/EMPHASIS-pytorch
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
EMPHASIS helps you create spoken audio from text with specific emotional tones. You provide the text you want spoken and indicate the desired emotions, and it generates speech that conveys those feelings. This is for anyone like content creators, voiceover artists, or educators who need to produce expressive audio.
No commits in the last 6 months.
Use this if you need to transform written content into natural-sounding speech that carries distinct emotional nuances.
Not ideal if you need a simple text-to-speech system without emotion control, or if you require highly specific, custom voice characteristics.
Stars
15
Forks
3
Language
Python
License
—
Category
Last pushed
Mar 31, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/WelkinYang/EMPHASIS-pytorch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System