Wonbin-Jung/e3-vits
Official GitHub page of E3-VITS
This helps voice actors, content creators, and educators generate natural-sounding speech with specific emotions. You provide text and, optionally, a reference audio clip showcasing the desired emotion, and it produces an audio file of someone speaking that text with the specified emotional tone. It's especially useful for creating voiceovers or characters with consistent emotional styles.
No commits in the last 6 months.
Use this if you need to synthesize speech that conveys specific emotions, whether from a text description or by mimicking an emotion from another speaker's voice.
Not ideal if you're looking for a simple text-to-speech converter without any emotional nuance or style transfer capabilities.
Stars
9
Forks
1
Language
HTML
License
—
Category
Last pushed
Jun 28, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Wonbin-Jung/e3-vits"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System