netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
EmotiVoice helps content creators, marketers, educators, and storytellers convert written text into natural-sounding speech with expressive emotions. You provide text in English or Chinese, specify the desired emotion (like happy, excited, or sad), and choose from over 2000 unique voices. The output is high-quality audio that brings your content to life, suitable for videos, audiobooks, presentations, or customer service.
8,455 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to generate realistic, emotionally rich voiceovers for your content in English or Chinese, and want a wide variety of voices to choose from.
Not ideal if you need to generate speech in languages other than English or Chinese, or require extremely precise control over every nuance of speech beyond emotional prompts.
Stars
8,455
Forks
746
Language
Python
License
Apache-2.0
Category
Last pushed
Aug 13, 2024
Commits (30d)
0
Dependencies
18
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/netease-youdao/EmotiVoice"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
yeyupiaoling/MASR
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
DigitalPhonetics/IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages!
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment