Executedone/Chinese-FastSpeech2
基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏
This project helps content creators and developers generate natural-sounding Mandarin Chinese speech from text. You provide Chinese text, and it produces an audio file with lively and rhythmic pronunciation. It's ideal for anyone needing realistic Chinese voiceovers for educational content, announcements, or interactive applications.
278 stars. No commits in the last 6 months.
Use this if you need to convert written Chinese text into expressive, high-quality spoken audio with accurate rhythm and intonation.
Not ideal if you require voices in languages other than Mandarin Chinese or need to generate speech with highly specific, custom voice characteristics beyond enhanced rhythm.
Stars
278
Forks
49
Language
Python
License
—
Category
Last pushed
Sep 10, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Executedone/Chinese-FastSpeech2"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...
lucasnewman/nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...
jxzhanggg/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC