Executedone/Chinese-FastSpeech2

基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏

/ 100

Emerging

This project helps content creators and developers generate natural-sounding Mandarin Chinese speech from text. You provide Chinese text, and it produces an audio file with lively and rhythmic pronunciation. It's ideal for anyone needing realistic Chinese voiceovers for educational content, announcements, or interactive applications.

278 stars. No commits in the last 6 months.

Use this if you need to convert written Chinese text into expressive, high-quality spoken audio with accurate rhythm and intonation.

Not ideal if you require voices in languages other than Mandarin Chinese or need to generate speech with highly specific, custom voice characteristics beyond enhanced rhythm.

voice-synthesis Mandarin-speech audio-generation text-to-speech content-creation

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 21 / 25

How are scores calculated?

Stars

278

Forks

Language

Python

License

—

Higher-rated alternatives

TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...

lucasnewman/nanospeech

A simple, hackable text-to-speech system in PyTorch and MLX

Tomiinek/Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...

keonlee9420/STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...

jxzhanggg/nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

Explore Voice AI Tools

All categories Trending Voice AI directory Insights