lars76/fastspeech2-clean
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
This project helps generate natural-sounding speech from text for Chinese. You provide Chinese text (characters, Pinyin, or IPA), and it outputs an audio file that sounds like a human speaking that text. It's designed for speech synthesis researchers or developers creating applications that need high-quality, reproducible Chinese text-to-speech.
No commits in the last 6 months.
Use this if you need to generate high-quality, synthetic Chinese speech from text and are looking for a reliable, up-to-date implementation of FastSpeech2 or LightSpeech models.
Not ideal if you need a plug-and-play solution for speech generation in languages other than Chinese without any code adaptation.
Stars
18
Forks
7
Language
Python
License
MIT
Category
Last pushed
Aug 16, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/lars76/fastspeech2-clean"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...
lucasnewman/nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...
jxzhanggg/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC