zsl24/Tacotron2-Mandarin-HiFiGAN

Implementation of TTS with combination of Tacotron2 and HiFi-GAN

/ 100

Emerging

This project helps create natural-sounding Mandarin Chinese speech from written text. You provide Mandarin text, and it generates an audio file of that text being spoken. This is useful for anyone who needs to convert written Mandarin into high-quality spoken audio, such as content creators, educators, or accessibility specialists.

No commits in the last 6 months.

Use this if you need to generate high-quality, natural-sounding Mandarin speech from text for applications like audiobooks, voiceovers, or learning materials.

Not ideal if you need to generate speech in languages other than Mandarin Chinese, or if you require real-time, low-latency speech generation for interactive applications.

Mandarin-speech-synthesis audio-content-creation e-learning-audio voiceover-production accessibility-tools

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

fatchord/WaveRNN

WaveRNN Vocoder + TTS

shangeth/wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...

rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...

seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights