vliu15/adversarial-tts

End-to-end Text-to-Speech with Generative Adversarial Networks

/ 100

Emerging

This project helps speech synthesis researchers and engineers create high-quality, natural-sounding synthetic speech from text. It takes written text and produces spoken audio, allowing you to build and experiment with speech generation models. It's ideal for those working on voice AI and automated narration.

No commits in the last 6 months.

Use this if you need to generate realistic human-like speech directly from written text for applications like virtual assistants or audio content creation.

Not ideal if you're looking for a pre-trained, ready-to-use text-to-speech service without needing to train or customize models.

speech-synthesis voice-generation audio-engineering AI-narration virtual-assistants

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

fatchord/WaveRNN

WaveRNN Vocoder + TTS

shangeth/wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...

rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...

seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights