BogiHsu/WG-WaveNet

Real-Time High-Fidelity Speech Synthesis without GPU

/ 100

Emerging

This project helps create high-quality, natural-sounding speech from text, even on standard computers without powerful graphics cards. You provide written text, and it generates an audio file of someone speaking that text aloud. It's ideal for content creators, audiobook producers, or anyone needing to generate realistic spoken audio quickly and efficiently.

No commits in the last 6 months.

Use this if you need to generate high-fidelity spoken audio from text in real-time without requiring specialized, high-end GPU hardware.

Not ideal if you're looking for a complete text-to-speech system with pretrained models readily available for immediate use, as some components are still under development.

speech-synthesis audio-production voice-over content-creation digital-audio

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

fatchord/WaveRNN

WaveRNN Vocoder + TTS

shangeth/wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...

rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...

seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights