BogiHsu/WG-WaveNet
Real-Time High-Fidelity Speech Synthesis without GPU
This project helps create high-quality, natural-sounding speech from text, even on standard computers without powerful graphics cards. You provide written text, and it generates an audio file of someone speaking that text aloud. It's ideal for content creators, audiobook producers, or anyone needing to generate realistic spoken audio quickly and efficiently.
No commits in the last 6 months.
Use this if you need to generate high-fidelity spoken audio from text in real-time without requiring specialized, high-end GPU hardware.
Not ideal if you're looking for a complete text-to-speech system with pretrained models readily available for immediate use, as some components are still under development.
Stars
73
Forks
13
Language
Python
License
MIT
Category
Last pushed
Jul 29, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/BogiHsu/WG-WaveNet"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
fatchord/WaveRNN
WaveRNN Vocoder + TTS
shangeth/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...
rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...
seungwonpark/melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)