Rongjiehuang/Multiband-WaveRNN

An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/

/ 100

Emerging

This project helps developers and researchers working on speech synthesis to generate high-quality audio from speech models. You provide an existing dataset of spoken audio, and it generates synthetic speech samples. It's intended for those who need to experiment with and implement autoregressive vocoder models.

No commits in the last 6 months.

Use this if you are a developer or researcher looking to implement and train an autoregressive vocoder model for speech generation.

Not ideal if you need a user-friendly, out-of-the-box solution for text-to-speech without coding or model training.

speech-synthesis audio-generation vocoder-development text-to-speech-research machine-learning-engineering

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

fatchord/WaveRNN

WaveRNN Vocoder + TTS

shangeth/wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...

rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...

seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights