will-rice/diffwave
TensorFlow 2.0 Implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis. (WIP)
This project offers a versatile diffusion model for generating audio. It takes in various parameters to synthesize new sounds, music, or speech. Audio researchers, sound designers, or AI practitioners exploring novel audio generation techniques would find this useful.
No commits in the last 6 months.
Use this if you are an audio researcher or sound designer looking to experiment with a diffusion model for creating high-quality, synthetic audio from scratch.
Not ideal if you need a production-ready, highly optimized audio synthesis tool, as this is a work-in-progress.
Stars
8
Forks
—
Language
Python
License
Apache-2.0
Category
Last pushed
Sep 06, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/will-rice/diffwave"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
fatchord/WaveRNN
WaveRNN Vocoder + TTS
shangeth/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...
rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...
seungwonpark/melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)