will-rice/diffwave

TensorFlow 2.0 Implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis. (WIP)

/ 100

Experimental

This project offers a versatile diffusion model for generating audio. It takes in various parameters to synthesize new sounds, music, or speech. Audio researchers, sound designers, or AI practitioners exploring novel audio generation techniques would find this useful.

No commits in the last 6 months.

Use this if you are an audio researcher or sound designer looking to experiment with a diffusion model for creating high-quality, synthetic audio from scratch.

Not ideal if you need a production-ready, highly optimized audio synthesis tool, as this is a work-in-progress.

audio-synthesis sound-design music-generation speech-synthesis AI-audio-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

Apache-2.0

Higher-rated alternatives

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

fatchord/WaveRNN

WaveRNN Vocoder + TTS

shangeth/wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...

rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...

seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights