drethage/speech-denoising-wavenet
A neural network for end-to-end speech denoising
This tool helps improve the clarity of spoken audio by removing unwanted background noise. You provide it with noisy speech recordings, and it outputs cleaner versions of those recordings. This is useful for anyone working with audio that needs to be easily understood, like transcribers, audio engineers, or researchers analyzing speech data.
708 stars. No commits in the last 6 months.
Use this if you have audio recordings with significant background noise that makes the speech difficult to hear or process.
Not ideal if you need a solution compatible with the latest TensorFlow versions, as this project requires specific older software versions.
Stars
708
Forks
163
Language
Python
License
MIT
Category
Last pushed
Jul 06, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/drethage/speech-denoising-wavenet"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz...
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
iver56/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
lmnt-com/wavegrad
A fast, high-quality neural vocoder.
madhavmk/Noise2Noise-audio_denoising_without_clean_training_data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise...