minyoungpark1/Speech-Enhancement
Unofficial implementation of SCP-GAN
This project helps audio engineers and researchers improve the clarity of speech recordings. It takes noisy speech audio as input and produces enhanced audio with reduced background noise. Its primary users are professionals working with audio processing, speech recognition, or acoustic research who need to clean up audio data.
No commits in the last 6 months.
Use this if you need to significantly reduce background noise from recorded speech to make it clearer for human listening or further processing.
Not ideal if you're looking for a plug-and-play application; this requires some technical setup and familiarity with machine learning environments.
Stars
18
Forks
1
Language
Python
License
—
Category
Last pushed
Jul 04, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/minyoungpark1/Speech-Enhancement"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PrunaAI/pruna
Pruna is a model optimization framework built for developers, enabling you to deliver faster,...
bytedance/LatentSync
Taming Stable Diffusion for Lip Sync!
haoheliu/AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead...