seorim0/SE-using-SRL-Model
Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings
This project helps audio engineers and speech scientists improve the clarity and naturalness of spoken audio by removing background noise in real-time. It takes noisy speech recordings as input and produces enhanced speech that is easier to understand and sounds more natural. The primary users are researchers and practitioners working with audio data where clean speech is critical.
No commits in the last 6 months.
Use this if you need to enhance speech quality from noisy environments in live applications, ensuring low latency and improved intelligibility.
Not ideal if your primary goal is offline noise reduction without real-time constraints, or if you are working with non-speech audio.
Stars
19
Forks
2
Language
Python
License
MIT
Category
Last pushed
Jun 06, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/seorim0/SE-using-SRL-Model"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
iver56/audiomentations
A Python library for audio data augmentation. Useful for making audio ML models work well in the...
Rikorose/DeepFilterNet
Noise supression using deep filtering
torchsynth/torchsynth
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.
marl/openl3
OpenL3: Open-source deep audio and image embeddings
archinetai/audio-data-pytorch
A collection of useful audio datasets and transforms for PyTorch.