AkojimaSLP/Beamforming-for-speech-enhancement
simple delaysum, MVDR and CGMM-MVDR
This project helps you clean up noisy speech recordings, making voices clearer and easier to understand. You input a speech recording that has unwanted background noise, and it outputs a version of that recording where the speech is enhanced and the noise is reduced. It's designed for anyone working with audio who needs to improve the clarity of spoken words, such as researchers, audio engineers, or transcribers.
279 stars. No commits in the last 6 months.
Use this if you have speech recordings corrupted by noise and need a straightforward way to enhance the spoken content.
Not ideal if you need advanced deep learning-based speech enhancement or real-time processing for live audio streams.
Stars
279
Forks
82
Language
Python
License
—
Category
Last pushed
Jan 19, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/AkojimaSLP/Beamforming-for-speech-enhancement"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models