BingYang-20/SRP-DNN
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
This project helps researchers and engineers analyze audio recordings to pinpoint the exact location (azimuth and elevation) of multiple moving sound sources. By taking raw multi-channel audio data from microphone arrays, it generates a spatial map showing where each sound originates. This is ideal for acousticians, robotics engineers, or surveillance analysts.
No commits in the last 6 months.
Use this if you need to accurately track the 3D position of one or more moving sound sources, even in noisy or reverberant environments.
Not ideal if your primary goal is speech recognition or speaker identification, rather than source localization.
Stars
60
Forks
13
Language
Python
License
MIT
Category
Last pushed
Sep 28, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/BingYang-20/SRP-DNN"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
deezer/spleeter
Deezer source separation library including pretrained models.
audeering/opensmile
The Munich Open-Source Large-Scale Multimedia Feature Extractor
audeering/opensmile-python
Python package for openSMILE