bootphon/learnable-strf
Learnable STRF, from Riad et al. 2021 JASA
This project helps researchers and engineers analyze complex audio signals more effectively. It takes raw audio data or existing sound representations and processes them to extract meaningful spectro-temporal features. Acoustic scientists and machine learning engineers working on audio tasks would use this to improve the representation of sound.
No commits in the last 6 months.
Use this if you need to extract advanced spectro-temporal features from audio data for tasks like speech activity detection, speaker verification, or urban sound classification.
Not ideal if you are looking for a complete, out-of-the-box solution for general audio processing without needing to implement custom neural network architectures.
Stars
13
Forks
1
Language
Python
License
MIT
Category
Last pushed
Aug 21, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/bootphon/learnable-strf"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
iver56/audiomentations
A Python library for audio data augmentation. Useful for making audio ML models work well in the...
Rikorose/DeepFilterNet
Noise supression using deep filtering
torchsynth/torchsynth
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.
marl/openl3
OpenL3: Open-source deep audio and image embeddings
archinetai/audio-data-pytorch
A collection of useful audio datasets and transforms for PyTorch.