Lallapallooza/fast-audiomentations
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
This tool helps machine learning engineers and audio researchers quickly prepare large batches of audio data for training models. You feed in raw audio recordings and it outputs the same recordings with various modifications like added noise or filtered sounds. This is for professionals building, training, or evaluating AI models that process sound, such as speech recognition or sound event detection systems.
No commits in the last 6 months.
Use this if you need to rapidly augment massive datasets of audio files for machine learning training and have access to a powerful GPU.
Not ideal if you're only working with small batches of audio or don't have GPU hardware available.
Stars
35
Forks
1
Language
Python
License
MIT
Category
Last pushed
Jan 19, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Lallapallooza/fast-audiomentations"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
iver56/audiomentations
A Python library for audio data augmentation. Useful for making audio ML models work well in the...
Rikorose/DeepFilterNet
Noise supression using deep filtering
torchsynth/torchsynth
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.
marl/openl3
OpenL3: Open-source deep audio and image embeddings
archinetai/audio-data-pytorch
A collection of useful audio datasets and transforms for PyTorch.