archinetai/audio-data-pytorch
A collection of useful audio datasets and transforms for PyTorch.
This tool helps machine learning engineers and researchers efficiently manage and preprocess various types of audio data for training machine learning models. It takes raw audio files from local folders, web datasets, or online sources like YouTube, and outputs pre-processed audio waveforms and associated metadata ready for model training. It's designed for anyone building speech recognition, audio classification, or other audio-centric AI applications.
144 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to quickly load, transform, and prepare diverse audio datasets for training PyTorch-based machine learning models.
Not ideal if you are looking for a GUI-based audio editing tool or a comprehensive data visualization platform for audio.
Stars
144
Forks
23
Language
Python
License
MIT
Category
Last pushed
Feb 11, 2023
Commits (30d)
0
Dependencies
13
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/archinetai/audio-data-pytorch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
iver56/audiomentations
A Python library for audio data augmentation. Useful for making audio ML models work well in the...
Rikorose/DeepFilterNet
Noise supression using deep filtering
torchsynth/torchsynth
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.
marl/openl3
OpenL3: Open-source deep audio and image embeddings
ductho-le/WaveDL
A Scalable Deep Learning Framework for Wave-Based Inverse Problems