archinetai/audio-data-pytorch

A collection of useful audio datasets and transforms for PyTorch.

53
/ 100
Established

This tool helps machine learning engineers and researchers efficiently manage and preprocess various types of audio data for training machine learning models. It takes raw audio files from local folders, web datasets, or online sources like YouTube, and outputs pre-processed audio waveforms and associated metadata ready for model training. It's designed for anyone building speech recognition, audio classification, or other audio-centric AI applications.

144 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to quickly load, transform, and prepare diverse audio datasets for training PyTorch-based machine learning models.

Not ideal if you are looking for a GUI-based audio editing tool or a comprehensive data visualization platform for audio.

audio-machine-learning speech-technology sound-analysis AI-audio-datasets
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 18 / 25

How are scores calculated?

Stars

144

Forks

23

Language

Python

License

MIT

Last pushed

Feb 11, 2023

Commits (30d)

0

Dependencies

13

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/archinetai/audio-data-pytorch"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.