pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
This tool helps machine learning engineers and researchers prepare audio data for training AI models. It takes raw audio files and converts them into numerical representations like spectrograms or Mel-frequency cepstral coefficients (MFCCs), which are essential for deep learning tasks. The output is data structured in PyTorch tensors, ready for model training and experimentation.
2,838 stars. Actively maintained with 1 commit in the last 30 days.
Use this if you are building machine learning models for audio or speech applications and need to efficiently process and transform audio data using PyTorch.
Not ideal if you need a general-purpose audio editing or signal processing suite that doesn't focus on machine learning preparation.
Stars
2,838
Forks
764
Language
Python
License
BSD-2-Clause
Category
Last pushed
Mar 13, 2026
Commits (30d)
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/pytorch/audio"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
deezer/spleeter
Deezer source separation library including pretrained models.
audeering/opensmile
The Munich Open-Source Large-Scale Multimedia Feature Extractor
audeering/opensmile-python
Python package for openSMILE
markovka17/dla
Deep learning for audio processing