pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

64
/ 100
Established

This tool helps machine learning engineers and researchers prepare audio data for training AI models. It takes raw audio files and converts them into numerical representations like spectrograms or Mel-frequency cepstral coefficients (MFCCs), which are essential for deep learning tasks. The output is data structured in PyTorch tensors, ready for model training and experimentation.

2,838 stars. Actively maintained with 1 commit in the last 30 days.

Use this if you are building machine learning models for audio or speech applications and need to efficiently process and transform audio data using PyTorch.

Not ideal if you need a general-purpose audio editing or signal processing suite that doesn't focus on machine learning preparation.

audio-analysis speech-recognition sound-classification machine-learning-engineering audio-feature-extraction
No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

2,838

Forks

764

Language

Python

License

BSD-2-Clause

Last pushed

Mar 13, 2026

Commits (30d)

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/pytorch/audio"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.