pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

/ 100

Established

This tool helps machine learning engineers and researchers prepare audio data for training AI models. It takes raw audio files and converts them into numerical representations like spectrograms or Mel-frequency cepstral coefficients (MFCCs), which are essential for deep learning tasks. The output is data structured in PyTorch tensors, ready for model training and experimentation.

2,838 stars. Actively maintained with 1 commit in the last 30 days.

Use this if you are building machine learning models for audio or speech applications and need to efficiently process and transform audio data using PyTorch.

Not ideal if you need a general-purpose audio editing or signal processing suite that doesn't focus on machine learning preparation.

audio-analysis speech-recognition sound-classification machine-learning-engineering audio-feature-extraction

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

2,838

Forks

764

Language

Python

License

BSD-2-Clause

Related frameworks

asteroid-team/asteroid

The PyTorch-based audio source separation toolkit for researchers

deezer/spleeter

Deezer source separation library including pretrained models.

audeering/opensmile

The Munich Open-Source Large-Scale Multimedia Feature Extractor

audeering/opensmile-python

Python package for openSMILE

markovka17/dla

Deep learning for audio processing

Explore ML Frameworks

All categories Trending ML Framework directory Insights