pooya-mohammadi/audio-classification-pytorch
In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.
This project helps you automatically categorize audio recordings. You provide a list of audio files and their correct labels (e.g., "male" or "female"), and it generates a trained model that can predict the category of new audio. This is useful for researchers, data scientists, or anyone needing to sort or identify audio clips based on distinct features.
No commits in the last 6 months.
Use this if you have a collection of audio files that need to be automatically classified into predefined categories, such as identifying gender in speech.
Not ideal if you need to transcribe speech into text, identify specific words, or perform real-time audio processing on a live stream.
Stars
43
Forks
4
Language
Jupyter Notebook
License
—
Category
Last pushed
Jan 11, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/pooya-mohammadi/audio-classification-pytorch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
CouncilDataProject/speakerbox
Speakerbox: Fine-tune Audio Transformers for speaker identification.
CVxTz/music_genre_classification
music genre classification : LSTM vs Transformer
HHousen/speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
palonso/MAEST
Pre-training, fine-tuning, and inference code with the MAEST models for music analysis applications.
icon-lab/HST
Official implementation of Hierarchical Spectrogram Transformers (HST)