pooya-mohammadi/audio-classification-pytorch

In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.

/ 100

Experimental

This project helps you automatically categorize audio recordings. You provide a list of audio files and their correct labels (e.g., "male" or "female"), and it generates a trained model that can predict the category of new audio. This is useful for researchers, data scientists, or anyone needing to sort or identify audio clips based on distinct features.

No commits in the last 6 months.

Use this if you have a collection of audio files that need to be automatically classified into predefined categories, such as identifying gender in speech.

Not ideal if you need to transcribe speech into text, identify specific words, or perform real-time audio processing on a live stream.

audio-analysis speech-recognition sound-classification data-labeling machine-learning-research

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

CouncilDataProject/speakerbox

Speakerbox: Fine-tune Audio Transformers for speaker identification.

CVxTz/music_genre_classification

music genre classification : LSTM vs Transformer

HHousen/speaker-change-detection

Speaker change detection using SincNet and an LSTM/Transformer

palonso/MAEST

Pre-training, fine-tuning, and inference code with the MAEST models for music analysis applications.

icon-lab/HST

Official implementation of Hierarchical Spectrogram Transformers (HST)

Explore Transformer Models

All categories Trending Transformer directory Insights