Audio Classification Transformers Transformer Models

Tools for classifying, detecting, and identifying audio events, speech, and sound types using transformer models. Includes speaker identification, sound event detection, environmental sound classification, and biomedical audio analysis. Does NOT include music generation, speech synthesis, or general audio processing without classification objectives.

There are 24 audio classification transformers models tracked. The highest-rated is CouncilDataProject/speakerbox at 44/100 with 60 stars.

Get all 24 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=audio-classification-transformers&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 CouncilDataProject/speakerbox

Speakerbox: Fine-tune Audio Transformers for speaker identification.

44
Emerging
2 CVxTz/music_genre_classification

music genre classification : LSTM vs Transformer

42
Emerging
3 HHousen/speaker-change-detection

Speaker change detection using SincNet and an LSTM/Transformer

40
Emerging
4 palonso/MAEST

Pre-training, fine-tuning, and inference code with the MAEST models for...

35
Emerging
5 icon-lab/HST

Official implementation of Hierarchical Spectrogram Transformers (HST)

30
Emerging
6 aaronstevenwhite/spectrans

Modular spectral transformer implementations in PyTorch with Fourier,...

29
Experimental
7 pooya-mohammadi/audio-classification-pytorch

In this project, several approaches for training/finetuning an audio gender...

26
Experimental
8 GiovanniIacuzzo/Classification-instruments

Automatic classification of musical instruments from audio spectrograms...

23
Experimental
9 8asic/mlpc2025-sound-event-detection

Competition-winning SED (Sound Event Detection) system that identifies audio...

23
Experimental
10 Rana-yamach/Music-Genre-Classification

Comparing SVM and Transformer (AST) models for classifying music genres...

21
Experimental
11 Walt-1091/Signal-to-Sequence-Transformer

🔍 Classify 1D signal data using a CNN + Transformer model, enabling advanced...

21
Experimental
12 nbathreya/Signal-to-Sequence-Transformer

Deep learning classifier for 1D signal data with transformer architecture.

20
Experimental
13 Vaioskn/song-identification-fingerprints-and-embeddings

Song identification combining landmark audio fingerprinting with...

20
Experimental
14 JaspreetSingh-exe/Music-Genre-Classification

This project builds a Music Genre Classification System using SVM, CNN,...

17
Experimental
15 Saiful185/AudioFuse

AudioFuse: Unified Spectral-Temporal Learning via a Hybrid ViT-1D CNN...

17
Experimental
16 sborquez/volcano-seismic

Automatic classification of seismic signals from Llaima volcano (Chile)

17
Experimental
17 omer-gulsoy/ML-ClassicalMusicEra

🎻 AI project classifying Classical Music eras (Baroque, Classical, Romantic,...

13
Experimental
18 qthuy2k1/audio-instrument-classification

A program for training audio classification model

13
Experimental
19 Ashu708907/Music-Genre-Classification-using-Spectrogram-images

🎵 Classify music genres by analyzing spectrogram images with machine...

13
Experimental
20 sayandeepmaity/luminator

Microphone Array-Based Direction of Arrival of Gunshot Detection .Gun...

13
Experimental
21 M4sum/save-forest-elephants

Detect elephant rumbles and gunshots on recordings made in the forests of...

11
Experimental
22 sayhitosandy/Transformer-Speech-Classifier-LM

Implementation and exploration of transformer models for speech segment...

11
Experimental
23 SagharShafaati/PD-Detection-Transformer-GMM

This repository contains the Python implementation for the article...

11
Experimental
24 pamudu123/HTDemucs

Hybrid Transformers for Audio Source Separation

10
Experimental