GiovanniIacuzzo/Classification-instruments
Automatic classification of musical instruments from audio spectrograms using CNN, Vision Transformer (ViT) embeddings, XGBoost, and Random Forest. Includes feature extraction, model comparison, and visualization of results.
Automatically identify which musical instruments are present in an audio recording. You provide raw audio files, and the system processes them into visual representations (spectrograms) to determine if a guitar, piano, violin, viola, or flute is playing. This is for researchers or enthusiasts interested in music information retrieval and audio analysis.
No commits in the last 6 months.
Use this if you need to classify specific musical instruments from audio files and want to compare deep learning and traditional machine learning approaches.
Not ideal if you need to classify a wide range of instruments beyond the five specified (guitar, piano, violin, viola, flute), or if you are looking for real-time audio processing.
Stars
11
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Sep 29, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/GiovanniIacuzzo/Classification-instruments"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
CouncilDataProject/speakerbox
Speakerbox: Fine-tune Audio Transformers for speaker identification.
CVxTz/music_genre_classification
music genre classification : LSTM vs Transformer
HHousen/speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
palonso/MAEST
Pre-training, fine-tuning, and inference code with the MAEST models for music analysis applications.
icon-lab/HST
Official implementation of Hierarchical Spectrogram Transformers (HST)