MTG/mtg-jamendo-dataset
Metadata, scripts and baselines for the MTG-Jamendo dataset
This project provides a comprehensive dataset for anyone working with music categorization and analysis. It contains over 55,000 full audio tracks from Jamendo, each labeled with categories like genre, instrument, and mood/theme. Music researchers, data scientists, and anyone building automatic music tagging systems can use this to train and evaluate models that classify music.
371 stars.
Use this if you need a large, pre-tagged music dataset to develop and test algorithms for automatic music classification.
Not ideal if you're looking for a dataset of very specific, niche music genres or a small, curated set for manual review.
Stars
371
Forks
48
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/MTG/mtg-jamendo-dataset"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
tyiannak/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
libAudioFlux/audioFlux
A library for audio and music analysis, feature extraction.
CPJKU/madmom
Python audio and music signal processing library
spotify/pedalboard
🎛 🔊 A Python library for audio.
spotify/basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection