loubbrad/aria-midi
Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.
This dataset offers over a million MIDI files of transcribed solo piano recordings, complete with genre, composer, performer, and compositional details. It provides a rich resource for music researchers and developers who are building new generative music models or analyzing music patterns. You can take this collection of MIDI files and use it to train AI to compose music or understand musical structures.
No commits in the last 6 months.
Use this if you are a music researcher or an AI developer looking for a vast, high-quality dataset of solo piano MIDI files to train generative music models, conduct music information retrieval studies, or perform data analysis on musical compositions.
Not ideal if you are looking for audio files or a dataset of instrumental music beyond solo piano.
Stars
78
Forks
2
Language
—
License
—
Category
Last pushed
Jun 19, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/loubbrad/aria-midi"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Natooz/MidiTok
MIDI / symbolic music tokenizers for Deep Learning models 🎶
salu133445/muspy
A toolkit for symbolic music generation
jacbz/Lofi
ML-supported lo-fi music generator
jisungk/deepjazz
Deep learning driven jazz generation using Keras & Theano!
mdeff/fma
FMA: A Dataset For Music Analysis