loubbrad/aria-midi

Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.

31
/ 100
Emerging

This dataset offers over a million MIDI files of transcribed solo piano recordings, complete with genre, composer, performer, and compositional details. It provides a rich resource for music researchers and developers who are building new generative music models or analyzing music patterns. You can take this collection of MIDI files and use it to train AI to compose music or understand musical structures.

No commits in the last 6 months.

Use this if you are a music researcher or an AI developer looking for a vast, high-quality dataset of solo piano MIDI files to train generative music models, conduct music information retrieval studies, or perform data analysis on musical compositions.

Not ideal if you are looking for audio files or a dataset of instrumental music beyond solo piano.

music-composition music-analysis generative-art music-education computational-musicology
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 4 / 25

How are scores calculated?

Stars

78

Forks

2

Language

License

Last pushed

Jun 19, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/loubbrad/aria-midi"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.