Natooz/MidiTok
MIDI / symbolic music tokenizers for Deep Learning models 🎶
This tool helps music researchers and AI developers prepare symbolic music data for machine learning models. It takes MIDI or ABC music files as input and converts them into sequences of tokens, which are the numerical representations that AI models can understand. The output is a structured dataset ready for tasks like music generation, transcription, or analysis. It's designed for researchers and practitioners working on AI applications in music.
855 stars. Actively maintained with 4 commits in the last 30 days. Available on PyPI.
Use this if you need to transform MIDI or ABC music files into a tokenized format suitable for training deep learning models for music AI tasks.
Not ideal if you are a musician looking for a digital audio workstation or a tool for direct music composition and production.
Stars
855
Forks
98
Language
Python
License
MIT
Category
Last pushed
Mar 02, 2026
Commits (30d)
4
Dependencies
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Natooz/MidiTok"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
salu133445/muspy
A toolkit for symbolic music generation
jacbz/Lofi
ML-supported lo-fi music generator
jisungk/deepjazz
Deep learning driven jazz generation using Keras & Theano!
mdeff/fma
FMA: A Dataset For Music Analysis
cjbayron/autochord
Automatic Chord Recognition tools - ISMIR2021 Late-Breaking Demo presentation