sony/DiffRoll
PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model
This project helps musicians, music producers, and researchers automatically convert audio recordings of piano music into a 'piano roll' format, which visually represents notes over time. You input an audio file of piano music, and it outputs a detailed piano roll that shows which notes were played, when, and for how long. This is ideal for anyone who needs to quickly get a structured, editable representation of a piano performance from an audio source.
No commits in the last 6 months.
Use this if you need to accurately convert piano audio recordings into a digital piano roll for editing, analysis, or further music production.
Not ideal if you need to transcribe music from instruments other than piano or require real-time transcription, as this is for batch processing pre-recorded audio.
Stars
80
Forks
11
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Dec 06, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/sony/DiffRoll"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
bghira/SimpleTuner
A general fine-tuning kit geared toward image/video/audio diffusion models.
mcmonkeyprojects/SwarmUI
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an...
nateraw/stable-diffusion-videos
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
TheDesignFounder/DreamLayer
Benchmark diffusion models faster. Automate evals, seeds, and metrics for reproducible results.