sony/DiffRoll

PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model

/ 100

Emerging

This project helps musicians, music producers, and researchers automatically convert audio recordings of piano music into a 'piano roll' format, which visually represents notes over time. You input an audio file of piano music, and it outputs a detailed piano roll that shows which notes were played, when, and for how long. This is ideal for anyone who needs to quickly get a structured, editable representation of a piano performance from an audio source.

No commits in the last 6 months.

Use this if you need to accurately convert piano audio recordings into a digital piano roll for editing, analysis, or further music production.

Not ideal if you need to transcribe music from instruments other than piano or require real-time transcription, as this is for batch processing pre-recorded audio.

music-transcription piano-performance audio-analysis music-production score-creation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

bghira/SimpleTuner

A general fine-tuning kit geared toward image/video/audio diffusion models.

mcmonkeyprojects/SwarmUI

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an...

nateraw/stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

TheDesignFounder/DreamLayer

Benchmark diffusion models faster. Automate evals, seeds, and metrics for reproducible results.

Explore Diffusion Models

All categories Trending Diffusion directory Insights