keshik6/grafting

[NeurIPS 2025 Oral] Official Code for Exploring Diffusion Transformer Designs via Grafting

/ 100

Emerging

This project offers a method called 'grafting' to efficiently explore new designs for diffusion transformer models. It takes existing, pretrained diffusion transformers and allows you to modify their internal components, such as attention mechanisms or MLPs, without the extensive computational cost of training from scratch. This is for machine learning researchers and engineers who want to quickly experiment with and evaluate novel generative AI architectures.

Use this if you are a researcher or engineer looking to rapidly prototype and test new Diffusion Transformer architectures and evaluate their impact on image generation quality and speed.

Not ideal if you are looking for a plug-and-play solution for generating images without needing to delve into model architecture modifications.

generative-AI diffusion-models architecture-search model-optimization image-synthesis

No Package No Dependents

Maintenance 6 / 25

Adoption 9 / 25

Maturity 15 / 25

Community 4 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

Apache-2.0

Higher-rated alternatives

FlorianFuerrutter/genQC

Generative Quantum Circuits

horseee/DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Gen-Verse/MMaDA

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion,...

kuleshov-group/mdlm

[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model

Shark-NLP/DiffuSeq

[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

Explore Diffusion Models

All categories Trending Diffusion directory Insights