Compositional T2I Generation Diffusion Models

Tools for enhancing spatial reasoning, multi-concept composition, and fine-grained control in text-to-image diffusion models through architectural improvements and guidance techniques. Does NOT include general T2I generation, LoRA training, or personalization fine-tuning methods.

There are 144 compositional t2i generation models tracked. 4 score above 50 (established tier). The highest-rated is UCSC-VLAA/story-iter at 65/100 with 949 stars. 1 of the top 10 are actively maintained.

Get all 144 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=diffusion&subcategory=compositional-t2i-generation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 UCSC-VLAA/story-iter

[ICLR 2026] A Training-free Iterative Framework for Long Story Visualization

65
Established
2 PaddlePaddle/PaddleMIX

Paddle Multimodal Integration and eXploration, supporting mainstream...

61
Established
3 keivalya/mini-vla

a minimal, beginner-friendly VLA to show how robot policies can fuse images,...

57
Established
4 adobe-research/custom-diffusion

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

51
Established
5 byliutao/1Prompt1Story

🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent...

49
Emerging
6 zai-org/ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for...

48
Emerging
7 JyChen9811/FaithDiff

[CVPR 2025] FaithDiff for Classic Film Rejuvenation, Old Photo Revival,...

47
Emerging
8 lmxyy/sige

[NeurIPS 2022, T-PAMI 2023] Efficient Spatially Sparse Inference for...

45
Emerging
9 haoyangzheng-ai/didi-instruct

[ICLR 2026] Discrete Diffusion Divergence Instruct (DiDi-Instruct)

45
Emerging
10 HorizonWind2004/reconstruction-alignment

[ICLR 2026] Official repo of paper "Reconstruction Alignment Improves...

45
Emerging
11 mit-han-lab/lpd

[ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient...

44
Emerging
12 bytedance/UNO

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and...

44
Emerging
13 ankanbhunia/PIDM

Person Image Synthesis via Denoising Diffusion Model (CVPR 2023)

44
Emerging
14 grigorisg9gr/polynomial_nets

Official Implementation of the CVPR'20 paper 'Π-nets: Deep Polynomial Neural...

44
Emerging
15 YixunLiang/UniTEX

Official implementation of "UniTEX: Universal High Fidelity Generative...

43
Emerging
16 energy-based-model/Compositional-Visual-Generation-with-Composable-Diffusion-Models-PyTorch

[ECCV 2022] Compositional Generation using Diffusion Models

43
Emerging
17 yuval-alaluf/Attend-and-Excite

Official Implementation for "Attend-and-Excite: Attention-Based Semantic...

43
Emerging
18 OpenDriveLab/Nexus

[ICCV 2025] Nexus: Decoupled Diffusion Sparks Adaptive Scene Generation

43
Emerging
19 ziqihuangg/Collaborative-Diffusion

[CVPR 2023] Collaborative Diffusion

43
Emerging
20 open-mmlab/PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by...

43
Emerging
21 foivospar/Arc2Face

[ECCV 2024 Oral 🔥] Arc2Face: A Foundation Model for ID-Consistent Human...

43
Emerging
22 H-EmbodVis/MERGE

[NeurIPS 2025] More Than Generation: Unifying Generation and Depth...

42
Emerging
23 youngwanLEE/sdxl-koala

[NeurIPS 2024] Empirical Lessons Toward Memory-Efficient and Fast Diffusion...

42
Emerging
24 sihyun-yu/REPA

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion...

42
Emerging
25 WindVChen/Diff-Harmonization

A novel zero-shot image harmonization method based on Diffusion Model Prior.

42
Emerging
26 yandex-research/swd

[ICLR'2026] Scale-wise Distillation of Diffusion Models

41
Emerging
27 AlaaLab/InstructCV

[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned...

41
Emerging
28 ExplainableML/ReNO

[NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through...

41
Emerging
29 nupurkmr9/concept-ablation

Ablating Concepts in Text-to-Image Diffusion Models (ICCV 2023)

41
Emerging
30 limuloo/MIGC

[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)

41
Emerging
31 gojasper/flash-diffusion

⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few...

40
Emerging
32 blurgyy/CoMPaSS

[ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models

38
Emerging
33 junkunyuan/NexusAlign

A unified and extensible framework for aligning foundation models.

38
Emerging
34 NeuralTextualInversion/NeTI

Official Implementation for "A Neural Space-Time Representation for...

38
Emerging
35 M-E-AGI-Lab/PSAlign

Official Implementation of "PSAlign: Personalized Safety Alignment for...

38
Emerging
36 aminK8/KnobGen

CVPR 2025 Workshop on CVEU.

37
Emerging
37 muzishen/IMAGPose

[NeurIPS 2024] 🕺IMAGPose🕺: A Unified Conditional Framework for Pose-Guided...

37
Emerging
38 VinAIResearch/DiMSUM

DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method...

37
Emerging
39 huanngzh/Parts2Whole

[TIP 2025] From Parts to Whole: A Unified Reference Framework for...

37
Emerging
40 ashutosh1919/mdp-diffusion

Text-guided image editing by manipulating diffusion path without any training.

37
Emerging
41 sled-group/CycleNet

[NeurIPS 2023] Official Code for CycleNet: Rethinking Cycle Consistent in...

37
Emerging
42 kfirgoldberg/ConceptLab

Official Implementation for "ConceptLab: Creative Generation using Diffusion...

37
Emerging
43 CVL-UESTC/Internal-Guidance

CVPR 2026-Guiding a Diffusion Transformer with the Internal Dynamics of Itself (IG)

36
Emerging
44 customdiffusion360/custom-diffusion360

CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control

36
Emerging
45 gudaochangsheng/RefAlign

Official PyTorch implementation of RefAlign: Representation Alignment for...

36
Emerging
46 JIA-Lab-research/RIVAL

[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion...

36
Emerging
47 universome/alis

[ICCV 2021] Aligning Latent and Image Spaces to Connect the Unconnectable

36
Emerging
48 baojudezeze/RMP-Adapter

The implementation of RMP-Adapter: A region-based Multiple Prompt Adapter...

36
Emerging
49 YangLing0818/IterComp

[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from...

36
Emerging
50 tgxs002/align_sd

Better Aligning Text-to-Image Models with Human Preference. ICCV 2023

35
Emerging
51 ChenDarYen/Key-Locked-Rank-One-Editing-for-Text-to-Image-Personalization

An Pytorch implementation of the paper Key-Locked Rank One Editing for...

35
Emerging
52 AIDC-AI/TeEFusion

TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance (ICCV 2025)

35
Emerging
53 RockeyCoss/SPO

[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic...

35
Emerging
54 lzyhha/VisualCloze

[ICCV 2025] VisualCloze: A universal image generation framework that can...

35
Emerging
55 boschresearch/Divide-and-Bind

Official implementation of "Divide & Bind Your Attention for Improved...

35
Emerging
56 Nikolai10/PerCo

PyTorch implementation of PerCo (Towards Image Compression with Perfect...

34
Emerging
57 HKUST-LongGroup/Coarse-guided-Gen

[arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual...

34
Emerging
58 guillaumejs2403/TIME

Text-to-Image Models for Counterfactual Explanations: a black-box approach...

34
Emerging
59 ChenWu98/generative-visual-prompt

[NeurIPS 2022] (Amortized) distributional control for pre-trained generative models

34
Emerging
60 mapooon/Face2Diffusion

[CVPR 2024] Face2Diffusion for Fast and Editable Face Personalization...

34
Emerging
61 joanrod/figure-diffusion

Generating figures from research papers, using textual captions from the paper.

34
Emerging
62 quickgrid/text-to-image-diffusion

Experimental (working!) custom implementation of conditional and...

33
Emerging
63 zhiyichin/P4D

[ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models...

33
Emerging
64 ewrfcas/LeftRefill

LeftRefill: Filling Right Canvas based on Left Reference through Generalized...

33
Emerging
65 Ammmob/PixelSmile

PixelSmile: Fine-grained facial expression editing with continuous control,...

33
Emerging
66 ChenWu98/unified-generative-zoo

[ICCV 2023] https://arxiv.org/abs/2210.05559

33
Emerging
67 kongzhecn/OMG

[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In...

33
Emerging
68 TsingZ0/FedKTL

CVPR 2024 accepted paper, An Upload-Efficient Scheme for Transferring...

33
Emerging
69 thecrazymage/CasTex

[WACV 2026] CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture...

33
Emerging
70 xie-lab-ml/CoRe2

[TPAMI] The official implementation of our paper "CoRe^2: Collect, Reflect...

33
Emerging
71 HVision-NKU/ImageCritic

Official implementation of ImageCritic (CVPR 2026)

33
Emerging
72 SPRIGHT-T2I/SPRIGHT

[ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving...

32
Emerging
73 yuxin-jiang/Anomagic

[AAAI 2026] The Official Implementation for "Anomagic: Crossmodal...

32
Emerging
74 IBM/DiffuseKronA

DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized...

32
Emerging
75 opendilab/PRG

[ICCV 2025] Pretrained Reversible Generation as Unsupervised Visual...

32
Emerging
76 bytedance-fanqie-ai/MOSAIC

[ICLR 2026]🔥🔥🔥MOSAIC: Multi-Subject Personalized Generation via...

31
Emerging
77 Nithin-GK/UniteandConquer

[CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using...

31
Emerging
78 VAST-AI-Research/SeqTex

[SIGGRAPH Asia 2025] Official github repo of SeqTex, an end-to-end 3D...

31
Emerging
79 haoningwu3639/MegaFusion

[WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution...

31
Emerging
80 DeepakSridhar/fgdm

[NeurIPS 2024] Factor Graph Diffusion Models for Improved Prompt Alignment,...

31
Emerging
81 Raghuram-Veeramallu/DiffTransBEV

BEV Representation of an Autonomous car using 6 RGB cameras by making use of...

31
Emerging
82 p-lambda/composed_finetuning

Code for the ICML 2021 paper "Composed Fine-Tuning: Freezing Pre-Trained...

31
Emerging
83 pyladiesams/personalization-with-text-to-image-diffusion-models-feb2024

Get familiar with different fine-tuning techniques for text-to-image models,...

31
Emerging
84 mofayezi/RobuText

[CVPRW 2023] Official implementation of "Benchmarking Robustness to...

31
Emerging
85 dsshim0125/s2p

"S2P: State-conditioned Image Synthesis for Data Augmentation in Offline...

31
Emerging
86 LiyaoJiang1998/RAISE

"RAISE: Requirement-Adaptive Evolutionary Refinement for Training-Free...

31
Emerging
87 rabiulcste/vismin

[NeurIPS24] VisMin: Visual Minimal-Change Understanding

30
Emerging
88 AIDC-AI/CHATS

CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for...

30
Emerging
89 Nithin-GK/MaxFusion

[ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image...

30
Emerging
90 showlab/BoxDiff

[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free...

30
Emerging
91 zelaki/ReDi

[NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint...

30
Emerging
92 koi953215/NaRCan

[NeurIPS 2024] NaRCan: Natural Refined Canonical Image with Integration of...

29
Experimental
93 youweiliang/RichHF

Code for CVPR'24 best paper: Rich Human Feedback for Text-to-Image...

29
Experimental
94 hu-zijing/AsynDM

[ICLR 26] Asynchronous diffusion models allocate individual pixels with...

28
Experimental
95 bytedance/ID-Patch

Official implementation of CVPR 2025 paper "ID-Patch: Robust ID Association...

27
Experimental
96 alibaba/mm-diff

MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration

27
Experimental
97 yandex-research/adaptive-diffusion

[CVPR'2024] Adaptive Teacher-Student Collaboration for Text-Conditional...

27
Experimental
98 ConceptBed/evaluations

[AAAI 2024] ConceptBed Evaluations for Personalized Text-to-Image Diffusion Models

27
Experimental
99 lxa9867/ControlVAR

This is the official implementation for ControlVAR.

27
Experimental
100 johndpope/Emote-hack

Emote Portrait Alive - using ai to reverse engineer code from white paper....

27
Experimental
101 byliutao/Cradle2Cane

(NeurIPS 2025) From Cradle to Cane: A Two-Pass Framework for High-Fidelity...

26
Experimental
102 muzishen/RCDMs

[AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with...

25
Experimental
103 hohonu-vicml/DirectedDiffusion

Directed Diffusion: Direct Control of Object Placement through Attention...

25
Experimental
104 sooyeon-go/eye_for_an_eye

Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models

25
Experimental
105 yugwangyeol/Facial-caricature-profile-GIF

[Project] Facial-caricature-profile GIF

25
Experimental
106 CFGpp-diffusion/CFGpp

Official repository for "CFG++: manifold-constrained classifier free...

25
Experimental
107 Viresh-R/ml-CCA

Implementation of Fast ml-CCA from the ICCV-2015 work "Multi-Label...

25
Experimental
108 YangLing0818/ContextDiff

[ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video...

25
Experimental
109 Ka1b0/Foresight-Guidance

NeurIPS25 Spotlight | Classifier-free guidance (CFG) can be viewed as...

24
Experimental
110 sungnyun/diffblender

DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models

24
Experimental
111 nanlliu/Unsupervised-Compositional-Concepts-Discovery

[ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image...

23
Experimental
112 basiclab/MAD

MAD: Makeup All-in-One with Cross-Domain Diffusion Model

23
Experimental
113 PeterHUistyping/M3ashy

M^3ashy: Multi-Modal Material Synthesis via Hyperdiffusion, AAAI'26 (former...

23
Experimental
114 diaoenmao/Multimodal-Controller-for-Generative-Models

[CVMI 2022] Multimodal Controller for Generative Models

23
Experimental
115 YangLing0818/RealCompo

[NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves...

23
Experimental
116 wateasca/DiffusionVL

🌟 Translate autoregressive models into cutting-edge diffusion vision...

22
Experimental
117 tuananhbui89/Embedding-Adjustment

Mitigating Semantic Collapse in Generative Personalization with Test-Time...

22
Experimental
118 ZiyiZhang27/MVC-ZigAL

[CVPR 2026] Code for the paper "Refining Few-Step Text-to-Multiview...

22
Experimental
119 RuiqingYoung/EAR

Learning to Expand Images for Efficient Visual Autoregressive Modeling

22
Experimental
120 wfanyue/DPG-T2I-Personalization

[ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via...

21
Experimental
121 james-oldfield/PoS-subspaces

[NeurIPS'23] Parts of Speech–Grounded Subspaces in Vision-Language Models

21
Experimental
122 SHI-Labs/Diffusion-Driven-Test-Time-Adaptation-via-Synthetic-Domain-Alignment

Everything to the Synthetic: Diffusion-driven Test-time Adaptation via...

21
Experimental
123 agneet42/revision

[ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in...

21
Experimental
124 abyildirim/md-projtex

Text-guided 3D texture generation using training-free multi-diffusion in UV space.

21
Experimental
125 play-with-HOI-generation/HOIG

[NeurIPS 2022 Spotlight] Hand-Object Interaction Image Generation

21
Experimental
126 quickgrid/paper-implementations

Attempts to implement various deep learning, computer vision papers.

19
Experimental
127 TsinghuaC3I/Efficient-Diffusion-Models

TPAMI 2025 Survey Paper

19
Experimental
128 JortVincenti/DMoE-VAR

Research code for the Dynamic Mixture-of-Experts in Visual Autoregressive...

19
Experimental
129 X-GenGroup/PaCo-RL

Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for...

18
Experimental
130 rese1f/pose2img

pose-driven human natural image generation based on latent diffusion model

17
Experimental
131 JD-GenX/Reliable_AD

[ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback

16
Experimental
132 dt-3t/LSRS

Official PyTorch implementation of "LSRS: Latent Scale Rejection Sampling...

15
Experimental
133 anhquanpham/iterative-comp-rl-generation

Iterative Compositional Data Generation for Robot Control

15
Experimental
134 xuyang-liu16/VGDiffZero

[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot...

14
Experimental
135 whq-xxh/FFA-Synthesis

FFA Synthesis from CFP (ACM MM 2024 Workshop Best Paper Award)

14
Experimental
136 gmum/beta-CFG

This paper presents β-CFG, a dynamic guidance method for text-to-image...

13
Experimental
137 CMLab-Korea/CHIMERA-arxiv

🦁 CHIMERA: Adaptive CacHe Injection and SeMantic Anchor Prompting for...

13
Experimental
138 eunso999/SSDV

[ICCV 2025] Translation of Text Embedding via Delta Vector to Suppress...

13
Experimental
139 wendell0218/FocusDiff

Official repository of the paper "FocusDiff: Advancing Fine-Grained...

13
Experimental
140 LiyaoJiang1998/PixelMan

"PixelMan: Consistent Object Editing with Diffusion Models via Pixel...

13
Experimental
141 divyakraman/ImPosterDiffusion2024

Codebase for the paper ImPoster: Text and Frequency Guidance for Subject...

12
Experimental
142 wangkai930418/mc_ti

Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier...

12
Experimental
143 Pengchengpcx/TextVDB

[AAAI2025] Textualize Visual Prompt for Image Editing via Diffusion Bridge

11
Experimental
144 jiuntian/OneHOI

[CVPR2026] Official repo for "OneHOI: Unifying Human-Object Interaction...

11
Experimental