Compositional T2I Generation Diffusion Models

Tools for enhancing spatial reasoning, multi-concept composition, and fine-grained control in text-to-image diffusion models through architectural improvements and guidance techniques. Does NOT include general T2I generation, LoRA training, or personalization fine-tuning methods.

There are 144 compositional t2i generation models tracked. 4 score above 50 (established tier). The highest-rated is UCSC-VLAA/story-iter at 65/100 with 949 stars. 1 of the top 10 are actively maintained.

Get all 144 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=diffusion&subcategory=compositional-t2i-generation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	UCSC-VLAA/story-iter [ICLR 2026] A Training-free Iterative Framework for Long Story Visualization	65	Established	949	Python
2	PaddlePaddle/PaddleMIX Paddle Multimodal Integration and eXploration, supporting mainstream...	61	Established	718	Python
3	keivalya/mini-vla a minimal, beginner-friendly VLA to show how robot policies can fuse images,...	57	Established	204	Python
4	adobe-research/custom-diffusion Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)	51	Established	1,971	Python
5	byliutao/1Prompt1Story 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent...	49	Emerging	313	Python
6	zai-org/ImageReward [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for...	48	Emerging	1,649	Python
7	JyChen9811/FaithDiff [CVPR 2025] FaithDiff for Classic Film Rejuvenation, Old Photo Revival,...	47	Emerging	240	Python
8	lmxyy/sige [NeurIPS 2022, T-PAMI 2023] Efficient Spatially Sparse Inference for...	45	Emerging	268	Python
9	haoyangzheng-ai/didi-instruct [ICLR 2026] Discrete Diffusion Divergence Instruct (DiDi-Instruct)	45	Emerging	153	Python
10	HorizonWind2004/reconstruction-alignment [ICLR 2026] Official repo of paper "Reconstruction Alignment Improves...	45	Emerging	378	Python
11	mit-han-lab/lpd [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient...	44	Emerging	91	Python
12	bytedance/UNO [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and...	44	Emerging	1,353	Python
13	ankanbhunia/PIDM Person Image Synthesis via Denoising Diffusion Model (CVPR 2023)	44	Emerging	500	Jupyter Notebook
14	grigorisg9gr/polynomial_nets Official Implementation of the CVPR'20 paper 'Π-nets: Deep Polynomial Neural...	44	Emerging	176	Python
15	YixunLiang/UniTEX Official implementation of "UniTEX: Universal High Fidelity Generative...	43	Emerging	172	Python
16	energy-based-model/Compositional-Visual-Generation-with-Composable-Diffusion-Models-PyTorch [ECCV 2022] Compositional Generation using Diffusion Models	43	Emerging	485	Jupyter Notebook
17	yuval-alaluf/Attend-and-Excite Official Implementation for "Attend-and-Excite: Attention-Based Semantic...	43	Emerging	767	Jupyter Notebook
18	OpenDriveLab/Nexus [ICCV 2025] Nexus: Decoupled Diffusion Sparks Adaptive Scene Generation	43	Emerging	109	Python
19	ziqihuangg/Collaborative-Diffusion [CVPR 2023] Collaborative Diffusion	43	Emerging	438	Python
20	open-mmlab/PIA [CVPR 2024] PIA, your Personalized Image Animator. Animate your images by...	43	Emerging	978	Python
21	foivospar/Arc2Face [ECCV 2024 Oral 🔥] Arc2Face: A Foundation Model for ID-Consistent Human...	43	Emerging	789	Python
22	H-EmbodVis/MERGE [NeurIPS 2025] More Than Generation: Unifying Generation and Depth...	42	Emerging	215	Python
23	youngwanLEE/sdxl-koala [NeurIPS 2024] Empirical Lessons Toward Memory-Efficient and Fast Diffusion...	42	Emerging	147	Python
24	sihyun-yu/REPA [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion...	42	Emerging	1,582	Python
25	WindVChen/Diff-Harmonization A novel zero-shot image harmonization method based on Diffusion Model Prior.	42	Emerging	147	Python
26	yandex-research/swd [ICLR'2026] Scale-wise Distillation of Diffusion Models	41	Emerging	117	Python
27	AlaaLab/InstructCV [ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned...	41	Emerging	461	Python
28	ExplainableML/ReNO [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through...	41	Emerging	166	Python
29	nupurkmr9/concept-ablation Ablating Concepts in Text-to-Image Diffusion Models (ICCV 2023)	41	Emerging	168	Python
30	limuloo/MIGC [CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)	41	Emerging	615	Python
31	gojasper/flash-diffusion ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few...	40	Emerging	657	Python
32	blurgyy/CoMPaSS [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models	38	Emerging	92	Python
33	junkunyuan/NexusAlign A unified and extensible framework for aligning foundation models.	38	Emerging	2	Python
34	NeuralTextualInversion/NeTI Official Implementation for "A Neural Space-Time Representation for...	38	Emerging	181	Python
35	M-E-AGI-Lab/PSAlign Official Implementation of "PSAlign: Personalized Safety Alignment for...	38	Emerging	7	Python
36	aminK8/KnobGen CVPR 2025 Workshop on CVEU.	37	Emerging	42	Python
37	muzishen/IMAGPose [NeurIPS 2024] 🕺IMAGPose🕺: A Unified Conditional Framework for Pose-Guided...	37	Emerging	349	Python
38	VinAIResearch/DiMSUM DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method...	37	Emerging	43	Python
39	huanngzh/Parts2Whole [TIP 2025] From Parts to Whole: A Unified Reference Framework for...	37	Emerging	196	Python
40	ashutosh1919/mdp-diffusion Text-guided image editing by manipulating diffusion path without any training.	37	Emerging	16	Python
41	sled-group/CycleNet [NeurIPS 2023] Official Code for CycleNet: Rethinking Cycle Consistent in...	37	Emerging	96	Python
42	kfirgoldberg/ConceptLab Official Implementation for "ConceptLab: Creative Generation using Diffusion...	37	Emerging	255	Python
43	CVL-UESTC/Internal-Guidance CVPR 2026-Guiding a Diffusion Transformer with the Internal Dynamics of Itself (IG)	36	Emerging	60	Python
44	customdiffusion360/custom-diffusion360 CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control	36	Emerging	171	Python
45	gudaochangsheng/RefAlign Official PyTorch implementation of RefAlign: Representation Alignment for...	36	Emerging	6	Python
46	JIA-Lab-research/RIVAL [NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion...	36	Emerging	153	Python
47	universome/alis [ICCV 2021] Aligning Latent and Image Spaces to Connect the Unconnectable	36	Emerging	262	Jupyter Notebook
48	baojudezeze/RMP-Adapter The implementation of RMP-Adapter: A region-based Multiple Prompt Adapter...	36	Emerging	20	Python
49	YangLing0818/IterComp [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from...	36	Emerging	204	Python
50	tgxs002/align_sd Better Aligning Text-to-Image Models with Human Preference. ICCV 2023	35	Emerging	294	Python
51	ChenDarYen/Key-Locked-Rank-One-Editing-for-Text-to-Image-Personalization An Pytorch implementation of the paper Key-Locked Rank One Editing for...	35	Emerging	85	Python
52	AIDC-AI/TeEFusion TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance (ICCV 2025)	35	Emerging	9	Python
53	RockeyCoss/SPO [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic...	35	Emerging	265	Python
54	lzyhha/VisualCloze [ICCV 2025] VisualCloze: A universal image generation framework that can...	35	Emerging	279	Python
55	boschresearch/Divide-and-Bind Official implementation of "Divide & Bind Your Attention for Improved...	35	Emerging	37	Jupyter Notebook
56	Nikolai10/PerCo PyTorch implementation of PerCo (Towards Image Compression with Perfect...	34	Emerging	103	Python
57	HKUST-LongGroup/Coarse-guided-Gen [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual...	34	Emerging	35	Python
58	guillaumejs2403/TIME Text-to-Image Models for Counterfactual Explanations: a black-box approach...	34	Emerging	9	Python
59	ChenWu98/generative-visual-prompt [NeurIPS 2022] (Amortized) distributional control for pre-trained generative models	34	Emerging	121	Python
60	mapooon/Face2Diffusion [CVPR 2024] Face2Diffusion for Fast and Editable Face Personalization...	34	Emerging	97	Jupyter Notebook
61	joanrod/figure-diffusion Generating figures from research papers, using textual captions from the paper.	34	Emerging	42	Python
62	quickgrid/text-to-image-diffusion Experimental (working!) custom implementation of conditional and...	33	Emerging	5	Python
63	zhiyichin/P4D [ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models...	33	Emerging	52	Python
64	ewrfcas/LeftRefill LeftRefill: Filling Right Canvas based on Left Reference through Generalized...	33	Emerging	82	Python
65	Ammmob/PixelSmile PixelSmile: Fine-grained facial expression editing with continuous control,...	33	Emerging	63	Python
66	ChenWu98/unified-generative-zoo [ICCV 2023] https://arxiv.org/abs/2210.05559	33	Emerging	122	Python
67	kongzhecn/OMG [ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In...	33	Emerging	701	Python
68	TsingZ0/FedKTL CVPR 2024 accepted paper, An Upload-Efficient Scheme for Transferring...	33	Emerging	66	Python
69	thecrazymage/CasTex [WACV 2026] CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture...	33	Emerging	33	Python
70	xie-lab-ml/CoRe2 [TPAMI] The official implementation of our paper "CoRe^2: Collect, Reflect...	33	Emerging	31	Python
71	HVision-NKU/ImageCritic Official implementation of ImageCritic (CVPR 2026)	33	Emerging	156	Python
72	SPRIGHT-T2I/SPRIGHT [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving...	32	Emerging	103	Python
73	yuxin-jiang/Anomagic [AAAI 2026] The Official Implementation for "Anomagic: Crossmodal...	32	Emerging	129	Python
74	IBM/DiffuseKronA DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized...	32	Emerging	132	Python
75	opendilab/PRG [ICCV 2025] Pretrained Reversible Generation as Unsupervised Visual...	32	Emerging	28	Python
76	bytedance-fanqie-ai/MOSAIC [ICLR 2026]🔥🔥🔥MOSAIC: Multi-Subject Personalized Generation via...	31	Emerging	396	Python
77	Nithin-GK/UniteandConquer [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using...	31	Emerging	36	Python
78	VAST-AI-Research/SeqTex [SIGGRAPH Asia 2025] Official github repo of SeqTex, an end-to-end 3D...	31	Emerging	41	Python
79	haoningwu3639/MegaFusion [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution...	31	Emerging	99	Python
80	DeepakSridhar/fgdm [NeurIPS 2024] Factor Graph Diffusion Models for Improved Prompt Alignment,...	31	Emerging	2	Python
81	Raghuram-Veeramallu/DiffTransBEV BEV Representation of an Autonomous car using 6 RGB cameras by making use of...	31	Emerging	4	Python
82	p-lambda/composed_finetuning Code for the ICML 2021 paper "Composed Fine-Tuning: Freezing Pre-Trained...	31	Emerging	4	Python
83	pyladiesams/personalization-with-text-to-image-diffusion-models-feb2024 Get familiar with different fine-tuning techniques for text-to-image models,...	31	Emerging	16	Jupyter Notebook
84	mofayezi/RobuText [CVPRW 2023] Official implementation of "Benchmarking Robustness to...	31	Emerging	3	Python
85	dsshim0125/s2p "S2P: State-conditioned Image Synthesis for Data Augmentation in Offline...	31	Emerging	4	Python
86	LiyaoJiang1998/RAISE "RAISE: Requirement-Adaptive Evolutionary Refinement for Training-Free...	31	Emerging	9	Python
87	rabiulcste/vismin [NeurIPS24] VisMin: Visual Minimal-Change Understanding	30	Emerging	19	Python
88	AIDC-AI/CHATS CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for...	30	Emerging	114	Python
89	Nithin-GK/MaxFusion [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image...	30	Emerging	27	Jupyter Notebook
90	showlab/BoxDiff [ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free...	30	Emerging	275	Python
91	zelaki/ReDi [NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint...	30	Emerging	115	Python
92	koi953215/NaRCan [NeurIPS 2024] NaRCan: Natural Refined Canonical Image with Integration of...	29	Experimental	169	Python
93	youweiliang/RichHF Code for CVPR'24 best paper: Rich Human Feedback for Text-to-Image...	29	Experimental	31	Python
94	hu-zijing/AsynDM [ICLR 26] Asynchronous diffusion models allocate individual pixels with...	28	Experimental	18	Python
95	bytedance/ID-Patch Official implementation of CVPR 2025 paper "ID-Patch: Robust ID Association...	27	Experimental	75	Python
96	alibaba/mm-diff MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration	27	Experimental	27	Python
97	yandex-research/adaptive-diffusion [CVPR'2024] Adaptive Teacher-Student Collaboration for Text-Conditional...	27	Experimental	33	Python
98	ConceptBed/evaluations [AAAI 2024] ConceptBed Evaluations for Personalized Text-to-Image Diffusion Models	27	Experimental	25	Python
99	lxa9867/ControlVAR This is the official implementation for ControlVAR.	27	Experimental	126	Python
100	johndpope/Emote-hack Emote Portrait Alive - using ai to reverse engineer code from white paper....	27	Experimental	184	Python
101	byliutao/Cradle2Cane （NeurIPS 2025) From Cradle to Cane: A Two-Pass Framework for High-Fidelity...	26	Experimental	7	Python
102	muzishen/RCDMs [AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with...	25	Experimental	120	Python
103	hohonu-vicml/DirectedDiffusion Directed Diffusion: Direct Control of Object Placement through Attention...	25	Experimental	81	Python
104	sooyeon-go/eye_for_an_eye Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models	25	Experimental	32	Jupyter Notebook
105	yugwangyeol/Facial-caricature-profile-GIF [Project] Facial-caricature-profile GIF	25	Experimental	4	Python
106	CFGpp-diffusion/CFGpp Official repository for "CFG++: manifold-constrained classifier free...	25	Experimental	238	Python
107	Viresh-R/ml-CCA Implementation of Fast ml-CCA from the ICCV-2015 work "Multi-Label...	25	Experimental	22	Matlab
108	YangLing0818/ContextDiff [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video...	25	Experimental	73	Python
109	Ka1b0/Foresight-Guidance NeurIPS25 Spotlight \| Classifier-free guidance (CFG) can be viewed as...	24	Experimental	9	Python
110	sungnyun/diffblender DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models	24	Experimental	46	Python
111	nanlliu/Unsupervised-Compositional-Concepts-Discovery [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image...	23	Experimental	85	Python
112	basiclab/MAD MAD: Makeup All-in-One with Cross-Domain Diffusion Model	23	Experimental	31	Python
113	PeterHUistyping/M3ashy M^3ashy: Multi-Modal Material Synthesis via Hyperdiffusion, AAAI'26 (former...	23	Experimental	1	Python
114	diaoenmao/Multimodal-Controller-for-Generative-Models [CVMI 2022] Multimodal Controller for Generative Models	23	Experimental	3	Python
115	YangLing0818/RealCompo [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves...	23	Experimental	121	Python
116	wateasca/DiffusionVL 🌟 Translate autoregressive models into cutting-edge diffusion vision...	22	Experimental	—	Python
117	tuananhbui89/Embedding-Adjustment Mitigating Semantic Collapse in Generative Personalization with Test-Time...	22	Experimental	10	Jupyter Notebook
118	ZiyiZhang27/MVC-ZigAL [CVPR 2026] Code for the paper "Refining Few-Step Text-to-Multiview...	22	Experimental	9	Python
119	RuiqingYoung/EAR Learning to Expand Images for Efficient Visual Autoregressive Modeling	22	Experimental	4	Python
120	wfanyue/DPG-T2I-Personalization [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via...	21	Experimental	51	Python
121	james-oldfield/PoS-subspaces [NeurIPS'23] Parts of Speech–Grounded Subspaces in Vision-Language Models	21	Experimental	29	Jupyter Notebook
122	SHI-Labs/Diffusion-Driven-Test-Time-Adaptation-via-Synthetic-Domain-Alignment Everything to the Synthetic: Diffusion-driven Test-time Adaptation via...	21	Experimental	40	Python
123	agneet42/revision [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in...	21	Experimental	13	Python
124	abyildirim/md-projtex Text-guided 3D texture generation using training-free multi-diffusion in UV space.	21	Experimental	14	—
125	play-with-HOI-generation/HOIG [NeurIPS 2022 Spotlight] Hand-Object Interaction Image Generation	21	Experimental	33	Python
126	quickgrid/paper-implementations Attempts to implement various deep learning, computer vision papers.	19	Experimental	4	Jupyter Notebook
127	TsinghuaC3I/Efficient-Diffusion-Models TPAMI 2025 Survey Paper	19	Experimental	26	Python
128	JortVincenti/DMoE-VAR Research code for the Dynamic Mixture-of-Experts in Visual Autoregressive...	19	Experimental	—	Python
129	X-GenGroup/PaCo-RL Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for...	18	Experimental	32	Python
130	rese1f/pose2img pose-driven human natural image generation based on latent diffusion model	17	Experimental	1	Jupyter Notebook
131	JD-GenX/Reliable_AD [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback	16	Experimental	59	Python
132	dt-3t/LSRS Official PyTorch implementation of "LSRS: Latent Scale Rejection Sampling...	15	Experimental	—	Python
133	anhquanpham/iterative-comp-rl-generation Iterative Compositional Data Generation for Robot Control	15	Experimental	5	Python
134	xuyang-liu16/VGDiffZero [ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot...	14	Experimental	17	Python
135	whq-xxh/FFA-Synthesis FFA Synthesis from CFP （ACM MM 2024 Workshop Best Paper Award）	14	Experimental	24	—
136	gmum/beta-CFG This paper presents β-CFG, a dynamic guidance method for text-to-image...	13	Experimental	10	Python
137	CMLab-Korea/CHIMERA-arxiv 🦁 CHIMERA: Adaptive CacHe Injection and SeMantic Anchor Prompting for...	13	Experimental	2	—
138	eunso999/SSDV [ICCV 2025] Translation of Text Embedding via Delta Vector to Suppress...	13	Experimental	5	Jupyter Notebook
139	wendell0218/FocusDiff Official repository of the paper "FocusDiff: Advancing Fine-Grained...	13	Experimental	5	Python
140	LiyaoJiang1998/PixelMan "PixelMan: Consistent Object Editing with Diffusion Models via Pixel...	13	Experimental	10	Python
141	divyakraman/ImPosterDiffusion2024 Codebase for the paper ImPoster: Text and Frequency Guidance for Subject...	12	Experimental	5	Python
142	wangkai930418/mc_ti Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier...	12	Experimental	5	Python
143	Pengchengpcx/TextVDB [AAAI2025] Textualize Visual Prompt for Image Editing via Diffusion Bridge	11	Experimental	4	—
144	jiuntian/OneHOI [CVPR2026] Official repo for "OneHOI: Unifying Human-Object Interaction...	11	Experimental	—	—

Comparisons in this category

custom-diffusion and Collaborative-Diffusion (51 vs 43) story-iter and 1Prompt1Story (65 vs 49)