Text-to-Image Generation Diffusion Models

Tools and implementations for generating images from text prompts using diffusion models, GANs, or CLIP-guided approaches. Does NOT include image editing tools, inpainting, video generation, or evaluation benchmarks.

There are 62 text-to-image generation models tracked. 2 score above 50 (established tier). The highest-rated is NVlabs/Sana at 57/100 with 5,000 stars. 1 of the top 10 are actively maintained.

Get all 62 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=diffusion&subcategory=text-to-image-generation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	NVlabs/Sana SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer	57	Established	5,000	Python
2	FoundationVision/VAR [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in...	50	Established	8,641	Jupyter Notebook
3	nerdyrodent/VQGAN-CLIP Just playing with getting VQGAN+CLIP running locally, rather than having to...	49	Emerging	2,653	Python
4	huggingface/finetrainers Scalable and memory-optimized training of diffusion models	48	Emerging	1,343	Python
5	AssemblyAI-Community/MinImagen MinImagen: A minimal implementation of the Imagen text-to-image model	47	Emerging	313	Python
6	eps696/aphantasia CLIP + FFT/DWT/RGB = text to image/video	47	Emerging	789	Python
7	AlonzoLeeeooo/awesome-text-to-image-studies A collection of awesome text-to-image generation studies.	46	Emerging	750	TeX
8	nerdyrodent/CLIP-Guided-Diffusion Just playing with getting CLIP Guided Diffusion running locally, rather than...	44	Emerging	385	Python
9	songweige/rich-text-to-image Rich-Text-to-Image Generation	43	Emerging	801	Python
10	kyegomez/LUMIERE Implementation of the text to video model LUMIERE from the paper: "A...	43	Emerging	52	Python
11	kamalkraj/stable-diffusion-tritonserver Deploy stable diffusion model with onnx/tenorrt + tritonserver	43	Emerging	126	Jupyter Notebook
12	parlance-zz/dualdiffusion Dual Diffusion is a generative diffusion model for music trained on video...	42	Emerging	90	Python
13	mehdidc/feed_forward_vqgan_clip Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for...	41	Emerging	140	Python
14	AIDC-AI/Ovis-Image Ovis-Image is a 7B text-to-image model specifically optimized for...	41	Emerging	307	Python
15	woctezuma/stable-diffusion-safety-checker Python package to apply the Safety Checker from Stable Diffusion.	40	Emerging	9	Python
16	huggingface/instruction-tuned-sd Code for instruction-tuning Stable Diffusion.	39	Emerging	249	Python
17	OutofAi/StableFace Build your own Face App with Stable Diffusion 2.1	39	Emerging	153	Jupyter Notebook
18	HFAiLab/clip-gen CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP	39	Emerging	146	Python
19	slowy07/luna text to image generation with stable diffusion	39	Emerging	65	Python
20	WZDTHU/NiT [NeurIPS 2025] Native-resolution diffusion Transformer	39	Emerging	283	Python
21	DiT-3D/DiT-3D 🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for...	38	Emerging	315	Python
22	amaralibey/nanoCLIP A lightweight Text-to-Image Retrieval model [Web App]	37	Emerging	29	Python
23	py-img-gen/python-image-generation 🎨 書籍「Pythonで学ぶ画像生成」のコードを配置したリポジトリです	37	Emerging	20	Jupyter Notebook
24	rockerBOO/sd-ext Scripts and extensions for Stable Diffusion	36	Emerging	9	Python
25	gmongaras/Stable-Diffusion-3-From-Scratch A repo that attempts to train stable diffusion 3 from scratch	35	Emerging	37	Python
26	hila-chefer/TargetCLIP [ECCV 2022] Official PyTorch implementation of the paper Image-Based...	34	Emerging	231	Jupyter Notebook
27	saharmor/anima Turn text into video using Stable Diffusion and Google FILM	34	Emerging	42	Jupyter Notebook
28	Qiyuan-Ge/PaintMind Fast and controllable text-to-image model.	34	Emerging	41	Python
29	nahyeonkaty/textboost TextBoost: Towards One-Shot Personalization of Text-to-Image Models via...	33	Emerging	57	Python
30	The-Swarm-Corporation/DART DART (Diffusion-Autoregressive Recursive Transformer) is a novel hybrid...	31	Emerging	6	Python
31	ouhenio/StyleGAN3-CLIP-notebooks A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and...	31	Emerging	215	Jupyter Notebook
32	ji-code25/Point-Transformer-Diffusion Point Transformer Diffusion is a novel generative model for 3D point cloud...	31	Emerging	24	Python
33	contrebande-labs/charred CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for...	27	Experimental	14	Python
34	SaiBalaji-PSS/Stable-Diffusion-Catalyst A macOS Catalyst app which uses Apple's CoreML Stable Diffusion package to...	27	Experimental	14	Swift
35	defgsus/clipig OpenAI CLIP based image generator with complex config file controlled...	27	Experimental	19	Python
36	nhtlongcs/live-novel Self-host application can generate illustration from a novel by highlighting...	26	Experimental	13	Python
37	ZicoDiegoRR/stable-diffusion-xl-colab-ui An interactive Jupyter notebook leveraging IPython widgets for the UI and...	26	Experimental	5	Python
38	KogaiIrina/masterpiece-creator Lightning App that generates beautiful art using the Disco Diffusion model	25	Experimental	6	Python
39	jdh-algo/JoyType JoyType: A Robust Design for Multilingual Visual Text Creation	25	Experimental	39	Python
40	ShivamDuggal4/UNITE-tokenization-generation Single-stage End-to-End Training for Tokenization and Generation	25	Experimental	62	Python
41	narahir/RetroDiffusionApp 🎨 Create and pixelate images effortlessly with the RetroDiffusion iOS app,...	25	Experimental	2	Swift
42	monk1337/OpenAI-CLIP-Image-search OpenAI's CLIP neural network	23	Experimental	4	Python
43	VachanVY/diffusion-transformer Pytorch and JAX Implementation of Scalable Diffusion Models with...	22	Experimental	8	Python
44	tripletclip/TripletCLIP [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional...	21	Experimental	46	Python
45	ArchitAnant/stroxapi Text to Handwriting generation using Diffusion	21	Experimental	—	Python
46	EngineeringAI-LAB/MIS-DiT-AST This is a training-free sketch to scene generation.	20	Experimental	4	Python
47	kyegomez/Gen2 Implementation of "Text driven video generation" in pytorch	20	Experimental	7	Python
48	johnsutor/emoji-painter Paint with emojis.	19	Experimental	3	Python
49	jiaowoguanren0615/DiT-Pytorch This is a warehouse for DiT-pytorch-model, can be used to generate your image dataset	18	Experimental	2	Python
50	PRITHIVSAKTHIUR/Flux-Krea-multi-GPU-Pool A Python-based multi-GPU image generation pipeline using Huggingface...	18	Experimental	1	Python
51	TrieuPhi/Image-Caption Project sẽ tổng hợp những model liên quan đến image caption, sử dụng các...	17	Experimental	1	Jupyter Notebook
52	linsun449/cliper.code This repo is the official pytorch implementation of the paper: CLIPer:...	17	Experimental	40	Python
53	aimagelab/safe-clip Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024	16	Experimental	67	Python
54	johnamit/sit-faf-generate-edit A deep learning project for Fundus Autofluorescence (FAF) image generation,...	14	Experimental	1	Python
55	dheeren-tejani/mini-sd A lightweight, end-to-end implementation of Stable Diffusion built from...	13	Experimental	—	TypeScript
56	edcalderin/textual-diffuser TextualDiffuser is a text-to-image generation tool powered by Stable...	13	Experimental	—	Python
57	PhyoMyanmarKyaw/AI-Art AI Text-to-Image with Stable Diffusion using CoreML	12	Experimental	6	Swift
58	MadJokkerr/Text-2-Image This CNN model will convert the given text to images using stable diffusion	11	Experimental	—	—
59	Gaurav-Jaiswal-1/Text-to-Image-Generation-Using-HuggingFace A simple project that generates images based on text descriptions using...	11	Experimental	—	Python
60	Neha-Shrestha/Text-to-Image-Generator Final year project of BSc. CSIT on Text-to-Image generator system using a...	11	Experimental	4	Jupyter Notebook
61	pky1987/Verilume-True-Light-Image-Generator Verilume is a high-fidelity image generation and editing framework built...	11	Experimental	—	Python
62	hitthecodelabs/StableDiffusion-GenerativeImage Simple Python application to generate images using the Stable Diffusion model	11	Experimental	—	Jupyter Notebook

Comparisons in this category

VQGAN-CLIP and CLIP-Guided-Diffusion (49 vs 44)