Text-to-Image Generation Diffusion Models

Tools and implementations for generating images from text prompts using diffusion models, GANs, or CLIP-guided approaches. Does NOT include image editing tools, inpainting, video generation, or evaluation benchmarks.

There are 62 text-to-image generation models tracked. 2 score above 50 (established tier). The highest-rated is NVlabs/Sana at 57/100 with 5,000 stars. 1 of the top 10 are actively maintained.

Get all 62 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=diffusion&subcategory=text-to-image-generation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

57
Established
2 FoundationVision/VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in...

50
Established
3 nerdyrodent/VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to...

49
Emerging
4 huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

48
Emerging
5 AssemblyAI-Community/MinImagen

MinImagen: A minimal implementation of the Imagen text-to-image model

47
Emerging
6 eps696/aphantasia

CLIP + FFT/DWT/RGB = text to image/video

47
Emerging
7 AlonzoLeeeooo/awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

46
Emerging
8 nerdyrodent/CLIP-Guided-Diffusion

Just playing with getting CLIP Guided Diffusion running locally, rather than...

44
Emerging
9 songweige/rich-text-to-image

Rich-Text-to-Image Generation

43
Emerging
10 kyegomez/LUMIERE

Implementation of the text to video model LUMIERE from the paper: "A...

43
Emerging
11 kamalkraj/stable-diffusion-tritonserver

Deploy stable diffusion model with onnx/tenorrt + tritonserver

43
Emerging
12 parlance-zz/dualdiffusion

Dual Diffusion is a generative diffusion model for music trained on video...

42
Emerging
13 mehdidc/feed_forward_vqgan_clip

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for...

41
Emerging
14 AIDC-AI/Ovis-Image

Ovis-Image is a 7B text-to-image model specifically optimized for...

41
Emerging
15 woctezuma/stable-diffusion-safety-checker

Python package to apply the Safety Checker from Stable Diffusion.

40
Emerging
16 huggingface/instruction-tuned-sd

Code for instruction-tuning Stable Diffusion.

39
Emerging
17 OutofAi/StableFace

Build your own Face App with Stable Diffusion 2.1

39
Emerging
18 HFAiLab/clip-gen

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

39
Emerging
19 slowy07/luna

text to image generation with stable diffusion

39
Emerging
20 WZDTHU/NiT

[NeurIPS 2025] Native-resolution diffusion Transformer

39
Emerging
21 DiT-3D/DiT-3D

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for...

38
Emerging
22 amaralibey/nanoCLIP

A lightweight Text-to-Image Retrieval model [Web App]

37
Emerging
23 py-img-gen/python-image-generation

🎨 書籍「Pythonで学ぶ画像生成」のコードを配置したリポジトリです

37
Emerging
24 rockerBOO/sd-ext

Scripts and extensions for Stable Diffusion

36
Emerging
25 gmongaras/Stable-Diffusion-3-From-Scratch

A repo that attempts to train stable diffusion 3 from scratch

35
Emerging
26 hila-chefer/TargetCLIP

[ECCV 2022] Official PyTorch implementation of the paper Image-Based...

34
Emerging
27 saharmor/anima

Turn text into video using Stable Diffusion and Google FILM

34
Emerging
28 Qiyuan-Ge/PaintMind

Fast and controllable text-to-image model.

34
Emerging
29 nahyeonkaty/textboost

TextBoost: Towards One-Shot Personalization of Text-to-Image Models via...

33
Emerging
30 The-Swarm-Corporation/DART

DART (Diffusion-Autoregressive Recursive Transformer) is a novel hybrid...

31
Emerging
31 ouhenio/StyleGAN3-CLIP-notebooks

A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and...

31
Emerging
32 ji-code25/Point-Transformer-Diffusion

Point Transformer Diffusion is a novel generative model for 3D point cloud...

31
Emerging
33 contrebande-labs/charred

CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for...

27
Experimental
34 SaiBalaji-PSS/Stable-Diffusion-Catalyst

A macOS Catalyst app which uses Apple's CoreML Stable Diffusion package to...

27
Experimental
35 defgsus/clipig

OpenAI CLIP based image generator with complex config file controlled...

27
Experimental
36 nhtlongcs/live-novel

Self-host application can generate illustration from a novel by highlighting...

26
Experimental
37 ZicoDiegoRR/stable-diffusion-xl-colab-ui

An interactive Jupyter notebook leveraging IPython widgets for the UI and...

26
Experimental
38 KogaiIrina/masterpiece-creator

Lightning App that generates beautiful art using the Disco Diffusion model

25
Experimental
39 jdh-algo/JoyType

JoyType: A Robust Design for Multilingual Visual Text Creation

25
Experimental
40 ShivamDuggal4/UNITE-tokenization-generation

Single-stage End-to-End Training for Tokenization and Generation

25
Experimental
41 narahir/RetroDiffusionApp

🎨 Create and pixelate images effortlessly with the RetroDiffusion iOS app,...

25
Experimental
42 monk1337/OpenAI-CLIP-Image-search

OpenAI's CLIP neural network

23
Experimental
43 VachanVY/diffusion-transformer

Pytorch and JAX Implementation of Scalable Diffusion Models with...

22
Experimental
44 tripletclip/TripletCLIP

[NeurIPS 2024] Official PyTorch implementation of "Improving Compositional...

21
Experimental
45 ArchitAnant/stroxapi

Text to Handwriting generation using Diffusion

21
Experimental
46 EngineeringAI-LAB/MIS-DiT-AST

This is a training-free sketch to scene generation.

20
Experimental
47 kyegomez/Gen2

Implementation of "Text driven video generation" in pytorch

20
Experimental
48 johnsutor/emoji-painter

Paint with emojis.

19
Experimental
49 jiaowoguanren0615/DiT-Pytorch

This is a warehouse for DiT-pytorch-model, can be used to generate your image dataset

18
Experimental
50 PRITHIVSAKTHIUR/Flux-Krea-multi-GPU-Pool

A Python-based multi-GPU image generation pipeline using Huggingface...

18
Experimental
51 TrieuPhi/Image-Caption

Project sẽ tổng hợp những model liên quan đến image caption, sử dụng các...

17
Experimental
52 linsun449/cliper.code

This repo is the official pytorch implementation of the paper: CLIPer:...

17
Experimental
53 aimagelab/safe-clip

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024

16
Experimental
54 johnamit/sit-faf-generate-edit

A deep learning project for Fundus Autofluorescence (FAF) image generation,...

14
Experimental
55 dheeren-tejani/mini-sd

A lightweight, end-to-end implementation of Stable Diffusion built from...

13
Experimental
56 edcalderin/textual-diffuser

TextualDiffuser is a text-to-image generation tool powered by Stable...

13
Experimental
57 PhyoMyanmarKyaw/AI-Art

AI Text-to-Image with Stable Diffusion using CoreML

12
Experimental
58 MadJokkerr/Text-2-Image

This CNN model will convert the given text to images using stable diffusion

11
Experimental
59 Gaurav-Jaiswal-1/Text-to-Image-Generation-Using-HuggingFace

A simple project that generates images based on text descriptions using...

11
Experimental
60 Neha-Shrestha/Text-to-Image-Generator

Final year project of BSc. CSIT on Text-to-Image generator system using a...

11
Experimental
61 pky1987/Verilume-True-Light-Image-Generator

Verilume is a high-fidelity image generation and editing framework built...

11
Experimental
62 hitthecodelabs/StableDiffusion-GenerativeImage

Simple Python application to generate images using the Stable Diffusion model

11
Experimental