Text-to-Image Generation Diffusion Models
Tools and implementations for generating images from text prompts using diffusion models, GANs, or CLIP-guided approaches. Does NOT include image editing tools, inpainting, video generation, or evaluation benchmarks.
There are 62 text-to-image generation models tracked. 2 score above 50 (established tier). The highest-rated is NVlabs/Sana at 57/100 with 5,000 stars. 1 of the top 10 are actively maintained.
Get all 62 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=diffusion&subcategory=text-to-image-generation&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer |
|
Established |
| 2 |
FoundationVision/VAR
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in... |
|
Established |
| 3 |
nerdyrodent/VQGAN-CLIP
Just playing with getting VQGAN+CLIP running locally, rather than having to... |
|
Emerging |
| 4 |
huggingface/finetrainers
Scalable and memory-optimized training of diffusion models |
|
Emerging |
| 5 |
AssemblyAI-Community/MinImagen
MinImagen: A minimal implementation of the Imagen text-to-image model |
|
Emerging |
| 6 |
eps696/aphantasia
CLIP + FFT/DWT/RGB = text to image/video |
|
Emerging |
| 7 |
AlonzoLeeeooo/awesome-text-to-image-studies
A collection of awesome text-to-image generation studies. |
|
Emerging |
| 8 |
nerdyrodent/CLIP-Guided-Diffusion
Just playing with getting CLIP Guided Diffusion running locally, rather than... |
|
Emerging |
| 9 |
songweige/rich-text-to-image
Rich-Text-to-Image Generation |
|
Emerging |
| 10 |
kyegomez/LUMIERE
Implementation of the text to video model LUMIERE from the paper: "A... |
|
Emerging |
| 11 |
kamalkraj/stable-diffusion-tritonserver
Deploy stable diffusion model with onnx/tenorrt + tritonserver |
|
Emerging |
| 12 |
parlance-zz/dualdiffusion
Dual Diffusion is a generative diffusion model for music trained on video... |
|
Emerging |
| 13 |
mehdidc/feed_forward_vqgan_clip
Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for... |
|
Emerging |
| 14 |
AIDC-AI/Ovis-Image
Ovis-Image is a 7B text-to-image model specifically optimized for... |
|
Emerging |
| 15 |
woctezuma/stable-diffusion-safety-checker
Python package to apply the Safety Checker from Stable Diffusion. |
|
Emerging |
| 16 |
huggingface/instruction-tuned-sd
Code for instruction-tuning Stable Diffusion. |
|
Emerging |
| 17 |
OutofAi/StableFace
Build your own Face App with Stable Diffusion 2.1 |
|
Emerging |
| 18 |
HFAiLab/clip-gen
CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP |
|
Emerging |
| 19 |
slowy07/luna
text to image generation with stable diffusion |
|
Emerging |
| 20 |
WZDTHU/NiT
[NeurIPS 2025] Native-resolution diffusion Transformer |
|
Emerging |
| 21 |
DiT-3D/DiT-3D
🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for... |
|
Emerging |
| 22 |
amaralibey/nanoCLIP
A lightweight Text-to-Image Retrieval model [Web App] |
|
Emerging |
| 23 |
py-img-gen/python-image-generation
🎨 書籍「Pythonで学ぶ画像生成」のコードを配置したリポジトリです |
|
Emerging |
| 24 |
rockerBOO/sd-ext
Scripts and extensions for Stable Diffusion |
|
Emerging |
| 25 |
gmongaras/Stable-Diffusion-3-From-Scratch
A repo that attempts to train stable diffusion 3 from scratch |
|
Emerging |
| 26 |
hila-chefer/TargetCLIP
[ECCV 2022] Official PyTorch implementation of the paper Image-Based... |
|
Emerging |
| 27 |
saharmor/anima
Turn text into video using Stable Diffusion and Google FILM |
|
Emerging |
| 28 |
Qiyuan-Ge/PaintMind
Fast and controllable text-to-image model. |
|
Emerging |
| 29 |
nahyeonkaty/textboost
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via... |
|
Emerging |
| 30 |
The-Swarm-Corporation/DART
DART (Diffusion-Autoregressive Recursive Transformer) is a novel hybrid... |
|
Emerging |
| 31 |
ouhenio/StyleGAN3-CLIP-notebooks
A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and... |
|
Emerging |
| 32 |
ji-code25/Point-Transformer-Diffusion
Point Transformer Diffusion is a novel generative model for 3D point cloud... |
|
Emerging |
| 33 |
contrebande-labs/charred
CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for... |
|
Experimental |
| 34 |
SaiBalaji-PSS/Stable-Diffusion-Catalyst
A macOS Catalyst app which uses Apple's CoreML Stable Diffusion package to... |
|
Experimental |
| 35 |
defgsus/clipig
OpenAI CLIP based image generator with complex config file controlled... |
|
Experimental |
| 36 |
nhtlongcs/live-novel
Self-host application can generate illustration from a novel by highlighting... |
|
Experimental |
| 37 |
ZicoDiegoRR/stable-diffusion-xl-colab-ui
An interactive Jupyter notebook leveraging IPython widgets for the UI and... |
|
Experimental |
| 38 |
KogaiIrina/masterpiece-creator
Lightning App that generates beautiful art using the Disco Diffusion model |
|
Experimental |
| 39 |
jdh-algo/JoyType
JoyType: A Robust Design for Multilingual Visual Text Creation |
|
Experimental |
| 40 |
ShivamDuggal4/UNITE-tokenization-generation
Single-stage End-to-End Training for Tokenization and Generation |
|
Experimental |
| 41 |
narahir/RetroDiffusionApp
🎨 Create and pixelate images effortlessly with the RetroDiffusion iOS app,... |
|
Experimental |
| 42 |
monk1337/OpenAI-CLIP-Image-search
OpenAI's CLIP neural network |
|
Experimental |
| 43 |
VachanVY/diffusion-transformer
Pytorch and JAX Implementation of Scalable Diffusion Models with... |
|
Experimental |
| 44 |
tripletclip/TripletCLIP
[NeurIPS 2024] Official PyTorch implementation of "Improving Compositional... |
|
Experimental |
| 45 |
ArchitAnant/stroxapi
Text to Handwriting generation using Diffusion |
|
Experimental |
| 46 |
EngineeringAI-LAB/MIS-DiT-AST
This is a training-free sketch to scene generation. |
|
Experimental |
| 47 |
kyegomez/Gen2
Implementation of "Text driven video generation" in pytorch |
|
Experimental |
| 48 |
johnsutor/emoji-painter
Paint with emojis. |
|
Experimental |
| 49 |
jiaowoguanren0615/DiT-Pytorch
This is a warehouse for DiT-pytorch-model, can be used to generate your image dataset |
|
Experimental |
| 50 |
PRITHIVSAKTHIUR/Flux-Krea-multi-GPU-Pool
A Python-based multi-GPU image generation pipeline using Huggingface... |
|
Experimental |
| 51 |
TrieuPhi/Image-Caption
Project sẽ tổng hợp những model liên quan đến image caption, sử dụng các... |
|
Experimental |
| 52 |
linsun449/cliper.code
This repo is the official pytorch implementation of the paper: CLIPer:... |
|
Experimental |
| 53 |
aimagelab/safe-clip
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024 |
|
Experimental |
| 54 |
johnamit/sit-faf-generate-edit
A deep learning project for Fundus Autofluorescence (FAF) image generation,... |
|
Experimental |
| 55 |
dheeren-tejani/mini-sd
A lightweight, end-to-end implementation of Stable Diffusion built from... |
|
Experimental |
| 56 |
edcalderin/textual-diffuser
TextualDiffuser is a text-to-image generation tool powered by Stable... |
|
Experimental |
| 57 |
PhyoMyanmarKyaw/AI-Art
AI Text-to-Image with Stable Diffusion using CoreML |
|
Experimental |
| 58 |
MadJokkerr/Text-2-Image
This CNN model will convert the given text to images using stable diffusion |
|
Experimental |
| 59 |
Gaurav-Jaiswal-1/Text-to-Image-Generation-Using-HuggingFace
A simple project that generates images based on text descriptions using... |
|
Experimental |
| 60 |
Neha-Shrestha/Text-to-Image-Generator
Final year project of BSc. CSIT on Text-to-Image generator system using a... |
|
Experimental |
| 61 |
pky1987/Verilume-True-Light-Image-Generator
Verilume is a high-fidelity image generation and editing framework built... |
|
Experimental |
| 62 |
hitthecodelabs/StableDiffusion-GenerativeImage
Simple Python application to generate images using the Stable Diffusion model |
|
Experimental |