Text to Image Generation Transformer Models
Tools for generating, manipulating, and editing images from text prompts using diffusion models and related generative techniques. Does NOT include general image classification, detection, or non-generative image processing tasks.
There are 37 text to image generation models tracked. 1 score above 70 (verified tier). The highest-rated is filipstrand/mflux at 73/100 with 1,882 stars. 1 of the top 10 are actively maintained.
Get all 37 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=text-to-image-generation&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
filipstrand/mflux
MLX native implementations of state-of-the-art generative image models |
|
Verified |
| 2 |
potamides/DeTikZify
Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ. |
|
Established |
| 3 |
FoundationVision/Infinity
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for... |
|
Emerging |
| 4 |
zai-org/CogView
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView:... |
|
Emerging |
| 5 |
EleutherAI/DALLE-mtf
Open-AI's DALL-E for large scale training in mesh-tensorflow. |
|
Emerging |
| 6 |
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation |
|
Emerging |
| 7 |
TextGeneratorio/text-generator.io
Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io |
|
Emerging |
| 8 |
kyegomez/Fusion3D
An extremely experimental model that intakes images and generates 3D scenes... |
|
Emerging |
| 9 |
amazon-science/text_generation_diffusion_llm_topic
Topic Embedding, Text Generation and Modeling using diffusion |
|
Emerging |
| 10 |
ivonajdenkoska/tulip
[ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP" |
|
Emerging |
| 11 |
RishabSA/Sketch2Graphviz
Sketch2Graphviz allows you to convert sketches or images of graphs and... |
|
Emerging |
| 12 |
allenai/x-lxmert
PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer... |
|
Emerging |
| 13 |
aimagelab/Emuru-autoregressive-text-img
Official PyTorch implementation for "Zero-Shot Styled Text Image Generation,... |
|
Emerging |
| 14 |
EchoSingh/GitHub_Profile_Picture
A guide code to generate your ai profile picture |
|
Emerging |
| 15 |
renan-siqueira/image-to-text-tool
This tool processes images and generates textual descriptions using advanced... |
|
Emerging |
| 16 |
affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition
Inverse DALL-E for Optical Character Recognition |
|
Experimental |
| 17 |
robjsliwa/mlx-sd-single-file-models
Single safetensors file support Apple MLX Stable Diffusion |
|
Experimental |
| 18 |
andyngdz/exogen_backend
ExoGen Backend |
|
Experimental |
| 19 |
FredyRivera-dev/Flux2-from-scratch
This repo proposes to implement the Flux2 model from scratch |
|
Experimental |
| 20 |
avijit-jana/huggingface-nlp-image-tool
An end‑to‑end application leveraging Hugging Face pretrained models for... |
|
Experimental |
| 21 |
inuwamobarak/stable-diffusion
Implementing a diffusion framework with Hugging Face. Stable diffusion... |
|
Experimental |
| 22 |
PRITHIVSAKTHIUR/Kontext-Photo-Mate-v2
Kontext-Photo-Mate-v2 is an advanced image manipulation application built on... |
|
Experimental |
| 23 |
PRITHIVSAKTHIUR/GALLO-3XL
High Quality Image Generation Model - Powered with NVIDIA A100 |
|
Experimental |
| 24 |
PRITHIVSAKTHIUR/Sub-Memory-Efficient-Merging-FluxKreaDev
black-forest-labs/FLUX.1-dev and black-forest-labs/FLUX.1-Krea-dev. This... |
|
Experimental |
| 25 |
PRITHIVSAKTHIUR/Flux.1-dev-4bit
FLUX.1-dev model with 4-bit quantization, quantized model maintains image... |
|
Experimental |
| 26 |
damianoimola/diffit
DiffiT: Diffusion Vision Transformers for Image Generation and DiffiP a... |
|
Experimental |
| 27 |
SaiLikith14/text_to_image_generation
text to image generation |
|
Experimental |
| 28 |
Eyelor/text-to-image-item-generator
A Python workflow for generating random item images using models from Hugging Face. |
|
Experimental |
| 29 |
SARIT42/Image-InPainting-SAM
A combination of Image segmentation, Image editing and in-place Image... |
|
Experimental |
| 30 |
krishnakoushik225/CLAP-Optimized-Text-to-Audio-Generation-AudioLDM-
Inference-time optimization for diffusion-based text-to-audio generation... |
|
Experimental |
| 31 |
andrewtyw/MathImg2LaTeX
Convert pictures of mathematical formulas to LaTeX expressions |
|
Experimental |
| 32 |
PRITHIVSAKTHIUR/StableDiffusion
Continuous progress in AI research leads to the development of more robust... |
|
Experimental |
| 33 |
mwasifanwar/NeuralCanvas
AI-powered digital art studio that transforms sketches into photorealistic... |
|
Experimental |
| 34 |
jayeshbhandarkar/Text-to-Image-Generator-using-Stable-Diffusion-Model
Text to Image Generator and Multilingual Text to Image Generator using... |
|
Experimental |
| 35 |
motartin/PictureFrame
A configurable picture viewer |
|
Experimental |
| 36 |
AmrMKayid/fanan
Art & Creativity in JAX 🎨 💗 |
|
Experimental |
| 37 |
apiraccini/formulae
Simple tool to extract markdown text from images of scientific formulae. |
|
Experimental |