Text to Image Generation Transformer Models

Tools for generating, manipulating, and editing images from text prompts using diffusion models and related generative techniques. Does NOT include general image classification, detection, or non-generative image processing tasks.

There are 37 text to image generation models tracked. 1 score above 70 (verified tier). The highest-rated is filipstrand/mflux at 73/100 with 1,882 stars. 1 of the top 10 are actively maintained.

Get all 37 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=text-to-image-generation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 filipstrand/mflux

MLX native implementations of state-of-the-art generative image models

73
Verified
2 potamides/DeTikZify

Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ.

53
Established
3 FoundationVision/Infinity

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for...

48
Emerging
4 zai-org/CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView:...

46
Emerging
5 EleutherAI/DALLE-mtf

Open-AI's DALL-E for large scale training in mesh-tensorflow.

42
Emerging
6 Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

42
Emerging
7 TextGeneratorio/text-generator.io

Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io

42
Emerging
8 kyegomez/Fusion3D

An extremely experimental model that intakes images and generates 3D scenes...

39
Emerging
9 amazon-science/text_generation_diffusion_llm_topic

Topic Embedding, Text Generation and Modeling using diffusion

39
Emerging
10 ivonajdenkoska/tulip

[ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"

37
Emerging
11 RishabSA/Sketch2Graphviz

Sketch2Graphviz allows you to convert sketches or images of graphs and...

36
Emerging
12 allenai/x-lxmert

PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer...

33
Emerging
13 aimagelab/Emuru-autoregressive-text-img

Official PyTorch implementation for "Zero-Shot Styled Text Image Generation,...

30
Emerging
14 EchoSingh/GitHub_Profile_Picture

A guide code to generate your ai profile picture

30
Emerging
15 renan-siqueira/image-to-text-tool

This tool processes images and generates textual descriptions using advanced...

30
Emerging
16 affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition

Inverse DALL-E for Optical Character Recognition

29
Experimental
17 robjsliwa/mlx-sd-single-file-models

Single safetensors file support Apple MLX Stable Diffusion

28
Experimental
18 andyngdz/exogen_backend

ExoGen Backend

27
Experimental
19 FredyRivera-dev/Flux2-from-scratch

This repo proposes to implement the Flux2 model from scratch

25
Experimental
20 avijit-jana/huggingface-nlp-image-tool

An end‑to‑end application leveraging Hugging Face pretrained models for...

24
Experimental
21 inuwamobarak/stable-diffusion

Implementing a diffusion framework with Hugging Face. Stable diffusion...

23
Experimental
22 PRITHIVSAKTHIUR/Kontext-Photo-Mate-v2

Kontext-Photo-Mate-v2 is an advanced image manipulation application built on...

21
Experimental
23 PRITHIVSAKTHIUR/GALLO-3XL

High Quality Image Generation Model - Powered with NVIDIA A100

19
Experimental
24 PRITHIVSAKTHIUR/Sub-Memory-Efficient-Merging-FluxKreaDev

black-forest-labs/FLUX.1-dev and black-forest-labs/FLUX.1-Krea-dev. This...

18
Experimental
25 PRITHIVSAKTHIUR/Flux.1-dev-4bit

FLUX.1-dev model with 4-bit quantization, quantized model maintains image...

18
Experimental
26 damianoimola/diffit

DiffiT: Diffusion Vision Transformers for Image Generation and DiffiP a...

18
Experimental
27 SaiLikith14/text_to_image_generation

text to image generation

17
Experimental
28 Eyelor/text-to-image-item-generator

A Python workflow for generating random item images using models from Hugging Face.

17
Experimental
29 SARIT42/Image-InPainting-SAM

A combination of Image segmentation, Image editing and in-place Image...

17
Experimental
30 krishnakoushik225/CLAP-Optimized-Text-to-Audio-Generation-AudioLDM-

Inference-time optimization for diffusion-based text-to-audio generation...

13
Experimental
31 andrewtyw/MathImg2LaTeX

Convert pictures of mathematical formulas to LaTeX expressions

12
Experimental
32 PRITHIVSAKTHIUR/StableDiffusion

Continuous progress in AI research leads to the development of more robust...

12
Experimental
33 mwasifanwar/NeuralCanvas

AI-powered digital art studio that transforms sketches into photorealistic...

12
Experimental
34 jayeshbhandarkar/Text-to-Image-Generator-using-Stable-Diffusion-Model

Text to Image Generator and Multilingual Text to Image Generator using...

11
Experimental
35 motartin/PictureFrame

A configurable picture viewer

11
Experimental
36 AmrMKayid/fanan

Art & Creativity in JAX 🎨 💗

11
Experimental
37 apiraccini/formulae

Simple tool to extract markdown text from images of scientific formulae.

10
Experimental