AssemblyAI-Community/MinImagen

MinImagen: A minimal implementation of the Imagen text-to-image model

/ 100

Emerging

This project lets you create images from descriptive text. You input a text caption, and it generates a corresponding image. It's designed for machine learning students or researchers who want to understand how text-to-image models like Imagen work under the hood.

313 stars. No commits in the last 6 months.

Use this if you are studying generative AI and want to see a simplified, working example of a text-to-image diffusion model.

Not ideal if you need a production-ready tool for generating high-quality images, as this version is stripped down for educational clarity.

generative-ai-education diffusion-models text-to-image machine-learning-research computer-vision-learning

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

313

Forks

Language

Python

License

MIT

Higher-rated alternatives

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

FoundationVision/VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈]...

nerdyrodent/VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

eps696/aphantasia

CLIP + FFT/DWT/RGB = text to image/video

Explore Diffusion Models

All categories Trending Diffusion directory Insights