mehdidc/feed_forward_vqgan_clip

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt

/ 100

Emerging

This tool generates images directly from text descriptions, eliminating the need for complex parameter tuning. You provide a text prompt describing the image you want, and it produces an RGB image. It's designed for digital artists, content creators, or anyone needing to quickly visualize concepts without extensive manual image manipulation.

140 stars. No commits in the last 6 months.

Use this if you need to rapidly create unique images or visual concepts from text descriptions, or generate multiple diverse images from a single prompt.

Not ideal if you require precise control over every pixel of the output image, or if your primary need is for photo-realistic images of real-world scenes.

generative-art concept-design text-to-image digital-illustration content-creation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

140

Forks

Language

Python

License

MIT

Higher-rated alternatives

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

FoundationVision/VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈]...

nerdyrodent/VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

AssemblyAI-Community/MinImagen

MinImagen: A minimal implementation of the Imagen text-to-image model

Explore Diffusion Models

All categories Trending Diffusion directory Insights