VachanVY/diffusion-transformer

Pytorch and JAX Implementation of Scalable Diffusion Models with Transformers | Diffusion Transformers in Pytorch and JAX

/ 100

Experimental

This project helps designers, artists, and researchers generate realistic images from scratch, specifically celebrity-like faces and handwritten digits. You input text prompts or conditions (like 'female' or 'male'), and the system outputs new, unique images matching those descriptions. This is for anyone needing to create diverse visual content without manual design.

No commits in the last 6 months.

Use this if you need to generate a large volume of diverse and realistic images based on simple textual descriptions or conditions.

Not ideal if you need to edit existing images or require extremely precise control over fine-grained details in the generated output.

image-generation creative-design synthetic-media digital-art research-prototyping

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

FoundationVision/VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈]...

nerdyrodent/VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

AssemblyAI-Community/MinImagen

MinImagen: A minimal implementation of the Imagen text-to-image model

Explore Diffusion Models

All categories Trending Diffusion directory Insights