VachanVY/diffusion-transformer
Pytorch and JAX Implementation of Scalable Diffusion Models with Transformers | Diffusion Transformers in Pytorch and JAX
This project helps designers, artists, and researchers generate realistic images from scratch, specifically celebrity-like faces and handwritten digits. You input text prompts or conditions (like 'female' or 'male'), and the system outputs new, unique images matching those descriptions. This is for anyone needing to create diverse visual content without manual design.
No commits in the last 6 months.
Use this if you need to generate a large volume of diverse and realistic images based on simple textual descriptions or conditions.
Not ideal if you need to edit existing images or require extremely precise control over fine-grained details in the generated output.
Stars
8
Forks
—
Language
Python
License
MIT
Category
Last pushed
Jun 19, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/VachanVY/diffusion-transformer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
FoundationVision/VAR
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈]...
nerdyrodent/VQGAN-CLIP
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
huggingface/finetrainers
Scalable and memory-optimized training of diffusion models
AssemblyAI-Community/MinImagen
MinImagen: A minimal implementation of the Imagen text-to-image model