WZDTHU/NiT

[NeurIPS 2025] Native-resolution diffusion Transformer

/ 100

Emerging

This tool helps creative professionals and researchers generate high-quality images from descriptions, allowing them to produce visually rich content for various applications. It takes textual class descriptions as input and generates images at a range of standard and non-standard resolutions and aspect ratios, up to 1536x1536 pixels and beyond. Visual artists, designers, and AI researchers who need to create realistic or stylized imagery would find this useful.

283 stars. No commits in the last 6 months.

Use this if you need to generate images from categorical inputs with precise control over output resolution and aspect ratio, including producing very high-resolution or uniquely shaped images.

Not ideal if you need to generate images from complex text prompts or require fine-grained control over specific visual elements within the generated image beyond broad categories.

image-generation digital-art computational-photography content-creation visual-design

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 12 / 25

How are scores calculated?

Stars

283

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

FoundationVision/VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈]...

nerdyrodent/VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

AssemblyAI-Community/MinImagen

MinImagen: A minimal implementation of the Imagen text-to-image model

Explore Diffusion Models

All categories Trending Diffusion directory Insights