j-min/VPGen

Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)

/ 100

Experimental

VPGen helps creators and designers generate images from text descriptions with more control. You provide a text prompt, and it first breaks down the scene into objects and their arrangement, then generates an image that precisely matches that layout. This is ideal for anyone needing to visualize specific object placements or compositions from a text idea.

No commits in the last 6 months.

Use this if you need fine-grained control over the composition and object placement in AI-generated images, rather than just a general aesthetic.

Not ideal if you're looking for a simple, one-step text-to-image tool without needing to inspect or influence intermediate layout steps.

generative-art digital-illustration concept-design content-creation visual-storytelling

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

Vchitect/VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

VectorSpaceLab/OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

EndlessSora/focal-frequency-loss

[ICCV 2021] Focal Frequency Loss for Image Reconstruction and Synthesis

JIA-Lab-research/DreamOmni2

This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing...

SkyworkAI/UniPic

Open-source SOTA multi-image editing model

Explore Diffusion Models

All categories Trending Diffusion directory Insights