YangLing0818/SGDiff
Official implementation for "Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training" https://arxiv.org/abs/2211.11138
This tool generates complex images from descriptive structural annotations. You provide a "scene graph" specifying objects, their attributes (like "red" or "large"), and how they relate to each other (e.g., "person riding a bicycle"). The output is a visual representation of that precise scene. This is ideal for researchers in computer vision or AI who need to create custom image datasets for experiments or model training.
No commits in the last 6 months.
Use this if you need to generate synthetic images with fine-grained control over objects, their properties, and relationships within a scene.
Not ideal if you're looking for a simple text-to-image generator without needing to define explicit object relationships.
Stars
77
Forks
18
Language
Python
License
—
Category
Last pushed
Dec 25, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/YangLing0818/SGDiff"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jayin92/Skyfall-GS
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery
Tencent-Hunyuan/Hunyuan3D-2
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
ActiveVisionLab/gaussctrl
[ECCV 2024] GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
caiyuanhao1998/Open-DiffusionGS
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D...
deepseek-ai/DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with...