cvlab-columbia/pix2gestalt

Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)

/ 100

Emerging

When working with images where objects are partially hidden, this tool helps you understand and process those objects as if they were fully visible. It takes an image with occluded objects and, using your input to outline the visible parts, it outputs a complete, unoccluded image of each object, along with precise boundaries for both the visible and imagined hidden parts. This is for computer vision researchers, robotics engineers, or anyone developing systems that need to 'see' and interact with objects even when they're partially blocked.

200 stars. No commits in the last 6 months.

Use this if you need to perform object recognition, scene understanding, or 3D reconstruction on images containing objects that are partially obscured.

Not ideal if you are looking for a tool that works in real-time on consumer-grade hardware or if your primary interest is simple visible-only object segmentation.

computer-vision-research robotics scene-understanding amodal-perception object-recognition

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 10 / 25

How are scores calculated?

Stars

200

Forks

Language

Python

License

—

Higher-rated alternatives

jayin92/Skyfall-GS

Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery

Tencent-Hunyuan/Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

ActiveVisionLab/gaussctrl

[ECCV 2024] GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

caiyuanhao1998/Open-DiffusionGS

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D...

deepseek-ai/DreamCraft3D

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with...

Explore Diffusion Models

All categories Trending Diffusion directory Insights