zubair-irshad/CenterSnap

Pytorch code for ICRA'22 paper: "Single-Shot Multi-Object 3D Shape Reconstruction and Categorical 6D Pose and Size Estimation"

/ 100

Emerging

This project helps roboticists and computer vision engineers precisely identify and understand objects in 3D space. It takes an image from a standard camera (RGB-D) as input and outputs the 3D shape, exact position (6D pose), and size of multiple objects detected within that single image. This is ideal for those developing robotic manipulation systems, augmented reality applications, or inventory management solutions.

325 stars. No commits in the last 6 months.

Use this if you need to quickly and accurately determine the 3D characteristics and spatial location of multiple objects from a single camera view.

Not ideal if your application requires object detection with custom camera parameters without the ability to finetune the network with your own data.

robotics computer-vision 3d-reconstruction object-detection augmented-reality

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 21 / 25

How are scores calculated?

Stars

325

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

3DOM-FBK/deep-image-matching

Multiview matching with deep-learning and hand-crafted local features for COLMAP and other SfM...

suhangpro/mvcnn

Multi-view CNN (MVCNN) for shape recognition

zouchuhang/LayoutNet

Torch implementation of our CVPR 18 paper: "LayoutNet: Reconstructing the 3D Room Layout from a...

andyzeng/tsdf-fusion-python

Python code to fuse multiple RGB-D images into a TSDF voxel volume.

andyzeng/tsdf-fusion

Fuse multiple depth frames into a TSDF voxel volume.

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights