NVlabs/VoxFormer

Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]

/ 100

Emerging

This tool helps autonomous vehicle developers and robotics engineers create complete 3D maps of their surroundings using only standard 2D camera images. It takes raw camera feeds and an estimated depth map, then generates a detailed 3D scene where every visible and occluded object is identified and categorized. The result is a richer understanding of the environment, crucial for navigation and interaction.

1,178 stars. No commits in the last 6 months.

Use this if you need to understand the full 3D geometry and semantics of a scene, including occluded areas, from camera images alone, for applications like self-driving or robotic perception.

Not ideal if your application requires real-time processing with extremely low latency on resource-constrained hardware, or if you already have access to high-fidelity LiDAR or radar data for 3D sensing.

autonomous-driving robotics-perception 3d-scene-reconstruction computer-vision semantic-mapping

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

1,178

Forks

Language

Python

License

—

Higher-rated alternatives

isl-org/Open3D

Open3D: A Modern Library for 3D Data Processing

cvg/Hierarchical-Localization

Visual localization made easy with hloc

gmberton/CosPlace

Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"

Vincentqyw/image-matching-webui

🤗 image matching webui

cvg/glue-factory

Training library for local feature detection and matching

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights