YunzeMan/Situation3D

[CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning

/ 100

Experimental

This project helps researchers working with 3D environments to improve how AI systems understand and respond to natural language questions about those spaces. It takes 3D scene data (like scans of rooms) and natural language questions, then provides a more accurate understanding of the scene from the perspective of an "embodied agent" within it. This is useful for AI researchers and engineers developing sophisticated 3D vision-language systems for tasks like robotics or virtual assistants.

No commits in the last 6 months.

Use this if you are a researcher or engineer developing AI systems that need to interpret natural language questions about 3D environments from a specific, situated viewpoint.

Not ideal if you are looking for an off-the-shelf application for end-users, or if your primary focus is on 2D image or video analysis without a 3D environmental context.

3D-vision natural-language-processing robotics-perception embodied-AI scene-understanding

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

isl-org/Open3D

Open3D: A Modern Library for 3D Data Processing

cvg/Hierarchical-Localization

Visual localization made easy with hloc

gmberton/CosPlace

Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"

Vincentqyw/image-matching-webui

🤗 image matching webui

cvg/glue-factory

Training library for local feature detection and matching

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights