YunzeMan/Situation3D
[CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning
This project helps researchers working with 3D environments to improve how AI systems understand and respond to natural language questions about those spaces. It takes 3D scene data (like scans of rooms) and natural language questions, then provides a more accurate understanding of the scene from the perspective of an "embodied agent" within it. This is useful for AI researchers and engineers developing sophisticated 3D vision-language systems for tasks like robotics or virtual assistants.
No commits in the last 6 months.
Use this if you are a researcher or engineer developing AI systems that need to interpret natural language questions about 3D environments from a specific, situated viewpoint.
Not ideal if you are looking for an off-the-shelf application for end-users, or if your primary focus is on 2D image or video analysis without a 3D environmental context.
Stars
43
Forks
2
Language
Python
License
MIT
Category
Last pushed
Dec 09, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/YunzeMan/Situation3D"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
isl-org/Open3D
Open3D: A Modern Library for 3D Data Processing
cvg/Hierarchical-Localization
Visual localization made easy with hloc
gmberton/CosPlace
Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"
Vincentqyw/image-matching-webui
🤗 image matching webui
cvg/glue-factory
Training library for local feature detection and matching