sled-group/3D-GRAND

[CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs

/ 100

Experimental

This project offers a vast dataset and evaluation tools to improve how AI models understand and respond to instructions about physical 3D spaces. It takes 3D scene data and text descriptions, and produces AI models that can better interact with and describe real-world objects and environments. This is for researchers and developers building embodied AI agents and robots that need to accurately perceive and act within the physical world.

No commits in the last 6 months.

Use this if you are developing AI systems for robotics, augmented reality, or virtual assistants that need to understand and generate language grounded in complex 3D environments.

Not ideal if your AI application solely processes text or 2D images, or if you do not require dense, explicit connections between language and 3D objects.

robotics embodied-ai 3d-scene-understanding natural-language-interaction augmented-reality

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

col14m/cadrille

[ICLR2026] cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning

filaPro/cad-recode

[ICCV2025] CAD-Recode: Reverse Engineering CAD Code from Point Clouds

pengsongyou/openscene

[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies

worldbench/3EED

[NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D

cambrian-mllm/cambrian-s

Cambrian-S: Towards Spatial Supersensing in Video

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights