cambrian-mllm/cambrian-s

Cambrian-S: Towards Spatial Supersensing in Video

/ 100

Emerging

This project offers models that dramatically improve how AI systems understand spatial relationships within video footage. It takes raw video data and outputs highly accurate spatial reasoning, enabling AI to better answer questions about object locations, movements, and interactions. This is ideal for researchers and developers building sophisticated video analysis tools, intelligent surveillance, or autonomous systems.

507 stars.

Use this if you need an AI model that can precisely understand where things are and how they move in videos, especially for complex spatial reasoning tasks.

Not ideal if your primary need is general video understanding without a strong focus on intricate spatial details or if you lack development resources to integrate advanced models.

video-analysis spatial-reasoning computer-vision AI-development autonomous-systems

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 11 / 25

How are scores calculated?

Stars

507

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

col14m/cadrille

[ICLR2026] cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning

filaPro/cad-recode

[ICCV2025] CAD-Recode: Reverse Engineering CAD Code from Point Clouds

pengsongyou/openscene

[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies

worldbench/3EED

[NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D

Gorilla-Lab-SCUT/PaDT

[ICLR 2026] Official implementation of "Patch-as-Decodable-Token: Towards Unified Multi-Modal...

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights