juletx/spatial-reasoning
Grounding Language Models for Compositional and Spatial Reasoning
This project evaluates and enhances AI models' ability to understand spatial relationships between objects described in text, especially when multiple objects are involved. It takes text descriptions (like "a person stands and a dog sits") and visual data to determine if the AI can accurately grasp complex spatial and compositional nuances. Researchers and AI developers working on improving the spatial intelligence of language models would find this valuable.
No commits in the last 6 months.
Use this if you are a researcher or AI developer working to improve how AI understands and represents spatial and compositional relationships from language and images.
Not ideal if you are looking for an end-user application to directly process images or text for everyday spatial analysis without deep AI model development.
Stars
18
Forks
3
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Oct 26, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/juletx/spatial-reasoning"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MediaTek-NeuroPilot/mai21-learned-smartphone-isp
The official codebase for the Learned Smartphone ISP Challenge in MAI @ CVPR 2021
ashishpatel26/365-Days-Computer-Vision-Learning-Linkedin-Post
365 Days Computer Vision Learning Linkedin Post
amusi/daily-paper-computer-vision
记录每天整理的计算机视觉/深度学习/机器学习相关方向的论文
extreme-assistant/ICCV2023-Paper-Code-Interpretation
ICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理
extreme-assistant/survey-computer-vision-2020
2020-2021年计算机视觉综述论文分方向整理