vaew/Awesome-spatial-visual-reasoning-MLLMs

Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications)

22
/ 100
Experimental

This project helps AI researchers and developers working on embodied AI applications by curating a list of cutting-edge research and open-source projects. It takes research papers, code repositories, and datasets focused on spatial and visual reasoning in Multimodal Large Language Models (MLLMs) as input. The output is an organized catalog that allows practitioners to quickly find relevant resources for building more intelligent, perception-aware AI systems.

No commits in the last 6 months.

Use this if you are an AI researcher or developer looking for the latest advancements and resources in spatial/visual reasoning for embodied MLLMs.

Not ideal if you are an end-user seeking a ready-to-use application or a non-technical person unfamiliar with AI research concepts.

AI Research Embodied AI Computer Vision Natural Language Processing Reinforcement Learning
No License Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 9 / 25
Maturity 7 / 25
Community 4 / 25

How are scores calculated?

Stars

71

Forks

2

Language

Python

License

Last pushed

Jun 26, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/vaew/Awesome-spatial-visual-reasoning-MLLMs"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.