vaew/Awesome-spatial-visual-reasoning-MLLMs
Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications)
This project helps AI researchers and developers working on embodied AI applications by curating a list of cutting-edge research and open-source projects. It takes research papers, code repositories, and datasets focused on spatial and visual reasoning in Multimodal Large Language Models (MLLMs) as input. The output is an organized catalog that allows practitioners to quickly find relevant resources for building more intelligent, perception-aware AI systems.
No commits in the last 6 months.
Use this if you are an AI researcher or developer looking for the latest advancements and resources in spatial/visual reasoning for embodied MLLMs.
Not ideal if you are an end-user seeking a ready-to-use application or a non-technical person unfamiliar with AI research concepts.
Stars
71
Forks
2
Language
Python
License
—
Category
Last pushed
Jun 26, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/vaew/Awesome-spatial-visual-reasoning-MLLMs"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
chrisliu298/awesome-llm-unlearning
A resource repository for machine unlearning in large language models
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
FoundationVision/Liquid
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification),...
Yangyi-Chen/Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record papers I read in the...