Multimodal Vision Language Models LLM Tools

Comprehensive surveys, benchmarks, and research collections on vision-language models, multimodal learning architectures, and their domain-specific applications (remote sensing, transportation, urban computing, weather). Does NOT include individual model implementations, fine-tuning techniques, or tools for building applications with these models.

There are 38 multimodal vision language models tools tracked. 3 score above 50 (established tier). The highest-rated is chrisliu298/awesome-llm-unlearning at 52/100 with 551 stars. 1 of the top 10 are actively maintained.

Get all 38 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=multimodal-vision-language-models&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 chrisliu298/awesome-llm-unlearning

A resource repository for machine unlearning in large language models

52
Established
2 worldbench/awesome-vla-for-ad

🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future

50
Established
3 hijkzzz/Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓...

50
Established
4 zjukg/KG-MM-Survey

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

49
Emerging
5 worldbench/awesome-spatial-intelligence

🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training...

47
Emerging
6 worldbench/DriveBench

[ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from...

43
Emerging
7 PeterGriffinJin/Awesome-Language-Model-on-Graphs

A curated list of papers and resources based on "Large Language Models on...

42
Emerging
8 sou350121/VLA-Handbook

本项目旨在为致力于进入VLA(Vision-Language-Action)领域的算法工程师提供一份全中文、实战导向的学习/面试手册。 不同于通用的...

40
Emerging
9 EmulationAI/awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs)...

39
Emerging
10 THUMNLab/awesome-large-graph-model

Papers about large graph models.

38
Emerging
11 MIT-SPARK/LP2

Long-term Human Trajectory Prediction using 3D DSGs

38
Emerging
12 PJLab-ADG/awesome-knowledge-driven-AD

A curated list of awesome knowledge-driven autonomous driving (continually updated)

38
Emerging
13 llmbev/talk2bev

Talk2BEV: Language-Enhanced Bird's Eye View Maps (ICRA'24)

38
Emerging
14 RManLuo/Awesome-LLM-KG

Awesome papers about unifying LLMs and KGs

38
Emerging
15 WLiK/LLM4Rec-Awesome-Papers

A list of awesome papers and resources of recommender system on large...

37
Emerging
16 SuperBruceJia/Awesome-Large-Vision-Language-Model

Awesome Large Vision-Language Model: A Curated List of Large Vision-Language Model

36
Emerging
17 LJungang/Awesome-Video-Reasoning-Landscape

🔥An open-source survey of the latest video reasoning tasks, paradigms, and...

36
Emerging
18 tongnie/awesome-llm4tr

Exploring the Roles of Large Language Models in Reshaping Transportation...

35
Emerging
19 vincentlux/Awesome-Multimodal-LLM

Reading list for Multimodal Large Language Models

35
Emerging
20 basiclab/TTSG

Traffic Scene Generation from Natural Language Description for Autonomous...

34
Emerging
21 archersama/awesome-recommend-system-pretraining-papers

Paper List for Recommend-system PreTrained Models

32
Emerging
22 Xiaohao-Liu/Awesome-Multi-Token-Prediction

A curated list of papers, tools, and resources on Multi-Token Prediction...

32
Emerging
23 Atomic-man007/Awesome_Multimodel_LLM

Awesome_Multimodel is a curated GitHub repository that provides a...

31
Emerging
24 westlake-repl/MicroLens

A Large Short-video Recommendation Dataset with Raw Text/Audio/Image/Videos...

31
Emerging
25 YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image,...

31
Emerging
26 tongnie/IMPEL

TRE'25: Joint Estimation and Prediction of City-wide Delivery Demand: A...

28
Experimental
27 davendw49/Awesome-Long-Context-Language-Modeling

Papers of Long Context Language Model

28
Experimental
28 AdityaLab/MM4TSA

A professional list on Multi-Modalities For Time Series Analysis (MM4TSA)...

27
Experimental
29 OpenTSLab/TimeOmni

[ICLR 2026] Official implementation of SciTS: Scientific Time Series...

26
Experimental
30 ThomasVonWu/Awesome-VLMs-Strawberry

A collection of VLMs papers, blogs, and projects, with a focus on VLMs in...

25
Experimental
31 Tangkfan/Awesome-Temporal-Video-Grounding

paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding...

24
Experimental
32 Nehs6xy3hgdguzjs/Awesome-Video-Reasoning

🎥 Explore cutting-edge research focused on reasoning with video models,...

24
Experimental
33 edujbarrios/awesome-vision-ai-stack

A curated, builder-first list of Vision Language Models (VLMs), local...

22
Experimental
34 showlab/Awesome-Long-Context

A curated list of resources about long-context in large-language models and...

21
Experimental
35 HKUDS/Awesome-LLM4Urban-Papers

[ACM TIST] "LLM4Urban: Urban Computing in the Era of Large Language Models"

19
Experimental
36 bailynlove/Awesome-OCR-Vision-Based-Context-Compression

Awesome list of paper on vision-based context compression

16
Experimental
37 Shaadalam9/llms-av-crowdsourced

This repository contains the code and analysis for the research paper "Cross...

13
Experimental
38 Charmve/PuppyGo

vision language model and large language model powered embodied robot

12
Experimental