BradyFU/Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

53
/ 100
Established

This resource helps AI researchers and practitioners stay current with the rapidly evolving field of Multimodal Large Language Models (MLLMs). It provides curated lists of significant research papers, comprehensive surveys, and evaluation benchmarks for MLLMs. The intended users are researchers, students, and engineers who are actively working on or studying advanced AI models that integrate different data types like text, images, and audio.

17,448 stars. Actively maintained with 14 commits in the last 30 days.

Use this if you are an AI researcher or developer looking for the latest academic papers, surveys, and evaluation methods related to Multimodal Large Language Models.

Not ideal if you are looking for ready-to-use software or applications built with MLLMs, as this project is a research compilation rather than a tool.

AI research natural language processing computer vision multimodal AI machine learning
No License No Package No Dependents
Maintenance 17 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 18 / 25

How are scores calculated?

Stars

17,448

Forks

1,112

Language

License

Last pushed

Mar 12, 2026

Commits (30d)

14

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/BradyFU/Awesome-Multimodal-Large-Language-Models"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.