BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
This resource helps AI researchers and practitioners stay current with the rapidly evolving field of Multimodal Large Language Models (MLLMs). It provides curated lists of significant research papers, comprehensive surveys, and evaluation benchmarks for MLLMs. The intended users are researchers, students, and engineers who are actively working on or studying advanced AI models that integrate different data types like text, images, and audio.
17,448 stars. Actively maintained with 14 commits in the last 30 days.
Use this if you are an AI researcher or developer looking for the latest academic papers, surveys, and evaluation methods related to Multimodal Large Language Models.
Not ideal if you are looking for ready-to-use software or applications built with MLLMs, as this project is a research compilation rather than a tool.
Stars
17,448
Forks
1,112
Language
—
License
—
Category
Last pushed
Mar 12, 2026
Commits (30d)
14
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/BradyFU/Awesome-Multimodal-Large-Language-Models"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related models
FoundationVision/Liquid
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification),...
Yangyi-Chen/Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record papers I read in the...
thuml/AutoTimes
Official implementation for "AutoTimes: Autoregressive Time Series Forecasters via Large Language Models"
flixpar/med-ts-llm
MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis