vincentlux/Awesome-Multimodal-LLM

Reading list for Multimodal Large Language Models

/ 100

Emerging

This is a reading list for anyone deeply involved in or studying advanced AI, specifically focusing on how large language models (LLMs) can process and understand multiple types of data, like images, video, and text, simultaneously. It provides a structured collection of the latest academic papers, tutorials, and datasets in the field. Researchers, academics, and AI practitioners looking to stay current or explore specific areas within multimodal AI would use this resource.

No commits in the last 6 months.

Use this if you are an AI researcher, academic, or practitioner who needs to find cutting-edge research papers, datasets, or tutorials on multimodal large language models and their applications.

Not ideal if you are looking for ready-to-use software, code implementations for immediate development, or an introduction to basic AI concepts.

AI-research machine-learning-engineering natural-language-processing computer-vision multimodal-AI

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

—

License

MIT

Compare

Awesome-Multimodal-LLM and awesome-vla-for-ad Awesome-Multimodal-LLM and Awesome-Large-Vision-Language-Model

Higher-rated alternatives

chrisliu298/awesome-llm-unlearning

A resource repository for machine unlearning in large language models

worldbench/awesome-vla-for-ad

🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future

hijkzzz/Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

zjukg/KG-MM-Survey

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

worldbench/awesome-spatial-intelligence

🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Explore LLM Tools

All categories Trending LLM Tool directory Insights