vincentlux/Awesome-Multimodal-LLM
Reading list for Multimodal Large Language Models
This is a reading list for anyone deeply involved in or studying advanced AI, specifically focusing on how large language models (LLMs) can process and understand multiple types of data, like images, video, and text, simultaneously. It provides a structured collection of the latest academic papers, tutorials, and datasets in the field. Researchers, academics, and AI practitioners looking to stay current or explore specific areas within multimodal AI would use this resource.
No commits in the last 6 months.
Use this if you are an AI researcher, academic, or practitioner who needs to find cutting-edge research papers, datasets, or tutorials on multimodal large language models and their applications.
Not ideal if you are looking for ready-to-use software, code implementations for immediate development, or an introduction to basic AI concepts.
Stars
69
Forks
7
Language
—
License
MIT
Category
Last pushed
Aug 17, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/vincentlux/Awesome-Multimodal-LLM"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
chrisliu298/awesome-llm-unlearning
A resource repository for machine unlearning in large language models
worldbench/awesome-vla-for-ad
🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
zjukg/KG-MM-Survey
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey
worldbench/awesome-spatial-intelligence
🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems