vincentlux/Awesome-Multimodal-LLM

Reading list for Multimodal Large Language Models

35
/ 100
Emerging

This is a reading list for anyone deeply involved in or studying advanced AI, specifically focusing on how large language models (LLMs) can process and understand multiple types of data, like images, video, and text, simultaneously. It provides a structured collection of the latest academic papers, tutorials, and datasets in the field. Researchers, academics, and AI practitioners looking to stay current or explore specific areas within multimodal AI would use this resource.

No commits in the last 6 months.

Use this if you are an AI researcher, academic, or practitioner who needs to find cutting-edge research papers, datasets, or tutorials on multimodal large language models and their applications.

Not ideal if you are looking for ready-to-use software, code implementations for immediate development, or an introduction to basic AI concepts.

AI-research machine-learning-engineering natural-language-processing computer-vision multimodal-AI
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 11 / 25

How are scores calculated?

Stars

69

Forks

7

Language

License

MIT

Last pushed

Aug 17, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/vincentlux/Awesome-Multimodal-LLM"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.