Awesome-Multimodal-Large-Language-Models and Awesome-Multimodal-LLM

These are ecosystem siblings, as both projects curate lists of resources related to multimodal large language models, with BradyFU's being a broader collection of latest advances and HenryHZY's focusing more specifically on LLM-guided multimodal learning trends.

Awesome-Multimodal-Large-Language-Models

Established

Awesome-Multimodal-LLM

Emerging

Maintenance 17/25

Adoption 10/25

Maturity 8/25

Community 18/25

Maintenance 0/25

Adoption 10/25

Maturity 16/25

Community 10/25

Stars: 17,448

Forks: 1,112

Downloads: —

Commits (30d): 14

Language: —

License: —

Stars: 356

Forks: 16

Downloads: —

Commits (30d): 0

Language: —

License: MIT

No License No Package No Dependents

Stale 6m No Package No Dependents

About Awesome-Multimodal-Large-Language-Models

BradyFU/Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

This resource helps AI researchers and practitioners stay current with the rapidly evolving field of Multimodal Large Language Models (MLLMs). It provides curated lists of significant research papers, comprehensive surveys, and evaluation benchmarks for MLLMs. The intended users are researchers, students, and engineers who are actively working on or studying advanced AI models that integrate different data types like text, images, and audio.

AI research natural language processing computer vision multimodal AI machine learning

About Awesome-Multimodal-LLM

HenryHZY/Awesome-Multimodal-LLM

Research Trends in LLM-guided Multimodal Learning.

This resource provides an organized overview of the latest advancements in AI models that can understand and process information from various sources like text, images, and audio. It showcases how these advanced models are being guided by large language models to solve complex, real-world tasks. Researchers and practitioners in AI and machine learning will find this valuable for staying current with cutting-edge developments in multimodal AI.

AI-research multimodal-learning large-language-models computer-vision natural-language-processing

Related comparisons

Awesome-Multimodal-Large-Language-Models and Awesome-Multimodal-LLM-Autonomous-Driving Awesome-Multimodal-Large-Language-Models and Awesome-VLA

Scores updated daily from GitHub, PyPI, and npm data. How scores work