HenryHZY/Awesome-Multimodal-LLM
Research Trends in LLM-guided Multimodal Learning.
This resource provides an organized overview of the latest advancements in AI models that can understand and process information from various sources like text, images, and audio. It showcases how these advanced models are being guided by large language models to solve complex, real-world tasks. Researchers and practitioners in AI and machine learning will find this valuable for staying current with cutting-edge developments in multimodal AI.
356 stars. No commits in the last 6 months.
Use this if you are an AI researcher or machine learning engineer looking to explore the most recent research and practical implementations of large language models combined with visual, auditory, and textual data.
Not ideal if you are an end-user seeking a ready-to-use application or a general introduction to AI concepts, as this resource focuses on advanced research trends.
Stars
356
Forks
16
Language
—
License
MIT
Category
Last pushed
Oct 17, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/HenryHZY/Awesome-Multimodal-LLM"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
FoundationVision/Liquid
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification),...
Yangyi-Chen/Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record papers I read in the...
thuml/AutoTimes
Official implementation for "AutoTimes: Autoregressive Time Series Forecasters via Large Language Models"