Yangyi-Chen/Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
This is a curated list of research papers focused on multimodal and large language models. It provides a categorized overview of recent advancements, allowing researchers and practitioners to quickly find relevant literature on topics like computer vision, natural language processing, and machine learning. The list acts as a personal reading log, updated regularly with significant contributions.
756 stars. Actively maintained with 4 commits in the last 30 days.
Use this if you are an AI researcher, machine learning engineer, or academic looking for a structured and up-to-date collection of papers on multimodal and large language models for your research or to stay current with the field.
Not ideal if you are a beginner looking for introductory materials or an application developer seeking code examples or practical implementation guides.
Stars
756
Forks
43
Language
—
License
—
Category
Last pushed
Jan 22, 2026
Commits (30d)
4
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Yangyi-Chen/Multimodal-AND-Large-Language-Models"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
FoundationVision/Liquid
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification),...
thuml/AutoTimes
Official implementation for "AutoTimes: Autoregressive Time Series Forecasters via Large Language Models"
flixpar/med-ts-llm
MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis