IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
This project offers a comprehensive survey of cutting-edge research using multimodal large language models for autonomous driving systems. It curates a list of papers and resources, showcasing how these advanced AI models process information from various sources, like road images and spoken commands, to make real-time driving decisions. Researchers and engineers in the autonomous vehicle field would use this to stay updated on the latest developments.
309 stars. No commits in the last 6 months.
Use this if you are an autonomous driving researcher or engineer looking for a centralized resource to understand the application of multimodal large language models in vehicle perception, planning, and control.
Not ideal if you are looking for ready-to-use software or a codebase to implement an autonomous driving feature directly, as this is a research survey.
Stars
309
Forks
13
Language
—
License
MIT
Category
Last pushed
Mar 14, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
FoundationVision/Liquid
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification),...
Yangyi-Chen/Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record papers I read in the...
thuml/AutoTimes
Official implementation for "AutoTimes: Autoregressive Time Series Forecasters via Large Language Models"