worldbench/DriveBench

[ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

/ 100

Emerging

DriveBench is a dataset for evaluating how well Vision-Language Models (VLMs) understand complex driving scenarios. It takes in images and text-based questions about driving situations and outputs answers that reveal if the VLM truly understands the visual context, especially under challenging conditions. Autonomous driving researchers and engineers can use this to rigorously test and improve the reliability of their VLM-powered systems.

232 stars.

Use this if you are developing or evaluating AI systems for autonomous vehicles and need to rigorously test how reliably Vision-Language Models interpret driving scenes under various conditions, including degraded visual input.

Not ideal if you are looking for a dataset to train general-purpose Vision-Language Models outside of the autonomous driving domain.

autonomous-driving vehicle-AI VLM-evaluation perception-systems robotics-safety

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

232

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

chrisliu298/awesome-llm-unlearning

A resource repository for machine unlearning in large language models

worldbench/awesome-vla-for-ad

🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future

hijkzzz/Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

zjukg/KG-MM-Survey

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

worldbench/awesome-spatial-intelligence

🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Explore LLM Tools

All categories Trending LLM Tool directory Insights