llmbev/talk2bev

Talk2BEV: Language-Enhanced Bird's Eye View Maps (ICRA'24)

/ 100

Emerging

This project helps autonomous vehicle engineers interpret complex driving scenes using natural language. It takes bird's-eye view (BEV) maps, which are common in self-driving systems, and allows you to ask questions about them in plain English. The output is answers that help understand traffic actor intentions and make decisions based on visual information. Autonomous vehicle developers and safety engineers would use this to enhance scene understanding.

119 stars. No commits in the last 6 months.

Use this if you need to perform advanced visual and spatial reasoning on autonomous driving BEV maps using natural language queries, without needing to train for specific object categories.

Not ideal if your primary need is for a system that relies solely on predefined, closed sets of object categories and driving scenarios without natural language interaction.

autonomous-driving vehicle-perception scene-understanding robotics traffic-analysis

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

119

Forks

Language

Python

License

BSD-3-Clause

Higher-rated alternatives

chrisliu298/awesome-llm-unlearning

A resource repository for machine unlearning in large language models

worldbench/awesome-vla-for-ad

🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future

hijkzzz/Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

zjukg/KG-MM-Survey

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

worldbench/awesome-spatial-intelligence

🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Explore LLM Tools

All categories Trending LLM Tool directory Insights