Utkarsh-Mishra444/Sparsely-Grounded-Long-Range-Navigation

AgentNav: Zero-shot sparsely grounded long-range visual navigation in real-world cities using Multimodal Large Language Models (MLLMs).

/ 100

Emerging

This project helps urban planning researchers and autonomous navigation system designers understand how AI agents can navigate complex city environments using only visual cues. It takes street-level images from intersections and a destination, then outputs a sequence of navigation decisions and an estimated path, acting like a human driver or pedestrian navigating without GPS or maps. It's designed for those who need to simulate or evaluate advanced visual navigation capabilities in diverse real-world urban settings.

Use this if you are exploring advanced AI techniques for self-localization and pathfinding in real-world urban environments without relying on GPS, maps, or explicit instructions.

Not ideal if you need a simple, ready-to-deploy GPS-based navigation system or a tool for navigating with pre-defined maps and landmarks.

urban-robotics autonomous-navigation visual-pathfinding city-simulation AI-geospatial-reasoning

No License No Package No Dependents

Maintenance 13 / 25

Adoption 4 / 25

Maturity 5 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

ShisatoYano/AutonomousVehicleControlBeginnersGuide

Python sample codes and documents about Autonomous vehicle control algorithm. This project can...

yyyanbj/arxiv-daily

🎓 Automatically Update Some Fields Papers Daily using Github Actions (Update Every 12th hours)

JdeRobot/BehaviorMetrics

Autonomous driving neural network comparison tool

gmberton/VPR-datasets-downloader

Automatic download VPR datasets in a standard format

open-forest-observatory/geograypher

Enabling Geospatial Predictions from Individual Drone Images

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights