MultimodalGeo/GeoText-1652
An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching
This project provides a unique dataset and model for developing natural language-guided drones. It takes in drone, satellite, or ground camera images and natural language descriptions, and outputs bounding boxes linked to specific textual elements, enabling drones to understand and act on spatial commands. It's intended for researchers and engineers working on autonomous drone navigation and control systems.
114 stars. No commits in the last 6 months.
Use this if you are developing or evaluating AI models that allow drones to interpret and navigate based on natural language instructions, particularly focusing on spatial relationships in real-world imagery.
Not ideal if you are looking for a plug-and-play drone control system, as this project provides a benchmark dataset and model for research and development, not an out-of-the-box solution.
Stars
114
Forks
7
Language
Python
License
—
Category
Last pushed
Jan 26, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/MultimodalGeo/GeoText-1652"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ymcui/cmrc2018
A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)
thunlp/MultiRD
Code and data of the AAAI-20 paper "Multi-channel Reverse Dictionary Model"
princeton-nlp/DensePhrases
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval...
IndexFziQ/KMRC-Papers
A list of recent papers regarding knowledge-based machine reading comprehension.
danqi/rc-cnn-dailymail
CNN/Daily Mail Reading Comprehension Task