ccmdi/geobench
GeoGuessr benchmark for language models
This project helps evaluate how well different large language models can determine the real-world location of an image, similar to playing GeoGuessr. You provide images (or let the system generate them from GeoGuessr maps) and select the language models you want to test. The output shows you how accurately each model geolocated the images, helping you understand their spatial reasoning capabilities. This is for researchers and developers working with large language models to assess and compare their performance on visual geolocation tasks.
Use this if you need to benchmark and compare the spatial reasoning and image geolocation capabilities of various large language models.
Not ideal if you are looking for a tool to play GeoGuessr for entertainment or to solve general image recognition problems unrelated to geographic inference.
Stars
51
Forks
3
Language
Python
License
MIT
Category
Last pushed
Nov 20, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/ccmdi/geobench"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TsinghuaC3I/MARTI
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
zjunlp/KnowLM
An Open-sourced Knowledgable Large Language Model Framework.
cli99/llm-analysis
Latency and Memory Analysis of Transformer Models for Training and Inference
tanyuqian/redco
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to...
stanleylsx/llms_tool
一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。