ccmdi/geobench

GeoGuessr benchmark for language models

37
/ 100
Emerging

This project helps evaluate how well different large language models can determine the real-world location of an image, similar to playing GeoGuessr. You provide images (or let the system generate them from GeoGuessr maps) and select the language models you want to test. The output shows you how accurately each model geolocated the images, helping you understand their spatial reasoning capabilities. This is for researchers and developers working with large language models to assess and compare their performance on visual geolocation tasks.

Use this if you need to benchmark and compare the spatial reasoning and image geolocation capabilities of various large language models.

Not ideal if you are looking for a tool to play GeoGuessr for entertainment or to solve general image recognition problems unrelated to geographic inference.

AI-research language-model-evaluation geolocation spatial-reasoning image-analysis
No Package No Dependents
Maintenance 6 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 7 / 25

How are scores calculated?

Stars

51

Forks

3

Language

Python

License

MIT

Last pushed

Nov 20, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/ccmdi/geobench"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.