opendatalab/UrBench

[AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios”

/ 100

Emerging

This tool helps urban planners, smart city developers, and researchers assess how well AI models understand complex urban environments. It takes in various urban image data, like street views and satellite images, and evaluates AI models on tasks ranging from urban planning analysis to everyday city issues, providing scores for their performance. This is for professionals and researchers working on AI applications for urban development.

No commits in the last 6 months.

Use this if you need a comprehensive and challenging way to benchmark large multimodal AI models specifically for multi-view urban scenarios.

Not ideal if you are evaluating general-purpose AI models or models for non-urban-specific tasks.

urban-planning smart-cities AI-evaluation geospatial-analysis urban-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

TsinghuaC3I/MARTI

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

zjunlp/KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

cli99/llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

tanyuqian/redco

NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to...

stanleylsx/llms_tool

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

Explore Transformer Models

All categories Trending Transformer directory Insights