ShiZhengyan/StepGame

[AAAI 2022] Dataset and pytorch codes for the paper titled "StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts" in AAAI 2022 (Oral)

/ 100

Emerging

This project offers a specialized dataset and accompanying code for evaluating how well AI models can understand and reason about spatial relationships described across multiple sentences in text. It takes text-based stories and questions about spatial arrangements as input, and outputs labels indicating correct spatial reasoning. This is primarily useful for AI researchers and natural language processing engineers who are developing and testing advanced AI systems.

No commits in the last 6 months.

Use this if you are developing or benchmarking AI models that need to perform complex spatial reasoning across several pieces of information within a text.

Not ideal if you are looking for a pre-trained model to directly apply to a business problem, as this is a research benchmark.

AI model evaluation Natural Language Processing research textual reasoning spatial understanding AI benchmark

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

ymcui/cmrc2018

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)

princeton-nlp/DensePhrases

[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval...

thunlp/MultiRD

Code and data of the AAAI-20 paper "Multi-channel Reverse Dictionary Model"

IndexFziQ/KMRC-Papers

A list of recent papers regarding knowledge-based machine reading comprehension.

danqi/rc-cnn-dailymail

CNN/Daily Mail Reading Comprehension Task

Explore NLP Tools

All categories Trending NLP directory Insights