shmsw25/AmbigQA

An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"

/ 100

Emerging

This project helps researchers in natural language processing (NLP) develop and evaluate systems that can answer ambiguous open-domain questions. It provides a dataset, AmbigNQ, which includes questions that have multiple valid interpretations and corresponding multiple answers or rephrased question-answer pairs. NLP researchers and machine learning engineers can use this to train and benchmark models for complex question answering.

121 stars. No commits in the last 6 months.

Use this if you are an NLP researcher working on question answering systems and need a dataset to train and evaluate models on questions that might have more than one correct answer depending on interpretation.

Not ideal if you are looking for an out-of-the-box question answering system for direct use in an application or if your questions are always unambiguous with single, definitive answers.

Natural Language Processing Question Answering Machine Learning Research Data Annotation Information Retrieval

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 20 / 25

How are scores calculated?

Stars

121

Forks

Language

Python

License

—

Higher-rated alternatives

ymcui/cmrc2018

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)

thunlp/MultiRD

Code and data of the AAAI-20 paper "Multi-channel Reverse Dictionary Model"

princeton-nlp/DensePhrases

[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval...

IndexFziQ/KMRC-Papers

A list of recent papers regarding knowledge-based machine reading comprehension.

danqi/rc-cnn-dailymail

CNN/Daily Mail Reading Comprehension Task

Explore NLP Tools

All categories Trending NLP directory Insights