ASoleimaniB/NLQuAD
NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021
This project offers a specialized dataset for training and evaluating question-answering systems. It takes BBC news articles as input and provides long, non-factoid questions derived from sub-headings, along with their corresponding body paragraphs as answers. Data scientists and researchers in natural language processing would use this to develop and benchmark models capable of understanding and extracting comprehensive answers from lengthy texts.
No commits in the last 6 months.
Use this if you are a researcher or data scientist developing question-answering models and need a challenging dataset focused on extracting long, descriptive answers from news articles.
Not ideal if you are looking for a pre-built tool to answer questions from documents, or if your primary need is for short, fact-based answers.
Stars
13
Forks
1
Language
Python
License
—
Category
Last pushed
May 18, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/ASoleimaniB/NLQuAD"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PaddlePaddle/RocketQA
🚀 RocketQA, dense retrieval for information retrieval and question answering, including both...
shuaihuaiyi/QA
使用深度å¦ä¹ ç®—æ³•å®žçŽ°çš„ä¸æ–‡é—®ç”系统
allenai/deep_qa
A deep NLP library, based on Keras / tf, focused on question answering (but useful for other NLP too)
worldbank/iQual
iQual is a package that leverages natural language processing to scale up interpretative...
fhamborg/Giveme5W1H
Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did...