Question-Answering Systems NLP Tools

Datasets, benchmarks, and frameworks for building question answering systems across modalities (open-domain, reading comprehension, commonsense, multilingual). Does NOT include general machine translation, information retrieval, or dialogue systems.

There are 74 question-answering systems tools tracked. 3 score above 50 (established tier). The highest-rated is PaddlePaddle/RocketQA at 58/100 with 785 stars.

Get all 74 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=question-answering-systems&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 PaddlePaddle/RocketQA

🚀 RocketQA, dense retrieval for information retrieval and question...

58
Established
2 shuaihuaiyi/QA

使用深度学习算法实现的中文问答系统

51
Established
3 allenai/deep_qa

A deep NLP library, based on Keras / tf, focused on question answering (but...

51
Established
4 worldbank/iQual

iQual is a package that leverages natural language processing to scale up...

48
Emerging
5 seriousran/awesome-qa

😎 A curated list of the Question Answering (QA)

47
Emerging
6 fhamborg/Giveme5W1H

Extraction of the journalistic five W and one H questions (5W1H) from news...

47
Emerging
7 mandarjoshi90/triviaqa

Code for the TriviaQA reading comprehension dataset

45
Emerging
8 programmer290399/pyqna

A simple python package for question answering.

45
Emerging
9 huggingface/node-question-answering

Fast and production-ready question answering in Node.js

44
Emerging
10 21han/nlp_qa_project

Natural Language Processing Question Answering Final Project

44
Emerging
11 TheHamkerCat/python-arq

Asynchronous Python Wrapper For A.R.Q API.

44
Emerging
12 UKP-SQuARE/square-core

SQuARE: Software for question answering research.

42
Emerging
13 TmaxEdu/KorDPR

This repo Implements "Dense Passage Retrieval for Open-Domain Question...

42
Emerging
14 Karthik-Bhaskar/Context-Based-Question-Answering

Context-Based-Question-Answering

42
Emerging
15 seominjoon/denspi

Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (DenSPI)

42
Emerging
16 allenai/semanticilp

Question Answering as Global Reasoning over Semantic Abstractions (AAAI-18)

41
Emerging
17 dice-group/TeBaQA

A question answering system which utilises machine learning.

41
Emerging
18 CogComp/multirc

Reasoning over Multiple Sentences (Multi-RC)

40
Emerging
19 apple/ml-mkqa

We introduce MKQA, an open-domain question answering evaluation set...

40
Emerging
20 BDBC-KG-NLP/QA-Survey-CN

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA)...

40
Emerging
21 ARBML/qawafi

Platform for Arabic Poetry Analysis using knowledge-based and deep learning...

40
Emerging
22 neuml/tldrstory

📊 Semantic search for headlines and story text

40
Emerging
23 anassinator/markov-sentence-correction

Markov Chains and Hidden Markov Models to generate and correct sentences

37
Emerging
24 soco-ai/SF-QA

Evaluation framework for open-domain question answering.

37
Emerging
25 IBM/sciqa-arcade198-dataset

ARCADE198 Dataset from the ACL 2018 MRQA Workshop

37
Emerging
26 Chia-Hsuan-Lee/KaggleDBQA

Introduction page of a challenging text-to-SQL dataset: KaggleDBQA

37
Emerging
27 AskNowQA/QA-Tutorial

The repo contains all the materials related to Question Answering.

34
Emerging
28 stanford-oval/schema2qa

Schema2QA Question Answering Dataset

34
Emerging
29 Mukhopadhyay/Amazon_QnA_Dataset

Amazon question/answer dataset.

33
Emerging
30 nhatsmrt/wiki-dpr

A simple tool to retrieve relevant Wikipedia passages

32
Emerging
31 siddharthkhincha/Inter-IIT-11-Devrev

IIT Guwahati's Gold Medal winning solution to DevRev’s Expert Answers in a...

32
Emerging
32 RDTvlokip/InfiniQA

The Official InfiniQA Dataset 📁📝

32
Emerging
33 google-research-datasets/query-wellformedness

25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with...

30
Emerging
34 hasanhuz/MentalQA

MentalQA: An Annotated Arabic Corpus for Questions and Answers of Mental Healthcare

30
Emerging
35 sriyavasudevan/Question-Answering-System

We built a Question Answer System using BERT. Based on our benchmark dataset...

30
Emerging
36 I-QA-UCT/IQA

Extensions to Yuan et al. QAit task.

30
Emerging
37 scruel/campusQA

Deeplearning4J框架搭建的第一个问答小AI

30
Emerging
38 pln-fing-udelar/newsqa-es

Code to rebuild the NewsQA-es dataset: a Spanish version of the NewsQA dataset

30
Emerging
39 christianbitter/QA_and_QG

An inventory of data sets around Question Generation and Question Answering

29
Experimental
40 Chia-Hsuan-Lee/ODSQA

ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET

28
Experimental
41 di37/question-answering-api-llm

Question Answering System API based on all of the Harry Potter Books that...

28
Experimental
42 boostcampaitech3/level2-mrc-level2-nlp-09

네이버 부스트캠프 | Open-Domain Question Answering(ODQA)

28
Experimental
43 impyadav/QA-Annotator

A flask based web app for Question-Answering (Natural Language Processing)...

28
Experimental
44 boostcampaitech3/final-project-level3-nlp-09

네이버 부스트캠프 | 회의록을 활용한 Closed-Domain Question Answering(CDQA)

27
Experimental
45 ZhiyunLab/CsQA

CommonsenseQA

26
Experimental
46 boostcampaitech2/mrc-level2-nlp-09

[2nd] KLUE Open-Domain Question Answering

26
Experimental
47 youngerous/Open-domain-QA

Presentation slides of ODQA

26
Experimental
48 louisowen6/quora_paraphrasing_id

Quora Paraphrasing Dataset Bahasa Indonesia Version

25
Experimental
49 spapicchio/QATCH

Official implementation of QATCH: Benchmarking SQL-centric tasks with Table...

25
Experimental
50 orsiluk/Answer-Ranking

Model to find relevant answers to questions on CQA (Community Question...

23
Experimental
51 AkariAsai/unanswerable_qa

The official implementation for ACL 2021 "Challenges in Information Seeking...

23
Experimental
52 Sparshjain25/SQuAD-2.0

NLP Team 12

22
Experimental
53 Dalia-Mahmoud-ElSayes/Gp-2022-ma3aref-Arabic-QA

Our Graduation project: "Ma'aref" an Arabic Question Answering on Quran and Fatwa.

22
Experimental
54 SockAndSandal/semantic-search-qa

Code for the Semantic Search QA Algorithm

22
Experimental
55 Wikidepia/SQuAD-id

Stanford Question Answering Dataset Translated to Indonesia.

22
Experimental
56 gsh199449/productqa

Product-Aware Answer Generation in E-Commerce Question-Answering

21
Experimental
57 santhoshtr/wq

An experimental natural language based querying system for Wikipedia

21
Experimental
58 motazsaad/Quran-QA

Quran QA

21
Experimental
59 asaparov/fictionalgeoqa

Question-answering dataset to evaluate reasoning ability over short paragraphs.

20
Experimental
60 aklein4/ASKiT

Stanford CS224N Final Project. A text-based multi-hop reasoning...

19
Experimental
61 svjack/tableQA-Chinese

Unsupervised tableQA and databaseQA on chinese finance question and tabular data

19
Experimental
62 ASoleimaniB/NLQuAD

NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021

19
Experimental
63 reddrex/lingcomp_QA

An Spanish computational linguistics QA corpus (JSON format) with 1004 rows

19
Experimental
64 donderom/sqwat

TUI editor for the Stanford Question Answering Dataset (SQuAD) 💬

19
Experimental
65 mkearney/infoquality

Information Quality

18
Experimental
66 felixgiov/UDST-DurationQA

Dataset from the paper "Improving Event Duration Question Answering by...

17
Experimental
67 GUT-AI/qa

Question Answering (QA)

17
Experimental
68 muhammedshihab1001/quora-duplicate-question-detection

Detect duplicate questions using NLP techniques including TF-IDF + Logistic...

13
Experimental
69 aditi184/MultilingualQA

Chaii (Challenge in AI for India) Multilingual QnA - Google Research India

12
Experimental
70 vaibagga/BERT_QnA

QnA system using BERT

11
Experimental
71 blaze7451/Project-JaQUAd-QA-System

Extractive QA system using JaQUAd dataset

11
Experimental
72 lucadiliello/asnq-challenging

ASNQ without trivial negative answers.

10
Experimental
73 Wadaboa/squad-question-answering

Question answering on the SQuAD dataset, for NLP class at UNIBO

10
Experimental
74 TuozhenLiu/Chaii-QA

chaii - Hindi and Tamil Question Answering

10
Experimental