Task-Oriented Dialogue Systems NLP Tools

Datasets, frameworks, and evaluation tools for building goal-oriented conversational agents (e.g., task completion, dialogue state tracking, multi-domain dialogue). Does NOT include open-domain chitchat, emotion/sentiment analysis in dialogue, or general conversational AI without task-specific goals.

There are 45 task-oriented dialogue systems tools tracked. 1 score above 70 (verified tier). The highest-rated is gunthercox/chatterbot-corpus at 74/100 with 1,411 stars. 2 of the top 10 are actively maintained.

Get all 45 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=task-oriented-dialogue-systems&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 gunthercox/chatterbot-corpus

A multilingual dialog corpus

74
Verified
2 EdinburghNLP/awesome-hallucination-detection

List of papers on hallucination detection in LLMs.

56
Established
3 jfainberg/self_dialogue_corpus

The Self-dialogue Corpus - a collection of self-dialogues across music,...

45
Emerging
4 jkkummerfeld/irc-disentanglement

Dataset and model for disentangling chat on IRC

43
Emerging
5 Tomiinek/MultiWOZ_Evaluation

Unified MultiWOZ evaluation scripts for the context-to-response task.

42
Emerging
6 tae898/multimodal-datasets

Multimodal datasets.

40
Emerging
7 IBM/permuted-bAbI-dialog-tasks

Dataset for 'Learning End-to-End Goal-Oriented Dialog with Multiple Answers'...

39
Emerging
8 hsgodhia/hred

Implements the paper " Building End-To-End Dialogue Systems Using Generative...

38
Emerging
9 doheejin/ProTACT

This repository is the implementation of the ProTACT architecture,...

38
Emerging
10 IBM/modified-bAbI-dialog-tasks

'Learning End-to-End Goal-Oriented Dialog with maximal User task success and...

37
Emerging
11 anaistack/ai-teacher-test

Source code and data for the EDM 2022 paper

36
Emerging
12 vaskonov/negochat_corpus

Negochat Corpus - a dialogue corpus in the negotiation domain

36
Emerging
13 Sensente/Security-Attacks-on-LCCTs

Security Attacks on LLM-based Code Completion Tools (AAAI 2025)

35
Emerging
14 LCS2-IIITD/SPARTA_WSDM2022

This repository contains the code and dataset for our paper titled Speaker...

34
Emerging
15 TianboJi/Dialogue-Eval

Code and data for paper "Achieving Reliable Human Assessment of Open-Domain...

33
Emerging
16 chiachienhung/Multi2WOZ

Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for...

32
Emerging
17 yukyunglee/Awesome-Dialogue-State-Tracking

Dialogue State Tracking (DST) Papers, Datasets, Resources 🤩

32
Emerging
18 Reason-Wang/DialogueGLP

Code for our EACL 2023 findings paper "Global-Local Modeling with...

32
Emerging
19 bryanwilie/pick

PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue...

30
Emerging
20 passing2961/EmpGPT-3

Official code for our COLING 2022 paper: In-Context Learning for Empathetic...

30
Emerging
21 GiovanniTRA/UDCG

Code and Data of the paper: "Redefining Retrieval Evaluation in the Era of LLMs"

30
Emerging
22 umass-ml4ed/tiktoc

Official repo for "Test Case-Informed Knowledge Tracing for Open-ended...

30
Emerging
23 zengyan-97/Transformer-DST

A Generative Dialogue State Tracking Model

29
Experimental
24 mrzjy/GenshinDialog

Extracting character conversations in Genshin Project

29
Experimental
25 mrzjy/hoyo_public_wiki_parser

Parsing Hoyoverse game text corpus from public wikipedia

27
Experimental
26 convei-lab/BotsTalk

🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for...

27
Experimental
27 kaustubhdhole/natural-dont-know

Code for the paper: Saying No is An Art: Contextualized Fallback Responses...

26
Experimental
28 abhi1nandy2/yesbut_dataset

YesBut - Multimodal Satire Comprehension Dataset

26
Experimental
29 skywalker023/prosocial-dialog

🐥 Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial...

24
Experimental
30 nu-dialogue/real-persona-chat

RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own...

24
Experimental
31 UEC-InabaLab/KokoroChat

ロールプレイで収集した日本語のカウンセリング対話データセット

23
Experimental
32 M0gician/RaccoonBench

[ACL 2024] Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications

21
Experimental
33 mrzjy/StarrailDialog

A project that extracts Honkai: Star Rail text corpus

21
Experimental
34 mrzjy/ZZZDialog

A project that extracts ZenlessZoneZero text corpus

20
Experimental
35 mrzjy/WutheringDialog

Extracting character conversations in Wuthering Waves

20
Experimental
36 miguel-kjh/Improving-Dialogue-Management

A detailed analysis of task-oriented dialogue systems, emphasizing the...

19
Experimental
37 mrzjy/HonkaiImpact3rdDialog

A project that collects Honkai Impact 3rd text corpus

19
Experimental
38 kimdanny/user-simulation-t5

Official Code for SIGIR 2022 "A Multi-task Based Neural Model to Simulate...

18
Experimental
39 zenquiorra/M3LS

M3LS : Multi-lingual Multi-modal summarization dataset

18
Experimental
40 minsik-ai/PK-ICR

Persona-Knowledge Interactive Multi-Context Retrieval for Grounded Dialogue...

17
Experimental
41 evelynkyl/xRAD_multilingual_dialog_systems

Codes for master's thesis investigating approaches for building a...

17
Experimental
42 TomatenMarc/TACO

This repository contains the annotation framework, dataset and code used for...

10
Experimental
43 stuti-agrawal/Unsupervised-dialogue-schema-induction

Unsupervised approach for inducing dialogue schemas from domain-specific...

10
Experimental
44 BaluHarshavardan99/Hallucination-in-Chat-bots

Hallucination in Chat-bots: Faithful Benchmark for Information-Seeking Dialogue

10
Experimental
45 mrzjy/ArknightsDialog

Extracting character conversations in Arknights

10
Experimental