nu-dialogue/real-persona-chat
RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalities
This project provides a large dataset of nearly 14,000 Japanese casual conversations. It includes the actual personalities and characteristics of the speakers, along with demographic information and chat experience, allowing you to train or analyze models with realistic conversational data. It's designed for researchers and AI developers working on building more human-like conversational AI systems.
No commits in the last 6 months.
Use this if you need a rich, realistic dataset of Japanese dialogues with detailed speaker profiles for developing or evaluating conversational AI.
Not ideal if you are looking for English dialogue data or conversational data without specific speaker personality traits.
Stars
63
Forks
—
Language
—
License
CC-BY-SA-4.0
Category
Last pushed
Mar 13, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/nu-dialogue/real-persona-chat"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
gunthercox/chatterbot-corpus
A multilingual dialog corpus
EdinburghNLP/awesome-hallucination-detection
List of papers on hallucination detection in LLMs.
jfainberg/self_dialogue_corpus
The Self-dialogue Corpus - a collection of self-dialogues across music, movies and sports
jkkummerfeld/irc-disentanglement
Dataset and model for disentangling chat on IRC
Tomiinek/MultiWOZ_Evaluation
Unified MultiWOZ evaluation scripts for the context-to-response task.