anaistack/ai-teacher-test
Source code and data for the EDM 2022 paper
This project helps researchers and educators evaluate how well AI models, like GPT-3, perform as educational tutors. It takes real student-teacher conversation data and simulated AI responses, then compares them based on how teacher-like, understanding, and helpful the AI is. This tool is for educational researchers, AI ethics reviewers, and curriculum designers who need to objectively measure an AI's pedagogical ability.
No commits in the last 6 months.
Use this if you need a systematic way to test and quantify the teaching effectiveness of AI conversational agents in educational settings.
Not ideal if you are looking for an off-the-shelf AI teaching assistant to deploy directly to students without further research and development.
Stars
12
Forks
4
Language
Python
License
—
Category
Last pushed
May 16, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/anaistack/ai-teacher-test"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
gunthercox/chatterbot-corpus
A multilingual dialog corpus
EdinburghNLP/awesome-hallucination-detection
List of papers on hallucination detection in LLMs.
jfainberg/self_dialogue_corpus
The Self-dialogue Corpus - a collection of self-dialogues across music, movies and sports
jkkummerfeld/irc-disentanglement
Dataset and model for disentangling chat on IRC
Tomiinek/MultiWOZ_Evaluation
Unified MultiWOZ evaluation scripts for the context-to-response task.