anaistack/ai-teacher-test

Source code and data for the EDM 2022 paper

/ 100

Emerging

This project helps researchers and educators evaluate how well AI models, like GPT-3, perform as educational tutors. It takes real student-teacher conversation data and simulated AI responses, then compares them based on how teacher-like, understanding, and helpful the AI is. This tool is for educational researchers, AI ethics reviewers, and curriculum designers who need to objectively measure an AI's pedagogical ability.

No commits in the last 6 months.

Use this if you need a systematic way to test and quantify the teaching effectiveness of AI conversational agents in educational settings.

Not ideal if you are looking for an off-the-shelf AI teaching assistant to deploy directly to students without further research and development.

educational-research AI-ethics pedagogical-evaluation conversational-AI-assessment learning-technologies

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

gunthercox/chatterbot-corpus

A multilingual dialog corpus

EdinburghNLP/awesome-hallucination-detection

List of papers on hallucination detection in LLMs.

jfainberg/self_dialogue_corpus

The Self-dialogue Corpus - a collection of self-dialogues across music, movies and sports

jkkummerfeld/irc-disentanglement

Dataset and model for disentangling chat on IRC

Tomiinek/MultiWOZ_Evaluation

Unified MultiWOZ evaluation scripts for the context-to-response task.

Explore NLP Tools

All categories Trending NLP directory Insights