anaistack/cefr-asag-corpus
A corpus of short answers written by learners of English and graded with CEFR levels
This dataset provides short English answers from non-native speakers, each linked to a specific language proficiency level defined by the Common European Framework of Reference for Languages (CEFR). Some answers also include CEFR levels assigned by certified examiners. It's designed for researchers, language educators, and computational linguists studying second language acquisition and automated assessment.
No commits in the last 6 months.
Use this if you are developing or evaluating systems for automatically grading English proficiency from short written responses, or for research into language learner errors at different CEFR levels.
Not ideal if you need a corpus of long-form essays or spoken language, or if you require proficiency grading outside of the CEFR framework.
Stars
12
Forks
—
Language
—
License
—
Category
Last pushed
Dec 17, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/anaistack/cefr-asag-corpus"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
nityansuman/marvin
Web app to automatically generate subjective or an objective test and evaluate user responses...
shibing624/judger
自动作文评分工具,支持中文、英文作文智能评分,支持评分模型自训练,支持WEKA处理模型数据,支持自定义评分算法。java开发。
shubhpawar/Automated-Essay-Scoring
Automated Essay Scoring on The Hewlett Foundation dataset on Kaggle
antrixsh/trusteval
Enterprise LLM Evaluation & Responsible AI Framework — Benchmark bias, hallucination, PII...
samiali12/debateai-server
A FastAPI-powered backend that manages structured debates, analyzes arguments, and generates...