anaistack/cefr-asag-corpus

A corpus of short answers written by learners of English and graded with CEFR levels

/ 100

Experimental

This dataset provides short English answers from non-native speakers, each linked to a specific language proficiency level defined by the Common European Framework of Reference for Languages (CEFR). Some answers also include CEFR levels assigned by certified examiners. It's designed for researchers, language educators, and computational linguists studying second language acquisition and automated assessment.

No commits in the last 6 months.

Use this if you are developing or evaluating systems for automatically grading English proficiency from short written responses, or for research into language learner errors at different CEFR levels.

Not ideal if you need a corpus of long-form essays or spoken language, or if you require proficiency grading outside of the CEFR framework.

language-assessment english-learning cefr-grading educational-technology applied-linguistics

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

—

License

—

Higher-rated alternatives

nityansuman/marvin

Web app to automatically generate subjective or an objective test and evaluate user responses...

shibing624/judger

自动作文评分工具，支持中文、英文作文智能评分，支持评分模型自训练，支持WEKA处理模型数据，支持自定义评分算法。java开发。

shubhpawar/Automated-Essay-Scoring

Automated Essay Scoring on The Hewlett Foundation dataset on Kaggle

antrixsh/trusteval

Enterprise LLM Evaluation & Responsible AI Framework — Benchmark bias, hallucination, PII...

samiali12/debateai-server

A FastAPI-powered backend that manages structured debates, analyzes arguments, and generates...

Explore NLP Tools

All categories Trending NLP directory Insights