madrugado/gia-corpus
Corpus of exam tests for 9-graders in Russia for NLP/ML purposes
This project provides a collection of Russian 9th-grade exam questions and answers, formatted for easy use. It takes raw text from official exam PDFs and provides a cleaned, structured version. Educators, researchers studying educational trends, or anyone interested in natural language processing with Russian educational content would find this useful.
No commits in the last 6 months.
Use this if you need structured, publicly available Russian 9th-grade exam data for educational research, language analysis, or developing teaching tools.
Not ideal if you need a comprehensive, up-to-date collection of all Russian educational materials or exams beyond 9th-grade multiple choice questions.
Stars
8
Forks
1
Language
—
License
—
Category
Last pushed
Feb 25, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/madrugado/gia-corpus"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Helsinki-NLP/OpusFilter
OpusFilter - Parallel corpus processing toolkit
natasha/corus
Links to Russian corpora + Python functions for loading and parsing
SergeyShk/ruTS
Библиотека для извлечения статистик из текстов на русском языке.
darija-open-dataset/dataset
darija <-> english dataset
omicsNLP/Auto-CORPus
Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London...