selimfirat/bilkent-turkish-writings-dataset

Compilation of Turkish writings dataset that promotes creativity, content, composition, grammar, spelling and punctuation.

35
/ 100
Emerging

This is a collection of over 9,000 Turkish creative writing samples from university students, gathered from courses focused on developing composition, grammar, spelling, and punctuation. It provides raw text entries, often with instructor feedback, to help researchers and educators analyze Turkish language development and creative expression. The dataset is ideal for linguists, educational researchers, or anyone studying Turkish natural language processing.

No commits in the last 6 months.

Use this if you need a large, categorized dataset of real-world Turkish student writings for linguistic analysis, educational research, or developing AI models for Turkish text.

Not ideal if you need informal Turkish text, conversational data, or a dataset for commercial use, as it's specifically for academic purposes.

Turkish language studies creative writing linguistic research educational assessment natural language processing
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 9 / 25

How are scores calculated?

Stars

54

Forks

4

Language

Python

License

Last pushed

May 26, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/selimfirat/bilkent-turkish-writings-dataset"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.