kuzgnlar/datasets
Turkish NER, Question-Answer and Sentence datasets
This project provides pre-curated datasets for training AI models in Turkish language processing. It offers collections of named entities (like people, places, organizations), question-answer pairs, and general sentences. Data scientists, NLP engineers, and researchers working with Turkish text would use these datasets to develop or improve natural language understanding applications.
No commits in the last 6 months.
Use this if you are building or fine-tuning AI models that need to understand or generate Turkish text, such as chatbots, information extractors, or search engines.
Not ideal if you are looking for ready-to-use AI models or tools for Turkish text analysis, rather than raw data for training them.
Stars
16
Forks
—
Language
—
License
GPL-3.0
Category
Last pushed
Sep 26, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/kuzgnlar/datasets"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hltcoe/turkle
Django-based clone of Amazon's Mechanical Turk service running in your local environment.
emres/turkish-deasciifier
Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs
ahmetaa/zemberek-nlp
NLP tools for Turkish.
ooguz/turkce-kufur-karaliste
Türkçe için bir kara liste (blacklist)
brolin59/trnlp
TÜRKÇE İÇİN DOĞAL DİL İŞLEME ARAÇLARI