cdliai/durak
Durak is an open-source modular Turkish NLP preprocessing toolkit
Durak helps researchers, data scientists, and analysts working with Turkish text by cleaning, tokenizing, and preparing it for analysis. It takes raw Turkish text as input and produces processed tokens, allowing for tasks like sentiment analysis, information retrieval, or machine translation. This is for anyone who needs to make sense of large volumes of Turkish language data.
Use this if you need a high-performance, robust toolkit to preprocess Turkish text for natural language processing applications.
Not ideal if your primary language of interest is not Turkish, or if you need a full NLP suite with advanced capabilities like named entity recognition or dependency parsing built-in.
Stars
7
Forks
6
Language
Python
License
MIT
Category
Last pushed
Feb 24, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/cdliai/durak"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hltcoe/turkle
Django-based clone of Amazon's Mechanical Turk service running in your local environment.
emres/turkish-deasciifier
Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs
brolin59/trnlp
TÜRKÇE İÇİN DOĞAL DİL İŞLEME ARAÇLARI
ooguz/turkce-kufur-karaliste
Türkçe için bir kara liste (blacklist)
ahmetaa/zemberek-nlp
NLP tools for Turkish.