cdliai/durak

Durak is an open-source modular Turkish NLP preprocessing toolkit

43
/ 100
Emerging

Durak helps researchers, data scientists, and analysts working with Turkish text by cleaning, tokenizing, and preparing it for analysis. It takes raw Turkish text as input and produces processed tokens, allowing for tasks like sentiment analysis, information retrieval, or machine translation. This is for anyone who needs to make sense of large volumes of Turkish language data.

Use this if you need a high-performance, robust toolkit to preprocess Turkish text for natural language processing applications.

Not ideal if your primary language of interest is not Turkish, or if you need a full NLP suite with advanced capabilities like named entity recognition or dependency parsing built-in.

Turkish language processing text analysis data preprocessing linguistics research
No Package No Dependents
Maintenance 10 / 25
Adoption 4 / 25
Maturity 13 / 25
Community 16 / 25

How are scores calculated?

Stars

7

Forks

6

Language

Python

License

MIT

Last pushed

Feb 24, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/cdliai/durak"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.