ahmetaa/zemberek-nlp

NLP tools for Turkish.

50
/ 100
Established

This project offers a suite of tools for processing and understanding the Turkish language. You can input raw Turkish text and perform tasks like breaking it into words, checking for spelling errors, generating word forms, or identifying specific entities like names. It's designed for anyone needing to programmatically analyze, clean, or classify Turkish text data, such as data scientists, computational linguists, or market researchers.

1,314 stars. No commits in the last 6 months.

Use this if you need to build applications or conduct research that requires detailed programmatic analysis of Turkish text, including morphology, tokenization, or classification.

Not ideal if you need an out-of-the-box Named Entity Recognition (NER) model or if your application requires robust multi-threaded processing.

Turkish-language-processing text-analysis spell-checking information-extraction text-classification
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 24 / 25

How are scores calculated?

Stars

1,314

Forks

224

Language

Java

License

Last pushed

Feb 26, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/ahmetaa/zemberek-nlp"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.