ahmetaa/zemberek-nlp
NLP tools for Turkish.
This project offers a suite of tools for processing and understanding the Turkish language. You can input raw Turkish text and perform tasks like breaking it into words, checking for spelling errors, generating word forms, or identifying specific entities like names. It's designed for anyone needing to programmatically analyze, clean, or classify Turkish text data, such as data scientists, computational linguists, or market researchers.
1,314 stars. No commits in the last 6 months.
Use this if you need to build applications or conduct research that requires detailed programmatic analysis of Turkish text, including morphology, tokenization, or classification.
Not ideal if you need an out-of-the-box Named Entity Recognition (NER) model or if your application requires robust multi-threaded processing.
Stars
1,314
Forks
224
Language
Java
License
—
Category
Last pushed
Feb 26, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/ahmetaa/zemberek-nlp"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
hltcoe/turkle
Django-based clone of Amazon's Mechanical Turk service running in your local environment.
emres/turkish-deasciifier
Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs
brolin59/trnlp
TÜRKÇE İÇİN DOĞAL DİL İŞLEME ARAÇLARI
ooguz/turkce-kufur-karaliste
Türkçe için bir kara liste (blacklist)
obulat/zeyrek
Python morphological analyzer for Turkish language. Partial port of ZemberekNLP.