alstat/Yunir.jl
A lightweight Arabic NLP toolkit
This toolkit helps researchers and linguists process Arabic text by cleaning and standardizing it. You input raw Arabic text and get back versions that are easier to analyze or use in computational models, such as text with diacritics removed or different transliteration styles. It's designed for anyone working with Arabic language data who needs to prepare it for further study or application.
Use this if you need to standardize, clean, or transform Arabic text for linguistic analysis, machine learning, or other text processing tasks.
Not ideal if you require advanced deep learning models for Arabic NLP or a comprehensive suite of tools beyond basic text manipulation.
Stars
8
Forks
1
Language
Julia
License
MIT
Category
Last pushed
Jan 02, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/alstat/Yunir.jl"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
nert-nlp/streusle
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)
bretttolbert/verbecc
Verbe Complete Conjugator (verbecc) supports Catalan, Spanish, French, Italian, Portuguese and...
natasha/yargy
Rule-based facts extraction for Russian language
bjascob/LemmInflect
A python module for English lemmatization and inflection.
google-research/turkish-morphology
A two-level morphological analyzer for Turkish.