AliOsm/shakkelha
Neural Arabic text diacritization
This project helps convert plain Arabic text, which often lacks short vowels (diacritics), into fully vocalized text. It takes unvocalized Arabic text as input and adds the correct diacritics, producing text that is easier to read and process accurately. This tool is for anyone working with Arabic text who needs to ensure proper pronunciation, grammar, and meaning, such as linguists, Arabic language educators, or researchers in computational linguistics.
No commits in the last 6 months.
Use this if you need to automatically add diacritics to large volumes of Arabic text for better readability, linguistic analysis, or as a preprocessing step for other natural language processing tasks.
Not ideal if you need to manually review and fine-tune every diacritic placement, as this is an automated tool.
Stars
94
Forks
15
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Mar 24, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/AliOsm/shakkelha"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
linuxscout/mishkal
Mishkal is an arabic text vocalization software
hb20007/greek-dialect-classifier
Classifier that identifies Greek text as Cypriot Greek or Standard Modern Greek
AliOsm/arabic-text-diacritization
Benchmark Arabic text diacritization dataset
mush42/libtashkeel
Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models
BasmaElhoseny01/Tashkeel
A system that takes a sentence and produces the same sentence after restoring the missing diacritics.