nipponjo/arabic_vocalizer
Arabic deep-learning based diacritization models (Shakkala, Shakkelha) in the ONNX format.
Automatically adds short vowel marks (diacritics or "tashkeel") to unvocalized Arabic text. You provide Arabic text without these marks, and it returns the same text with the correct diacritics applied. This is useful for linguists, educators, content creators, or anyone working with Arabic text who needs accurate vocalization for readability, pronunciation, or language learning.
No commits in the last 6 months.
Use this if you need to quickly and accurately add Arabic diacritics to large volumes of unvocalized text for various applications.
Not ideal if you require manual, nuanced control over every diacritic choice or are working with highly specialized or poetic texts that might require human interpretation.
Stars
12
Forks
—
Language
Python
License
MIT
Category
Last pushed
Apr 21, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/nipponjo/arabic_vocalizer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
linuxscout/mishkal
Mishkal is an arabic text vocalization software
hb20007/greek-dialect-classifier
Classifier that identifies Greek text as Cypriot Greek or Standard Modern Greek
AliOsm/arabic-text-diacritization
Benchmark Arabic text diacritization dataset
mush42/libtashkeel
Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models
AliOsm/shakkelha
Neural Arabic text diacritization