VietHoang1512/khmer-nltk
Khmer language processing toolkit
This toolkit helps you analyze Khmer text by breaking down sentences into individual words and identifying the grammatical role of each word (like noun, verb, or adjective). It takes raw Khmer text as input and outputs structured text data, making it easier to understand and process large volumes of Khmer language content. It is designed for linguists, researchers, or anyone working with digital Khmer text who needs to perform basic text analysis.
Use this if you need to programmatically segment Khmer sentences and words or assign parts of speech for further linguistic analysis.
Not ideal if you are looking for advanced features like named entity recognition or text classification out-of-the-box.
Stars
81
Forks
19
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 17, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/VietHoang1512/khmer-nltk"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
PyThaiNLP/attacut
A Fast and Accurate Neural Thai Word Segmenter
UlugbekSalaev/UzTransliterator
UzTransliterator | State-of-the-art machine transliteration tool for Uzbek language
seanghay/KhmerOCR
A Fast Khmer Optical Character Recognition (KhmerOCR)
seanghay/khmerphonemizer
A Free, Standalone and Open-Source Khmer Grapheme-to-Phonemes.
ionite34/Aquila-Resolve
Augmented Recurrent Neural Grapheme-to-Phoneme conversion with Inflectional Orthography.