seanghay/khmer-neural-segmenter
Khmer Neural Segmenter
This library helps anyone working with Khmer language text to break down sentences into individual words quickly and accurately. You provide a block of Khmer text, and it outputs a list of separate words, making it easier to analyze, search, or process the language. It's designed for linguists, data analysts, or software developers building applications that need to understand Khmer text.
Use this if you need to programmatically separate continuous Khmer text into its constituent words for further processing or analysis.
Not ideal if you need a tool that handles complex grammatical parsing or semantic understanding beyond basic word segmentation.
Stars
12
Forks
—
Language
C
License
MIT
Category
Last pushed
Feb 05, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/seanghay/khmer-neural-segmenter"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
VietHoang1512/khmer-nltk
Khmer language processing toolkit
PyThaiNLP/attacut
A Fast and Accurate Neural Thai Word Segmenter
UlugbekSalaev/UzTransliterator
UzTransliterator | State-of-the-art machine transliteration tool for Uzbek language
seanghay/KhmerOCR
A Fast Khmer Optical Character Recognition (KhmerOCR)
seanghay/khmerphonemizer
A Free, Standalone and Open-Source Khmer Grapheme-to-Phonemes.