Socret360/joint-khmer-word-segmentation-and-pos-tagging
A Keras implementation of a deep learning network to simultaneously perform Word Segmentation and Part-of-Speech (POS) Tagging introduced by Bouy et al. in the paper Joint Khmer Word Segmentation and Part-of-Speech Tagging Using Deep Learning.
This project helps natural language processing (NLP) researchers and computational linguists analyze Khmer text by simultaneously splitting sentences into individual words and identifying the grammatical role (like noun, verb, or adjective) of each word. You provide raw Khmer text, and it outputs the text with clear word boundaries and part-of-speech tags for each word. It's designed for those building advanced Khmer language understanding systems.
No commits in the last 6 months.
Use this if you need an efficient and accurate way to perform both word segmentation and part-of-speech tagging on Khmer text for research or application development.
Not ideal if you are looking for an out-of-the-box, end-user application for Khmer text analysis without any programming or deep learning knowledge.
Stars
11
Forks
1
Language
Python
License
—
Category
Last pushed
Apr 16, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Socret360/joint-khmer-word-segmentation-and-pos-tagging"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
VietHoang1512/khmer-nltk
Khmer language processing toolkit
PyThaiNLP/attacut
A Fast and Accurate Neural Thai Word Segmenter
UlugbekSalaev/UzTransliterator
UzTransliterator | State-of-the-art machine transliteration tool for Uzbek language
seanghay/KhmerOCR
A Fast Khmer Optical Character Recognition (KhmerOCR)
seanghay/khmerphonemizer
A Free, Standalone and Open-Source Khmer Grapheme-to-Phonemes.