dogterbox/thai-word-segmentation
Thai word segmentation using deep learning
This project helps anyone working with Thai text to accurately break down sentences and paragraphs into individual words. It takes raw Thai text as input and outputs the text with clear word boundaries, which is crucial for tasks like search, analysis, and translation. This tool is ideal for linguists, researchers, content managers, or anyone needing to process unstructured Thai text data.
No commits in the last 6 months.
Use this if you need to reliably segment Thai text into its constituent words to enable further linguistic analysis or information retrieval.
Not ideal if you are working with languages other than Thai or require advanced natural language understanding beyond basic word segmentation.
Stars
14
Forks
2
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jul 01, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/dogterbox/thai-word-segmentation"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PyThaiNLP/pythainlp
Thai natural language processing in Python
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
hankcs/pyhanlp
中文分词