dogterbox/thai-word-segmentation

Thai word segmentation using deep learning

31
/ 100
Emerging

This project helps anyone working with Thai text to accurately break down sentences and paragraphs into individual words. It takes raw Thai text as input and outputs the text with clear word boundaries, which is crucial for tasks like search, analysis, and translation. This tool is ideal for linguists, researchers, content managers, or anyone needing to process unstructured Thai text data.

No commits in the last 6 months.

Use this if you need to reliably segment Thai text into its constituent words to enable further linguistic analysis or information retrieval.

Not ideal if you are working with languages other than Thai or require advanced natural language understanding beyond basic word segmentation.

Thai-language-processing text-analysis information-retrieval linguistics content-management
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 10 / 25

How are scores calculated?

Stars

14

Forks

2

Language

Jupyter Notebook

License

MIT

Last pushed

Jul 01, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/dogterbox/thai-word-segmentation"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.