rkcosmos/deepcut

A Thai word tokenization library using Deep Neural Network

60
/ 100
Established

This tool helps you break down raw Thai text into individual words, which is essential for accurate analysis of Thai language. You provide a block of Thai text, and it returns a list of separate words. Anyone working with Thai language data, such as linguists, researchers, or data analysts, would find this useful.

427 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.

Use this if you need to accurately split Thai sentences and paragraphs into their constituent words for further processing or analysis.

Not ideal if your primary need is for languages other than Thai, as this tool is specifically designed for Thai word segmentation.

Thai-language-processing text-analysis natural-language-processing linguistics data-preparation
Stale 6m
Maintenance 0 / 25
Adoption 11 / 25
Maturity 25 / 25
Community 24 / 25

How are scores calculated?

Stars

427

Forks

98

Language

Python

License

MIT

Last pushed

Oct 23, 2020

Commits (30d)

0

Dependencies

6

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/rkcosmos/deepcut"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.