mathsyouth/awesome-word-segmentation
A curated list of resources dedicated to word segmentation
This resource helps anyone who needs to process Chinese text by providing a curated list of tools for 'word segmentation'. When working with Chinese, words aren't separated by spaces, so these tools automatically break down sentences into individual words, which is crucial for tasks like search, analysis, or translation. It's used by researchers, data analysts, or developers dealing with Chinese language data.
No commits in the last 6 months.
Use this if you need to find a tool to automatically identify and separate words within continuous Chinese text.
Not ideal if you are looking for tools to process languages other than Chinese or need a ready-to-use, integrated Natural Language Processing solution rather than a list of options.
Stars
12
Forks
1
Language
—
License
Apache-2.0
Category
Last pushed
Jan 09, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/mathsyouth/awesome-word-segmentation"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PyThaiNLP/pythainlp
Thai natural language processing in Python
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
hankcs/pyhanlp
中文分词