jsrpy/Chinese-NLP-Jieba

This is an introduction to Chinese words segmentation using Jieba.

/ 100

Experimental

This helps you break down Chinese text into individual words, which is a crucial first step for many types of language analysis. You provide raw Chinese sentences or documents, and it gives you a list of separated words. This is used by anyone working with Chinese language data, such as researchers, linguists, or data analysts.

No commits in the last 6 months.

Use this if you need to prepare Chinese text for analysis by accurately segmenting it into words.

Not ideal if you are working with languages other than Chinese or need more advanced NLP tasks like sentiment analysis without prior word segmentation.

Chinese-language-processing text-segmentation linguistic-analysis data-preparation text-mining

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

PyThaiNLP/pythainlp

Thai natural language processing in Python

hankcs/HanLP

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...

jacksonllee/pycantonese

Cantonese Linguistics and NLP

dongrixinyu/JioNLP

中文 NLP 预处理、解析工具包，准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

hankcs/pyhanlp

中文分词

Explore NLP Tools

All categories Trending NLP directory Insights