qinwf/jiebaR
Chinese text segmentation with R. R语言中文分词 (文档已更新 🎉 :https://qinwenfeng.com/jiebaR/ )
This helps you break down Chinese text into individual words or phrases, which is crucial for any text analysis. You provide raw Chinese text, and it outputs segmented words, important keywords, or identifies parts of speech. This is for researchers, marketers, or anyone needing to analyze Chinese language data in R.
363 stars. No commits in the last 6 months.
Use this if you need to prepare Chinese text for analysis, such as counting word frequencies, understanding themes, or building predictive models in R.
Not ideal if you primarily work with other programming languages or need to segment non-Chinese text.
Stars
363
Forks
108
Language
C++
License
—
Category
Last pushed
Jul 13, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/qinwf/jiebaR"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
PyThaiNLP/pythainlp
Thai natural language processing in Python
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
hankcs/pyhanlp
中文分词