shibing624/crf-seg
crf-seg:用于生产环境的中文分词处理工具,可自定义语料、可自定义模型、架构清晰,分词效果好。java编写。
This tool helps Java developers integrate high-quality Chinese word segmentation into their applications. It takes raw Chinese text as input and outputs the text segmented into individual words, which is crucial for natural language processing tasks. Developers working on applications that require precise understanding or analysis of Chinese text, such as search engines, content analysis tools, or machine translation systems, would find this project useful.
No commits in the last 6 months.
Use this if you are a Java developer building an application that needs accurate and efficient Chinese word segmentation, especially for new words, proper nouns, and traditional characters.
Not ideal if you are looking for a pre-built application or a non-Java solution, as this is a Java toolkit for developers.
Stars
14
Forks
9
Language
Java
License
Apache-2.0
Category
Last pushed
Dec 11, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/shibing624/crf-seg"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PyThaiNLP/pythainlp
Thai natural language processing in Python
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
hankcs/pyhanlp
中文分词