NLPchina/ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
When you have Chinese text and need to break it down into meaningful words, identify names, or understand the type of each word, this tool can help. It takes raw Chinese sentences or documents and outputs the text segmented into words, with proper nouns like names recognized, and each word tagged with its grammatical category. This is ideal for linguists, data analysts working with Chinese text, or anyone processing Chinese content for search, analysis, or information extraction.
6,544 stars. No commits in the last 6 months.
Use this if you need fast and highly accurate Chinese word segmentation, especially for tasks requiring precise identification of names and word types.
Not ideal if your primary need is for non-Chinese language processing or if you require advanced features like full syntactic parsing beyond basic word tagging.
Stars
6,544
Forks
2,291
Language
Java
License
Apache-2.0
Category
Last pushed
Nov 19, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/NLPchina/ansj_seg"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PyThaiNLP/pythainlp
Thai natural language processing in Python
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
hankcs/pyhanlp
中文分词