NLPchina/ansj_seg

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典

51
/ 100
Established

When you have Chinese text and need to break it down into meaningful words, identify names, or understand the type of each word, this tool can help. It takes raw Chinese sentences or documents and outputs the text segmented into words, with proper nouns like names recognized, and each word tagged with its grammatical category. This is ideal for linguists, data analysts working with Chinese text, or anyone processing Chinese content for search, analysis, or information extraction.

6,544 stars. No commits in the last 6 months.

Use this if you need fast and highly accurate Chinese word segmentation, especially for tasks requiring precise identification of names and word types.

Not ideal if your primary need is for non-Chinese language processing or if you require advanced features like full syntactic parsing beyond basic word tagging.

Chinese-language-processing text-analytics information-extraction natural-language-understanding content-analysis
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

6,544

Forks

2,291

Language

Java

License

Apache-2.0

Last pushed

Nov 19, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/NLPchina/ansj_seg"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.