hankcs/pyhanlp

中文分词

60
/ 100
Established

This tool helps you automatically break down Chinese text into meaningful words and analyze its grammatical structure, which is essential for understanding and processing large amounts of text. You input Chinese sentences or documents, and it outputs segmented words, their parts of speech, and how they relate to each other grammatically. Anyone working with Chinese text data, like linguists, data analysts, or content managers, would find this useful.

3,211 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to accurately process and understand Chinese language data by segmenting text, tagging parts of speech, or analyzing sentence structure.

Not ideal if you primarily work with languages other than Chinese or require deep learning-based, cutting-edge multilingual NLP which HanLP2.x offers.

Chinese-text-analysis natural-language-processing linguistics content-analysis data-preprocessing
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 25 / 25

How are scores calculated?

Stars

3,211

Forks

803

Language

Python

License

Apache-2.0

Last pushed

Jan 16, 2025

Commits (30d)

0

Dependencies

2

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/hankcs/pyhanlp"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.