hankcs/pyhanlp
中文分词
This tool helps you automatically break down Chinese text into meaningful words and analyze its grammatical structure, which is essential for understanding and processing large amounts of text. You input Chinese sentences or documents, and it outputs segmented words, their parts of speech, and how they relate to each other grammatically. Anyone working with Chinese text data, like linguists, data analysts, or content managers, would find this useful.
3,211 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to accurately process and understand Chinese language data by segmenting text, tagging parts of speech, or analyzing sentence structure.
Not ideal if you primarily work with languages other than Chinese or require deep learning-based, cutting-edge multilingual NLP which HanLP2.x offers.
Stars
3,211
Forks
803
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 16, 2025
Commits (30d)
0
Dependencies
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/hankcs/pyhanlp"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PyThaiNLP/pythainlp
Thai natural language processing in Python
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
ownthink/Jiagu
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类