hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
HanLP helps you analyze text in over 100 languages, breaking it down into individual words, identifying parts of speech, and recognizing entities like names or locations. This allows you to understand the structure and meaning of documents. It's used by researchers and businesses to process text data for tasks like information extraction or content analysis.
36,209 stars. Used by 1 other package. Available on PyPI.
Use this if you need to perform detailed linguistic analysis on text in multiple languages, such as identifying key terms or understanding sentence structure.
Not ideal if you only need very basic text operations or are looking for a simple keyword search tool.
Stars
36,209
Forks
10,909
Language
Python
License
Apache-2.0
Category
Last pushed
Nov 15, 2025
Commits (30d)
0
Dependencies
9
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/hankcs/HanLP"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PyThaiNLP/pythainlp
Thai natural language processing in Python
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
hankcs/pyhanlp
中文分词
ownthink/Jiagu
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类