baidu/lac
百度NLP:分词,词性标注,命名实体识别,词重要性
LAC helps you automatically break down Chinese text into meaningful units, identify the type of each word (like a noun or verb), and recognize specific entities such as names of people, places, or organizations. It takes raw Chinese sentences as input and outputs segmented words, their grammatical tags, and their importance within the sentence. This tool is ideal for anyone working with large volumes of Chinese text, like linguists, data analysts, or content strategists.
3,992 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.
Use this if you need to accurately process and understand the structure and key information within Chinese text for tasks like text analysis or content categorization.
Not ideal if your primary need is for languages other than Chinese, or if you require extremely complex, domain-specific semantic understanding beyond basic lexical analysis and named entity recognition.
Stars
3,992
Forks
592
Language
C++
License
Apache-2.0
Category
Last pushed
May 25, 2021
Commits (30d)
0
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/baidu/lac"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PyThaiNLP/pythainlp
Thai natural language processing in Python
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
hankcs/pyhanlp
中文分词