samzshi0529/HanziNLP
A NLP package for Chinese text:Preprocessing, Tokenization, Chinese Fonts, Word Embeddings, Text Similarity and Sentiment Analysis 轻量级中文自然语言处理软件包
HanziNLP is a user-friendly tool for analyzing, modeling, and visualizing Chinese text. It takes raw Chinese text as input and helps you preprocess it, break it into words and sentences, identify key topics, and understand its sentiment. This is ideal for researchers, marketers, or anyone who needs to gain insights from large volumes of Chinese written content.
No commits in the last 6 months. Available on PyPI.
Use this if you need to perform detailed natural language processing tasks on Chinese text, such as sentiment analysis, topic modeling, or preparing text for further analysis and visualization.
Not ideal if your primary need is for English-only text analysis or if you require advanced, highly specialized NLP models not included in this package.
Stars
36
Forks
3
Language
Python
License
MIT
Category
Last pushed
Nov 03, 2024
Commits (30d)
0
Dependencies
13
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/samzshi0529/HanziNLP"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
nltk/nltk
NLTK Source
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
undertheseanlp/underthesea
Underthesea - Vietnamese NLP Toolkit
stanfordnlp/stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many...
flairNLP/flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)