thunlp/THUCTC
An Efficient Chinese Text Classifier
This tool helps organize large collections of Chinese text by automatically sorting documents into predefined categories. You provide a set of labeled example texts, and the tool learns to classify new, uncategorized Chinese documents. This is ideal for anyone who regularly deals with large volumes of Chinese-language content, like news editors, market researchers, or data analysts.
211 stars. No commits in the last 6 months.
Use this if you need to automatically categorize large volumes of Chinese documents, like news articles, customer feedback, or market reports, into specific topics.
Not ideal if you primarily work with English texts or need a solution that can handle extremely large datasets on a Windows operating system due to memory limitations.
Stars
211
Forks
68
Language
Java
License
MIT
Last pushed
Sep 30, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/thunlp/THUCTC"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
gaussic/text-classification-cnn-rnn
CNN-RNN中文文本分类,基于TensorFlow
TobiasLee/Text-Classification
Implementation of papers for text classification task on DBpedia
prakashpandey9/Text-Classification-Pytorch
Text classification using deep learning models in Pytorch
ShawnyXiao/TextClassification-Keras
Text classification models implemented in Keras, including: FastText, TextCNN, TextRNN,...
FreedomIntelligence/TextClassificationBenchmark
A Benchmark of Text Classification in PyTorch