thunlp/THUCTC

An Efficient Chinese Text Classifier

49
/ 100
Emerging

This tool helps organize large collections of Chinese text by automatically sorting documents into predefined categories. You provide a set of labeled example texts, and the tool learns to classify new, uncategorized Chinese documents. This is ideal for anyone who regularly deals with large volumes of Chinese-language content, like news editors, market researchers, or data analysts.

211 stars. No commits in the last 6 months.

Use this if you need to automatically categorize large volumes of Chinese documents, like news articles, customer feedback, or market reports, into specific topics.

Not ideal if you primarily work with English texts or need a solution that can handle extremely large datasets on a Windows operating system due to memory limitations.

Chinese-text-analysis news-categorization content-management market-research data-organization
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 23 / 25

How are scores calculated?

Stars

211

Forks

68

Language

Java

License

MIT

Last pushed

Sep 30, 2018

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/thunlp/THUCTC"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.