yuzhimanhua/MetaCat

Minimally Supervised Categorization of Text with Metadata (SIGIR'20)

31
/ 100
Emerging

This tool helps organize large collections of text documents, like product reviews or social media posts, into predefined categories. You provide your documents along with any relevant associated information (like author, tags, or product IDs), and it automatically assigns a category to each document, even if you only have a few examples for each category. It's designed for data analysts, content managers, or researchers who need to classify text using minimal labeled data.

No commits in the last 6 months.

Use this if you have a substantial collection of text documents and associated metadata that you need to categorize, but only have a small number of hand-labeled examples for each category.

Not ideal if you have no metadata accompanying your text documents or if you require a fully unsupervised clustering approach.

content-categorization document-classification text-analytics data-organization information-management
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 7 / 25

How are scores calculated?

Stars

47

Forks

3

Language

Python

License

Apache-2.0

Last pushed

Apr 02, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/yuzhimanhua/MetaCat"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.