yuzhimanhua/MetaCat

Minimally Supervised Categorization of Text with Metadata (SIGIR'20)

/ 100

Emerging

This tool helps organize large collections of text documents, like product reviews or social media posts, into predefined categories. You provide your documents along with any relevant associated information (like author, tags, or product IDs), and it automatically assigns a category to each document, even if you only have a few examples for each category. It's designed for data analysts, content managers, or researchers who need to classify text using minimal labeled data.

No commits in the last 6 months.

Use this if you have a substantial collection of text documents and associated metadata that you need to categorize, but only have a small number of hand-labeled examples for each category.

Not ideal if you have no metadata accompanying your text documents or if you require a fully unsupervised clustering approach.

content-categorization document-classification text-analytics data-organization information-management

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

kk7nc/HDLTex

HDLTex: Hierarchical Deep Learning for Text Classification

richliao/textClassifier

Text classifier for Hierarchical Attention Networks for Document Classification

RandolphVI/Hierarchical-Multi-Label-Text-Classification

The code of CIKM'19 paper《Hierarchical Multi-label Text Classification: An Attention-based...

yumeng5/LOTClass

[EMNLP 2020] Text Classification Using Label Names Only: A Language Model Self-Training Approach

sgrvinod/a-PyTorch-Tutorial-to-Text-Classification

Hierarchical Attention Networks | a PyTorch Tutorial to Text Classification

Explore NLP Tools

All categories Trending NLP directory Insights