aws-samples/sample-extreme-text-classifier

A Python text classifier for large-scale multi-class classification using Amazon Bedrock. Supports classification of 1000+ classes with LLM reranking and attribute validation.

/ 100

Emerging

This tool helps you automatically sort documents or text snippets into thousands of categories with high accuracy. You provide text (or PDF documents) and a list of your custom categories, and it outputs the best matching category along with a confidence score. It's ideal for anyone managing large volumes of varied text, like operations specialists, compliance officers, or data analysts.

Use this if you need to reliably classify a large number of texts or PDF documents into 1000+ distinct categories and want to ensure classifications meet specific business rules.

Not ideal if you only have a few dozen categories or don't require the advanced validation features for business-critical accuracy.

document-management information-extraction workflow-automation data-categorization compliance-automation

No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 15 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT-0

Higher-rated alternatives

codelion/adaptive-classifier

A flexible, adaptive classification system for dynamic text classification

jiegzhan/multi-class-text-classification-cnn-rnn

Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN...

jiegzhan/multi-class-text-classification-cnn

Classify Kaggle Consumer Finance Complaints into 11 classes. Build the model with CNN...

cbaziotis/datastories-semeval2017-task4

Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention...

iamaziz/ar-embeddings

Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic) using word2vec

Explore Embedding Tools

All categories Trending Embeddings directory Insights