aws-samples/sample-extreme-text-classifier
A Python text classifier for large-scale multi-class classification using Amazon Bedrock. Supports classification of 1000+ classes with LLM reranking and attribute validation.
This tool helps you automatically sort documents or text snippets into thousands of categories with high accuracy. You provide text (or PDF documents) and a list of your custom categories, and it outputs the best matching category along with a confidence score. It's ideal for anyone managing large volumes of varied text, like operations specialists, compliance officers, or data analysts.
Use this if you need to reliably classify a large number of texts or PDF documents into 1000+ distinct categories and want to ensure classifications meet specific business rules.
Not ideal if you only have a few dozen categories or don't require the advanced validation features for business-critical accuracy.
Stars
10
Forks
6
Language
Python
License
MIT-0
Category
Last pushed
Feb 14, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/aws-samples/sample-extreme-text-classifier"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
codelion/adaptive-classifier
A flexible, adaptive classification system for dynamic text classification
jiegzhan/multi-class-text-classification-cnn-rnn
Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN...
jiegzhan/multi-class-text-classification-cnn
Classify Kaggle Consumer Finance Complaints into 11 classes. Build the model with CNN...
cbaziotis/datastories-semeval2017-task4
Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention...
iamaziz/ar-embeddings
Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic) using word2vec