conditionedstimulus/DocumentClassifier
FastAPI application for document classification using a multimodal LayoutLM model, designed to classify PDF documents into RVL-DCIP categories.
Quickly sort your PDF documents into predefined categories like invoices, resumes, or reports. You provide a PDF file, and it tells you what type of document it is along with the probability of it belonging to each category. This is for professionals who need to automatically categorize incoming documents, such as administrative staff or data entry specialists.
No commits in the last 6 months.
Use this if you need an automated way to classify PDF documents into common business categories to streamline your document management.
Not ideal if you need highly accurate classification for modern, high-quality documents, as this model performs better on older document styles and has an 86% accuracy.
Stars
12
Forks
—
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Apr 29, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/conditionedstimulus/DocumentClassifier"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
paperless-ngx/paperless-ngx
A community-supported supercharged document management system: scan, index and archive all your documents
GoogleCloudPlatform/document-ai-samples
Sample applications and demos for Document AI, the end-to-end document processing platform on...
aws-solutions/document-understanding-solution
Example of integrating & using Amazon Textract, Amazon Comprehend, Amazon Comprehend Medical,...
naiveHobo/InvoiceNet
Deep neural network to extract intelligent information from invoice documents.
aphp/edspdf
EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides...