Yashsonaar/LayoutLMv3-Fine-Tuning
Welcome to the LayoutLMv3 Fine-Tuning project! 🚀 This project focuses on extracting structured data from invoices and PDFs using LayoutLMv3, PaddleOCR, and Label Studio. The system extracts key fields like invoice number, date, vendor GSTIN, PAN, product description, rate, quantity, and amount.
This project helps businesses automate the painful process of extracting specific data from invoice PDFs, including scanned versions. It takes your invoices, whether digital or scanned, and pulls out key details like invoice numbers, dates, vendor information, product descriptions, rates, quantities, and amounts. This tool is for accounts payable teams, finance departments, or anyone who regularly processes a high volume of invoices.
No commits in the last 6 months.
Use this if you need to quickly and accurately extract structured data from diverse invoice formats to streamline your accounting or record-keeping workflows.
Not ideal if you only process a handful of invoices occasionally or need to extract data from a wider variety of document types beyond invoices.
Stars
12
Forks
3
Language
Python
License
—
Category
Last pushed
Jan 06, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Yashsonaar/LayoutLMv3-Fine-Tuning"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
paperless-ngx/paperless-ngx
A community-supported supercharged document management system: scan, index and archive all your documents
GoogleCloudPlatform/document-ai-samples
Sample applications and demos for Document AI, the end-to-end document processing platform on...
aws-solutions/document-understanding-solution
Example of integrating & using Amazon Textract, Amazon Comprehend, Amazon Comprehend Medical,...
naiveHobo/InvoiceNet
Deep neural network to extract intelligent information from invoice documents.
aphp/edspdf
EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides...