microsoft/OCR-Form-Tools

A set of tools to use in Microsoft Azure Form Recognizer and OCR services.

53
/ 100
Established

This tool helps businesses and researchers convert scanned or digital forms (like PDFs, JPEGs, or TIFFs) into structured, usable data. You feed it images of forms, visually label the key information on them, and it learns to automatically extract that data, outputting predictions of key-value pairs. It's designed for anyone who regularly processes large volumes of similar forms and needs to automate data extraction.

537 stars. No commits in the last 6 months.

Use this if you need to extract specific information from a consistent set of document types, such as invoices, receipts, or application forms, and want to automate this process.

Not ideal if you're looking for a tool to extract data from highly unstructured documents or need a solution that doesn't involve Azure cloud services.

data-extraction document-processing form-automation record-management optical-character-recognition
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

537

Forks

176

Language

TypeScript

License

MIT

Category

latex-ocr-tools

Last pushed

Jul 07, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/microsoft/OCR-Form-Tools"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.