microsoft/OCR-Form-Tools
A set of tools to use in Microsoft Azure Form Recognizer and OCR services.
This tool helps businesses and researchers convert scanned or digital forms (like PDFs, JPEGs, or TIFFs) into structured, usable data. You feed it images of forms, visually label the key information on them, and it learns to automatically extract that data, outputting predictions of key-value pairs. It's designed for anyone who regularly processes large volumes of similar forms and needs to automate data extraction.
537 stars. No commits in the last 6 months.
Use this if you need to extract specific information from a consistent set of document types, such as invoices, receipts, or application forms, and want to automate this process.
Not ideal if you're looking for a tool to extract data from highly unstructured documents or need a solution that doesn't involve Azure cloud services.
Stars
537
Forks
176
Language
TypeScript
License
MIT
Category
Last pushed
Jul 07, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/microsoft/OCR-Form-Tools"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
ogkalu2/comic-translate
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a...
naptha/tesseract.js
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
mayocream/koharu
ML-powered manga translator, written in Rust.
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
zyddnys/manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)