archity/doc-scanner
Computer Vision and NLP based document scanner, text extractor and summarizer.
This tool helps individuals convert physical documents, like books or printed pages, into clear, searchable digital text and summarized content. It takes an image of a document as input and outputs a neatly scanned digital image, the extracted text, and a concise summary. Anyone who needs to digitize and quickly understand key information from paper documents will find this useful.
No commits in the last 6 months.
Use this if you need to transform hardcopy documents into clean, readable digital files, extract the text, and get a quick overview of the content.
Not ideal if you primarily work with existing digital documents or only need basic image cropping without text extraction or summarization.
Stars
11
Forks
—
Language
Python
License
MIT
Category
Last pushed
Aug 24, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/archity/doc-scanner"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
deepdoctection/deepdoctection
A Repo For Document AI
deanmalmgren/textract
extract text from any document. no muss. no fuss.
eikek/docspell
Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources...
zzzDavid/ICDAR-2019-SROIE
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic...