archity/doc-scanner

Computer Vision and NLP based document scanner, text extractor and summarizer.

/ 100

Experimental

This tool helps individuals convert physical documents, like books or printed pages, into clear, searchable digital text and summarized content. It takes an image of a document as input and outputs a neatly scanned digital image, the extracted text, and a concise summary. Anyone who needs to digitize and quickly understand key information from paper documents will find this useful.

No commits in the last 6 months.

Use this if you need to transform hardcopy documents into clean, readable digital files, extract the text, and get a quick overview of the content.

Not ideal if you primarily work with existing digital documents or only need basic image cropping without text extraction or summarization.

document-digitization information-extraction text-summarization knowledge-management

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

deepdoctection/deepdoctection

A Repo For Document AI

deanmalmgren/textract

extract text from any document. no muss. no fuss.

eikek/docspell

Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources...

zzzDavid/ICDAR-2019-SROIE

ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction

clovaai/donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic...

Explore NLP Tools

All categories Trending NLP directory Insights