FreeOCR-AI/layoutreader

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

/ 100

Emerging

This tool helps you accurately extract text from scanned documents, PDFs, or images by determining the correct reading order of content blocks. It takes in the bounding box locations of text sections and outputs the numerical sequence in which they should be read, ensuring a coherent and logical reconstruction of the document's content. Anyone who needs to process large volumes of unstructured document images, such as data entry specialists, archivists, or legal professionals, will find this project useful.

314 stars. No commits in the last 6 months.

Use this if you need to reliably convert visual document layouts into sequential, readable text, especially from diverse or complex document types.

Not ideal if your primary need is simple OCR without complex layout understanding, or if you only process documents with very basic, uniform layouts.

document-processing data-extraction digital-archiving information-capture content-digitization

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

314

Forks

Language

Python

License

—

Higher-rated alternatives

clusterzx/paperless-ai

An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure...

kha-white/manga-ocr

Optical character recognition for Japanese text, with the main focus being Japanese manga

alephpi/Texo-web

The web application for Texo, a minimalist SOTA LaTeX OCR model which contains only 20M...

bytefer/ollama-ocr

Implementing OCR with a local visual model run by ollama.

alephpi/Texo

A minimalist SOTA LaTeX OCR model with only 20M parameters, running in browser. Full training...

Explore Transformer Models

All categories Trending Transformer directory Insights