muhd-umer/pyramidtabnet

Official PyTorch implementation of PyramidTabNet: Transformer-based Table Recognition in Image-based Documents

/ 100

Emerging

PyramidTabNet helps you automatically extract structured table data from scanned documents, images, and PDFs. It takes an image-based document containing tables as input and precisely identifies the tables and their internal structure (rows and columns). This is ideal for data entry specialists, researchers, and operations teams who need to convert visual table information into an editable, structured format for analysis or database entry.

No commits in the last 6 months.

Use this if you need to accurately detect tables and understand their layout in various image-based documents, saving significant manual data entry time.

Not ideal if your documents are already in a structured, machine-readable format like Excel or CSV, or if you only need to extract plain text without table recognition.

document-automation data-extraction digitization information-capture document-analysis

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Compare

pyramidtabnet and TableNet-pytorch

Higher-rated alternatives

Psarpei/Multi-Type-TD-TSR

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and...

Layout-Parser/layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

Sudhanshu1304/table-transformer

🔍 Table Extraction Tool: A powerful open-source solution combining OCR and computer vision for...

ses4255/Versatile-OCR-Program

Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)

asagar60/TableNet-pytorch

Pytorch Implementation of TableNet

Explore ML Frameworks

All categories Trending ML Framework directory Insights