Sudhanshu1304/table-transformer

🔍 Table Extraction Tool: A powerful open-source solution combining OCR and computer vision for extracting structured tabular data from images. Ideal for LLM preprocessing, data analysis, and automation. 🚀

/ 100

Emerging

This tool helps data analysts, researchers, or operations managers automatically extract structured data from images. You can input an image containing a table, and it will output the data in a clean, organized format like a spreadsheet, HTML, or CSV. This is ideal for anyone needing to convert information locked in image-based tables into usable, editable data.

No commits in the last 6 months.

Use this if you regularly need to pull tabular data from scanned documents, screenshots, or other image files for analysis or automation.

Not ideal if your primary need is to extract unstructured text from documents or if you only deal with already digital, machine-readable tables.

data-extraction document-processing data-entry-automation information-retrieval business-intelligence

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

Psarpei/Multi-Type-TD-TSR

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and...

Layout-Parser/layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

asagar60/TableNet-pytorch

Pytorch Implementation of TableNet

ses4255/Versatile-OCR-Program

Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)

JG1VPP/MuTabNet

ICDAR 2024 Table OCR Model

Explore ML Frameworks

All categories Trending ML Framework directory Insights