Sudhanshu1304/table-transformer
🔍 Table Extraction Tool: A powerful open-source solution combining OCR and computer vision for extracting structured tabular data from images. Ideal for LLM preprocessing, data analysis, and automation. 🚀
This tool helps data analysts, researchers, or operations managers automatically extract structured data from images. You can input an image containing a table, and it will output the data in a clean, organized format like a spreadsheet, HTML, or CSV. This is ideal for anyone needing to convert information locked in image-based tables into usable, editable data.
No commits in the last 6 months.
Use this if you regularly need to pull tabular data from scanned documents, screenshots, or other image files for analysis or automation.
Not ideal if your primary need is to extract unstructured text from documents or if you only deal with already digital, machine-readable tables.
Stars
93
Forks
21
Language
Python
License
MIT
Category
Last pushed
Feb 22, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Sudhanshu1304/table-transformer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Psarpei/Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and...
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
asagar60/TableNet-pytorch
Pytorch Implementation of TableNet
ses4255/Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
JG1VPP/MuTabNet
ICDAR 2024 Table OCR Model