poloclub/tsr-convstem

High-Performance Transformers for Table Structure Recognition Need Early Convolutions

33
/ 100
Emerging

This project helps convert tables from image formats, like those in scanned documents or PDFs, into a machine-readable format such as HTML. You input an image containing a table, and it outputs the table's structure, recognizing cells, rows, and columns. This is ideal for data entry specialists, researchers, or anyone who needs to extract structured data from visual documents quickly and accurately.

No commits in the last 6 months.

Use this if you need to efficiently and accurately convert tables embedded in images or documents into a structured, editable format.

Not ideal if you primarily work with already digital, machine-readable tables (e.g., CSV, Excel) and don't need to process them from images.

data-extraction document-processing table-digitization information-capture data-entry-automation
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 9 / 25

How are scores calculated?

Stars

45

Forks

4

Language

Python

License

MIT

Last pushed

Apr 03, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/poloclub/tsr-convstem"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.