abdoelsayed2016/TNCR_Dataset

Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classification. https://www.sciencedirect.com/science/article/pii/S0925231221018142

32
/ 100
Emerging

This dataset helps organize and extract information from scanned documents by identifying and categorizing tables. It takes images of documents as input and outputs detected tables, classified into different types like 'full lined' or 'no lines'. This is valuable for data entry specialists, archivists, or researchers who need to process large volumes of document scans to extract tabular data.

No commits in the last 6 months.

Use this if you need to automatically locate tables within scanned document images and categorize them by their visual structure for further processing.

Not ideal if you are working with digitally native documents (e.g., PDFs with selectable text) where table data can be extracted directly without image processing.

document-processing data-extraction information-management scanned-documents table-recognition
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 8 / 25

How are scores calculated?

Stars

69

Forks

4

Language

Python

License

MIT

Last pushed

Feb 24, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/abdoelsayed2016/TNCR_Dataset"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.