andreagemelli/doc2graph

Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.

51
/ 100
Established

This project helps people who need to extract specific information from various documents like forms, invoices, or scanned papers. It takes an image of a document as input and identifies key information, such as spotting key-value relationships in forms or detecting tables, outputting the extracted data in a structured format (like a JSON file). This is useful for data entry specialists, administrative staff, or anyone processing large volumes of documents.

137 stars.

Use this if you need to automatically extract structured data from scanned documents or images, such as identifying key fields on a form or detecting tables in an invoice.

Not ideal if you need to train the system on your own specific, private datasets without a developer's help, as this feature is still under development.

document-processing data-extraction invoice-automation form-understanding layout-analysis
No Package No Dependents
Maintenance 6 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

137

Forks

25

Language

Jupyter Notebook

License

MIT

Last pushed

Oct 18, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/andreagemelli/doc2graph"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.