AdemBoukhris457/Documents-Parsing-Lab

Jupyter notebooks testing different OCR models for document parsing (Dolphin, MonkeyOCR, Marker, Nanonets, ...)

/ 100

Emerging

This project helps you automate the extraction of key information from various documents like PDFs, scanned images, and more. You provide the documents, and it gives you structured text, tables, and even data from charts, ready for analysis or database entry. This is ideal for data entry clerks, compliance officers, researchers, or anyone dealing with large volumes of documents that need to be digitized and analyzed efficiently.

Use this if you need to automatically pull specific data, tables, or charts from digital or scanned documents and process them systematically.

Not ideal if you only need simple text conversion without needing to understand document structure or extract specific data fields.

document-processing data-extraction compliance research-data-management digitization

No License No Package No Dependents

Maintenance 6 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

jupyterlab/jupyter-ai

A generative AI extension for JupyterLab

aws-samples/generative-ai-ml-latam-samples

This repo provides Generative AI and AI/ML code samples, blueprints (end-to-end solutions) and...

dkanungo/Probabilistic-ML-for-finance-and-investing

Probabilistic Machine Learning for Finance and Investing: A Primer to Generative AI with Python

morganstanley/MSML

Repo for Morgan Stanley Machine Learning Research group's publications

Yash-Kavaiya/GenAI-Learning

Up-to-Date Content: We regularly update our repository with new courses, articles, and tutorials...

Explore Generative AI Tools

All categories Trending Generative AI directory Insights