ses4255/Versatile-OCR-Program

Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)

/ 100

Emerging

This system helps students and researchers convert complex educational documents like exam papers into structured, AI-ready data. It takes PDFs containing multilingual text, math equations, tables, and diagrams, and outputs semantically enriched JSON or Markdown. The output includes natural language descriptions of images and tables, making it easier to create high-quality datasets for training machine learning models.

682 stars. No commits in the last 6 months.

Use this if you need to extract and semantically annotate content from scientific or academic PDFs, especially those with dense layouts, for machine learning training or advanced study.

Not ideal if you're looking for a simple OCR to digitize basic text documents or an out-of-the-box solution that doesn't require further processing or integration into an ML workflow.

academic-research education-technology exam-preparation data-extraction scientific-documentation

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

682

Forks

Language

Python

License

—

Higher-rated alternatives

Psarpei/Multi-Type-TD-TSR

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and...

Layout-Parser/layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

Sudhanshu1304/table-transformer

🔍 Table Extraction Tool: A powerful open-source solution combining OCR and computer vision for...

asagar60/TableNet-pytorch

Pytorch Implementation of TableNet

JG1VPP/MuTabNet

ICDAR 2024 Table OCR Model

Explore ML Frameworks

All categories Trending ML Framework directory Insights