liunian-Jay/MU-GOT

PDF Parsing Tool: GOT's vLLM acceleration implementation, MinerU for layout recognition, and GOT for table formula parsing.

25
/ 100
Experimental

This tool helps professionals convert PDF documents into a more usable text format. It takes a PDF document as input and produces a text file that combines Markdown formatting for general content with LaTeX formatting for tables. This is ideal for anyone who needs to extract and work with both regular text and structured table data from PDFs.

No commits in the last 6 months.

Use this if you need to quickly extract content, including complex tables, from PDFs into a text-based format for further analysis or editing.

Not ideal if you require a pure Markdown output without any LaTeX formatting for tables, or if you need to retain the original visual layout of the PDF.

document-processing data-extraction pdf-conversion research-analysis information-management
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 8 / 25
Community 9 / 25

How are scores calculated?

Stars

65

Forks

5

Language

Python

License

Last pushed

Nov 07, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/liunian-Jay/MU-GOT"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.