jiangnanboy/pdf_multimodal_rag

pdf multimodal rag 【pdf多模态rag问答】

22
/ 100
Experimental

This tool helps you quickly get answers from complex PDF documents that contain both text and visual information like charts and tables. You input a PDF file and a question, and it processes everything within the document to give you a comprehensive answer, including relevant images and text snippets from the original PDF. It's designed for analysts, researchers, or anyone who needs to extract detailed insights from lengthy, graphically rich reports.

No commits in the last 6 months.

Use this if you need to ask specific questions and get accurate, context-rich answers from large PDF reports that include a mix of text, images, and tables.

Not ideal if you only need to search plain text documents or prefer to manually scan PDFs for information.

document analysis research information extraction report comprehension data retrieval
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 8 / 25
Community 7 / 25

How are scores calculated?

Stars

27

Forks

2

Language

Python

License

Last pushed

Feb 26, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/jiangnanboy/pdf_multimodal_rag"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.