David-Lolly/ViewRAG

图文并茂的 PDF RAG 系统：支持版式感知分块、图表深度理解与精准视觉溯源。 Multimodal PDF RAG: Features layout-aware chunking, visual chart understanding, and precise inline image citations.

/ 100

Emerging

This tool helps professionals working with PDFs to quickly get answers to their questions, even when the information is in images or tables. You input PDF documents and ask questions in natural language, and it provides accurate answers with inline images and precise citations to the original document pages. Anyone who regularly needs to extract information from complex PDFs, like researchers, analysts, or legal professionals, would find this valuable.

Use this if you need to understand and extract detailed information from PDFs, including content locked in images and tables, and require traceable, trustworthy answers.

Not ideal if your workflow involves only plain text documents or if you primarily need to summarize very short, simple texts without complex layouts.

document-analysis research-assist knowledge-retrieval pdf-interrogation information-extraction

No Package No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 11 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

thiswillbeyourgithub/wdoc

Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype,...

Arterning/DeepParseX

DeepParseX 是一个强大的多模态文档解析与知识管理平台，支持 PDF、Word、Excel、PPT、图片、视频、音频等多种文件格式的智能解析，自动提取关键信息，并构建...

NoEdgeAI/pdfdeal

A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall...

laxmimerit/RAGWire

Production-grade RAG toolkit — ingest PDFs, DOCX, XLSX into Qdrant with LLM metadata extraction,...

atpuxiner/docsloader

This is a documents loader. (文档解析加载器，rag文档解析，rag知识库构建)

Explore RAG Tools

All categories Trending RAG directory Insights