preprocess-co/rag-document-viewer
RAG Document Viewer is an open-source library that generates high-fidelity file previews for seamless integration into your applications. It provides desktop-level file viewing capabilities for a wide range of document formats
This project helps application developers display various document types—like PDFs, Word, PowerPoint, and Excel files—directly within their web or desktop applications. It takes a document file and converts it into a self-contained HTML preview, which can then be embedded to provide users with a desktop-level viewing experience. Developers would use this to integrate rich document previews, potentially with highlighted sections, into their software.
No commits in the last 6 months. Available on PyPI.
Use this if you need to embed high-fidelity, interactive previews of documents directly into your application, allowing users to view and navigate files without leaving your platform.
Not ideal if your primary need is simply to store or manage documents without requiring an integrated, interactive previewing experience within an application.
Stars
13
Forks
1
Language
Python
License
MIT
Category
Last pushed
Sep 23, 2025
Commits (30d)
0
Dependencies
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/preprocess-co/rag-document-viewer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
thiswillbeyourgithub/wdoc
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype,...
Arterning/DeepParseX
DeepParseX 是一个强大的多模态文档解析与知识管理平台,支持 PDF、Word、Excel、PPT、图片、视频、音频 等多种文件格式的智能解析,自动提取关键信息,并构建...
NoEdgeAI/pdfdeal
A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall...
laxmimerit/RAGWire
Production-grade RAG toolkit — ingest PDFs, DOCX, XLSX into Qdrant with LLM metadata extraction,...
David-Lolly/ViewRAG
图文并茂的 PDF RAG 系统:支持版式感知分块、图表深度理解与精准视觉溯源。 Multimodal PDF RAG: Features layout-aware chunking,...