laxmimerit/RAGWire

Production-grade RAG toolkit — ingest PDFs, DOCX, XLSX into Qdrant with LLM metadata extraction, hybrid search, and SHA256 deduplication.

/ 100

Emerging

This tool helps you quickly make large collections of internal documents, like PDFs or spreadsheets, searchable using AI. You input entire folders of files, and it organizes them into a smart system, even extracting key details like company names or fiscal periods using AI. The output is a powerful search capability that lets you find specific information across all your documents instantly. Knowledge managers, researchers, or anyone needing to make extensive document archives queryable would find this invaluable.

Used by 1 other package. Available on PyPI.

Use this if you need to build a robust, AI-powered search system over your company's large collection of documents like reports, manuals, or financial statements.

Not ideal if you only have a few documents to search or if you don't require advanced metadata extraction or hybrid search capabilities.

knowledge-management document-intelligence enterprise-search information-retrieval data-organization

Maintenance 13 / 25

Adoption 5 / 25

Maturity 18 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

thiswillbeyourgithub/wdoc

Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype,...

Arterning/DeepParseX

DeepParseX 是一个强大的多模态文档解析与知识管理平台，支持 PDF、Word、Excel、PPT、图片、视频、音频等多种文件格式的智能解析，自动提取关键信息，并构建...

NoEdgeAI/pdfdeal

A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall...

David-Lolly/ViewRAG

图文并茂的 PDF RAG 系统：支持版式感知分块、图表深度理解与精准视觉溯源。 Multimodal PDF RAG: Features layout-aware chunking,...

atpuxiner/docsloader

This is a documents loader. (文档解析加载器，rag文档解析，rag知识库构建)

Explore RAG Tools

All categories Trending RAG directory Insights