Document Processing Platforms Vector Databases

Tools for converting, parsing, and indexing unstructured documents (PDFs, Word, PowerPoint, images, audio) into searchable, queryable formats with vector embeddings and semantic search. Does NOT include general chatbots, code documentation generators, or vector database infrastructure itself.

There are 34 document processing platforms tools tracked. The highest-rated is AmadeusITGroup/docs2vecs at 47/100 with 6 stars.

Get all 34 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=vector-db&subcategory=document-processing-platforms&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 AmadeusITGroup/docs2vecs

CLI that helps with docs splitting, embedding and exposing them in a seamless manner

47
Emerging
2 in-c0/updAPI

Free, open-source collection of latest public API documentations - Update...

42
Emerging
3 AlexisBalayre/RagDocs

An AI-powered search engine to interact with documentation using RAG and...

34
Emerging
4 LikithMeruvu/Framework-Docs-AI

Framework Docs AI is a powerful SaaS solution for managing framework...

30
Emerging
5 lh0x00/docsifer

Docsifer is a powerful tool for converting various data formats into...

29
Experimental
6 dhruvkshah75/docstream

Turn static PDF archives into an interactive, searchable AI knowledge base

29
Experimental
7 Surya-Hariharan/DocuQueryAI

Built for HackRx 6.0 โ€“ Bajaj Finservโ€™s Annual Hackathon, this backend system...

28
Experimental
8 CodebyKumar/QueryWise

AI Document assistant

27
Experimental
9 existential-birds/pearl

Open-source DeepWiki alternative: AI-generated documentation and natural...

22
Experimental
10 AvishkaGihan/documind-ai

๐Ÿง  Secure AI Assistant to chat with your documents. Isolated vector data...

22
Experimental
11 Milesdexter/docschat-rag

๐Ÿ“š Enhance technical documentation queries with DocsChat RAG, a robust...

21
Experimental
12 bcfeen/DocMine

Knowledge-centric document ingestion with stable IDs, provenance, entities,...

21
Experimental
13 SOHAIL-IQB/DocQuerry

AI-powered document Q&A platform built with a Retrieval-Augmented Generation...

21
Experimental
14 humanhady/DocMine

๐Ÿ“„ Transform documents into queryable knowledge with exact recall and entity...

21
Experimental
15 AIAfterDark/AI-URL-Read

A Python-based documentation assistant that uses local LLMs to crawl...

19
Experimental
16 heyshivamjaiswal/Folio

RAG-powered knowledge library โ€” save articles, YouTube videos, PDFs & text,...

15
Experimental
17 kanugurajesh/SmartDoc

A rag application to chat with your documents with the help of gemini,...

14
Experimental
18 AbdulRehman393/DocuMind-Nexus

๐Ÿง  DocuMind Nexus โ€” A docs-first RAG assistant (FastAPI + Streamlit +...

14
Experimental
19 oaht9412/DocSage

Process unstructured documents intelligently using DocSage, a serverless...

13
Experimental
20 Thamizh0206/DocuMind-AI

DocuMind AI is a RAG-powered application that lets users chat with multiple...

13
Experimental
21 dineshjsd/smart-doc-ai

A production-ready RAG system built with Next.js and Node.js. Uses MongoDB...

13
Experimental
22 rajj28/DocPilot

๐Ÿค– AI-powered browser extension that summarizes documentation pages and...

13
Experimental
23 franjofranjic27/knomi

knomi is a CLI tool that indexes your documents into a vector database and...

13
Experimental
24 muaaz-ur-habibi/fthedocs

A documentation querying engine, useful for scanning the docs in a...

13
Experimental
25 noelmarior/arivagam-cloud-rag

A Full Stack, RAG application which acts as a workspace for students to...

13
Experimental
26 rishirochan/DocVaultAI

A privacy-centric document intelligence platform designed for secure, local...

13
Experimental
27 pritom169/documind-ai

AI-powered document analysis platform with multi-agent RAG, hybrid vector...

13
Experimental
28 Janmesh23/sidequest

SideQuest is an AI assistant that helps query big documents/pdfs/files with...

13
Experimental
29 Kirill89/source-to-docs

AI-powered code documentation generator with RAG-based semantic search for...

13
Experimental
30 ChanikyaSaiL/AI-Document-Search

This project is an AI-powered Document Intelligence System that enables...

13
Experimental
31 elchibek5/DocuMind

A RAG-based AI Research Assistant that enables natural language querying of...

12
Experimental
32 Mani0606/Document-QA

A Document Question-Answering System built with Spring Boot, designed to...

12
Experimental
33 MITHILESHK11/IntelProject

Intel Nexus โ€“ An enterprise-grade document intelligence platform that...

12
Experimental
34 peterdays/retrieva

Retrieva: Smart Documentation Retrieval based on LLMs๐Ÿค“

11
Experimental

Comparisons in this category