Document Chunking Embedding Pipelines Vector Databases

There are 30 document chunking embedding pipelines tools tracked. The highest-rated is Siddhant-K-code/distill at 46/100 with 136 stars.

Get all 30 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=vector-db&subcategory=document-chunking-embedding-pipelines&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 Siddhant-K-code/distill

Reliable LLM outputs start with clean context. Deterministic deduplication,...

46
Emerging
2 pesu-dev/ask-pesu

A RAG pipeline for question answering about PES University

44
Emerging
3 louisbrulenaudet/ragoon

High level library for batched embeddings generation, blazingly-fast...

42
Emerging
4 B-A-M-N/FlockParser

Distributed document RAG system with intelligent GPU/CPU orchestration....

40
Emerging
5 namtroi/RAGBase

Open Source RAG ETL Platform. Turns PDFs, Docs & Slides into queryable...

37
Emerging
6 aws-samples/rag-with-amazon-postgresql-using-pgvector-and-sagemaker

Question Answering application with Large Language Models (LLMs) and Amazon...

37
Emerging
7 aws-samples/rag-with-amazon-opensearch-and-sagemaker

Question Answering Generative AI application with Large Language Models...

32
Emerging
8 pashpashpash/python-rag-scaffold

A comprehensive RAG FastAPI service that handles document uploads and...

26
Experimental
9 Ashish-Abraham/DocWhisperer-Qdrant

A Retrieval-Augmented Generation (RAG) System for PDF Chat using Qdrant...

25
Experimental
10 gurbaj5124871/rag-app-deepseek

A RAG (Retrieval-Augmented Generation) application which combines...

22
Experimental
11 mpessis/rag-doc-search

Semantic search over technical documentation using natural language. RAG...

22
Experimental
12 B-A-M-N/FlockParser-legacy

Legacy version of FlockParser PDF processing system

22
Experimental
13 Amayes985-stack/Mimir

Privacy-first RAG pipeline application that transforms personal documents...

21
Experimental
14 josephsenior/Microbione

Multimodal RAG system for microbiome data analysis with cross-modal search,...

21
Experimental
15 SrijanShovit/HomeoRAG

A RAG application to search documents for homeopathic remedies based on...

19
Experimental
16 Farhaj499/RAG_with_Weaviate_DB

This project implements a Retrieval Augmented Generation (RAG) system that...

18
Experimental
17 RijuSaha-01/RAG-Document-Assistant-with-Azure-Cosmos-DB

A RAG pipeline implementation using Azure Cosmos DB (MongoDB vCore) and...

17
Experimental
18 bharghavaram/rag-knowledge-assistant

A lightweight Retrieval-Augmented Generation (RAG) system for answering...

17
Experimental
19 RAK0152/doc-watch-rag

Async document watcher that keeps your RAG index hot. Automatically ingests...

17
Experimental
20 LEADisDEAD/Vector-Forge

Production-style Retrieval-Augmented Generation (RAG) system with...

14
Experimental
21 felix-dowl/ResearchPal

Basic RAG pipeline for uploading documents and making natural language queries

14
Experimental
22 Abs01ute000/policymind-rag-showcase

Semantic search and RAG showcase built with FastAPI, ChromaDB,...

14
Experimental
23 ankit123nag/pdf-rag-assistant

Production-grade RAG backend for document ingestion and semantic retrieval...

14
Experimental
24 Vaibhavii3/AI-Knowlendge-Base-RAG

Built a Retrieval-Augmented Generation system that allows users to upload...

13
Experimental
25 razevedo1994/paper-rag-pipeline

A complete RAG ingestion pipeline for scientific papers.

13
Experimental
26 DRJ-14/context-aware-email-assistant-RAG

RAG system to query Gmail Takeout (.mbox) with semantic search + local LLM...

13
Experimental
27 tahamohmadf19-dev/rag-document-search

Document search with retrieval-augmented generation using FastAPI, Qdrant...

13
Experimental
28 bijay-odyssey/Personal-Knowledge-Base-RAG-API

Personal Knowledge Base RAG API – FastAPI-based RAG system for querying...

13
Experimental
29 mxmarchal/archivist

Archivist is a local-first macOS file indexer + RAG search engine. It...

12
Experimental
30 Ashish-Abraham/QueReyDB

RAG (Retrieval-Augmented Generation) and vector search to transform plain...

11
Experimental