Document Chunking Embedding Pipelines Vector Databases
There are 30 document chunking embedding pipelines tools tracked. The highest-rated is Siddhant-K-code/distill at 46/100 with 136 stars.
Get all 30 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=vector-db&subcategory=document-chunking-embedding-pipelines&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
Siddhant-K-code/distill
Reliable LLM outputs start with clean context. Deterministic deduplication,... |
|
Emerging |
| 2 |
pesu-dev/ask-pesu
A RAG pipeline for question answering about PES University |
|
Emerging |
| 3 |
louisbrulenaudet/ragoon
High level library for batched embeddings generation, blazingly-fast... |
|
Emerging |
| 4 |
B-A-M-N/FlockParser
Distributed document RAG system with intelligent GPU/CPU orchestration.... |
|
Emerging |
| 5 |
namtroi/RAGBase
Open Source RAG ETL Platform. Turns PDFs, Docs & Slides into queryable... |
|
Emerging |
| 6 |
aws-samples/rag-with-amazon-postgresql-using-pgvector-and-sagemaker
Question Answering application with Large Language Models (LLMs) and Amazon... |
|
Emerging |
| 7 |
aws-samples/rag-with-amazon-opensearch-and-sagemaker
Question Answering Generative AI application with Large Language Models... |
|
Emerging |
| 8 |
pashpashpash/python-rag-scaffold
A comprehensive RAG FastAPI service that handles document uploads and... |
|
Experimental |
| 9 |
Ashish-Abraham/DocWhisperer-Qdrant
A Retrieval-Augmented Generation (RAG) System for PDF Chat using Qdrant... |
|
Experimental |
| 10 |
gurbaj5124871/rag-app-deepseek
A RAG (Retrieval-Augmented Generation) application which combines... |
|
Experimental |
| 11 |
mpessis/rag-doc-search
Semantic search over technical documentation using natural language. RAG... |
|
Experimental |
| 12 |
B-A-M-N/FlockParser-legacy
Legacy version of FlockParser PDF processing system |
|
Experimental |
| 13 |
Amayes985-stack/Mimir
Privacy-first RAG pipeline application that transforms personal documents... |
|
Experimental |
| 14 |
josephsenior/Microbione
Multimodal RAG system for microbiome data analysis with cross-modal search,... |
|
Experimental |
| 15 |
SrijanShovit/HomeoRAG
A RAG application to search documents for homeopathic remedies based on... |
|
Experimental |
| 16 |
Farhaj499/RAG_with_Weaviate_DB
This project implements a Retrieval Augmented Generation (RAG) system that... |
|
Experimental |
| 17 |
RijuSaha-01/RAG-Document-Assistant-with-Azure-Cosmos-DB
A RAG pipeline implementation using Azure Cosmos DB (MongoDB vCore) and... |
|
Experimental |
| 18 |
bharghavaram/rag-knowledge-assistant
A lightweight Retrieval-Augmented Generation (RAG) system for answering... |
|
Experimental |
| 19 |
RAK0152/doc-watch-rag
Async document watcher that keeps your RAG index hot. Automatically ingests... |
|
Experimental |
| 20 |
LEADisDEAD/Vector-Forge
Production-style Retrieval-Augmented Generation (RAG) system with... |
|
Experimental |
| 21 |
felix-dowl/ResearchPal
Basic RAG pipeline for uploading documents and making natural language queries |
|
Experimental |
| 22 |
Abs01ute000/policymind-rag-showcase
Semantic search and RAG showcase built with FastAPI, ChromaDB,... |
|
Experimental |
| 23 |
ankit123nag/pdf-rag-assistant
Production-grade RAG backend for document ingestion and semantic retrieval... |
|
Experimental |
| 24 |
Vaibhavii3/AI-Knowlendge-Base-RAG
Built a Retrieval-Augmented Generation system that allows users to upload... |
|
Experimental |
| 25 |
razevedo1994/paper-rag-pipeline
A complete RAG ingestion pipeline for scientific papers. |
|
Experimental |
| 26 |
DRJ-14/context-aware-email-assistant-RAG
RAG system to query Gmail Takeout (.mbox) with semantic search + local LLM... |
|
Experimental |
| 27 |
tahamohmadf19-dev/rag-document-search
Document search with retrieval-augmented generation using FastAPI, Qdrant... |
|
Experimental |
| 28 |
bijay-odyssey/Personal-Knowledge-Base-RAG-API
Personal Knowledge Base RAG API – FastAPI-based RAG system for querying... |
|
Experimental |
| 29 |
mxmarchal/archivist
Archivist is a local-first macOS file indexer + RAG search engine. It... |
|
Experimental |
| 30 |
Ashish-Abraham/QueReyDB
RAG (Retrieval-Augmented Generation) and vector search to transform plain... |
|
Experimental |