Document Chunking Embedding Pipelines RAG Tools

There are 43 document chunking embedding pipelines tools tracked. The highest-rated is wangxb96/RAG-QA-Generator at 42/100 with 263 stars.

Get all 43 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=rag&subcategory=document-chunking-embedding-pipelines&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 wangxb96/RAG-QA-Generator

RAG-QA-Generator...

42
Emerging
2 aws-samples/rag-with-amazon-opensearch-serverless-and-sagemaker

Question Answering Generative AI application with Large Language Models...

36
Emerging
3 PerciValXIII/CAFB-food-wise-ai

AI-powered content automation tool for the Capital Area Food Bank (CAFB),...

35
Emerging
4 libraryofcelsus/LLM_File_Parser

AutoML/Unstructured Data Processing for RAG and LLM Dataset Creation. ...

30
Emerging
5 manthan410/multimodal-RAG-ResearchQA-bot

using mulimodal RAG to query texts, images and tables from pdf for QA

30
Emerging
6 tuitige/fijian-rag-app

Public-benefit GenAI platform for the Fijian language — combining Claude +...

29
Experimental
7 MohammedAly22/GenQuest-RAG

A Question Generation Application leveraging RAG and Weaviate vector store...

27
Experimental
8 himaenshuu/Multi_modal_rag-application

A powerful, easy-to-use platform for question answering over documents, web...

27
Experimental
9 CarlosManuelDiaz/rag-ready-extractor

Stop indexing noise. Turn messy websites and PDFs into clean, structured...

24
Experimental
10 ajitsingh98/Building-RAG-System-with-Deepseek-R1-Locally

This repository contains an end-to-end Retrieval-Augmented Generation (RAG)...

23
Experimental
11 tanmay271/RAG-Qdrant-AI

High-performance RAG pipeline engineered to eliminate LLM hallucinations...

21
Experimental
12 Daddy-Myth/D-RAGon_System

Local Retrieval-Augmented Generation (RAG) system for PDF question answering...

21
Experimental
13 neehanthreddym/doc_query_rag

A basic RAG pipeline which uses gpt-oss-20b model to answer the user query...

21
Experimental
14 noaman680/rag-from-scratch

Production-ready RAG (Retrieval Augmented Generation) system built from...

21
Experimental
15 Abdellatif404/Eigen-Field

A local Retrieval-Augmented Generation (RAG) system for agricultural...

21
Experimental
16 johnIT56/STAR-RAG

STAR-RAG is a self-reflective, retrieval-augmented question answering system...

20
Experimental
17 srinivas-sateesh/RAG-query-classifier

Smart Query Classifier to earn user trust and save $$$

19
Experimental
18 devangvyas-it/fastapi-rag-starter

Lightweight, self-contained RAG application built with FastAPI. It enables...

19
Experimental
19 Debasish-87/rag-based-document-qa

rag-based-document-qa is a Retrieval-Augmented Generation (RAG) based...

19
Experimental
20 Powerostad/talk_to_github

A Retrieval-Augmented Generation (RAG) system enabling natural language...

18
Experimental
21 tolios/XPL

A simple cli tool for RAG on documents

17
Experimental
22 daviaraujocc/rag-docs

A simple project about implementing RAG (Retrieval-Augmented Generation) for...

17
Experimental
23 olexmal/ragu

RAGU - Retrieval-Augmented Generation Universal. A privacy-focused RAG...

17
Experimental
24 Mohamed-samy2/Arabic-Islamic-Assessment

This repository implements a compact, efficient Retrieval-Augmented...

17
Experimental
25 jy02140251/rag-document-loader

Load documents for RAG pipelines: PDF, DOCX, HTML, Markdown. Smart chunking,...

15
Experimental
26 RoodyCode/rag

A modular, self-hosted RAG pipeline for building a private, searchable...

15
Experimental
27 QuantumDrizzy/rag-scientific-papers

Full RAG pipeline over 30 seminal AI/ML papers · FAISS vector store · ReAct...

14
Experimental
28 ramyasri-m/RAG_Property_Document_Pipeline

A RAG pipeline for property documents using Weaviate, sentence-transformers,...

14
Experimental
29 smoothemerson/ragscope

Q&A over documents using RAG (FastAPI + ChromaDB + Ollama + MLflow)

14
Experimental
30 sjlewis25/rag-pipeline

Hybrid RAG pipeline with local/cloud LLM support for semantic document...

14
Experimental
31 rithunkp/RAG-Codebase

Retrieval-Augmented Generation (RAG) assistant that lets users ask natural...

14
Experimental
32 PrinceKay145/multiDocRAG

Multi-Document RAG System with source attribution and query logging

14
Experimental
33 Boney-massiveness357/ragscope

Build a Q&A API that indexes PDFs and text using RAG, logging queries with...

14
Experimental
34 raza242k5-sys/rag-ai-system

Retrieval-Augmented Generation (RAG) based Intelligent QA System using...

14
Experimental
35 shubham5027/RAG-Qwen-2.5-72b-instruct

I built a production-style RAG system focused on grounded generation, not...

14
Experimental
36 alunoshacker-beep/ragscope

Build an offline Q&A API using RAG to query PDFs and texts, with automated...

13
Experimental
37 GowriPriyanka27/adaptive-rag-auto-optimizer

Adaptive Retrieval-Augmented Generation (RAG) system with dynamic...

13
Experimental
38 thendralmagudapathi/RAG-for-NCERT

A professional-grade Retrieval-Augmented Generation (RAG) system designed...

13
Experimental
39 Selam1431/Rag-Document-Search

AI-powered document search system using Retrieval-Augmented Generation (RAG)...

13
Experimental
40 AbhashK1/Verbo

RAG based document query system that performs OCR(Tesseract) for text...

13
Experimental
41 ashankgupta/rag-flow

A visual, node-based RAG (Retrieval-Augmented Generation) pipeline builder...

13
Experimental
42 abd-km/StreamRAG

StreamRAG is a real-time news processing system that streams, embeds, and...

12
Experimental
43 HemaKumar0077/TableRAG

TableRAG is an advanced question-answering framework that combines...

11
Experimental