Perpetue237/rag-api-template
A template to set up, build and deploy a RAG-API including frontend and backend with docker compose.
This is a template for developers to quickly set up a Retrieval-Augmented Generation (RAG) system. It takes unstructured data, such as PDF files, processes them, and then allows users to ask questions and receive answers from a pre-trained language model, using the uploaded documents as context. The primary users are developers looking to build and deploy their own RAG applications with a user interface.
No commits in the last 6 months.
Use this if you are a developer looking for a comprehensive, ready-to-use boilerplate to build and deploy a GPU-accelerated RAG API with both a frontend and backend.
Not ideal if you are an end-user seeking a ready-made application to answer questions from your documents without any development work, or if you don't have access to an NVIDIA GPU.
Stars
8
Forks
1
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Jul 20, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/Perpetue237/rag-api-template"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
datawhalechina/all-in-rag
🔍大模型应用开发实战一:RAG 技术全栈指南,在线阅读地址:https://datawhalechina.github.io/all-in-rag/
bakrianoo/mini-rag
An Educational Project (step by step) to teach how to build a production-ready app for RAG application.
Sstobo/Claude-Code-Game-Master
Total conversion for Claude Code. Use RAG and the RPG ruleset apis to play a persistent...
BastinFlorian/RAG-on-GCP-with-VertexAI
Create a Chatbot app on your own data with GCP tools
oracle-devrel/oci-rag-vectordb
Improve insights to make smarter decisions by tapping into real-time data with...