LlamaEdge/rag-api-server
A RAG API server written in Rust following OpenAI specs
This project provides an API server that helps developers build applications that can chat with their own documents using Retrieval-Augmented Generation (RAG). You can upload text files, have them processed into searchable segments, and then use those segments to enhance a language model's ability to answer questions based on your specific content. This is useful for developers who need to integrate custom knowledge bases into AI chatbots or question-answering systems.
No commits in the last 6 months.
Use this if you are a developer looking for an OpenAI-compatible API to manage files, chunk documents, and power RAG-based AI applications with your proprietary data.
Not ideal if you are an end-user without programming knowledge looking for a ready-to-use application to chat with your documents.
Stars
61
Forks
13
Language
Rust
License
Apache-2.0
Category
Last pushed
Apr 14, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/LlamaEdge/rag-api-server"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
copilot-extensions/rag-extension
An example extension in go using retrevial-augmented generation
wangle201210/go-rag
基于eino+gf+vue实现知识库的rag
timescale/pgai
A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL
ca-srg/ragent
RAGent - A CLI tool for building RAG systems with hybrid search (BM25 + vector) using Amazon S3...
eryajf/langchaingo-ollama-rag
学习基于langchaingo结合ollama实现的rag应用流程