2dogsandanerd/Knowledge-Base-Self-Hosting-Kit
A Docker-powered RAG system that understands the difference between code and prose. Ingest your codebase and documentation, then query them with full privacy and zero configuration.
This kit helps teams and individuals create a private, self-hosted knowledge base from internal documents and code. You put in various file types like PDFs, markdown, or entire code repositories, and it allows you to query them with advanced search capabilities. The output is accurate answers with source citations, making it ideal for developers, researchers, or anyone needing to quickly find specific information within their own extensive datasets.
220 stars.
Use this if you need to build a secure, internal question-answering system over your company's documentation and codebase, ensuring data privacy and quick retrieval of information.
Not ideal if you're looking for a simple, cloud-based search solution without the need for self-hosting or integrating with advanced AI agents.
Stars
220
Forks
24
Language
Python
License
—
Category
Last pushed
Mar 01, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/2dogsandanerd/Knowledge-Base-Self-Hosting-Kit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ItzCrazyKns/Vane
Vane is an AI-powered answering engine.
ConardLi/easy-dataset
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
xuwei95/ezdata
基于python和llm大模型开发的数据处理和任务调度系统。...
ModelEngine-Group/DataMate
DataMate is an enterprise-level data processing platform designed for model fine-tuning and RAG...
DS4SD/deepsearch-toolkit
Interact with the Deep Search platform for new knowledge explorations and discoveries