MultiX0/last-archive
A local-first RAG engine for web archival and semantic search. Crawl, embed, and query your own knowledge base entirely offline.
This project helps you create a private, searchable collection of web content entirely on your own computer. You feed it websites, and it processes them so you can ask natural language questions and get answers based only on your archived data, without using external AI services. It's designed for anyone who needs to build a personal, offline knowledge base from web sources, like researchers, journalists, or individual knowledge managers.
Use this if you need to build a secure, private, and semantic search engine over web content you've personally archived, without relying on cloud services or external APIs.
Not ideal if you're looking for an instant, 'magic' solution without investing time in crawling a substantial amount of data, or if you don't have basic familiarity with Docker.
Stars
22
Forks
8
Language
Go
License
MIT
Category
Last pushed
Jan 31, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/MultiX0/last-archive"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ItzCrazyKns/Vane
Vane is an AI-powered answering engine.
ConardLi/easy-dataset
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
xuwei95/ezdata
基于python和llm大模型开发的数据处理和任务调度系统。...
ModelEngine-Group/DataMate
DataMate is an enterprise-level data processing platform designed for model fine-tuning and RAG...
DS4SD/deepsearch-toolkit
Interact with the Deep Search platform for new knowledge explorations and discoveries