MultiX0/last-archive

A local-first RAG engine for web archival and semantic search. Crawl, embed, and query your own knowledge base entirely offline.

/ 100

Emerging

This project helps you create a private, searchable collection of web content entirely on your own computer. You feed it websites, and it processes them so you can ask natural language questions and get answers based only on your archived data, without using external AI services. It's designed for anyone who needs to build a personal, offline knowledge base from web sources, like researchers, journalists, or individual knowledge managers.

Use this if you need to build a secure, private, and semantic search engine over web content you've personally archived, without relying on cloud services or external APIs.

Not ideal if you're looking for an instant, 'magic' solution without investing time in crawling a substantial amount of data, or if you don't have basic familiarity with Docker.

knowledge-management personal-archiving private-research information-retrieval semantic-search

No Package No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 11 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

License

MIT

Higher-rated alternatives

ItzCrazyKns/Vane

Vane is an AI-powered answering engine.

ConardLi/easy-dataset

A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval

xuwei95/ezdata

基于python和llm大模型开发的数据处理和任务调度系统。...

ModelEngine-Group/DataMate

DataMate is an enterprise-level data processing platform designed for model fine-tuning and RAG...

DS4SD/deepsearch-toolkit

Interact with the Deep Search platform for new knowledge explorations and discoveries

Explore RAG Tools

All categories Trending RAG directory Insights