philippe2803/contentmap
Build a RAG dataset for your domain in just a few lines of codes, using your XML sitemap
This helps content strategists and SEO specialists organize all the content from a website into a single, searchable database. By taking your website's XML sitemap, it creates a structured SQLite database that lists every page and can even include powerful search features. This is ideal for anyone needing a comprehensive overview and easy retrieval of their site's information.
No commits in the last 6 months.
Use this if you need to quickly consolidate all your website's content into a structured, searchable database for analysis or internal tools, leveraging your existing XML sitemap.
Not ideal if you're looking for a full-fledged content management system or a real-time content syndication tool like an RSS feed.
Stars
48
Forks
3
Language
Python
License
—
Category
Last pushed
Aug 24, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/philippe2803/contentmap"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
OpenBMB/UltraRAG
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
Quansight/ragna
RAG orchestration framework ⛵️
microsoft/rag-time
RAG Time: A 5-week Learning Journey to Mastering RAG
AnkitNayak-eth/EpsteinFiles-RAG
A RAG pipeline implementation built on the 'Epstein Files 20K' dataset from Hugging Face (Teyler).
apify/apify-haystack
The official integration for Apify and Haystack 2.0