CarlosManuelDiaz/rag-ready-extractor
Stop indexing noise. Turn messy websites and PDFs into clean, structured data for RAG pipelines with semantic importance scoring and token optimization.
Stars
3
Forks
—
Language
—
License
MIT
Last pushed
Mar 08, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/CarlosManuelDiaz/rag-ready-extractor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
wangxb96/RAG-QA-Generator
RAG-QA-Generator...
aws-samples/rag-with-amazon-opensearch-serverless-and-sagemaker
Question Answering Generative AI application with Large Language Models (LLMs) and Amazon...
PerciValXIII/CAFB-food-wise-ai
AI-powered content automation tool for the Capital Area Food Bank (CAFB), using RAG and LLMs to...
libraryofcelsus/LLM_File_Parser
AutoML/Unstructured Data Processing for RAG and LLM Dataset Creation. Current Database Options...
manthan410/multimodal-RAG-ResearchQA-bot
using mulimodal RAG to query texts, images and tables from pdf for QA