LexiestLeszek/scrapeGPT
ScrapeGPT is a RAG-based Telegram bot designed to scrape and analyze websites, then answer questions based on the scraped content. The bot utilizes Retrieval Augmented Generation and webscraping to return natural language answers to the user's queries.
This tool helps business analysts, marketers, or anyone needing to quickly understand a website's content and extract specific information without manually reading through every page. You provide website URLs (or even PDFs), and it processes the content. You can then ask questions in natural language, and it provides concise answers based on the site's information. It's ideal for tasks like competitor research or preparing for client meetings.
No commits in the last 6 months.
Use this if you need to rapidly ingest information from multiple web pages or documents and get specific answers to your questions, saving significant time compared to manual research.
Not ideal if you need to perform highly sensitive data analysis on confidential internal documents, or if your primary requirement is structured data extraction for database population.
Stars
87
Forks
15
Language
Python
License
MIT
Category
Last pushed
Feb 17, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/LexiestLeszek/scrapeGPT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
gpt-open/rag-gpt
RAG-GPT, leveraging LLM and RAG technology, learns from user-customized knowledge bases to...
leon0204/fast-rag
LLM Rag Intelligent Q&A Robot
gptscript-ai/gptparse
Document parser for RAG
maanvithag/thinkai
An LLM app with Retrieval Augmented Generation (RAG) built using OpenAI GPT models, Langchain...
PatentTRIZbasedAI20260226110030/Patent-GPT
Patent-GPT is an Agentic RAG-based invention copilot combining TRIZ methodology with LLMs. It...