ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
This tool helps you gather specific information from websites or local documents like HTML, XML, or JSON files. You simply tell it what data you need, and it extracts and organizes it for you. This is ideal for data analysts, marketers, or researchers who need to quickly collect structured data from various online or offline sources without manual copy-pasting.
22,929 stars. Actively maintained with 15 commits in the last 30 days.
Use this if you need to quickly and accurately extract specific pieces of information, such as company descriptions, founder details, or social media links, from webpages or documents.
Not ideal if you need a simple copy-paste of an entire document, or if you're looking for a general-purpose web browser rather than a data extraction tool.
Stars
22,929
Forks
2,000
Language
Python
License
MIT
Category
Last pushed
Feb 24, 2026
Commits (30d)
15
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/ScrapeGraphAI/Scrapegraph-ai"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Recent Releases
Related tools
any4ai/AnyCrawl
AnyCrawl π: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts...
kreuzberg-dev/html-to-markdown
High performance and CommonMark compliant HTML to Markdown converter. Maintained by the...
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping,...
paulpierre/markdown-crawler
A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file...
lightfeed/extractor
Using LLMs and AI browser automation to robustly extract web data