carlosplanchon/spidercreator
Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of Playwright-based spiders with minimal manual coding. Ideal for large enterprises with recurring data extraction needs.
This tool helps businesses and data analysts automate the creation of web scraping 'spiders' to collect data from websites. You provide a description of the data you want and the website you want it from, and it generates the code to extract that information. This is ideal for anyone who needs to regularly gather structured data from the web, such as for market research, competitor analysis, or inventory tracking.
217 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to repeatedly extract specific data from websites and want to automate the process without extensive manual coding.
Not ideal if you only need to scrape data from a website once or twice, or if your data extraction needs are highly dynamic and change frequently.
Stars
217
Forks
22
Language
Python
License
AGPL-3.0
Category
Last pushed
Aug 25, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/carlosplanchon/spidercreator"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
raznem/parsera
Lightweight library for scraping web-sites with LLMs
rednafi/html-to-text
Extract pure text from any webpage
supadata-ai/js
Official TypeScript/JavaScript SDK for the Supadata API.
yeahhe365/JustSearch
基于 Playwright 的自主 AI 搜索智能体。支持迭代式任务规划、深度网页爬取,以及带引用来源的多源知识整合。
Riddhish1/CogniScrape
Intelligent Web Scraping Library with LLMs