carlosplanchon/spidercreator

Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of Playwright-based spiders with minimal manual coding. Ideal for large enterprises with recurring data extraction needs.

/ 100

Established

This tool helps businesses and data analysts automate the creation of web scraping 'spiders' to collect data from websites. You provide a description of the data you want and the website you want it from, and it generates the code to extract that information. This is ideal for anyone who needs to regularly gather structured data from the web, such as for market research, competitor analysis, or inventory tracking.

217 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to repeatedly extract specific data from websites and want to automate the process without extensive manual coding.

Not ideal if you only need to scrape data from a website once or twice, or if your data extraction needs are highly dynamic and change frequently.

data-extraction market-intelligence competitor-monitoring e-commerce-data business-intelligence

Stale 6m No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 14 / 25

How are scores calculated?

Stars

217

Forks

Language

Python

License

AGPL-3.0

Related tools

raznem/parsera

Lightweight library for scraping web-sites with LLMs

rednafi/html-to-text

Extract pure text from any webpage

supadata-ai/js

Official TypeScript/JavaScript SDK for the Supadata API.

yeahhe365/JustSearch

基于 Playwright 的自主 AI 搜索智能体。支持迭代式任务规划、深度网页爬取，以及带引用来源的多源知识整合。

Riddhish1/CogniScrape

Intelligent Web Scraping Library with LLMs

Explore LLM Tools

All categories Trending LLM Tool directory Insights