brendonboshell/supercrawler

A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.

/ 100

Established

This tool helps developers automate the process of systematically browsing websites and extracting specific information. You provide a starting web address and define what content to look for (like links, images, or text), and the crawler will navigate the site according to rules like robots.txt, gathering the specified data. It's for developers who need to collect large amounts of publicly available web data for analysis, research, or integration into other applications.

382 stars. No commits in the last 6 months. Available on npm.

Use this if you need to programmatically explore a website, respecting site rules, to extract specific content or links.

Not ideal if you need a simple tool for occasional, manual data extraction without writing code.

web-scraping data-collection web-automation content-extraction developer-tool

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 20 / 25

How are scores calculated?

Stars

382

Forks

Language

JavaScript

License

Apache-2.0

Featured in

Giving AI Agents Eyes: Browser Automation in 2026

Related tools

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Altimis/Scweet

A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...

lexiforest/curl_cffi

Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...

plabayo/rama

modular service framework to move and transform network packets

scrapinghub/spidermon

Scrapy Extension for monitoring spiders execution.

Explore Perception Tools

All categories Trending Perception directory Insights