RuedigerVoigt/exoskeleton

A Python framework to build polite, but tenacious crawlers / scrapers with a MariaDB backend

/ 100

Emerging

This tool helps you reliably gather large amounts of information from websites without overwhelming the source servers. It takes a list of web addresses, then systematically downloads files, saves page content, or creates PDF copies, storing everything neatly in a database. Data analysts, researchers, or anyone needing to collect and organize extensive web data over time will find this useful.

Available on PyPI.

Use this if you need to build a persistent, fault-tolerant system for archiving web content or extracting data from many websites over an extended period.

Not ideal if you only need to download a few files quickly or are looking for a simple, one-off web scraping script without database management.

web-archiving data-collection market-research competitive-intelligence research-data

Maintenance 10 / 25

Adoption 6 / 25

Maturity 25 / 25

Community 4 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Featured in

Giving AI Agents Eyes: Browser Automation in 2026

Higher-rated alternatives

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Altimis/Scweet

A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...

lexiforest/curl_cffi

Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...

plabayo/rama

modular service framework to move and transform network packets

scrapinghub/spidermon

Scrapy Extension for monitoring spiders execution.

Explore Perception Tools

All categories Trending Perception directory Insights