McWashr/pyscrapify

PyScrapify is a modular Python web scraper framework built on top of Selenium and BeautifulSoup. Easily extendable with new scrapers off of BaseScraper class. Avoid the bloat of creating web scrapers in Python!

/ 100

Emerging

This tool helps you gather specific information from websites by automating the extraction of data. You provide it with the web addresses you want to monitor and what data patterns to look for. It then outputs the extracted data, typically in a structured format like a CSV, making it useful for researchers, marketers, or anyone needing to collect data from public web pages.

No commits in the last 6 months.

Use this if you frequently need to collect structured data from specific public web pages and want a way to automate and manage these data extraction tasks efficiently.

Not ideal if you need to extract data from websites with highly inconsistent layouts, or if you're not comfortable with some technical configuration to define new data extraction rules.

data-collection market-research competitive-intelligence content-monitoring lead-generation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Featured in

Giving AI Agents Eyes: Browser Automation in 2026

Higher-rated alternatives

seleniumbase/SeleniumBase

APIs for browser automation, testing, and bypassing bot-detection.

apify/crawlee-python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....

intoli/user-agents

A JavaScript library for generating random user agents with data that's updated daily.

apify/crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...

Kaliiiiiiiiii-Vinyzu/patchright

Undetected version of the Playwright testing and automation library.

Explore Perception Tools

All categories Trending Perception directory Insights