therealthoren/python-selenium-scraper-template
Watch your python scraper with selenium in a docker container
This project helps developers automate web scraping tasks by providing a ready-to-use template. You provide your custom scraping logic and configuration, and it outputs the extracted data, which can then be saved or sent to a webhook. It's designed for developers who need to reliably run web scrapers on a schedule, potentially using proxy services for anonymity.
No commits in the last 6 months.
Use this if you are a developer looking for a straightforward way to containerize, schedule, and run web scraping scripts with Selenium, especially if you need proxy integration (Tor or ProtonVPN).
Not ideal if you are not a developer and don't feel comfortable writing Python code or working with Docker.
Stars
16
Forks
8
Language
Python
License
MIT
Category
Last pushed
May 18, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/therealthoren/python-selenium-scraper-template"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.