alexferrari88/scrapeblocks
Scraping automation framework based on Playwright
This tool helps developers and technical users automate web data collection with less code. You provide a web address and specify what information you need (like product prices or article text), along with any pre-steps required (like clicking a button or typing into a search bar). It then navigates the website and returns the requested data, simplifying complex web scraping tasks.
No commits in the last 6 months. Available on npm.
Use this if you need to quickly set up automated data extraction from websites without deep expertise in browser automation, or if you want to streamline complex scraping workflows.
Not ideal if you prefer a no-code solution or if your primary goal is simple, one-off manual data extraction.
Stars
14
Forks
—
Language
TypeScript
License
MIT
Category
Last pushed
Jun 29, 2022
Commits (30d)
0
Dependencies
4
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/alexferrari88/scrapeblocks"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.