nirantak/scraper
Python web scrapers
This helps you gather specific information from websites automatically, turning live web pages into structured data you can use for analysis or other tasks. You tell it what websites to look at and what pieces of information you need, and it delivers that content in a usable format. This is for anyone who regularly needs to collect data from many different web pages, like researchers, marketers, or business analysts.
No commits in the last 6 months.
Use this if you need to systematically extract data from websites that might require interaction (like clicking buttons or filling forms) or are difficult to scrape with simple tools.
Not ideal if you only need to grab data from a few static web pages, as the setup might be more involved than necessary.
Stars
17
Forks
8
Language
Python
License
GPL-3.0
Category
Last pushed
Jul 15, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/nirantak/scraper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.