aleroot/scrapy-scraperapi-proxy

Scrapy middleware interface to a scraper API proxy service

35
/ 100
Emerging

This tool helps web scraping practitioners who use Scrapy to gather data from websites without being blocked. It integrates your Scrapy project with a 'man-in-the-middle' proxy service, sending your web requests through it. This means you feed it your Scrapy requests, and it returns the website data, bypassing common anti-scraping measures.

No commits in the last 6 months.

Use this if you are a data professional or researcher using Scrapy and frequently encounter blocks or captchas when trying to collect data from websites.

Not ideal if you are not using Scrapy for your web scraping tasks or if you prefer to manage your proxy infrastructure manually.

web-scraping data-collection market-research competitive-intelligence
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 16 / 25
Community 15 / 25

How are scores calculated?

Stars

8

Forks

4

Language

Python

License

MIT

Category

scraper

Last pushed

Aug 02, 2018

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/aleroot/scrapy-scraperapi-proxy"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.