ScrapingBee/scrapy-scrapingbee

JavaScript support and proxy rotation for Scrapy with ScrapingBee.

35
/ 100
Emerging

This tool helps web scraping developers extract data from websites that use JavaScript or have strong anti-bot measures. It takes a list of URLs and, through integration with the ScrapingBee API, provides the full, rendered HTML content of those pages, bypassing common scraping hurdles. Web scraping engineers, data scientists, or anyone building data collection systems would use this to reliably get data from complex websites.

No commits in the last 6 months. Available on PyPI.

Use this if you are a Python developer building a web scraper with Scrapy and need to handle JavaScript-heavy sites or require robust proxy rotation without managing it yourself.

Not ideal if you are not using Scrapy for your web scraping project or if you prefer to manage proxies and headless browsers manually.

web-scraping data-extraction bot-mitigation-bypass dynamic-content-scraping
No License Stale 6m
Maintenance 0 / 25
Adoption 9 / 25
Maturity 17 / 25
Community 9 / 25

How are scores calculated?

Stars

93

Forks

6

Language

Python

License

Category

scraper

Last pushed

May 14, 2024

Commits (30d)

0

Dependencies

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/ScrapingBee/scrapy-scrapingbee"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.