BaristaLabs/skrapr
DotNet Core based declarative web scraping using the Chrome Debugger Protocol
This tool helps you automate interactions with websites, turning unstructured web pages into organized data. It takes instructions on what data to extract or actions to perform from a website, and outputs that information in a structured format. This is ideal for anyone needing to regularly collect specific data from websites that don't offer a direct download or an API.
No commits in the last 6 months.
Use this if you frequently need to collect specific information or perform repetitive tasks on websites that lack an official API or easy data export options.
Not ideal if you only need to scrape data once or if you are comfortable with manual copy-pasting for small, infrequent tasks.
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/BaristaLabs/skrapr"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.