crawlbase/crawlbase-java
A lightweight, java class that acts as wrapper for Crawlbase API
This is a Java library that acts as a wrapper for the Crawlbase API. It helps developers easily integrate powerful web scraping and crawling capabilities into their Java applications. You input website URLs and optional parameters, and it outputs the content of those web pages, including status codes and body content. Developers building data-intensive applications will find this tool useful.
No commits in the last 6 months.
Use this if you are a Java developer and need to programmatically extract data from websites, including those built with JavaScript, or capture website screenshots.
Not ideal if you are not a Java developer or if you need a no-code solution for web scraping.
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/crawlbase/crawlbase-java"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.