hrbrmstr/htmlunit

🕸🧰☕️Tools to Scrape Dynamic Web Content via the 'HtmlUnit' Java Library

/ 100

Emerging

This tool helps non-programmers extract information from websites that are difficult to access with standard methods, such as those with interactive elements or JavaScript. It takes a web address (URL) and provides structured data like tables or text, similar to how a browser sees it. Digital marketers, researchers, or anyone needing to gather public information from dynamic websites would find this useful.

No commits in the last 6 months.

Use this if you need to reliably pull data from websites that use JavaScript, AJAX, or require form submissions and link clicks to reveal their content.

Not ideal if you primarily need to scrape static HTML content from simple websites without dynamic elements or complex interactions.

web-scraping market-research data-collection competitor-analysis content-extraction

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

License

Apache-2.0

Featured in

Giving AI Agents Eyes: Browser Automation in 2026

Higher-rated alternatives

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Altimis/Scweet

A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...

lexiforest/curl_cffi

Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...

plabayo/rama

modular service framework to move and transform network packets

scrapinghub/spidermon

Scrapy Extension for monitoring spiders execution.

Explore Perception Tools

All categories Trending Perception directory Insights