hrbrmstr/htmlunit

🕸🧰☕️Tools to Scrape Dynamic Web Content via the 'HtmlUnit' Java Library

37
/ 100
Emerging

This tool helps non-programmers extract information from websites that are difficult to access with standard methods, such as those with interactive elements or JavaScript. It takes a web address (URL) and provides structured data like tables or text, similar to how a browser sees it. Digital marketers, researchers, or anyone needing to gather public information from dynamic websites would find this useful.

No commits in the last 6 months.

Use this if you need to reliably pull data from websites that use JavaScript, AJAX, or require form submissions and link clicks to reveal their content.

Not ideal if you primarily need to scrape static HTML content from simple websites without dynamic elements or complex interactions.

web-scraping market-research data-collection competitor-analysis content-extraction
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 14 / 25

How are scores calculated?

Stars

36

Forks

6

Language

R

License

Apache-2.0

Category

scraper

Last pushed

Apr 12, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/hrbrmstr/htmlunit"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.