andytyler/gethtml

Utility for web scraping and fetching the html from a url, using various strategies in a 'waterfall' approach.

/ 100

Experimental

This tool helps you reliably collect content from websites, even from complex sites that use JavaScript or block standard access methods. You provide a web address (URL), and it attempts various strategies until it successfully retrieves the full HTML content of that page. It's designed for anyone who needs to gather data from websites, such as market researchers, data analysts, or content aggregators.

Use this if you need a dependable way to extract information from a variety of websites, especially those that are dynamic or employ anti-scraping measures.

Not ideal if you need a full browser automation tool for complex interactions beyond fetching HTML, or if you only ever fetch from simple, static pages.

web-data-collection market-intelligence content-aggregation competitor-monitoring data-acquisition

No Package No Dependents

Maintenance 6 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

TypeScript

License

MIT

Featured in

Giving AI Agents Eyes: Browser Automation in 2026

Higher-rated alternatives

seleniumbase/SeleniumBase

APIs for browser automation, testing, and bypassing bot-detection.

apify/crawlee-python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....

intoli/user-agents

A JavaScript library for generating random user agents with data that's updated daily.

apify/crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...

Kaliiiiiiiiii-Vinyzu/patchright

Undetected version of the Playwright testing and automation library.

Explore Perception Tools

All categories Trending Perception directory Insights