andytyler/gethtml
Utility for web scraping and fetching the html from a url, using various strategies in a 'waterfall' approach.
This tool helps you reliably collect content from websites, even from complex sites that use JavaScript or block standard access methods. You provide a web address (URL), and it attempts various strategies until it successfully retrieves the full HTML content of that page. It's designed for anyone who needs to gather data from websites, such as market researchers, data analysts, or content aggregators.
Use this if you need a dependable way to extract information from a variety of websites, especially those that are dynamic or employ anti-scraping measures.
Not ideal if you need a full browser automation tool for complex interactions beyond fetching HTML, or if you only ever fetch from simple, static pages.
Stars
8
Forks
—
Language
TypeScript
License
MIT
Category
Last pushed
Nov 27, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/andytyler/gethtml"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.