danielnieto/scrapman

Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs

42
/ 100
Emerging

This tool helps developers efficiently gather website content, even from pages that load dynamic information using JavaScript. You provide a list of URLs, and it returns the complete, rendered HTML content for each, just as a web browser would see it. It's designed for developers building applications that need to process or analyze web page content.

No commits in the last 6 months. Available on npm.

Use this if you need to programmatically fetch the fully-rendered HTML from many web pages quickly, especially those that rely heavily on JavaScript to display their content.

Not ideal if you need to interact with a web page like a user (e.g., clicking buttons, filling forms) beyond just retrieving its HTML, as this is not a browser automation tool.

web-scraping data-extraction content-gathering web-development backend-engineering
Stale 6m
Maintenance 0 / 25
Adoption 6 / 25
Maturity 25 / 25
Community 11 / 25

How are scores calculated?

Stars

22

Forks

3

Language

JavaScript

License

MIT

Category

scraper

Last pushed

Apr 13, 2018

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/danielnieto/scrapman"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.