apify/actor-page-analyzer
Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSON-LD metadata, analyzes AJAX requests, etc.
This tool helps marketing specialists, SEO managers, or competitive intelligence analysts understand the full content and structure of any public webpage. You provide a URL and optional keywords, and it returns a detailed breakdown of all visible text, hidden metadata, structured data (like product details or recipes), and data loaded by background requests, highlighting where your keywords appear. This helps you audit your own site or dissect competitor strategies.
153 stars. No commits in the last 6 months.
Use this if you need a comprehensive view of everything loaded and present on a webpage, including content generated by JavaScript, structured data, and metadata, to inform SEO, content strategy, or competitive analysis.
Not ideal if you only need basic information like a page's title or description, or if you primarily want to interact with a page (e.g., click buttons, fill forms) rather than just analyze its static and dynamic content.
Stars
153
Forks
22
Language
JavaScript
License
—
Category
Last pushed
Feb 27, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/apify/actor-page-analyzer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.