apify/actor-legacy-phantomjs-crawler
The actor implements the legacy Apify Crawler product. It uses PhantomJS headless browser to recursively crawl websites and extract data from them using a piece of JavaScript code.
This tool helps developers gather specific information from websites by simulating a user browsing. You provide initial website addresses and JavaScript code, and it navigates through pages, executes your code to extract data, and outputs it in a structured format. It's designed for developers building automated data collection pipelines.
No commits in the last 6 months.
Use this if you are a developer with existing website crawling projects that rely on the older PhantomJS technology and need to migrate them to a more modern platform without re-writing your data extraction logic completely.
Not ideal if you are starting a new web scraping project, as newer, more robust tools based on modern browsers are available and recommended.
Stars
9
Forks
5
Language
JavaScript
License
—
Category
Last pushed
Apr 14, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/apify/actor-legacy-phantomjs-crawler"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.