vdutts7/gpt4V-scraper
AI agent that can SEE 👁️, control, navigate, & do stuff for you on your browser.
This tool helps anyone who needs to capture exact website content, including full-page screenshots, and then extract specific information from those images using AI. You provide a web address and a question about the content, and it gives you a screenshot and the precise answer you asked for. It's designed for market researchers, content analysts, or anyone who regularly needs to gather structured data from diverse web pages.
294 stars.
Use this if you need to automate the process of visually capturing website content and extracting targeted information from it, especially from sites with anti-bot measures or requiring login.
Not ideal if you only need basic text scraping or don't require visual context (screenshots) for your data extraction tasks.
Stars
294
Forks
28
Language
JavaScript
License
—
Category
Last pushed
Mar 01, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/vdutts7/gpt4V-scraper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
alibaba/page-agent
JavaScript in-page GUI agent. Control web interfaces with natural language.
steel-dev/steel-browser
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser...
4ier/neo
Turn any web app into an API. Chrome extension captures browser traffic, auto-generates schemas,...
violettoolssite/CFspider
Cloudflare Workers 代理 IP 池,VLESS 动态 IP,内置 AI 智能浏览器支持自然语言控制
actionbook/actionbook
Browser action engine for AI agents. 10× faster, resilient by design.