jamesturk/scrapeghost
👻 Experimental library for scraping websites using OpenAI's GPT API.
This tool helped developers extract specific data from websites using AI. You would define the structure of the information you needed (like a list of product names and prices), feed it a webpage, and it would output that data in a clean, structured format. It was built for programmers who wanted to leverage large language models for web scraping tasks.
1,444 stars.
Use this if you are a developer looking for an experimental way to extract structured data from web pages using an LLM, and are comfortable with potentially high API costs.
Not ideal if you are looking for a maintained, cost-effective, or non-developer-centric solution for web scraping.
Stars
1,444
Forks
88
Language
Python
License
—
Category
Last pushed
Jan 14, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/jamesturk/scrapeghost"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
Priyanshu-hawk/ChatGPT-unofficial-api-selenium
This is unofficial ChatGPT API using selenium for prompt testing and flow testing purposes
3281448091/easyChatGPT
An unofficial yet elegant interface of the ChatGPT API using browser automation that bypasses...
ryuseisan/auto-chatgpt
Automate interaction with the browser version of ChatGPT.
Ryaang/gpt-web-crawler
A web crawler for GPTs to build knowledge bases 用于GPT构建知识库的网站爬虫
djb-gt/gpt-automated-web-scraper
The GPT-based Universal Web Scraper MVP is a solution that leverages GPT models and web scraping...