proxidize/x-scraper
A Twitter/X scraper built with Playwright for browser automation and OpenAI GPT-4 for AI-powered tweet analysis. Features timeline scraping, historical search, keyword search, checkpoint/resume, proxy support, and comprehensive data extraction.
This tool helps researchers, marketers, or analysts gather public posts from X (formerly Twitter) for in-depth understanding. It takes a specific username, keywords, or date range, navigates X like a human, and outputs structured data of tweets and user information. Additionally, it can use AI to automatically extract sentiment, topics, or summaries from the collected posts.
Use this if you need to systematically collect large volumes of public tweets from specific users, by keyword, or within historical date ranges, and want automated AI analysis like sentiment or topic extraction.
Not ideal if you only need a few tweets, prefer a simple point-and-click interface without any setup, or require real-time data streaming.
Stars
26
Forks
3
Language
Python
License
MIT
Category
Last pushed
Oct 30, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/proxidize/x-scraper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.