bjesus/pipet
Swiss-army tool for scraping and extracting data from online assets, made for hackers
Pipet is a command-line tool that helps you extract specific information from websites, whether it's plain HTML, JSON data, or content loaded by JavaScript. You define what data you want using simple selector rules, and Pipet fetches it from online sources. It’s ideal for anyone who needs to regularly pull public information from websites, like market researchers, analysts, or anyone tracking online changes.
4,662 stars. No commits in the last 6 months.
Use this if you need to reliably pull specific data points from websites for tracking, analysis, or monitoring, and you're comfortable with command-line tools.
Not ideal if you prefer a graphical user interface for web scraping or need to interact extensively with complex web forms.
Stars
4,662
Forks
216
Language
Go
License
MIT
Category
Last pushed
Oct 12, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/bjesus/pipet"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.