sewcio543/soupsavvy
Powerful and flexible web scraping Search Engine
This tool helps software developers efficiently extract specific information from websites. You feed it web pages (HTML content), and it provides a consistent, structured way to define and apply rules to pull out the data you need. It's for developers who build web scraping applications and want to streamline their data extraction processes.
Available on PyPI.
Use this if you are a developer building web scraping solutions and want a consistent, scalable, and maintainable way to define how data is extracted from various web sources.
Not ideal if you need a complete web scraping framework that handles crawling, proxies, or rendering, as this focuses specifically on the data selection logic.
Stars
9
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 16, 2026
Commits (30d)
0
Dependencies
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/sewcio543/soupsavvy"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.