xianhu/PSpider
简单易用的Python爬虫框架,QQ交流群:597510560
PSpider helps developers collect specific information from websites by automatically navigating pages, extracting data, and saving it. You provide it with the initial web addresses and what data points to look for, and it delivers the extracted information in a structured format. This tool is for Python developers who need to build custom web data collection scripts quickly and efficiently.
1,839 stars. No commits in the last 6 months.
Use this if you are a Python developer needing a straightforward, multi-threaded framework to build custom web scrapers for extracting structured data from websites.
Not ideal if you need a no-code solution, a pre-built web scraping service, or a tool that can execute JavaScript on web pages for data extraction.
Stars
1,839
Forks
498
Language
Python
License
BSD-2-Clause
Category
Last pushed
Jun 10, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/xianhu/PSpider"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.