KaidiGuo/keyword_based_Sina_weibo_crawler
A web crawler for Sina, search and retrieve microblogs that contain certain keywords 一个简单的python爬虫实践,爬取包含关键词的新浪微博
This tool helps researchers, marketers, or anyone needing to analyze public sentiment or trends by automatically collecting Sina Weibo posts containing specific keywords. You provide a list of keywords, and it outputs raw JSON data of microblogs, organized by date, which can then be processed for further insights. It's designed for individuals who need to gather focused social media data for analysis projects.
100 stars. No commits in the last 6 months.
Use this if you need to systematically collect Sina Weibo posts based on a predefined list of keywords for research or analysis, especially for moderate data volumes (30,000-60,000 posts daily).
Not ideal if you require an interactive user interface, need to retrieve very large historical datasets quickly, or if your primary interest is in real-time streaming data rather than daily batches.
Stars
100
Forks
32
Language
Python
License
—
Category
Last pushed
Oct 25, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/KaidiGuo/keyword_based_Sina_weibo_crawler"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.