yeahhe365/LLM-Online-Assistant
基于 PyQt5 的网络信息抓取工具,支持 Google/Bing/Baidu 多引擎搜索,自动抓取关键词相关内容并保存至本地 | A PyQt5-based web scraping tool that fetches keyword-related content from multiple search engines
This tool helps researchers, marketers, or anyone needing quick information by automatically gathering content from major search engines like Google and Bing. You input keywords or questions, specify how many search result pages to scan, and the tool outputs organized text files with all the retrieved information. It's designed for individuals who need to compile comprehensive web-based research without manually clicking through many search results.
No commits in the last 6 months.
Use this if you frequently need to collect extensive information on specific topics from multiple search engines and want to automate the data gathering process into easily reviewable text files.
Not ideal if you require real-time data monitoring, highly structured data (like tables or specific page elements), or interact with websites that actively block automated scraping tools.
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/yeahhe365/LLM-Online-Assistant"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.