pkb01/scrape_bbc_news_hindi
This project (using python and scrapy) contains 2 spiders for BBC news to scrape all the news(in Hindi script) data available in the website as per users choice, One spider will scrape all the news data available in the website and other will scrape news only for the recent day. You can find sample scrapped data in 'first_file.txt', 'third_file.txt' respectively, 'second_file.txt' will give you all hindi news headlines.
No commits in the last 6 months.
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/pkb01/scrape_bbc_news_hindi"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.