fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
This tool helps researchers, analysts, or anyone tracking specific topics by automatically collecting news articles from various websites. You provide a list of news website URLs or access a large news archive, and it returns structured information like headlines, main text, authors, and publication dates. It's designed for anyone who needs to gather and process news content efficiently for analysis or reporting.
2,402 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to automatically collect and extract specific information from news articles across many different sources or from historical archives.
Not ideal if you only need to manually read a few articles or are looking for a tool to perform in-depth sentiment or event analysis directly.
Stars
2,402
Forks
450
Language
Python
License
Apache-2.0
Category
Last pushed
Sep 21, 2025
Commits (30d)
0
Dependencies
25
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/fhamborg/news-please"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
flairNLP/fundus
A very simple news crawler with a funny name
affjljoo3581/canrevan
대량의 네이버 뉴스 기사를 수집하는 라이브러리입니다.
FreeDiscovery/FreeDiscovery
Web Service for E-Discovery Analytics
tirthajyoti/Web-Database-Analytics
Web scrapping and related analytics using Python tools
Multiverse-of-Projects/NewsAI
A dynamic NewsAI dashboard that uses NLP to analyze news articles, visualize sentiment trends,...