fhamborg/news-please

news-please - an integrated web crawler and information extractor for news that just works

61
/ 100
Established

This tool helps researchers, analysts, or anyone tracking specific topics by automatically collecting news articles from various websites. You provide a list of news website URLs or access a large news archive, and it returns structured information like headlines, main text, authors, and publication dates. It's designed for anyone who needs to gather and process news content efficiently for analysis or reporting.

2,402 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to automatically collect and extract specific information from news articles across many different sources or from historical archives.

Not ideal if you only need to manually read a few articles or are looking for a tool to perform in-depth sentiment or event analysis directly.

news-monitoring market-intelligence media-analysis research-data-collection competitor-tracking
Stale 6m
Maintenance 2 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 24 / 25

How are scores calculated?

Stars

2,402

Forks

450

Language

Python

License

Apache-2.0

Last pushed

Sep 21, 2025

Commits (30d)

0

Dependencies

25

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/fhamborg/news-please"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.