fhamborg/news-please

news-please - an integrated web crawler and information extractor for news that just works

/ 100

Established

This tool helps researchers, analysts, or anyone tracking specific topics by automatically collecting news articles from various websites. You provide a list of news website URLs or access a large news archive, and it returns structured information like headlines, main text, authors, and publication dates. It's designed for anyone who needs to gather and process news content efficiently for analysis or reporting.

2,402 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to automatically collect and extract specific information from news articles across many different sources or from historical archives.

Not ideal if you only need to manually read a few articles or are looking for a tool to perform in-depth sentiment or event analysis directly.

news-monitoring market-intelligence media-analysis research-data-collection competitor-tracking

Stale 6m

Maintenance 2 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 24 / 25

How are scores calculated?

Stars

2,402

Forks

450

Language

Python

License

Apache-2.0

Related tools

flairNLP/fundus

A very simple news crawler with a funny name

affjljoo3581/canrevan

대량의 네이버 뉴스 기사를 수집하는 라이브러리입니다.

FreeDiscovery/FreeDiscovery

Web Service for E-Discovery Analytics

tirthajyoti/Web-Database-Analytics

Web scrapping and related analytics using Python tools

Multiverse-of-Projects/NewsAI

A dynamic NewsAI dashboard that uses NLP to analyze news articles, visualize sentiment trends,...

Explore NLP Tools

All categories Trending NLP directory Insights