vipul-sharma20/sharingan
Tool to extract news articles from newspaper and give the context about the news
This tool helps you analyze physical newspaper clippings by extracting news articles as text from images of newspapers. You provide an image of a newspaper page, and it returns the digitized text of the articles and identifies key entities and contexts mentioned. This is useful for researchers, journalists, or archivists who need to process historical or physical newspaper content.
214 stars. No commits in the last 6 months.
Use this if you need to quickly digitize newspaper articles from images and understand the main topics and entities discussed within them.
Not ideal if you need to process large volumes of digital newspaper archives or require highly advanced sentiment analysis or complex relationship extraction.
Stars
214
Forks
26
Language
Python
License
—
Category
Last pushed
Aug 17, 2017
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/vipul-sharma20/sharingan"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
flairNLP/fundus
A very simple news crawler with a funny name
fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
affjljoo3581/canrevan
대량의 네이버 뉴스 기사를 수집하는 라이브러리입니다.
FreeDiscovery/FreeDiscovery
Web Service for E-Discovery Analytics
tirthajyoti/Web-Database-Analytics
Web scrapping and related analytics using Python tools