Duke-Chronicle-Project/awesome-historical-newspaper-analysis
Awesome historical newspaper analysis tools and literature
This collection helps researchers, historians, and digital humanities scholars analyze digitized historical newspapers. It provides various software and resources to process raw newspaper scans, convert them into searchable text, understand their layout, and extract meaningful information from decades or centuries of publications. It's designed for anyone working with large archives of scanned historical newspapers.
No commits in the last 6 months.
Use this if you need to transform scanned images of historical newspapers into structured data for text analysis, identify different sections within a newspaper page, or evaluate the quality of your digitized text.
Not ideal if you are working with modern, born-digital news articles or if your primary need is general-purpose, non-historical text analysis.
Stars
8
Forks
—
Language
—
License
—
Category
Last pushed
Jan 13, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Duke-Chronicle-Project/awesome-historical-newspaper-analysis"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
deepdoctection/deepdoctection
A Repo For Document AI
deanmalmgren/textract
extract text from any document. no muss. no fuss.
eikek/docspell
Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources...
zzzDavid/ICDAR-2019-SROIE
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic...