ericbrasiln/pyHDB
pyHDB - Ferramenta de auxílio metodológico para pesquisas na interface da Hemeroteca Digital Brasileira da Biblioteca Nacional. Desenvolvida por Eric Brasil (IHLM-UNILAB) como parte de pesquisa acadêmica da área de História Digital.
This tool helps researchers in fields like History or Social Sciences systematically collect information from the Brazilian Digital Newspaper Library (Hemeroteca Digital Brasileira). You provide keywords or criteria, and it automates the process of finding and documenting relevant historical newspaper content. It’s designed for academics and historians who need to ensure methodological rigor in their digital research by accurately logging data collection steps.
No commits in the last 6 months.
Use this if you are a historian or academic regularly searching and documenting historical newspaper content from the Hemeroteca Digital Brasileira and need a robust way to record your research steps.
Not ideal if you are looking for a simple, browser-based search interface or do not work with the Hemeroteca Digital Brasileira specifically.
Stars
11
Forks
10
Language
Python
License
MIT
Category
Last pushed
Oct 03, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/ericbrasiln/pyHDB"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.