IliaZenkov/async-pubmed-scraper
PubMed scraper for async search on a list of keywords and concurrent extraction of all found URLs, returning a DataFrame/CSV containing all article data (title, abstract, authors, affiliations, etc)
This tool helps researchers, scientists, and medical professionals quickly gather information on scholarly articles from PubMed. You input a list of keywords and desired publication years, and it outputs a spreadsheet (CSV) containing article titles, abstracts, authors, affiliations, journal names, and publication dates for all found results. It's designed for anyone needing to efficiently collect large datasets of medical and life sciences literature for analysis.
No commits in the last 6 months.
Use this if you need to rapidly extract detailed information from thousands of PubMed articles based on multiple keywords and specific date ranges, significantly faster than using standard methods.
Not ideal if you only need to look up a few articles or prefer using PubMed's official API with its rate limits for smaller, less time-sensitive data collection tasks.
Stars
45
Forks
17
Language
Python
License
MIT
Category
Last pushed
Nov 05, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/IliaZenkov/async-pubmed-scraper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.