edamontology/pubfetcher

A Java command-line tool and library to download and store publications with metadata by combining content from various online resources

/ 100

Emerging

This tool helps researchers, curators, or information scientists gather comprehensive details about academic publications, especially in biomedical and life sciences. It takes publication identifiers (like PMIDs or DOIs) and fetches titles, abstracts, full texts, keywords, and other metadata from various online sources. The result is a well-organized collection of publication data, stored locally or exported as JSON, ready for further analysis.

Use this if you need to gather detailed and complete information for thousands of biomedical or life sciences publications, pulling content from multiple sources like Europe PMC, PubMed, and even publisher websites.

Not ideal if you need to process millions of publications or if author lists are a critical piece of metadata for your analysis, as this is not currently supported.

biomedical-research literature-review scientific-data-curation bibliometrics life-sciences

No Package No Dependents

Maintenance 10 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Java

License

GPL-3.0

Featured in

Giving AI Agents Eyes: Browser Automation in 2026

Higher-rated alternatives

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Altimis/Scweet

A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...

lexiforest/curl_cffi

Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...

plabayo/rama

modular service framework to move and transform network packets

scrapinghub/spidermon

Scrapy Extension for monitoring spiders execution.

Explore Perception Tools

All categories Trending Perception directory Insights