vgupta123/sumpubmed

SUMPUBMED: Summarization Dataset of PubMed Scientific Article

35
/ 100
Emerging

This is a specialized dataset designed to help develop and test tools that automatically summarize scientific articles. It takes full biomedical research papers from PubMed and provides both short and long summaries. This is primarily useful for researchers and machine learning engineers working on natural language processing tasks, specifically in the domain of scientific text summarization.

No commits in the last 6 months.

Use this if you are building or evaluating an AI model that needs to generate concise summaries from long scientific texts, particularly in the biomedical field.

Not ideal if you are looking for a tool to summarize documents directly, as this is a dataset for training and testing, not a summarization application.

biomedical-research natural-language-processing academic-publishing text-summarization scientific-literature
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 12 / 25

How are scores calculated?

Stars

34

Forks

5

Language

License

MIT

Last pushed

May 31, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/vgupta123/sumpubmed"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.