sergiog95/csabstracts

Dataset of scientific abstracts for the purpose of sentence classification

26
/ 100
Experimental

This dataset helps researchers, NLP engineers, and data scientists working with scientific literature to automatically categorize sentences within computer science abstracts. It takes raw abstract sentences and provides them pre-labeled with categories like 'Background,' 'Objective,' 'Methods,' 'Results,' and 'Conclusions.' This is ideal for training and evaluating machine learning models designed to understand the structure of scientific papers.

No commits in the last 6 months.

Use this if you need a pre-labeled collection of computer science abstract sentences to train or test models for automatic text summarization, information extraction, or scientific document understanding.

Not ideal if you are looking for abstracts outside of computer science or if you need full-text articles rather than just abstract sentences.

scientific-text-analysis NLP-research academic-document-structuring information-extraction computer-science-literature
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 13 / 25

How are scores calculated?

Stars

10

Forks

2

Language

License

Last pushed

Sep 17, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/sergiog95/csabstracts"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.