zonghui0228/BioMedical-NLP-corpus

Biomedical NLP Corpus or Datasets.

/ 100

Experimental

This project provides a collection of datasets for anyone working with biomedical text, such as research scientists, clinical informaticists, or bio-data analysts. It helps with tasks like identifying specific medical terms, extracting relationships between concepts, or standardizing biological entities. You input scientific papers, clinical notes, or patents, and it provides structured data ready for computational analysis.

No commits in the last 6 months.

Use this if you need pre-labeled data to train or evaluate models for understanding biomedical text, such as identifying diseases, genes, drugs, or symptoms from medical literature or patient records.

Not ideal if you are looking for a ready-to-use software application or an API; this project provides raw data collections rather than an executable tool.

biomedical-research clinical-informatics medical-data-analysis pharmacovigilance bio-ontology

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

hellohaptik/chatbot_ner

chatbot_ner: Named Entity Recognition for chatbots.

openeventdata/mordecai

Full text geoparsing as a Python library

Rostlab/nalaf

NLP framework in python for entity recognition and relationship extraction

mpuig/spacy-lookup

Named Entity Recognition based on dictionaries

NorskRegnesentral/skweak

skweak: A software toolkit for weak supervision applied to NLP tasks

Explore NLP Tools

All categories Trending NLP directory Insights