practikpharma/PGxCorpus

PGxCorpus, a manually annotated corpus, designed for the extraction of pharmacogenomic relations from text.

/ 100

Emerging

Pharmacogenomics researchers or clinical pharmacologists can use this manually annotated collection of scientific sentences to understand relationships between genes, drugs, and diseases. It takes text from PubMed abstracts and highlights key pharmacogenomic entities and their connections. This resource is ideal for those studying drug response variability based on genetic factors.

No commits in the last 6 months.

Use this if you need a meticulously categorized dataset of pharmacogenomic information to train or validate systems that automatically extract drug-gene interactions from scientific literature.

Not ideal if you are looking for a tool to perform live text analysis or to directly query a database of pharmacogenomic facts rather than a corpus for machine learning.

pharmacogenomics drug-discovery biomedical-research clinical-pharmacology literature-mining

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Lua

License

—

Higher-rated alternatives

Helsinki-NLP/OpusFilter

OpusFilter - Parallel corpus processing toolkit

natasha/corus

Links to Russian corpora + Python functions for loading and parsing

darija-open-dataset/dataset

darija <-> english dataset

omicsNLP/Auto-CORPus

Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London...

SergeyShk/ruTS

Библиотека для извлечения статистик из текстов на русском языке.

Explore NLP Tools

All categories Trending NLP directory Insights