PNNL-CompBio/coderdata

Dataset package for facile training and testing of machine learning/AI algorithms that predict drug response in cancer model systems.

56
/ 100
Established

This project provides a standardized collection of cancer-related molecular and drug sensitivity data. It takes raw omics data and drug response measurements, processes them, and outputs harmonized datasets. Cancer researchers and computational biologists can use this to develop and test machine learning models for predicting how cancer cells will respond to different drug treatments.

Available on PyPI.

Use this if you need a reliable, pre-processed benchmark dataset to train and validate machine learning algorithms that predict drug outcomes in cancer models.

Not ideal if you are looking for raw, uncurated data or if your research is outside the domain of cancer drug response prediction.

cancer-research drug-discovery computational-biology genomics precision-medicine
Maintenance 10 / 25
Adoption 6 / 25
Maturity 25 / 25
Community 15 / 25

How are scores calculated?

Stars

20

Forks

4

Language

Jupyter Notebook

License

BSD-2-Clause

Last pushed

Feb 11, 2026

Commits (30d)

0

Dependencies

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/PNNL-CompBio/coderdata"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.