phuijse/bagging_pu

Simple sklearn based python implementation of Positive-Unlabeled (PU) classification using bagging based ensembles

44
/ 100
Emerging

This tool helps you train a classification model when you only have examples of one class (positive examples) and a large pool of unlabeled data, some of which might also be positive. It takes your positive and unlabeled datasets and outputs a model that can predict whether new, unseen data points are positive or not. This is ideal for data scientists, machine learning engineers, and researchers working with incomplete datasets.

No commits in the last 6 months.

Use this if you need to build a classifier but only have a small number of confirmed 'positive' examples and a large amount of data where the 'negative' examples are not explicitly labeled.

Not ideal if you have clearly labeled examples for both your positive and negative classes, as standard supervised classification methods would be more straightforward.

imbalanced-data-classification machine-learning-engineering data-science semi-supervised-learning
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

93

Forks

20

Language

Jupyter Notebook

License

MIT

Last pushed

Jan 03, 2017

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/phuijse/bagging_pu"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.