scikit-learn-contrib/imbalanced-learn

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

77
/ 100
Verified

This tool helps data scientists and machine learning engineers build more accurate predictive models when their datasets have unequal numbers of examples across different categories. It takes a raw, imbalanced dataset and processes it using various re-sampling techniques to create a more balanced dataset, which then leads to improved model performance, especially for the under-represented categories. This is particularly useful for tasks where correctly identifying rare events is critical.

7,090 stars. Used by 23 other packages. Actively maintained with 1 commit in the last 30 days. Available on PyPI.

Use this if your machine learning model struggles to accurately predict outcomes for minority classes because your training data has significantly fewer examples for those classes.

Not ideal if your dataset is already well-balanced across all categories, or if your primary goal is not classification on imbalanced data.

predictive-modeling data-preprocessing imbalanced-data-classification fraud-detection medical-diagnosis
Maintenance 13 / 25
Adoption 15 / 25
Maturity 25 / 25
Community 24 / 25

How are scores calculated?

Stars

7,090

Forks

1,328

Language

Python

License

MIT

Last pushed

Feb 02, 2026

Commits (30d)

1

Dependencies

6

Reverse dependents

23

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/scikit-learn-contrib/imbalanced-learn"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.