imbalanced-learn and imbalanced-ensemble

Imbalanced-learn provides foundational resampling and individual algorithm techniques (SMOTE, random undersampling, etc.), while imbalanced-ensemble builds specialized ensemble methods on top of these primitives to handle class imbalance at the ensemble level, making them complementary tools that can be used together in a machine learning pipeline.

imbalanced-learn
77
Verified
imbalanced-ensemble
64
Established
Maintenance 13/25
Adoption 15/25
Maturity 25/25
Community 24/25
Maintenance 10/25
Adoption 10/25
Maturity 25/25
Community 19/25
Stars: 7,090
Forks: 1,328
Downloads:
Commits (30d): 1
Language: Python
License: MIT
Stars: 418
Forks: 58
Downloads:
Commits (30d): 0
Language: Python
License: MIT
No risk flags
No risk flags

About imbalanced-learn

scikit-learn-contrib/imbalanced-learn

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

This tool helps data scientists and machine learning engineers build more accurate predictive models when their datasets have unequal numbers of examples across different categories. It takes a raw, imbalanced dataset and processes it using various re-sampling techniques to create a more balanced dataset, which then leads to improved model performance, especially for the under-represented categories. This is particularly useful for tasks where correctly identifying rare events is critical.

predictive-modeling data-preprocessing imbalanced-data-classification fraud-detection medical-diagnosis

About imbalanced-ensemble

ZhiningLiu1998/imbalanced-ensemble

🛠️ Class-imbalanced Ensemble Learning Toolbox. | 类别不平衡/长尾机器学习库 [NeurIPS'25]

This tool helps data professionals build more accurate predictive models when their datasets have very skewed categories, like detecting a rare disease or fraudulent transactions. It takes your existing tabular data and applies advanced techniques to balance the categories, outputting a more robust and reliable classification model. Data scientists, machine learning engineers, and analysts working with real-world, messy datasets will find this valuable.

predictive-modeling fraud-detection medical-diagnosis customer-churn risk-assessment

Scores updated daily from GitHub, PyPI, and npm data. How scores work