ishida-lab/irreducible

[ICLR 2023] Is the Performance of My Deep Network Too Good to Be True? A Direct Approach to Estimating the Bayes Error in Binary Classification

/ 100

Experimental

This helps machine learning practitioners determine the best possible performance for a binary classification model, considering the inherent uncertainty in the data. It takes in datasets with labels that reflect this uncertainty (e.g., multiple human annotations) and outputs an estimate of the Bayes error, which is the theoretical lower bound for classification error. Data scientists, machine learning engineers, and researchers can use this to benchmark their models and understand data difficulty.

No commits in the last 6 months.

Use this if you need to understand the theoretical limits of a classification model's performance on a specific dataset, especially when evaluating state-of-the-art deep networks or identifying test set overfitting.

Not ideal if you're looking for a tool to improve your model's accuracy directly or if you only have standard, single-label datasets without any information about label uncertainty.

machine-learning model-evaluation classification data-analysis deep-learning

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

GPL-3.0

Higher-rated alternatives

EmuKit/emukit

A Python-based toolbox of various methods in decision making, uncertainty quantification and...

google/uncertainty-baselines

High-quality implementations of standard and SOTA methods on a variety of tasks.

nielstron/quantulum3

Library for unit extraction - fork of quantulum for python3

IBM/UQ360

Uncertainty Quantification 360 (UQ360) is an extensible open-source toolkit that can help you...

aamini/evidential-deep-learning

Learn fast, scalable, and calibrated measures of uncertainty using neural networks!

Explore ML Frameworks

All categories Trending ML Framework directory Insights