antoninschrab/mmdfuse

MMD-FUSE package implementing the MMD-FUSE test proposed in MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data Splitting by Biggs, Schrab, and Gretton: https://arxiv.org/abs/2306.08777

/ 100

Experimental

This package helps machine learning researchers or statisticians determine if two sets of data samples come from the same underlying distribution. You input two arrays of data, and it outputs a 0 if the distributions are considered the same, or a 1 if they are different, along with an optional p-value. This is useful for validating models or comparing datasets without needing to split your data.

No commits in the last 6 months.

Use this if you need to quickly and reliably compare two datasets to see if they originate from the same statistical process, especially when data splitting is not ideal.

Not ideal if you are not comfortable working with Python code and installing packages from GitHub, or if you don't have access to a GPU for faster processing of large datasets.

statistical-analysis machine-learning-research data-comparison distribution-testing hypothesis-testing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

iamDecode/sklearn-pmml-model

A library to parse and convert PMML models into Scikit-learn estimators.

vecxoz/vecstack

Python package for stacking (machine learning technique)

yzhao062/combo

(AAAI' 20) A Python Toolbox for Machine Learning Model Combination

flennerhag/mlens

ML-Ensemble – high performance ensemble learning

aws-samples/aws-machine-learning-university-dte

Machine Learning University: Decision Trees and Ensemble Methods

Explore ML Frameworks

All categories Trending ML Framework directory Insights