SAP/knn-sampler

Machine learning imputation method to recover the distribution of missing values, based on kNN. This method can be enabled to be used as multiple imputation and provide uncertainty quantification.

39
/ 100
Emerging

This tool helps data scientists and analysts clean up datasets that have gaps or missing entries. You provide a dataset with incomplete information, and it fills in those blanks using a sophisticated kNN-based method. The output is a complete dataset that better reflects the original data's underlying patterns, enabling more reliable analysis and modeling.

Use this if you need to fill in missing values in your datasets accurately, especially when preserving the original data distribution and understanding the uncertainty of those imputed values is important.

Not ideal if you're looking for a simple, quick fix for missing data without concern for statistical rigor or the underlying data distribution.

data-cleaning statistical-analysis dataset-preparation machine-learning-engineering data-quality
No Package No Dependents
Maintenance 10 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 8 / 25

How are scores calculated?

Stars

9

Forks

1

Language

Jupyter Notebook

License

Apache-2.0

Last pushed

Mar 13, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/SAP/knn-sampler"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.