AmirhosseinHonardoust/Missing-Data-Doctor

Missing Data Doctor is a diagnostic and treatment toolkit for missing values in machine learning datasets. It profiles missingness patterns, visualizes gaps, applies multiple imputation strategies, and evaluates their impact on model performance. Includes automated plots, metrics, and a full HTML report.

25
/ 100
Experimental

Effectively managing missing data is crucial for reliable machine learning models. This toolkit helps data scientists diagnose missing data patterns, understand which features are most affected, and visualize where the gaps are. It takes a raw dataset with missing values and produces a comprehensive HTML report, charts, and metrics comparing how different imputation strategies impact your model's performance.

Use this if you are a data scientist working with tabular datasets and need to understand, impute, and evaluate the impact of missing values on your predictive models.

Not ideal if you need to process streaming data or very large datasets that don't fit into memory, or if you require highly specialized, domain-specific imputation methods not covered by common strategies.

data-cleaning feature-engineering machine-learning-preparation data-quality predictive-modeling
No Package No Dependents
Maintenance 6 / 25
Adoption 6 / 25
Maturity 13 / 25
Community 0 / 25

How are scores calculated?

Stars

20

Forks

Language

Python

License

MIT

Last pushed

Nov 15, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/AmirhosseinHonardoust/Missing-Data-Doctor"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.