rhiever/datacleaner

A Python tool that automatically cleans data sets and readies them for analysis.

60
/ 100
Established

This tool helps data analysts and scientists quickly prepare raw tabular datasets for further analysis. It takes your CSV or similar file, identifies common issues like missing values and text-based categories, and outputs a cleaned version where these issues are addressed, making it ready for statistical models or machine learning. It's designed for anyone who regularly works with structured data and needs to streamline their data preparation.

1,078 stars. No commits in the last 6 months. Available on PyPI.

Use this if you routinely deal with datasets containing missing values or non-numerical categorical features that need to be transformed for analysis.

Not ideal if your data is unstructured text, images, or requires complex domain-specific parsing before it can be represented in a table.

data-preparation data-wrangling feature-engineering statistical-analysis machine-learning-prep
Stale 6m No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 25 / 25

How are scores calculated?

Stars

1,078

Forks

206

Language

Python

License

MIT

Last pushed

May 22, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/rhiever/datacleaner"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.