Livingston-k/cleanPyData

cleanPyData is a Python package for data cleaning and preprocessing. It handles missing values, normalizes data, extracts features, and detects outliers, making your data ready for analysis or machine learning.

43
/ 100
Emerging

When preparing data for analysis or machine learning, you often encounter messy datasets with gaps, inconsistencies, or unusual entries. This tool helps you transform raw, incomplete data into a clean, standardized format ready for modeling. It takes your unrefined tabular data and outputs a polished dataset, making it ideal for data scientists, analysts, and machine learning engineers.

No commits in the last 6 months. Available on PyPI.

Use this if you need to quickly and systematically clean, normalize, and refine your tabular data before using it for predictive modeling or insightful reports.

Not ideal if your primary need is complex feature engineering for unstructured data like text or images, or if you're looking for advanced statistical modeling capabilities.

data-preparation data-analysis machine-learning-prep data-quality dataset-refinement
Stale 6m
Maintenance 0 / 25
Adoption 4 / 25
Maturity 25 / 25
Community 14 / 25

How are scores calculated?

Stars

8

Forks

3

Language

Python

License

MIT

Last pushed

May 25, 2024

Commits (30d)

0

Dependencies

2

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Livingston-k/cleanPyData"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.