elisim/hydra-sklearn-pipelines
Code accompanying the blogpost: "Creating Configurable Data Pre-Processing Pipelines by Combining Hydra and Sklearn" by Eli Simhayev & Benjamin Bodner
This project helps machine learning engineers and data scientists quickly configure and run different data preprocessing workflows. It takes raw or structured datasets and applies a sequence of cleaning, transformation, and feature engineering steps, producing a ready-to-model dataset. This is ideal for practitioners who need to experiment with various data preparation strategies before training a machine learning model.
No commits in the last 6 months.
Use this if you are a machine learning engineer or data scientist who needs a structured and repeatable way to define and execute different data preprocessing pipelines for your experiments.
Not ideal if you are looking for a general-purpose data transformation tool for business intelligence or simple data cleaning that doesn't involve machine learning.
Stars
28
Forks
4
Language
Jupyter Notebook
License
—
Category
Last pushed
Jun 26, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/elisim/hydra-sklearn-pipelines"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.