Pinak-Datta/wiz-craft
A CLI-based dataset preprocessing tool for machine learning tasks. Features include data exploration, null value handling, one-hot encoding, and feature scaling, and download the modified dataset effortlessly.
This tool helps data scientists prepare raw datasets for machine learning models. You input a dataset with various data types and it outputs a cleaned, transformed dataset, ready for model training. It's designed for data scientists who need to quickly handle common data preparation tasks.
No commits in the last 6 months. Available on PyPI.
Use this if you need to quickly inspect, clean, and transform your datasets using a command-line interface before training machine learning models.
Not ideal if you prefer a graphical user interface or need highly customized, advanced data manipulation techniques beyond standard preprocessing.
Stars
16
Forks
4
Language
Python
License
MIT
Category
Last pushed
Oct 18, 2023
Commits (30d)
0
Dependencies
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Pinak-Datta/wiz-craft"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
open-edge-platform/datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage...
explosion/ml-datasets
🌊 Machine learning dataset loaders for testing and example scripts
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with...
tensorflow/datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
mlcommons/croissant
Croissant is a high-level format for machine learning datasets that brings together four rich layers.