dataclr/dataclr

Feature selection for tabular datasets using advanced filter and wrapper methods

39
/ 100
Emerging

When building predictive models from large tables of data, it's often hard to know which columns (features) are most important. This tool helps data scientists and machine learning engineers intelligently identify the most impactful features from their raw tabular datasets. You input your raw data and a predictive model, and it outputs a prioritized list of the most relevant features, improving your model's accuracy and simplicity.

No commits in the last 6 months. Available on PyPI.

Use this if you are a data scientist or ML engineer struggling to select the best features from a complex tabular dataset for your classification or regression models.

Not ideal if you are working with unstructured data like images, text, or audio, or if you need a simple, manual feature selection method.

predictive-modeling machine-learning data-preparation model-optimization feature-engineering
Stale 6m
Maintenance 0 / 25
Adoption 6 / 25
Maturity 25 / 25
Community 8 / 25

How are scores calculated?

Stars

20

Forks

2

Language

Python

License

Apache-2.0

Last pushed

Mar 09, 2025

Commits (30d)

0

Dependencies

12

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/dataclr/dataclr"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.