dataclr/dataclr

Feature selection for tabular datasets using advanced filter and wrapper methods

/ 100

Emerging

When building predictive models from large tables of data, it's often hard to know which columns (features) are most important. This tool helps data scientists and machine learning engineers intelligently identify the most impactful features from their raw tabular datasets. You input your raw data and a predictive model, and it outputs a prioritized list of the most relevant features, improving your model's accuracy and simplicity.

No commits in the last 6 months. Available on PyPI.

Use this if you are a data scientist or ML engineer struggling to select the best features from a complex tabular dataset for your classification or regression models.

Not ideal if you are working with unstructured data like images, text, or audio, or if you need a simple, manual feature selection method.

predictive-modeling machine-learning data-preparation model-optimization feature-engineering

Stale 6m

Maintenance 0 / 25

Adoption 6 / 25

Maturity 25 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

feature-engine/feature_engine

Feature engineering and selection open-source Python library compatible with sklearn.

alteryx/featuretools

An open source python library for automated feature engineering

cod3licious/autofeat

Linear Prediction Model with Automated Feature Engineering and Selection Capabilities

abess-team/abess

Fast Best-Subset Selection Library

rodrigo-arenas/Sklearn-genetic-opt

ML hyperparameters tuning and features selection, using evolutionary algorithms.

Explore ML Frameworks

All categories Trending ML Framework directory Insights