for0nething/RECON

Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning

/ 100

Emerging

This tool helps data scientists and machine learning engineers build predictive models faster and more efficiently when dealing with large, complex datasets spread across multiple tables. It takes your raw, multi-table data and outputs a smaller, representative 'coreset' that preserves the key characteristics of the original data. This coreset can then be used for training classification or regression models, saving significant computation time and resources.

No commits in the last 6 months.

Use this if you need to train machine learning models on very large datasets composed of many joined tables, and you want to reduce training time and computational cost without sacrificing model accuracy.

Not ideal if your datasets are small, or if your machine learning workflow does not involve complex joins across multiple data sources.

data-efficiency predictive-modeling large-scale-data relational-data machine-learning-optimization

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

feature-engine/feature_engine

Feature engineering and selection open-source Python library compatible with sklearn.

alteryx/featuretools

An open source python library for automated feature engineering

cod3licious/autofeat

Linear Prediction Model with Automated Feature Engineering and Selection Capabilities

abess-team/abess

Fast Best-Subset Selection Library

rodrigo-arenas/Sklearn-genetic-opt

ML hyperparameters tuning and features selection, using evolutionary algorithms.

Explore ML Frameworks

All categories Trending ML Framework directory Insights