a-partovii/Gini-Impurity

This repository contains Python scripts for calculating the Gini Impurity measure for each feature in a relational dataset, great for feature selection, data preprocessing, decision tree construction, binary classification tasks.

29
/ 100
Experimental

When analyzing your data, this tool helps you understand which features, like 'age' or 'income', are most impactful for making decisions, especially in binary situations (e.g., 'yes'/'no' outcomes). It takes your structured dataset (CSV or Excel) and outputs an impurity score for each feature, helping data analysts or students identify key variables. This is particularly useful for preparing data for decision-making models.

Use this if you need to quickly identify which features in your dataset are most relevant for predicting a simple 'yes' or 'no' outcome, or any two-category target.

Not ideal if you're building a production-ready system or need to work with more complex, multi-category outcomes or unstructured data.

data-analysis feature-selection data-preprocessing decision-tree-preparation binary-classification-support
No License No Package No Dependents
Maintenance 6 / 25
Adoption 9 / 25
Maturity 8 / 25
Community 6 / 25

How are scores calculated?

Stars

76

Forks

3

Language

Python

License

Last pushed

Nov 02, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/a-partovii/Gini-Impurity"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.