a-partovii/Gini-Impurity
This repository contains Python scripts for calculating the Gini Impurity measure for each feature in a relational dataset, great for feature selection, data preprocessing, decision tree construction, binary classification tasks.
When analyzing your data, this tool helps you understand which features, like 'age' or 'income', are most impactful for making decisions, especially in binary situations (e.g., 'yes'/'no' outcomes). It takes your structured dataset (CSV or Excel) and outputs an impurity score for each feature, helping data analysts or students identify key variables. This is particularly useful for preparing data for decision-making models.
Use this if you need to quickly identify which features in your dataset are most relevant for predicting a simple 'yes' or 'no' outcome, or any two-category target.
Not ideal if you're building a production-ready system or need to work with more complex, multi-category outcomes or unstructured data.
Stars
76
Forks
3
Language
Python
License
—
Category
Last pushed
Nov 02, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/a-partovii/Gini-Impurity"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
google/yggdrasil-decision-forests
A library to train, evaluate, interpret, and productionize decision forest models such as Random...
parrt/dtreeviz
A python library for decision tree visualization and model interpretation.
tensorflow/decision-forests
A collection of state-of-the-art algorithms for the training, serving and interpretation of...
neurodata/treeple
Scikit-learn compatible decision trees beyond those offered in scikit-learn
winkjs/wink-regression-tree
Decision Tree to predict the value of a continuous target variable