feature-engine/feature_engine

Feature engineering and selection open-source Python library compatible with sklearn.

74
/ 100
Verified

This tool helps data scientists and machine learning engineers prepare their raw datasets for building better predictive models. It takes your raw data, with its missing values, messy categories, and complex numeric features, and systematically transforms it. The output is a cleaned, structured dataset optimized for machine learning algorithms, making your models more accurate and robust.

2,211 stars. Used by 3 other packages. Actively maintained with 1 commit in the last 30 days. Available on PyPI.

Use this if you need to systematically clean, transform, and select features from your datasets before training machine learning models.

Not ideal if you are looking for a fully automated 'black-box' solution without needing to understand or customize data transformation steps.

data preprocessing machine learning predictive modeling feature engineering data cleaning
Maintenance 13 / 25
Adoption 13 / 25
Maturity 25 / 25
Community 23 / 25

How are scores calculated?

Stars

2,211

Forks

338

Language

Python

License

BSD-3-Clause

Last pushed

Mar 09, 2026

Commits (30d)

1

Dependencies

5

Reverse dependents

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/feature-engine/feature_engine"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.