kedro-org/kedro

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.

77
/ 100
Verified

Kedro helps data scientists and engineers build robust, reproducible, and maintainable data pipelines. It takes raw data from various sources (like files or cloud storage) and processes it through a series of steps to produce clean datasets, features for machine learning, or analytical reports. This is for data science teams who need to move beyond one-off scripts and collaborate on production-ready data projects.

10,786 stars. Used by 5 other packages. Actively maintained with 17 commits in the last 30 days. Available on PyPI.

Use this if you need to transform raw data into a clean, structured format for analysis or machine learning, ensuring your data workflows are organized, testable, and can be easily updated or shared.

Not ideal if you're only performing quick, exploratory data analysis on small datasets that don't require a structured, multi-step pipeline or team collaboration.

data-pipeline-development machine-learning-engineering data-workflow-automation reproducible-research data-quality-assurance
Maintenance 17 / 25
Adoption 15 / 25
Maturity 25 / 25
Community 20 / 25

How are scores calculated?

Stars

10,786

Forks

1,014

Language

Python

License

Apache-2.0

Last pushed

Mar 12, 2026

Commits (30d)

17

Dependencies

18

Reverse dependents

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/mlops/kedro-org/kedro"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.