drivendataorg/cookiecutter-data-science

A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.

70
/ 100
Verified

Setting up a data science project can be complex, with many files and folders to organize. This tool helps data scientists quickly create a standardized, logical structure for new projects, providing a consistent layout for raw data, processed data, notebooks, models, and reports right from the start. It ensures all team members can easily understand and navigate the project's layout.

9,723 stars. Available on PyPI.

Use this if you are a data scientist starting a new project and want to ensure a consistent, well-organized file structure for your data, code, models, and reports.

Not ideal if you are a beginner looking for a simple, one-off script, or if you already have a deeply established and satisfactory project organization system.

data-science-project-management data-organization ml-project-setup research-workflow data-pipeline-structure
Maintenance 10 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 25 / 25

How are scores calculated?

Stars

9,723

Forks

2,628

Language

Python

License

MIT

Last pushed

Mar 03, 2026

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/drivendataorg/cookiecutter-data-science"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.