DataTalksClub/data-engineering-zoomcamp
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
This is a free, 9-week course designed to teach you how to build robust, end-to-end data pipelines from the ground up. You'll gain practical experience with industry-standard tools for moving, storing, and transforming data, resulting in the ability to create production-ready data systems. This course is ideal for aspiring data engineers, data analysts looking to specialize, or developers wanting to transition into data roles.
39,193 stars. Actively maintained with 3 commits in the last 30 days.
Use this if you want to master the fundamentals of data engineering and gain hands-on experience building complex data pipelines with modern tools.
Not ideal if you're looking for a quick introduction to data concepts without in-depth, hands-on pipeline construction.
Stars
39,193
Forks
7,884
Language
Jupyter Notebook
License
—
Category
Last pushed
Mar 19, 2026
Commits (30d)
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/DataTalksClub/data-engineering-zoomcamp"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.