data-engineering-community/data-engineering-wiki
The best place to learn data engineering. Built and maintained by the data engineering community.
This wiki serves as a comprehensive learning platform for anyone looking to understand or advance their skills in data engineering. It takes various data engineering concepts, FAQs, tools, and guides, and transforms them into organized notes and learning resources. This resource is for aspiring or current data engineers who want to deepen their knowledge and make informed decisions in their work.
1,915 stars. Actively maintained with 2 commits in the last 30 days.
Use this if you are a data engineer or aspiring data engineer looking for a centralized, community-curated knowledge base to learn new concepts, find answers to common questions, or explore tools and best practices.
Not ideal if you are a business user or a non-technical professional seeking high-level explanations without diving into the technical specifics of data engineering.
Stars
1,915
Forks
232
Language
CSS
License
CC0-1.0
Category
Last pushed
Mar 27, 2026
Commits (30d)
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/data-engineering-community/data-engineering-wiki"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.