richban/opendata-stack-platform
Open Data Stack Platform: a collection of projects and pipelines built with open data stack tools for scalable, observable data platform.
This project helps engineering and analytics teams build robust, scalable platforms for analyzing large datasets. It demonstrates how to ingest raw data, transform it, and prepare it for advanced analytics and machine learning. You feed it raw data from sources like NYC's Open Data portal, and it produces clean, structured data ready for dashboards and models, along with insights into data quality and pipeline performance.
Use this if you are an engineering leader or data analytics professional looking to build or enhance a modern, observable data platform with open-source tools.
Not ideal if you are looking for an out-of-the-box business intelligence solution without needing to build or manage underlying data infrastructure.
Stars
22
Forks
2
Language
Jupyter Notebook
License
—
Category
Last pushed
Mar 22, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/richban/opendata-stack-platform"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.