dataform-co/dataform
Dataform is a framework for managing SQL based data operations in BigQuery
This helps data teams manage and transform data stored in Google BigQuery using SQL. You put in raw data in BigQuery and define your transformation steps with SQL, and it outputs structured, tested, and documented data tables ready for analysis. Data engineers and data analysts who work with large datasets in BigQuery will find this useful.
967 stars. Actively maintained with 15 commits in the last 30 days.
Use this if you need to build robust, scalable, and well-managed data transformation pipelines in Google BigQuery, ensuring data quality and clear documentation.
Not ideal if your data is not in Google BigQuery or if you prefer a graphical user interface over code-based SQL transformations.
Stars
967
Forks
196
Language
TypeScript
License
Apache-2.0
Category
Last pushed
Mar 17, 2026
Commits (30d)
15
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/dataform-co/dataform"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.