catalyst-cooperative/pudl
The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
PUDL helps researchers, advocates, journalists, and policymakers understand the US energy system by making government data easier to use. It takes raw, complex spreadsheets and databases from agencies like EIA and EPA, cleans them, and provides them as unified, analysis-ready datasets. This saves you significant time in data preparation, letting you focus on uncovering insights.
577 stars. Actively maintained with 55 commits in the last 30 days.
Use this if you need consistent, high-quality US energy data for analysis without spending days cleaning and integrating disparate government sources.
Not ideal if you only need a single, small piece of US energy data that is already perfectly clean and readily available from its original source.
Stars
577
Forks
133
Language
Python
License
MIT
Category
Last pushed
Mar 19, 2026
Commits (30d)
55
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/catalyst-cooperative/pudl"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.