gbazad93/AirFlow-ML-Data-Integration
An Airflow-based pipeline that fetches data from a free API, cleans and transforms it, and saves it to a database—ready for downstream machine learning.
This project helps data professionals automate the daily collection of weather data. It takes raw weather information from a public API, cleans and organizes it, and then stores it in a database. This prepared data is then ready for further analysis, like building predictive models or creating dashboards. Data engineers or analysts who need a reliable, automated way to feed external weather data into their systems for reporting or machine learning would find this useful.
No commits in the last 6 months.
Use this if you need to set up a robust, automated daily pipeline to fetch, clean, and store external weather data in a PostgreSQL database for downstream analytics or machine learning.
Not ideal if you are looking for a pre-built machine learning model or a dashboard, as this project focuses solely on the data integration pipeline.
Stars
16
Forks
1
Language
Python
License
—
Category
Last pushed
Mar 26, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/gbazad93/AirFlow-ML-Data-Integration"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
elyra-ai/pipeline-editor
Common pipeline-editor components used in different clients (e.g. Elyra application, Web browser...
orchest/orchest
Build data pipelines, the easy way 🛠️
stitchfix/hamilton
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN...
chayansraj/Python-ETL-pipeline-using-Airflow-on-AWS
This project demonstrates how to build and automate an ETL pipeline written in Python and...
Codex56799/dataengineering
🚀 Build a containerized data engineering workflow for NYC Yellow Taxi Trip Data using Apache...