airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
This platform helps data professionals gather information from various sources like business applications, databases, or files, and centralize it into a single data warehouse or data lake. It takes raw data from hundreds of different systems and delivers clean, organized data ready for analysis. Data engineers, analysts, and operations teams use this to build reliable data pipelines.
20,904 stars. Actively maintained with 897 commits in the last 30 days.
Use this if you need to regularly pull data from many different operational systems and consolidate it for reporting, analytics, or machine learning projects.
Not ideal if you only need to move a small amount of data manually or perform simple data transformations within a single system.
Stars
20,904
Forks
5,097
Language
Python
License
—
Category
Last pushed
Mar 19, 2026
Commits (30d)
897
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/airbytehq/airbyte"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Recent Releases
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.