airbytehq/airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

76
/ 100
Verified

This platform helps data professionals gather information from various sources like business applications, databases, or files, and centralize it into a single data warehouse or data lake. It takes raw data from hundreds of different systems and delivers clean, organized data ready for analysis. Data engineers, analysts, and operations teams use this to build reliable data pipelines.

20,904 stars. Actively maintained with 897 commits in the last 30 days.

Use this if you need to regularly pull data from many different operational systems and consolidate it for reporting, analytics, or machine learning projects.

Not ideal if you only need to move a small amount of data manually or perform simple data transformations within a single system.

data-integration data-warehousing business-intelligence data-pipeline ETL
No Package No Dependents
Maintenance 25 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

20,904

Forks

5,097

Language

Python

License

Last pushed

Mar 19, 2026

Commits (30d)

897

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/airbytehq/airbyte"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.