MTSWebServices/onetl
One ETL tool to rule them all
This tool helps data engineers and analysts build and manage data pipelines to move and transform large datasets efficiently. It takes raw data from various databases and file systems like HDFS or S3, processes it using Apache Spark, and delivers cleaned, structured data to target data stores. Data professionals who regularly integrate and prepare data from disparate sources will find this valuable.
Use this if you need a flexible Python library to build batch-oriented Extract, Transform, Load (ETL) or Extract, Load, Transform (ELT) pipelines using Apache Spark.
Not ideal if you require real-time data streaming capabilities or a full-fledged data orchestration framework with built-in scheduling.
Stars
87
Forks
7
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/MTSWebServices/onetl"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.