TianLangStudio/DataXServer
为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能
This tool helps data engineers and operations teams manage and scale their data synchronization and migration tasks. It takes DataX configurations, which define how data moves between different sources and destinations, and allows you to submit them remotely for execution. The output is your data successfully transferred, with options to monitor task status and performance metrics.
144 stars.
Use this if you need to run DataX jobs across multiple machines, integrate DataX into existing systems via HTTP or Thrift, or require dynamic scaling of your data transfer operations.
Not ideal if you only need to run simple, one-off data transfers on a single machine without remote management or distributed processing requirements.
Stars
144
Forks
72
Language
Scala
License
Apache-2.0
Category
Last pushed
Mar 24, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/TianLangStudio/DataXServer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.