GregoryKogan/yt-framework
Build scalable data pipelines on YTsaurus with automatic stage management, local development simulation, and more.
This framework helps data engineers build and manage complex data processing workflows on YTsaurus (YT) clusters. You define your data tasks as 'stages,' and the framework takes care of running them efficiently, whether you're prototyping on your local machine or deploying to a large cluster. It simplifies moving raw data through various processing steps to produce refined datasets or analytical results.
Available on PyPI.
Use this if you are a data engineer or MLOps engineer needing to build, test, and deploy robust, scalable data pipelines on a YTsaurus cluster.
Not ideal if you don't use YTsaurus for your data infrastructure or if you're looking for a low-code data integration tool.
Stars
19
Forks
—
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 28, 2026
Monthly downloads
406
Commits (30d)
0
Dependencies
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/GregoryKogan/yt-framework"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.