vmware/versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
Versatile Data Kit (VDK) helps data engineers and analysts build, deploy, and manage their data pipelines efficiently. It simplifies extracting data from various sources like databases or APIs, transforming it using Python or SQL, and loading it into a chosen destination. VDK turns raw data into clean, ready-to-use information for reporting, analytics, or machine learning.
478 stars.
Use this if you need a unified way to develop, deploy, and operate your data ingestion and transformation workflows with Python and SQL.
Not ideal if you prefer a low-code or no-code drag-and-drop interface for building data pipelines.
Stars
478
Forks
66
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 23, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/vmware/versatile-data-kit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.