runprism/prism
Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.
Prism helps data professionals organize and automate complex data workflows, such as cleaning, transforming, and loading data, often called ETL. It takes individual data tasks written in Python and intelligently stitches them together, producing a complete, ordered data pipeline. This tool is for data scientists, data engineers, and data analysts who manage multi-step data projects.
No commits in the last 6 months.
Use this if you need to build, schedule, and run data transformations and analyses written in Python, especially if they involve multiple steps or dependencies on various data sources.
Not ideal if your data tasks are very simple, single-step operations, or if you prefer a low-code/no-code environment.
Stars
87
Forks
2
Language
Python
License
Apache-2.0
Category
Last pushed
Nov 25, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/runprism/prism"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
vaexio/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of...
alibaba/feathub
FeatHub - A stream-batch unified feature store for real-time machine learning
mindsdb/dbt-mindsdb
dbt adapter for connecting to MindsDB
kevin-hanselman/dud
A lightweight CLI tool for versioning data alongside source code and building data pipelines.