galafis/distributed-data-processing-pipeline
Enterprise-grade distributed data processing pipeline with Apache Spark (Scala + Python), Delta Lake, and Airflow orchestration
Stars
1
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 16, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/galafis/distributed-data-processing-pipeline"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hi-primus/optimus
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
fal-ai/dbt-fal
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts,...
hiazevedo/databricks-portfolio
Portfólio de projetos práticos de Data Engineering e ML com Databricks — Delta Lake, MLflow,...
joekakone/db-analytics-tools
Databases Analytics Tools - Data Integration - Data Visualization - Machine Learning