dataflint/spark
Drop-in replacement for Apache Spark UI
This tool enhances the standard Apache Spark Web UI, making it much easier to monitor and debug big data processing tasks. It takes raw Spark application metrics and displays them in a clear, intuitive tab within the existing Spark UI, offering real-time insights, performance alerts, and query breakdowns. Data engineers and data scientists who work with Apache Spark will find it invaluable for optimizing their data workflows.
420 stars.
Use this if you need to quickly understand and improve the performance of your Apache Spark jobs without sifting through complex native Spark UI metrics.
Not ideal if you are not using Apache Spark for big data processing or if you prefer command-line monitoring tools.
Stars
420
Forks
50
Language
TypeScript
License
Apache-2.0
Category
Last pushed
Mar 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/dataflint/spark"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.