kubeflow/mcp-apache-spark-history-server

MCP Server for Apache Spark History Server. The bridge between Agentic AI and Apache Spark.

66
/ 100
Established

This project helps data engineers and ML operations teams intelligently monitor and analyze their Apache Spark jobs. It connects AI agents to your Spark History Server, allowing you to ask questions in natural language about job performance, identify bottlenecks, compare jobs, and investigate failures. The input is your existing Spark History Server data, and the output is AI-driven insights and answers about your Spark application's execution.

135 stars. Available on PyPI.

Use this if you need to use AI agents to understand, troubleshoot, or optimize Apache Spark job performance and resource utilization.

Not ideal if you are looking for a standalone Spark monitoring solution that doesn't involve AI agents or require natural language interaction.

Spark-job-monitoring MLOps data-engineering performance-optimization AI-assisted-troubleshooting
Maintenance 10 / 25
Adoption 10 / 25
Maturity 24 / 25
Community 22 / 25

How are scores calculated?

Stars

135

Forks

46

Language

Python

License

Apache-2.0

Last pushed

Mar 03, 2026

Commits (30d)

0

Dependencies

7

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/mcp/kubeflow/mcp-apache-spark-history-server"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.