Llm Evaluation Platforms MLOps Tools

There are 20 llm evaluation platforms tools tracked. 1 score above 70 (verified tier). The highest-rated is kserve/kserve at 73/100 with 5,200 stars. 1 of the top 10 are actively maintained.

Get all 20 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=mlops&subcategory=llm-evaluation-platforms&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 kserve/kserve

Standardized Distributed Generative and Predictive AI Inference Platform for...

73
Verified
2 omegaml/omegaml

MLOps simplified. One-stop AI delivery platform, all the features you need.

59
Established
3 awslabs/aiops-modules

AIOps modules is a collection of reusable Infrastructure as Code (IaC)...

56
Established
4 GoogleCloudDataproc/dataproc-ml-python

Library to simplify running distributed ML workloads with Apache Spark

51
Established
5 jina-ai/serve

☁️ Build multimodal AI applications with cloud-native stack

46
Emerging
6 george0st/qgate-sln-mlrun

MLRun/Iguazio/Nuclio quality gate solution. The solution checks a quality of...

46
Emerging
7 aishwaryaprabhat/BigBertha

BigBertha is an architecture design that demonstrates how automated LLMOps...

39
Emerging
8 demml/opsml

Quality Control for AI Artifact Management

39
Emerging
9 awslabs/fmbench-orchestrator

Run FMBench simultaneously across multiple Amazon EC2 machines to benchmark...

38
Emerging
10 Impesud/ai-mlops-project

AI MLOps Project – A production-grade MLOps pipeline for scalable,...

32
Emerging
11 botanu-ai/botanu-sdk-python

SDK to track cost-per-outcome for AI workflows

32
Emerging
12 Ratnesh-181998/Production-Ready-MLOps-Pipelines

Production-grade MLOps pipelines with real-world ML and NLP projects.Covers...

22
Experimental
13 xxxihrmn/llmops

🚀 Discover top tools and resources for Large Language Model Operations...

21
Experimental
14 sochaty/llm-governance-engine

A robust LLM Governance & ROI Evaluation platform designed to benchmark...

21
Experimental
15 jthiruveedula/llmops-evaluation-framework

Production LLMOps platform with automated evaluation, A/B testing, prompt...

14
Experimental
16 jthiruveedula/llmops-mlflow-vertexai

LLMOps platform integrating MLflow experiment tracking, Vertex AI model...

14
Experimental
17 jthiruveedula/real-time-llm-streaming-platform

Kafka + Spark Streaming + LLM inference pipeline for real-time document...

14
Experimental
18 oriolrius/from-mlops-to-llmops

Educational materials for understanding the evolution from MLOps to LLMOps....

13
Experimental
19 rawatshaurya/llm-drift-monitor

Production-style LLM drift monitoring: semantic, structural, safety, and...

13
Experimental
20 lopezleandro03/LLMs-as-a-Service

Deploy LLMs using Azure Model-as-a-Service (MaaS) and Terraform

12
Experimental

Comparisons in this category