Model Inference Serving MLOps Tools
There are 35 model inference serving tools tracked. 1 score above 70 (verified tier). The highest-rated is feast-dev/feast at 84/100 with 6,793 stars. 2 of the top 10 are actively maintained.
Get all 35 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=mlops&subcategory=model-inference-serving&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
feast-dev/feast
The Open Source Feature Store for AI/ML |
|
Verified |
| 2 |
clearml/clearml-serving
ClearML - Model-Serving Orchestration and Repository Solution |
|
Established |
| 3 |
lakehq/sail
LakeSail's computation framework with a mission to unify batch processing,... |
|
Established |
| 4 |
PaddlePaddle/Serving
A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架) |
|
Established |
| 5 |
SeldonIO/MLServer
An inference server for your machine learning models, including support for... |
|
Established |
| 6 |
sustainable-computing-io/kepler-model-server
Model Server for Kepler |
|
Established |
| 7 |
pytorch/serve
Serve, optimize and scale PyTorch models in production |
|
Established |
| 8 |
raptor-ml/raptor
Transform your pythonic research to an artifact that engineers can deploy easily. |
|
Emerging |
| 9 |
sustainable-computing-io/kepler-model-db
Repository containing up-to-date models to be used by the kepler-model-server |
|
Emerging |
| 10 |
tugraz-isds/systemds
An open source ML system for the end-to-end data science lifecycle |
|
Emerging |
| 11 |
george0st/qgate-model
ML/AI meta-model, used in MLRun/Iguazio/Nuclio, see qgate-sln- |
|
Emerging |
| 12 |
fuseml/fuseml-core
FuseML APIs and core service. This repo include the FuseML client useful to... |
|
Emerging |
| 13 |
Kenza-AI/kenza
Open-Source Machine Learning Platform |
|
Emerging |
| 14 |
bioinformatist/cml
A Framework for Production-Ready Continuous Machine Learning |
|
Emerging |
| 15 |
aporia-ai/inferencedb
🚀 Stream inferences of real-time ML models in production to any data lake... |
|
Emerging |
| 16 |
eora-ai/inferoxy
Service for quick deploying and using dockerized Computer Vision models |
|
Emerging |
| 17 |
gasparian/ml-serving-template
Serving large ml models independently and asynchronously via message queue... |
|
Experimental |
| 18 |
datamass-io/ml-kraken
Machine-Learning orchestration framework. Cloud-based models management environment. |
|
Experimental |
| 19 |
mmziyad/flink-ms
Serving layer for large machine learning models on Apache Flink |
|
Experimental |
| 20 |
puneethkotha/Falcon
Production ML inference platform. Multi-worker · Nginx load balancing ·... |
|
Experimental |
| 21 |
JohnJTK/crucible_train
🚀 Accelerate ML training on the BEAM with CrucibleTrain's unified... |
|
Experimental |
| 22 |
HighviewOne/ml-model-registry
ML Model Registry & Deployment Dashboard - AI Dev Tools Zoomcamp 2025 |
|
Experimental |
| 23 |
gdroguski/MLServe
Docker-based Machine Learning models serving |
|
Experimental |
| 24 |
ameron-ai/model-serving-sidecar-service-example
A simple Python example of a Model Service that can be fronted by the Model Sidecar |
|
Experimental |
| 25 |
ameron-ai/model-serving-sidecar
A lightweight adapter that handles all the cross-cutting concerns for model serving |
|
Experimental |
| 26 |
North-Shore-AI/crucible_model_registry
ML model registry for the Crucible ecosystem. Artifact storage, model... |
|
Experimental |
| 27 |
North-Shore-AI/crucible_train
ML training orchestration for the Crucible ecosystem. Distributed training,... |
|
Experimental |
| 28 |
North-Shore-AI/crucible_feedback
ML feedback loop management for the Crucible ecosystem. Quality monitoring,... |
|
Experimental |
| 29 |
North-Shore-AI/crucible_deployment
ML model deployment for the Crucible ecosystem. vLLM and Ollama integration,... |
|
Experimental |
| 30 |
galafis/distributed-model-inference-engine
Distributed model inference engine with REST/gRPC serving, circuit breaker,... |
|
Experimental |
| 31 |
man4ish/omnibioai-model-registry
Production-grade model registry for the OmniBioAI ecosystem, providing... |
|
Experimental |
| 32 |
arthurhzna/Golang_AI_Pipeline
A scalable AI task queue and processing pipeline in Go, integrating Redis,... |
|
Experimental |
| 33 |
narphu/modelcache-operator
Model caching in Kubernetes |
|
Experimental |
| 34 |
marekgalovic/photon
ML model serving service. |
|
Experimental |
| 35 |
markdouthwaite/apogee-server
Probabilistic Graphical Models on Kubernetes |
|
Experimental |