cncf/llm-in-action
🤖 Discover how to apply your LLM app skills on Kubernetes!
This project helps developers learn how to deploy and manage large language model applications within a Kubernetes environment. It takes locally installed LLMs and deploys them to a local Kubernetes cluster, outputting a web service accessible in your browser. This is for developers or platform engineers who want to gain practical experience with LLM deployments on Kubernetes.
146 stars.
Use this if you are a developer looking for hands-on experience deploying and operating LLM applications in a cloud-native, Kubernetes-based environment.
Not ideal if you are a data scientist or end-user who just wants to use an LLM without managing the underlying infrastructure.
Stars
146
Forks
9
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 09, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mlops/cncf/llm-in-action"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
nndeploy/nndeploy
一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework
bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps,...
kubeflow/trainer
Distributed AI Model Training and LLM Fine-Tuning on Kubernetes
llmcloud24/de.KCD-Summer-School-2024
Learn how to deploy your own LLM in the de.NBI cloud via a step-by-step guided journey...
ray-project/llms-in-prod-workshop-2023
Deploy and Scale LLM-based applications