scitix/arks

Arks is a cloud-native inference framework running on Kubernetes

/ 100

Emerging

Arks helps MLOps engineers, platform architects, and AI infrastructure teams deploy and manage large language models (LLMs) in cloud environments. It takes various LLMs and configurations as input and provides a scalable, distributed, and multi-tenant inference service. This enables robust and efficient delivery of AI-powered applications.

Use this if you need to run multiple LLMs efficiently across different hardware, manage access for many users, and scale your AI applications on a Kubernetes cluster.

Not ideal if you are a single user experimenting with LLMs locally or do not use Kubernetes for your cloud infrastructure.

MLOps AI Infrastructure Cloud Computing Large Language Models Kubernetes Management

No Package No Dependents

Maintenance 10 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

License

Apache-2.0

Higher-rated alternatives

kubeflow/katib

Automated Machine Learning on Kubernetes

kubeai-project/kubeai

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports...

sgl-project/rbg

A workload for deploying LLM inference services on Kubernetes

beam-cloud/beta9

Ultrafast serverless GPU inference, sandboxes, and background jobs

ptimizeroracle/ondine

The LLM Dataset Engine — batch process millions of rows with 100+ providers. Multi-row batching...

Explore MLOps Tools

All categories Trending MLOps directory Insights