tensorchord/openmodelz

Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)

40
/ 100
Emerging

This project helps data scientists and SREs quickly deploy large language models (LLMs) and other machine learning models for live use. It takes a trained model and automatically sets up all the necessary infrastructure, like monitoring, scaling, and public access, providing a ready-to-use public endpoint. It is for anyone who needs to take a machine learning model from development to a production environment without getting bogged down in complex infrastructure setup.

281 stars. No commits in the last 6 months.

Use this if you need to deploy machine learning models, especially large language models, to a production environment quickly and efficiently without manually configuring all the underlying infrastructure.

Not ideal if you need extremely fine-grained control over every aspect of your infrastructure setup or are working with very small, simple models that don't require autoscaling.

MLOps model deployment AI infrastructure large language models SRE
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 14 / 25

How are scores calculated?

Stars

281

Forks

25

Language

Go

License

Apache-2.0

Last pushed

Nov 03, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/mlops/tensorchord/openmodelz"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.