g1ibby/llm-deploy

Tool to manage ollama model on vast.ai

/ 100

Experimental

This tool helps developers quickly set up and manage large language models (LLMs) like Llama on cloud servers through vast.ai. You provide configuration details for your desired LLMs, and the tool automates their deployment and lifecycle. It's designed for developers who want to experiment with or host LLMs without manual server configuration.

No commits in the last 6 months.

Use this if you are a developer looking for an automated way to deploy and manage Ollama-compatible LLMs on vast.ai for experimentation or hosting.

Not ideal if you prefer a graphical user interface for managing cloud instances or if you're not comfortable with command-line tools and YAML configurations.

LLM deployment Cloud infrastructure Model hosting AI development MLOps

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

PaddlePaddle/FastDeploy

High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

mlc-ai/mlc-llm

Universal LLM Deployment Engine with ML Compilation

skyzh/tiny-llm

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny...

ServerlessLLM/ServerlessLLM

Serverless LLM Serving for Everyone.

AXERA-TECH/ax-llm

Explore LLM model deployment based on AXera's AI chips

Explore Transformer Models

All categories Trending Transformer directory Insights