lordmathis/llamactl

Unified management and routing for llama.cpp, MLX and vLLM models with web dashboard.

/ 100

Emerging

This tool helps AI engineers and MLOps professionals efficiently manage and deploy multiple open-source large language models (LLMs) like Llama, MLX, and vLLM. It allows you to download models, serve them through a unified API compatible with OpenAI and Anthropic, and route requests to different instances, all controlled via an intuitive web dashboard. You get a central place to manage diverse models, monitor their health, and handle distributed deployments.

Use this if you need a centralized system to manage, route requests, and monitor multiple open-source LLMs across various backends and potentially different machines.

Not ideal if you only ever run a single LLM instance or are looking for a platform that handles model training and fine-tuning.

AI-inference-management LLM-deployment model-serving MLOps API-routing

No Package No Dependents

Maintenance 10 / 25

Adoption 9 / 25

Maturity 15 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

License

MIT

Higher-rated alternatives

containers/ramalama

RamaLama is an open-source developer tool that simplifies the local serving of AI models from...

av/harbor

One command brings a complete pre-wired LLM stack with hundreds of services to explore.

RunanywhereAI/runanywhere-sdks

Production ready toolkit to run AI locally

runpod-workers/worker-vllm

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.

foldl/chatllm.cpp

Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)

Explore LLM Tools

All categories Trending LLM Tool directory Insights