bentoml/OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

/ 100

Established

This project helps software developers, machine learning engineers, and data scientists easily host and serve open-source Large Language Models (LLMs) from their own cloud infrastructure. It takes various open-source LLMs as input and outputs an OpenAI-compatible API endpoint, making it straightforward to integrate these models into applications. The primary users are developers building AI-powered applications who need to deploy and manage LLMs.

12,161 stars. Available on PyPI.

Use this if you are a developer who wants to run and expose open-source or custom LLMs via an OpenAI-compatible API, either on your local machine or in the cloud.

Not ideal if you are an end-user looking for a ready-to-use application and do not have programming or cloud infrastructure experience.

LLM deployment AI application development machine learning operations cloud infrastructure API development

Maintenance 10 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 18 / 25

How are scores calculated?

Stars

12,161

Forks

803

Language

Python

License

Apache-2.0

Compare

OpenLLM and ludwig

Related models

ludwig-ai/ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

withcatai/node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema...

mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and...

zhudotexe/kani

kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)

SciSharp/LLamaSharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

Explore Transformer Models

All categories Trending Transformer directory Insights