tensorchord/modelz-llm

OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)

/ 100

Established

This is an inference server for developers who want to run popular open-source large language models (LLMs) like LLaMA and ChatGLM. It takes your code (using OpenAI's Python SDK or LangChain) and lets you query these LLMs as if they were OpenAI's API, giving you generated text or embeddings back. The primary users are machine learning engineers, data scientists, and AI developers.

276 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to integrate and self-host various open-source LLMs into your applications using a familiar OpenAI-compatible API.

Not ideal if you are a non-technical end-user looking for a pre-built application that leverages LLMs without any coding.

AI-application-development machine-learning-engineering large-language-models natural-language-processing model-deployment

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 15 / 25

How are scores calculated?

Stars

276

Forks

Language

Python

License

Apache-2.0

Related models

NVIDIA/sphinx-llm

LLM extensions for Sphinx Documentation

synacktraa/tool-parse

Making LLM Tool-Calling Simpler.

gusye1234/llm-as-function

Embed your LLM into a python function

caua1503/llm-tool-fusion

llm-tool-fusion é uma biblioteca Python que unifica e simplifica o uso de ferramentas com LLMs....

murphyhoucn/llm-dev

LLM Dev

Explore Transformer Models

All categories Trending Transformer directory Insights