rtp-llm and LightLLM

rtp-llm

Verified

LightLLM

Established

Maintenance 22/25

Adoption 10/25

Maturity 16/25

Community 22/25

Maintenance 20/25

Adoption 10/25

Maturity 16/25

Community 19/25

Stars: 1,065

Forks: 159

Downloads: —

Commits (30d): 163

Language: Cuda

License: Apache-2.0

Stars: 3,944

Forks: 307

Downloads: —

Commits (30d): 23

Language: Python

License: Apache-2.0

No Package No Dependents

About rtp-llm

alibaba/rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

This is a high-performance engine for deploying large language models (LLMs) in real-world applications. It takes your trained LLM, potentially with multimodal inputs like images and text, and efficiently generates responses for a large number of users. It is designed for engineers and AI product managers responsible for running LLM-powered services like AI assistants or smart search features at scale.

AI-application-deployment LLM-serving AI-platform-operations conversational-AI enterprise-search

About LightLLM

ModelTC/LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

LightLLM helps machine learning engineers and MLOps teams efficiently deploy and manage Large Language Models (LLMs). It takes a trained LLM as input and provides a high-speed, scalable serving framework, enabling applications to quickly get responses from the model. This is for professionals building and maintaining systems that rely on fast, reliable LLM interactions.

LLM deployment model serving AI infrastructure machine learning operations real-time AI

Related comparisons

rtp-llm and vllm rtp-llm and xllm rtp-llm and FastFlowLM rtp-llm and ZhiLight rtp-llm and PowerInfer rtp-llm and sglang

Scores updated daily from GitHub, PyPI, and npm data. How scores work