rtp-llm and LightLLM

rtp-llm
70
Verified
LightLLM
65
Established
Maintenance 22/25
Adoption 10/25
Maturity 16/25
Community 22/25
Maintenance 20/25
Adoption 10/25
Maturity 16/25
Community 19/25
Stars: 1,065
Forks: 159
Downloads:
Commits (30d): 163
Language: Cuda
License: Apache-2.0
Stars: 3,944
Forks: 307
Downloads:
Commits (30d): 23
Language: Python
License: Apache-2.0
No Package No Dependents
No Package No Dependents

About rtp-llm

alibaba/rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

This is a high-performance engine for deploying large language models (LLMs) in real-world applications. It takes your trained LLM, potentially with multimodal inputs like images and text, and efficiently generates responses for a large number of users. It is designed for engineers and AI product managers responsible for running LLM-powered services like AI assistants or smart search features at scale.

AI-application-deployment LLM-serving AI-platform-operations conversational-AI enterprise-search

About LightLLM

ModelTC/LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

LightLLM helps machine learning engineers and MLOps teams efficiently deploy and manage Large Language Models (LLMs). It takes a trained LLM as input and provides a high-speed, scalable serving framework, enabling applications to quickly get responses from the model. This is for professionals building and maintaining systems that rely on fast, reliable LLM interactions.

LLM deployment model serving AI infrastructure machine learning operations real-time AI

Scores updated daily from GitHub, PyPI, and npm data. How scores work