rtp-llm and xllm

rtp-llm

Verified

xllm

Established

Maintenance 22/25

Adoption 10/25

Maturity 16/25

Community 22/25

Maintenance 22/25

Adoption 10/25

Maturity 15/25

Community 22/25

Stars: 1,065

Forks: 159

Downloads: —

Commits (30d): 163

Language: Cuda

License: Apache-2.0

Stars: 1,081

Forks: 149

Downloads: —

Commits (30d): 123

Language: C++

License: —

No Package No Dependents

About rtp-llm

alibaba/rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

This is a high-performance engine for deploying large language models (LLMs) in real-world applications. It takes your trained LLM, potentially with multimodal inputs like images and text, and efficiently generates responses for a large number of users. It is designed for engineers and AI product managers responsible for running LLM-powered services like AI assistants or smart search features at scale.

AI-application-deployment LLM-serving AI-platform-operations conversational-AI enterprise-search

About xllm

jd-opensource/xllm

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

This project helps businesses and organizations deploy large language models (LLMs) like DeepSeek-V3.1 or Qwen2/3, especially on Chinese AI accelerators. It takes these pre-trained models and makes them run much faster and more cost-effectively, generating text responses for applications like intelligent customer service, risk control, or ad recommendations. The end-users are AI solution architects, MLOps engineers, and IT infrastructure managers responsible for deploying and managing AI applications.

AI-application-deployment large-language-model-inference AI-infrastructure-optimization enterprise-AI-solutions AI-acceleration-hardware

Related comparisons

rtp-llm and vllm rtp-llm and LightLLM rtp-llm and FastFlowLM rtp-llm and ZhiLight rtp-llm and PowerInfer rtp-llm and vllm

Scores updated daily from GitHub, PyPI, and npm data. How scores work