ModelTC/LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

65
/ 100
Established

LightLLM helps machine learning engineers and MLOps teams efficiently deploy and manage Large Language Models (LLMs). It takes a trained LLM as input and provides a high-speed, scalable serving framework, enabling applications to quickly get responses from the model. This is for professionals building and maintaining systems that rely on fast, reliable LLM interactions.

3,944 stars. Actively maintained with 23 commits in the last 30 days.

Use this if you need to serve large language models with high performance and scalability, ensuring quick responses for your applications.

Not ideal if you are looking for a tool to train LLMs or a pre-built application that uses LLMs, rather than a serving infrastructure.

LLM deployment model serving AI infrastructure machine learning operations real-time AI
No Package No Dependents
Maintenance 20 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

3,944

Forks

307

Language

Python

License

Apache-2.0

Last pushed

Mar 13, 2026

Commits (30d)

23

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/ModelTC/LightLLM"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.