romitjain/awesome-llm-systems

This repository aims to consolidate resources for learning about systems for LLM

/ 100

Experimental

This collection of resources helps you understand how large language models (LLMs) are built and optimized for performance. It compiles essential blogs and papers covering topics from GPU computing to advanced inference techniques. This is for AI/ML engineers and researchers who want to gain fundamental or intermediate knowledge in designing and scaling LLM systems.

Use this if you are an AI/ML engineer or researcher looking for curated resources to deepen your understanding of LLM system architecture, optimization, and deployment.

Not ideal if you are looking for an off-the-shelf tool or code library to implement an LLM, rather than educational materials on its underlying systems.

LLM-development AI-infrastructure machine-learning-engineering GPU-optimization deep-learning-systems

No License No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 7 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

—

License

—

Higher-rated alternatives

thu-pacman/chitu

High-performance inference framework for large language models, focusing on efficiency,...

NotPunchnox/rkllama

Ollama alternative for Rockchip NPU: An efficient solution for running AI and Deep learning...

sophgo/LLM-TPU

Run generative AI models in sophgo BM1684X/BM1688

Deep-Spark/DeepSparkHub

DeepSparkHub selects hundreds of application algorithms and models, covering various fields of...

howard-hou/VisualRWKV

VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle...

Explore LLM Tools

All categories Trending LLM Tool directory Insights