HuaizhengZhang/AI-Infra-from-Zero-to-Hero
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials.
This project helps developers and engineers understand and build robust infrastructure for machine learning, including Large Language Models (LLMs) and Generative AI. It takes in research papers, industry practices, and existing system designs, outputting a curated knowledge base and practical guidance for building AI systems. Anyone responsible for the architecture, deployment, and scaling of AI applications would find this resource valuable.
3,763 stars. No commits in the last 6 months.
Use this if you are a system engineer, MLOps professional, or researcher looking to design, implement, or optimize the underlying infrastructure for AI models and applications.
Not ideal if you are a data scientist primarily focused on model training and experimentation, or a business user seeking high-level AI concepts without diving into technical system architecture.
Stars
3,763
Forks
370
Language
—
License
MIT
Category
Last pushed
Jul 25, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/HuaizhengZhang/AI-Infra-from-Zero-to-Hero"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
thu-pacman/chitu
High-performance inference framework for large language models, focusing on efficiency,...
NotPunchnox/rkllama
Ollama alternative for Rockchip NPU: An efficient solution for running AI and Deep learning...
sophgo/LLM-TPU
Run generative AI models in sophgo BM1684X/BM1688
Deep-Spark/DeepSparkHub
DeepSparkHub selects hundreds of application algorithms and models, covering various fields of...
howard-hou/VisualRWKV
VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle...