zjhellofss/KuiperInfer

校招、秋招、春招、实习好项目！带你从零实现一个高性能的深度学习推理库，支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

/ 100

Emerging

This project is a comprehensive educational course designed for students and junior professionals looking to deeply understand and build high-performance deep learning inference frameworks from scratch. It takes you through the process of developing a custom framework capable of running popular models like Llama2/3, Qwen2.5, Unet, and YOLOv5. By completing the course, you gain practical experience in modern C++ development and deep learning system design, making you a stronger candidate in technical interviews.

3,353 stars. No commits in the last 6 months.

Use this if you are an aspiring machine learning engineer or a student preparing for interviews, eager to build a deep learning inference engine from the ground up to understand its inner workings.

Not ideal if you are looking for an off-the-shelf, ready-to-use deep learning inference framework for immediate deployment in production.

deep-learning-engineering machine-learning-systems AI-model-deployment software-engineering-education performance-optimization

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

3,353

Forks

357

Language

C++

License

MIT

Higher-rated alternatives

microsoft/onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

onnx/onnx

Open standard for machine learning interoperability

PINTO0309/onnx2tf

Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The...

NVIDIA/TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This...

onnx/onnxmltools

ONNXMLTools enables conversion of models to ONNX

Explore ML Frameworks

All categories Trending ML Framework directory Insights