OpenPPL/ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

/ 100

Emerging

This tool helps AI engineers optimize neural networks for deployment on resource-constrained hardware like edge devices. It takes a pre-trained neural network model (e.g., in ONNX, PyTorch, or Caffe format) and converts its floating-point calculations to fixed-point, resulting in a smaller, faster model with reduced power consumption. The output is a quantized model ready for deployment on specific hardware platforms, making AI applications more efficient.

1,788 stars. No commits in the last 6 months.

Use this if you need to significantly reduce the computational cost, memory footprint, and power consumption of your neural network models for efficient deployment on edge devices or specialized hardware.

Not ideal if your neural network models are already optimized for your target hardware, or if you do not require a reduction in model size or power usage.

AI model optimization edge AI deployment neural network inference embedded AI model compression

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 23 / 25

How are scores calculated?

Stars

1,788

Forks

274

Language

Python

License

Apache-2.0

Higher-rated alternatives

open-mmlab/mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

Xilinx/brevitas

Brevitas: neural network quantization in PyTorch

google/qkeras

QKeras: a quantization deep learning library for Tensorflow Keras

fastmachinelearning/qonnx

QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX

tensorflow/model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization...

Explore ML Frameworks

All categories Trending ML Framework directory Insights