tensorflow/model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

/ 100

Established

This toolkit helps machine learning engineers and researchers make their trained Keras and TensorFlow models smaller and faster. It takes an existing, functional machine learning model and applies optimization techniques like quantization or pruning. The output is a more efficient model that performs similarly but requires less computational power and memory, ideal for deploying on devices with limited resources.

1,565 stars. Actively maintained with 1 commit in the last 30 days.

Use this if you need to deploy a Keras or TensorFlow machine learning model to environments with tight constraints on processing power, memory, or battery life, such as mobile phones or embedded systems.

Not ideal if your primary goal is to improve model accuracy or if you are not working with Keras or TensorFlow models.

ML model deployment edge AI model optimization resource-constrained devices embedded ML

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

1,565

Forks

346

Language

Python

License

Apache-2.0

Compare

model-optimization and mct-model-optimization model-optimization and neural-compressor

Related frameworks

open-mmlab/mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

Xilinx/brevitas

Brevitas: neural network quantization in PyTorch

google/qkeras

QKeras: a quantization deep learning library for Tensorflow Keras

fastmachinelearning/qonnx

QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX

lucidrains/vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Explore ML Frameworks

All categories Trending ML Framework directory Insights