google/qkeras

QKeras: a quantization deep learning library for Tensorflow Keras

/ 100

Established

This tool helps machine learning engineers and researchers optimize deep learning models for deployment on resource-constrained hardware like edge devices. By applying quantization techniques to Keras models, it significantly reduces the memory footprint and computational cost of neural networks. You provide an existing Keras deep learning model, and it outputs a quantized version that runs more efficiently.

578 stars. Available on PyPI.

Use this if you need to deploy deep learning models on hardware with limited memory and processing power, such as embedded systems or specialized accelerators, without sacrificing too much accuracy.

Not ideal if your primary goal is rapid prototyping or if you are running models on powerful cloud GPUs where performance and memory constraints are not a critical concern.

edge-AI model-optimization embedded-ML deep-learning-deployment neural-network-efficiency

Maintenance 10 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 23 / 25

How are scores calculated?

Stars

578

Forks

109

Language

Python

License

Apache-2.0

Related frameworks

open-mmlab/mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

Xilinx/brevitas

Brevitas: neural network quantization in PyTorch

fastmachinelearning/qonnx

QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX

tensorflow/model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization...

lucidrains/vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Explore ML Frameworks

All categories Trending ML Framework directory Insights