SonySemiconductorSolutions/mct-model-optimization

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.

/ 100

Established

Deploying neural networks on devices with limited computational power can be challenging. This tool helps optimize your pre-trained PyTorch or Keras models by reducing their size and computational demands, making them efficient for edge devices. It takes your existing floating-point model and outputs a compressed, quantized model suitable for deployment, benefiting AI/ML engineers and researchers working with resource-constrained hardware.

431 stars.

Use this if you need to deploy your neural network models on edge devices or hardware with limited memory and processing capabilities.

Not ideal if you are developing models for high-performance computing environments without strict hardware constraints, as the optimization process introduces complexity.

edge-ai model-deployment embedded-systems deep-learning-optimization computer-vision-hardware

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

431

Forks

Language

Python

License

Apache-2.0

Compare

mct-model-optimization and model-optimization

Related frameworks

open-mmlab/mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

Xilinx/brevitas

Brevitas: neural network quantization in PyTorch

google/qkeras

QKeras: a quantization deep learning library for Tensorflow Keras

fastmachinelearning/qonnx

QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX

tensorflow/model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization...

Explore ML Frameworks

All categories Trending ML Framework directory Insights