jack-willturner/deep-compression

Learning both Weights and Connections for Efficient Neural Networks https://arxiv.org/abs/1506.02626

/ 100

Emerging

When deploying deep learning models, this project helps reduce the model's size and computational requirements. It takes an existing neural network, such as a ResNet, and optimizes it to use fewer parameters and connections without significantly losing accuracy. Data scientists and machine learning engineers working on edge devices or resource-constrained environments would find this useful.

181 stars. No commits in the last 6 months.

Use this if you need to deploy large neural networks to environments with limited memory or processing power, such as mobile phones, embedded systems, or IoT devices.

Not ideal if your primary concern is developing new model architectures from scratch, as this tool focuses on optimizing existing ones.

model-optimization edge-ai deep-learning-deployment resource-constrained-ml neural-network-compression

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

181

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

open-mmlab/mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

Xilinx/brevitas

Brevitas: neural network quantization in PyTorch

google/qkeras

QKeras: a quantization deep learning library for Tensorflow Keras

fastmachinelearning/qonnx

QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX

tensorflow/model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization...

Explore ML Frameworks

All categories Trending ML Framework directory Insights