foundation-model-stack/fms-model-optimizer

FMS Model Optimizer is a framework for developing reduced precision neural network models.

/ 100

Established

This tool helps AI practitioners optimize large neural network models like those used in vision, speech, or natural language processing. It takes your existing PyTorch deep learning models and applies advanced techniques to reduce their size and computational requirements. The output is a more efficient, "reduced precision" model that runs faster and uses less memory, ideal for deployment in resource-constrained environments. AI/ML engineers or researchers who need to deploy models more efficiently would use this.

Used by 1 other package. Available on PyPI.

Use this if you need to make your large neural network models (especially LLMs) smaller and faster for deployment without significantly losing accuracy.

Not ideal if you are working with small models that don't require significant optimization or if you are not familiar with deep learning model quantization techniques.

AI model deployment Deep learning optimization Natural language processing Computer vision Large language models

Maintenance 10 / 25

Adoption 7 / 25

Maturity 25 / 25

Community 19 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Related frameworks

fangwei123456/spikingjelly

SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.

neuromorphs/NIR

Neuromorphic Intermediate Representation reference implementation

BindsNET/bindsnet

Simulation of spiking neural networks (SNNs) using PyTorch.

norse/norse

Deep learning with spiking neural networks (SNNs) in PyTorch.

jeshraghian/snntorch

Deep and online learning with spiking neural networks in Python

Explore ML Frameworks

All categories Trending ML Framework directory Insights