foundation-model-stack/fms-model-optimizer

FMS Model Optimizer is a framework for developing reduced precision neural network models.

61
/ 100
Established

This tool helps AI practitioners optimize large neural network models like those used in vision, speech, or natural language processing. It takes your existing PyTorch deep learning models and applies advanced techniques to reduce their size and computational requirements. The output is a more efficient, "reduced precision" model that runs faster and uses less memory, ideal for deployment in resource-constrained environments. AI/ML engineers or researchers who need to deploy models more efficiently would use this.

Used by 1 other package. Available on PyPI.

Use this if you need to make your large neural network models (especially LLMs) smaller and faster for deployment without significantly losing accuracy.

Not ideal if you are working with small models that don't require significant optimization or if you are not familiar with deep learning model quantization techniques.

AI model deployment Deep learning optimization Natural language processing Computer vision Large language models
Maintenance 10 / 25
Adoption 7 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

21

Forks

18

Language

Python

License

Apache-2.0

Last pushed

Feb 23, 2026

Commits (30d)

0

Dependencies

9

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/foundation-model-stack/fms-model-optimizer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.