AkliluYirgalem/live-quantization
real-time model quantization directly in the browser
This tool helps machine learning practitioners or researchers evaluate how their models perform under different compression levels directly within a web browser. You input an existing machine learning model, and it shows you the model's performance and size as it's 'quantized' in real-time. This is useful for anyone exploring efficient deployment of AI models.
No commits in the last 6 months.
Use this if you need to quickly see the trade-offs between model size, speed, and accuracy when preparing a machine learning model for deployment.
Not ideal if you need to perform full, production-grade model optimization or have complex, custom quantization requirements beyond a basic demonstration.
Stars
29
Forks
1
Language
CSS
License
MIT
Category
Last pushed
Aug 23, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/AkliluYirgalem/live-quantization"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
open-mmlab/mmengine
OpenMMLab Foundational Library for Training Deep Learning Models
Xilinx/brevitas
Brevitas: neural network quantization in PyTorch
google/qkeras
QKeras: a quantization deep learning library for Tensorflow Keras
fastmachinelearning/qonnx
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
tensorflow/model-optimization
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization...