angelolamonaca/PyTorch-Precision-Converter
A flexible utility for converting tensor precision in PyTorch models and safetensors files, enabling efficient deployment across various platforms.
This tool helps machine learning engineers and researchers optimize their deep learning models for deployment. It takes existing PyTorch model checkpoints or safetensors files and converts their internal data precision (e.g., from full precision to half-precision). The output is a more memory-efficient and faster-running model, suitable for environments with limited resources like mobile devices or specialized hardware.
No commits in the last 6 months.
Use this if you need to reduce the memory footprint or improve the inference speed of your PyTorch models for deployment, especially on resource-constrained platforms.
Not ideal if you are solely focused on model training and do not require precision optimization for deployment.
Stars
11
Forks
—
Language
Python
License
MIT
Category
Last pushed
Aug 24, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/angelolamonaca/PyTorch-Precision-Converter"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
gpu-mode/Triton-Puzzles
Puzzles for learning Triton
hailo-ai/hailo_model_zoo
The Hailo Model Zoo includes pre-trained models and a full building and evaluation environment
open-mmlab/mmdeploy
OpenMMLab Model Deployment Framework
hyperai/tvm-cn
TVM Documentation in Chinese Simplified / TVM 中文文档