LukasHedegaard/pytorch-benchmark

Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption

/ 100

Emerging

This tool helps machine learning engineers and researchers compare the efficiency of different PyTorch models. By providing your model and a sample input, it measures key performance metrics like floating-point operations (FLOPs), inference speed (latency and throughput), and GPU memory usage. This allows you to understand how well your models will perform in a production environment or on resource-constrained devices.

109 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.

Use this if you need to objectively compare multiple PyTorch models based on their computational cost and speed, or optimize an existing model for better performance.

Not ideal if you're looking for deep profiling tools that identify specific bottlenecks within your model's code, or if you are not working with PyTorch models.

machine-learning-engineering deep-learning-optimization model-performance resource-management

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 12 / 25

How are scores calculated?

Stars

109

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

triton-inference-server/server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

gpu-mode/Triton-Puzzles

Puzzles for learning Triton

hailo-ai/hailo_model_zoo

The Hailo Model Zoo includes pre-trained models and a full building and evaluation environment

open-mmlab/mmdeploy

OpenMMLab Model Deployment Framework

hyperai/tvm-cn

TVM Documentation in Chinese Simplified / TVM 中文文档

Explore ML Frameworks

All categories Trending ML Framework directory Insights