TristanBilot/mlx-benchmark

Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.

/ 100

Established

This tool helps machine learning engineers and researchers understand the performance of MLX operations on various Apple Silicon chips (M1-M4) and compare them against PyTorch on Apple's MPS, CPU, and NVIDIA CUDA GPUs. It takes your specified hardware and MLX/PyTorch versions as input, and outputs detailed or averaged runtime benchmarks for different machine learning operations. It's ideal for those optimizing machine learning models for Apple hardware.

217 stars.

Use this if you are developing machine learning applications and need to compare the speed and efficiency of different ML frameworks and hardware configurations for specific operations.

Not ideal if you are looking for a high-level application performance monitor or a tool to benchmark entire machine learning model training workflows rather than individual operations.

machine-learning-engineering deep-learning-optimization hardware-benchmarking model-deployment performance-tuning

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

217

Forks

Language

Python

License

MIT

Related frameworks

NVIDIA/TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit...

mlcommons/inference

Reference implementations of MLPerf® inference benchmarks

mlcommons/training

Reference implementations of MLPerf® training benchmarks

datamade/usaddress

:us: a python library for parsing unstructured United States address strings into address components

GRAAL-Research/deepparse

Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning

Explore ML Frameworks

All categories Trending ML Framework directory Insights