alibaba/BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

/ 100

Established

When running machine learning models, especially those with varying input sizes (dynamic shapes), their performance can be slow on standard setups. This project takes your existing TensorFlow or PyTorch models and optimizes them to run much faster on GPUs and CPUs. It's designed for machine learning engineers and MLOps professionals who want to deploy high-performing models in production.

919 stars. No commits in the last 6 months.

Use this if you need to significantly speed up the inference or training of your TensorFlow or PyTorch machine learning models, particularly those with dynamic input shapes, on various hardware like NVIDIA, AMD, or Hygon GPUs, and x86/AArch64 CPUs.

Not ideal if your machine learning workloads always use static input shapes and are already well-optimized with existing static compilers, or if you are not working with TensorFlow or PyTorch.

machine-learning-deployment model-optimization deep-learning-inference AI-infrastructure MLOps

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 24 / 25

How are scores calculated?

Stars

919

Forks

169

Language

C++

License

Apache-2.0

Related frameworks

apache/tvm

Open Machine Learning Compiler Framework

uxlfoundation/oneDNN

oneAPI Deep Neural Network Library (oneDNN)

Tencent/ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

OpenMined/TenSEAL

A library for doing homomorphic encryption operations on tensors

iree-org/iree-turbine

IREE's PyTorch Frontend, based on Torch Dynamo.

Explore ML Frameworks

All categories Trending ML Framework directory Insights