athrva98/polyinfer

Unified deployment pipeline

/ 100

Emerging

This tool helps machine learning engineers and researchers quickly deploy and run their trained models on various hardware, from NVIDIA GPUs to Intel CPUs. It takes a trained model file (like an ONNX file) and runs it efficiently on the fastest available hardware without complex setup, giving back the model's predictions. This is for anyone who needs to get the best performance from their AI models in real-world applications, regardless of the underlying hardware.

Available on PyPI.

Use this if you need to run your AI models as fast as possible on different types of computer hardware, without spending a lot of time on configuration and optimization.

Not ideal if you are developing new AI models and primarily need a training framework, rather than a solution for deploying existing models for performance.

AI-deployment model-inference MLOps edge-AI performance-optimization

Maintenance 6 / 25

Adoption 5 / 25

Maturity 22 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

Apache-2.0

Higher-rated alternatives

microsoft/onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

onnx/onnx

Open standard for machine learning interoperability

PINTO0309/onnx2tf

Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The...

NVIDIA/TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This...

onnx/onnxmltools

ONNXMLTools enables conversion of models to ONNX

Explore ML Frameworks

All categories Trending ML Framework directory Insights