NVIDIA/TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

/ 100

Verified

NVIDIA TensorRT is a toolkit for developers who need to optimize and deploy deep learning models on NVIDIA GPUs for faster performance. It takes trained AI models, typically from frameworks like TensorFlow or ONNX, and processes them to run much more efficiently. This helps bring AI applications to users with minimal delay, making things like real-time image analysis or recommendation systems more responsive.

12,784 stars. Used by 2 other packages. Actively maintained with 1 commit in the last 30 days. Available on PyPI.

Use this if you are a deep learning engineer or MLOps specialist looking to significantly speed up the inference performance of your AI models on NVIDIA hardware.

Not ideal if you are an end-user without deep learning development experience or if you need to train models, as TensorRT focuses solely on optimizing already-trained models for deployment.

deep-learning-deployment AI-inference-optimization GPU-acceleration MLOps edge-AI

Maintenance 13 / 25

Adoption 12 / 25

Maturity 25 / 25

Community 24 / 25

How are scores calculated?

Stars

12,784

Forks

2,321

Language

C++

License

Apache-2.0

Recent Releases

v10.16 25 Mar 2026 v10.15 03 Feb 2026 v10.14 08 Nov 2025 v10.13.3 09 Sep 2025 v10.13.2 19 Aug 2025

Compare

TensorRT and onnx-tensorrt

Related frameworks

microsoft/onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

onnx/onnx

Open standard for machine learning interoperability

PINTO0309/onnx2tf

Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The...

onnx/onnxmltools

ONNXMLTools enables conversion of models to ONNX

microsoft/onnxconverter-common

Common utilities for ONNX converters

Explore ML Frameworks

All categories Trending ML Framework directory Insights