Is this page helpful?

Installing TensorRT#

This guide provides step-by-step instructions for installing TensorRT using various methods. Choose the installation method that best fits your development environment and deployment needs.

Before You Begin: Ensure you have reviewed the Prerequisites to confirm your system meets all requirements.

What Is TensorRT, and What Does Installing It Give You?#

NVIDIA TensorRT is an SDK for high-performance deep learning inference (running a trained model on new inputs to produce predictions) on NVIDIA GPUs. It compiles a trained model into a hardware-specific binary called an engine (also referred to as a plan file), then runs that engine inside your application.

Installing TensorRT puts the following on your system:

The TensorRT libraries (libnvinfer, libnvonnxparser, and friends), which your C++ or Python application links against to load and execute engines.
Python bindings (the tensorrt Python package), so you can build, save, and run engines from Python without writing C++.
The trtexec command-line tool, which builds an engine from an ONNX file, benchmarks it, and is the fastest way to confirm a fresh install actually works.
C++ headers (NvInfer.h and friends), included with every method except pip, for applications that build against the C++ API.

Once installation finishes, you should be able to run trtexec --help and import tensorrt from Python. At that point you’re ready to follow Build Your First Engine.

If you only need to run engines built by someone else (for example, a deployment server), the Lean or Dispatch runtimes described below are smaller alternatives to the Full Runtime.

Installation Method Comparison#

Quick Comparison Table:

Method	Best For	Requires Root	C++ Headers	Multi-Version	Installation Time
pip (Python)	Python development	No	No	Yes (venv)	⚡ Fastest (~2 min)
Debian/RPM	System-wide install	Yes	Yes	No	🔵 Fast (~5 min)
Tar/Zip	Multiple versions	No	Yes	Yes	🟡 Moderate (~10 min)
Container (NGC)	Isolated environments	No (Docker)	Yes	Yes	⚡ Fastest (~5 min)

Choosing Your Installation Method#

Use the Installation Method Comparison table above to pick pip, Debian/RPM, Tar/Zip, or Container. Follow the matching page under Installation Methods for step-by-step instructions.

Understanding TensorRT Runtime Options#

TensorRT ships Full, Lean, and Dispatch runtime packages with different capabilities and footprint sizes. For package names, size ranges, and a development-to-production workflow, refer to TensorRT Installation Modes in the prerequisites.

Downloading TensorRT#

Before installing with Debian (local repo), RPM (local repo), Tar, or Zip methods, you must download TensorRT packages.

Tip

For pip installation: Skip this section. The pip method downloads packages automatically from PyPI.

Go to https://developer.nvidia.com/tensorrt/download/11x.
Accept the license agreement.
Select TensorRT version 11.1.0 (or your target version).
Download the package for your platform:
- Linux x86-64: Debian local repo (.deb), RPM local repo (.rpm), or Tar (.tar.gz)
- Linux ARM SBSA and JetPack: Debian local repo (.deb) or Tar (.tar.gz)
- Windows x64: Zip (.zip)