Additional Resources#

Additional Resources#

Resource

Description

Layer builder API documentation for manual TensorRT engine construction: Creating a Network Definition in C++ and Creating a Network Definition in Python

The manual layer builder API is useful when you need the maximum flexibility possible in building a TensorRT engine. The Layer Builder API lets you construct a network by hand in TensorRT and gives you tools to load in weights from your model. When using the layer builder API, your goal is to essentially build an identical network to your training model using TensorRT layer by layer and then load in the weights from your model.

ONNX-TensorRT GitHub

The ONNX-TensorRT integration is a simple high-level interface for ONNX conversion with a Python runtime. It is useful for early prototyping of TensorRT workflows using the ONNX path.

Torch-TensorRT GitHub

Torch-TensorRT brings the power of TensorRT to PyTorch. In just one line of code, you can accelerate inference latency by up to 5x compared to eager execution.

TensorRT is integrated with NVIDIA’s profiling tool, NVIDIA Nsight Systems.

This is a great next step for further optimizing and debugging models you are working on productionizing.

TensorRT product documentation

Product documentation page for the ONNX, layer builder, C++, and legacy APIs.

TensorRT OSS GitHub

Contains OSS TensorRT components, sample applications, and plug-in examples.

TensorRT developer page

Contains downloads, posts, and quick reference code samples.