Is this page helpful?

ONNX Conversion Guide#

TensorRT-RTX uses the Open Neural Network Exchange (ONNX) format as its primary model input. Before you can build a TensorRT-RTX engine, your model must be exported to an .onnx file. This guide covers how to export models from common training frameworks and how to build ONNX models programmatically.

Note

TensorRT-RTX supports ONNX opsets 9–22 (inclusive). Not all operators and precisions within these opsets are supported. For a complete list of supported operators, refer to the Operator Support reference.

Exporting from a Training Framework#

Most users train models in PyTorch, TensorFlow, or Hugging Face and export to ONNX. Choose the method that matches your framework.

PyTorch

Use torch.onnx.export() to convert a PyTorch model to ONNX:

import torch

model = ...  # Your trained PyTorch model
dummy_input = torch.randn(1, 3, 224, 224)

torch.onnx.export(model, dummy_input, "model.onnx",
                  input_names=["input"],
                  output_names=["output"],
                  dynamic_axes={"input": {0: "batch_size"},
                                "output": {0: "batch_size"}})

For detailed options and troubleshooting, refer to the PyTorch ONNX export documentation and tutorial.

TensorFlow

TensorFlow does not include built-in ONNX support. Use the open-source tf2onnx tool:

pip install tf2onnx
python -m tf2onnx.convert --saved-model ./saved_model --output model.onnx

Hugging Face Transformers

Use the Optimum library to export Hugging Face model checkpoints to ONNX:

pip install optimum[onnxruntime]
optimum-cli export onnx --model bert-base-uncased ./bert_onnx/

Building ONNX Models Programmatically#

If your framework does not support ONNX export, you can construct ONNX models directly using the ONNX Python API. This approach defines the model graph, operators, and weights using protocol buffers.

Next Steps#

After you have an .onnx file, proceed to build and run a TensorRT-RTX engine:

Deploy Your First Model — End-to-end walkthrough from ONNX to inference
Architecture Overview: Model Specification — ONNX vs. native API paths