Installing TensorRT-Cloud CLI
Prerequisites
Install Linux, Windows, or MacOS.
Install Python 3.8 or above.
The
tensorrtortensorrt-leanPython package must be installed with the version matching the TensorRT engine for refit support through the CLI. Thetensorrt-leanpackage is required for refitting version-compatible engines (those built with the--vctrtexec flag), andtensorrtis required for refitting non-version-compatible engines.python3 -m pip install tensorrt==10.0.1 tensorrt-cu12==10.0.1 python3 -m pip install tensorrt-lean==10.0.1 tensorrt_lean-cu12==10.0.1
For more information, refer to the NVIDIA TensorRT Installation Guide.
If you would like to refit a TensorRT-LLM engine, install it locally. For more information, refer to the NVIDIA TensorRT-LLM Quick Start Guide.
Note
refit is not supported on MacOS as TensorRT does not support MacOS.
Steps
The TensorRT-Cloud CLI tool is distributed as a Python wheel packaged in a tar file. To access the tar, refer to Getting TensorRT-Cloud Access.
Use
pipto install.python3 -m pip install trt-cloud
Verify the TensorRT-Cloud CLI tool was installed successfully.
trt-cloud --version
Get an overview of the CLI usage.
trt-cloud --help