Building a Custom TensorRT-LLM Container | NVIDIA Dynamo Documentation

For the prebuilt container, see the TensorRT-LLM Quick Start.

Building a Custom Container

If you need to build a container from source (e.g., for custom modifications or a different CUDA version):

$ # TensorRT-LLM uses git-lfs, which needs to be installed in advance.
$ apt-get update && apt-get -y install git git-lfs
$ 
$ # On an x86 machine:
$ python container/render.py --framework=trtllm --target=runtime --output-short-filename --cuda-version=13.1
$ docker build -t dynamo:trtllm-latest -f container/rendered.Dockerfile .
$ 
$ # On an ARM machine:
$ python container/render.py --framework=trtllm --target=runtime --platform=arm64 --output-short-filename --cuda-version=13.1
$ docker build -t dynamo:trtllm-latest -f container/rendered.Dockerfile .

Run the custom container:

$ ./container/run.sh --framework trtllm -it