Configure The NIM at Runtime#
GPU Selection#
On a system with a single NVIDIA GPU, you can use the --gpus device=0 option in docker command to start the service.
In heterogeneous environments with a combination of GPUs, such as L40S and a GeForce display GPU, you should only run workload on the L40S GPU. Use the --gpus device=<GPU ID> option in docker command to start the service.
Use the nvidia-smi -L command to get the device ID(s) to use as input(s). This command should return information similar to the following:
GPU 0: NVIDIA GeForce RTX 3080
GPU 1: L40S
For more information, see NVIDIA Container Toolkit documentation.
Environment Variables#
You can pass the following environment variables into a NIM by adding the -e flag to the docker run command.
ENV |
Required? |
Default |
Notes |
|---|---|---|---|
NGC_API_KEY |
Yes |
None |
Your NGC API key with read access to the model registry for the model profile you are using. |
NV_SCRATCH_SPACE |
Yes |
None |
You should provide the same mount path as used in the service launch command. NIM uses this space to store intermediate artifacts. |
NIM_LOG_LEVEL |
Optional |
None |
You can pass
|
NIM_TRITON_LOG_VERBOSE |
Optional |
None |
You can pass verbosity level [1,2,3,4,5] to enable extra logging from the triton server backend. |
S3_ENDPOINT_URL |
Optional |
None |
S3 endpoint URL required to establish connection. |
S3_TEAM_NAME |
Optional |
None |
S3 Team name required to establish connection. |
S3_ACCESS_KEY |
Optional |
None |
S3 access key required to establish connection. |
S3_REGION_NAME |
Optional |
None |
S3 region name required to establish connection. |