Appendix A: Helm Chart Parameters
Refer to the Kubernetes API reference for details on customizing values in the Helm chart.
The values shown are the default values.
Deployment parameters
Refer to the Deployment API reference for details.
Name |
Description |
Value |
---|---|---|
|
[default: {}] Affinity settings for deployment. |
|
|
Sets privilege and access control settings for container (Only affects the main container, not pod-level). |
|
|
Overrides command line options sent to the NIM with the array listed here. |
|
|
Overrides command line arguments of the NIM container with the array listed here. |
|
|
Adds arbitrary environment variables to the main container using key-value pairs, for example NAME: value. |
|
|
Adds arbitrary additional volumes to the deployment set definition. |
|
|
Specify volume mounts to the main container from |
|
|
NIM Image Repository. |
|
|
Image tag or version. |
|
|
Image pull policy. |
|
|
Specify list of secret names that are needed for the main container and any init containers. |
|
|
Specify init containers, if needed. |
|
|
Sets node selectors for the NIM – for example |
|
|
Sets additional annotations on the main deployment pods. |
|
|
Specify privilege and access control settings for pod. |
|
|
Specify user UID for pod. |
|
|
Specify group ID for pod. |
|
|
Specify file system owner group id. |
|
|
Specify static replica count for deployment. |
|
|
Specify resources limits and requests for the running service. |
|
|
Specify number of GPUs to present to the running service. |
|
|
Specifies whether a service account should be created. |
|
|
Sets annotations to be added to the service account. |
|
|
Specifies the name of the service account to use. If it is not set and create is |
|
|
Enables |
|
|
Specify tolerations for pod assignment. Allows the scheduler to schedule pods with matching taints. |
Autoscaling parameters
Values used for creating a Horizontal Pod Autoscaler
. If autoscaling is not enabled, the rest are ignored.
NVIDIA recommends that you use the custom metrics API, commonly implemented with the prometheus-adapter.
Standard metrics of CPU and memory are of limited use in scaling NIM.
Refer to the HorizontalPodAutoscaler API reference for details.
Name |
Description |
Value |
---|---|---|
|
Enables horizontal pod autoscaler. |
|
|
Specify minimum replicas for autoscaling. |
|
|
Specify maximum replicas for autoscaling. |
|
|
Array of metrics for autoscaling. |
|
Ingress parameters
Refer to the Ingress API reference for details.
Name |
Description |
Value |
---|---|---|
|
Enables ingress. |
|
|
Specify class name for Ingress. |
|
|
Specify additional annotations for ingress. |
|
|
Specify list of hosts each containing lists of paths. |
|
|
Specify name of host. |
|
|
Specify ingress path. |
|
|
Specify path type. |
|
|
Specify list of pairs of TLS |
|
Probe parameters
Refer to the Pod Probe API reference for details.
Name |
Description |
Value |
---|---|---|
|
Enables |
|
|
|
|
|
Initial delay seconds for |
|
|
Timeout seconds for |
|
|
Period seconds for |
|
|
Success threshold for |
|
|
Failure threshold for |
|
|
Enables |
|
|
|
|
|
Initial delay seconds for |
|
|
Timeout seconds for |
|
|
Period seconds for |
|
|
Success threshold for |
|
|
Failure threshold for |
|
|
Enables |
|
|
|
|
|
Initial delay seconds for |
|
|
Timeout seconds for |
|
|
Period seconds for |
|
|
Success threshold for |
|
|
Failure threshold for |
|
Metrics parameters
Refer to the ServiceMonitor API reference for details.
Name |
Description |
Value |
---|---|---|
|
For NIMs with a separate metrics port, this opens that port on the container |
|
|
Options for |
|
|
Enables |
|
|
Specify additional labels for ServiceMonitor. |
|
NIM parameters
Name |
Description |
Value |
---|---|---|
|
Path to mount writeable storage or pre-filled model cache for the NIM |
|
|
Optionally specifies the name of the model in the API. This can be used in helm tests. |
|
|
Name of pre-existing secret with a key named |
|
|
NGC API key literal to use as the API secret and image pull secret when set |
|
|
Specify Open AI Port, for NIM. |
|
|
Specify HTTP Port. |
|
|
Specify GRPC Port. |
|
|
Specify extra labels to be added to the deployed pods. |
|
|
Whether to enable JSON lines logging. Defaults to true. |
|
|
Log level of NIM service. Possible values of the variable are TRACE, DEBUG, INFO, WARNING, ERROR, CRITICAL. |
|
Storage parameters
Refer to the PersistentVolumeClaim API reference and the PersistentVolume API reference for details.
Name |
Description |
Value |
---|---|---|
|
Specify settings to modify the path |
|
|
Enables the use of persistent volumes. |
|
|
Specifies an existing persistent volume claim. If using |
|
|
Specifies the persistent volume storage class. If set to |
|
|
Specify |
|
|
Specifies persistent volume claim retention policy when deleted. Only used with Stateful Set volume templates. |
|
|
Specifies persistent volume claim retention policy when scaled. Only used with Stateful Set volume templates. |
|
|
Specifies the size of the persistent volume claim (for example 40Gi). |
|
|
Adds annotations to the persistent volume claim. |
|
|
Configures model cache on local disk on the nodes using |
|
|
Enable |
|
|
Specifies path on the node used as a |
|
|
Configures the model cache to sit on shared direct-mounted NFS. NOTE: you cannot set mount options using direct NFS mount to pods without a node-intalled nfsmount.conf. An NFS-based |
|
|
Enable direct pod NFS mount |
|
|
Specify path on NFS server to mount |
|
|
Specify NFS server address |
|
|
Set to true to mount as read-only |
|
Service parameters
Refer to the Service API reference for details.
Name |
Description |
Value |
---|---|---|
|
Specifies the service type for the deployment. |
|
|
Overrides the default service name |
|
|
Specifies the OpenAI API Port for the service. |
|
|
Specifies the HTTP Port for the service. |
|
|
Specifies the GRPC Port for the service. |
|
|
Specifies the metrics port on the main service object. Some NIMs do not use a separate port. |
|
|
Specify additional annotations to be added to service. |
|
|
Specifies additional labels to be added to service. |
|