NeMo Guardrails Microservice Helm Chart#
For deployment guide, see Admin Setup in the NeMo Microservices documentation.
Values#
Key |
Type |
Default |
Description |
---|---|---|---|
affinity |
object |
|
Specifies affinity settings for the deployment. To learn more, refer to the Affinity and Anti-Affinity in the Kubernetes documentation. |
autoscaling |
object |
|
Specifies autoscaling configurations for the deployment. |
autoscaling.enabled |
bool |
|
Whether to enable horizontal pod autoscaler. |
autoscaling.maxReplicas |
int |
|
The maximum number of replicas for the deployment. |
autoscaling.minReplicas |
int |
|
The minimum number of replicas for the deployment. |
autoscaling.targetCPUUtilizationPercentage |
int |
|
The target CPU utilization percentage. |
configStore.nfs.enabled |
bool |
|
Whether to enable the use of an NFS persistent volume for the configuration store. |
configStore.nfs.mountPath |
string |
|
The path where the NFS volume will be mounted inside the container. |
configStore.nfs.path |
string |
|
The path to the root of the Configuration Store folder. |
configStore.nfs.server |
string |
|
The address of the NFS server. |
configStore.nfs.storageClass |
string |
|
The storage class for the PV and PVC. |
env |
object |
|
Environment variables for the container. |
fsGroup |
string |
|
Specifies the file system owner group id. |
fullnameOverride |
string |
|
Overrides the full chart name. |
guardrails |
object |
|
Kubernetes secret containing NVIDIA_API_KEY for Guardrails to use Nemoguard NIMS on NVCF |
image.pullPolicy |
string |
|
The image pull policy for the NeMo Guardrails container image. |
image.repository |
string |
|
The repository location of the NeMo Guardrails container image. |
image.tag |
string |
|
The tag of the NeMo Guardrails container image. |
imagePullSecrets |
list |
|
Specifies the list of secret names that are needed for the main container and any init containers. |
ingress.annotations |
object |
|
Additional annotations for the ingress resource. |
ingress.className |
string |
|
The class name for the ingress resource. |
ingress.enabled |
bool |
|
Whether to enable the ingress resource. |
ingress.hosts |
list |
[] |
A list of maps, each containing the keys |
ingress.tls |
list |
|
TLS configuration for the ingress resource. |
livenessProbe.httpGet |
object |
|
The HTTP GET request to use for the liveness probe. |
livenessProbe.initialDelaySeconds |
int |
|
The initial delay seconds for the liveness probe. |
livenessProbe.timeoutSeconds |
int |
|
The timeout in seconds for the liveness probe. |
nameOverride |
string |
|
Overrides the chart name. |
nodeSelector |
object |
|
Specifies labels to ensure that the NeMo Guardrails microservice is deployed only on certain nodes. To learn more, refer to the Node Selector in the Kubernetes documentation. |
podAnnotations |
object |
|
Specifies additional annotations to the main deployment pods. |
podLabels |
object |
|
Specifies additional labels to the main deployment pods. |
podSecurityContext |
object |
|
Specifies privilege and access control settings for the pod. |
readinessProbe.httpGet |
object |
|
The HTTP GET request to use for the readiness probe. |
readinessProbe.initialDelaySeconds |
int |
|
The initial delay seconds for the readiness probe. |
readinessProbe.timeoutSeconds |
int |
|
The timeout in seconds for the readiness probe. |
replicaCount |
int |
|
The number of replicas for the NeMo Guardrails microservice deployment. |
resources |
object |
|
Specifies resource configurations for the deployment. |
runAsGroup |
string |
|
Specifies the group ID for the pod. |
securityContext |
object |
|
Specifies security context for the container. |
service |
object |
|
Specifies the service type and the port for the deployment. |
serviceAccount.annotations |
object |
|
Annotations to be added to the service account. |
serviceAccount.automount |
bool |
|
Whether to automatically mount the service account token. |
serviceAccount.create |
bool |
|
Whether to create a service account for the NeMo Guardrails microservice. |
serviceAccount.name |
string |
|
The name of the service account to use. |
tolerations |
list |
|
Specifies tolerations for pod assignment. To learn more, refer to the Taint and Toleration in the Kubernetes documentation. |