nemo_microservices.types.v2.inference.k8s_nim_operator_config#
Module Contents#
Classes#
API#
- class nemo_microservices.types.v2.inference.k8s_nim_operator_config.K8sNIMOperatorConfig(/, **data: typing.Any)#
Bases:
nemo_microservices._models.BaseModel- node_selector: Optional[Dict[str, str]]#
None
Kubernetes node selector for pod placement.
Example: {‘node-type’: ‘gpu-node’, ‘zone’: ‘us-west1-a’}
- resources: Optional[Dict[str, object]]#
None
Kubernetes resource requirements including requests and limits.
Example: {‘requests’: {‘cpu’: ‘2’, ‘memory’: ‘8Gi’}, ‘limits’: {‘memory’: ‘16Gi’}}
- startup_probe_grace_seconds: Optional[int]#
None
Grace period in seconds for NIM startup.
Determines how long Kubernetes will wait for the NIM to become ready before restarting it. Example: 600 (10 minutes). Must be a positive integer.
- tolerations: Optional[List[Dict[str, object]]]#
None
Kubernetes tolerations for pod scheduling.
Example: [{‘key’: ‘nvidia.com/gpu’, ‘operator’: ‘Exists’, ‘effect’: ‘NoSchedule’}]