⚠️ Important: This documentation is automatically generated from source code. Do not edit this file directly.
API Reference#
Packages#
nvidia.com/v1alpha1#
Package v1alpha1 contains API Schema definitions for the nvidia.com v1alpha1 API group.
Resource Types#
Autoscaling#
Appears in:
Field |
Description |
Default |
Validation |
|---|---|---|---|
|
|||
|
|||
|
|||
|
|||
|
DynamoComponentDeployment#
DynamoComponentDeployment is the Schema for the dynamocomponentdeployments API
Field |
Description |
Default |
Validation |
|---|---|---|---|
|
|
||
|
|
||
|
Refer to Kubernetes API documentation for fields of |
||
Spec defines the desired state for this Dynamo component deployment. |
DynamoComponentDeploymentSpec#
DynamoComponentDeploymentSpec defines the desired state of DynamoComponentDeployment
Appears in:
Field |
Description |
Default |
Validation |
|---|---|---|---|
|
DynamoComponent selects the Dynamo component from the archive to deploy. |
||
|
contains the tag of the DynamoComponent: for example, “my_package:MyService” |
||
|
BackendFramework specifies the backend framework (e.g., “sglang”, “vllm”, “trtllm”) |
Enum: [sglang vllm trtllm] |
|
|
Annotations to add to generated Kubernetes resources for this component |
||
|
Labels to add to generated Kubernetes resources for this component. |
||
|
The name of the component |
||
|
ComponentType indicates the role of this component (for example, “main”). |
||
|
Dynamo namespace of the service (allows to override the Dynamo namespace of the service defined in annotations inside the Dynamo archive) |
||
|
Resources requested and limits for this component, including CPU, memory, |
||
|
Autoscaling config for this component (replica range, target utilization, etc.). |
||
|
Envs defines additional environment variables to inject into the component containers. |
||
|
EnvFromSecret references a Secret whose key/value pairs will be exposed as |
||
|
PVC config describing volumes to be mounted by the component. |
||
|
Ingress config to expose the component outside the cluster (or through a service mesh). |
||
|
SharedMemory controls the tmpfs mounted at /dev/shm (enable/disable and size). |
||
|
ExtraPodMetadata adds labels/annotations to the created Pods. |
||
|
ExtraPodSpec allows to override the main pod spec configuration. |
||
|
LivenessProbe to detect and restart unhealthy containers. |
||
|
ReadinessProbe to signal when the container is ready to receive traffic. |
||
|
Replicas is the desired number of Pods for this component when autoscaling is not used. |
||
|
Multinode is the configuration for multinode components. |
DynamoGraphDeployment#
DynamoGraphDeployment is the Schema for the dynamographdeployments API.
Field |
Description |
Default |
Validation |
|---|---|---|---|
|
|
||
|
|
||
|
Refer to Kubernetes API documentation for fields of |
||
Spec defines the desired state for this graph deployment. |
|||
|
Status reflects the current observed state of this graph deployment. |
DynamoGraphDeploymentSpec#
DynamoGraphDeploymentSpec defines the desired state of DynamoGraphDeployment.
Appears in:
Field |
Description |
Default |
Validation |
|---|---|---|---|
|
DynamoGraph selects the graph (workflow/topology) to deploy. This must match |
||
|
Envs are environment variables applied to all services in the graph unless |
Optional: {} |
|
|
BackendFramework specifies the backend framework (e.g., “sglang”, “vllm”, “trtllm”). |
Enum: [sglang vllm trtllm] |
DynamoGraphDeploymentStatus#
DynamoGraphDeploymentStatus defines the observed state of DynamoGraphDeployment.
Appears in:
Field |
Description |
Default |
Validation |
|---|---|---|---|
|
State is a high-level textual status of the graph deployment lifecycle. |
||
|
Conditions contains the latest observed conditions of the graph deployment. |
IngressSpec#
Appears in:
Field |
Description |
Default |
Validation |
|---|---|---|---|
|
Enabled exposes the component through an ingress or virtual service when true. |
||
|
Host is the base host name to route external traffic to this component. |
||
|
UseVirtualService indicates whether to configure a service-mesh VirtualService instead of a standard Ingress. |
||
|
VirtualServiceGateway optionally specifies the gateway name to attach the VirtualService to. |
||
|
HostPrefix is an optional prefix added before the host. |
||
|
Annotations to set on the generated Ingress/VirtualService resources. |
||
|
Labels to set on the generated Ingress/VirtualService resources. |
||
|
TLS holds the TLS configuration used by the Ingress/VirtualService. |
||
|
HostSuffix is an optional suffix appended after the host. |
||
|
IngressControllerClassName selects the ingress controller class (e.g., “nginx”). |
IngressTLSSpec#
Appears in:
Field |
Description |
Default |
Validation |
|---|---|---|---|
|
SecretName is the name of a Kubernetes Secret containing the TLS certificate and key. |
MultinodeSpec#
Appears in:
Field |
Description |
Default |
Validation |
|---|---|---|---|
|
Indicates the number of nodes to deploy for multinode components. |
2 |
Minimum: 2 |
PVC#
Appears in:
Field |
Description |
Default |
Validation |
|---|---|---|---|
|
Create indicates to create a new PVC |
||
|
Name is the name of the PVC |
||
|
StorageClass to be used for PVC creation. Leave it as empty if the PVC is already created. |
||
|
Size of the NIM cache in Gi, used during PVC creation |
||
|
VolumeAccessMode is the volume access mode of the PVC |
||
|