Google Kubernetes Engine (GKE)
Google Kubernetes Engine (GKE)
Google Kubernetes Engine (GKE)
https://cloud.google.com/sdk/docs/install
Note: Please make sure GitHub branch/commit version matches with Dynamo platform and VLLM container.
After installation, verify the installation:
Expected output
We will deploy a LLM model to the Dynamo platform. Here we use Qwen/Qwen3-0.6B model with VLLM and disaggregated deployment as an example.
In the deployment yaml file, some adjustments have to/ could be made:
LD_LIBRARY_PATH and PATH of decoder container, to enable GKE find the correct GPU driverMore configurations please refer to https://github.com/ai-dynamo/dynamo/tree/v1.2.0/examples/deployments/GKE/vllm
Please note that LD_LIBRARY_PATH needs to be set properly in GKE as per Run GPUs in GKE
The following snippet needs to be present in the args field of the deployment yaml file:
For example, refer to the following from examples/deployments/GKE/vllm/disagg.yaml
Expected output after successful deployment