Create ModelDeploymentConfig
Create a new ModelDeploymentConfig (version 1).
Path parameters
Request
Name of the deployment configuration. Allowed characters: letters (a-z, A-Z), digits (0-9), underscores, hyphens, and dots.
Inference engine selecting the compiler path (nim/vllm/generic)
What model to serve and how — independent of the executor it runs on
Compute + container settings for the executor the deployment runs on
Response
Name of the entity. Name/workspace combo must be unique across all entities. Allowed characters: letters (a-z, A-Z), digits (0-9), underscores, hyphens, and dots.
The workspace of the entity. Allowed characters: letters (a-z, A-Z), digits (0-9), underscores, hyphens, and dots.
Inference engine selecting the compiler path (nim/vllm/generic)
What model to serve and how — independent of the executor it runs on
Compute + container settings for the executor the deployment runs on