Dynamo CLI#

The Dynamo CLI is a powerful tool for serving, containerizing, and deploying Dynamo applications. It leverages core pieces of the BentoML deployment stack and provides a range of commands to manage your Dynamo services.

Overview At a high level, the Dynamo CLI allows you to:

  • run - quickly chat with a model

  • serve - run a set of services locally (via depends() or .link())

  • build - create an archive of your services (called a bento)

Commands#

run#

The run command allows you to quickly chat with a model. Under the hood - it is running the dynamo-run Rust binary. You can find the arguments that it takes here: dynamo-run docs

Example

dynamo run deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

serve#

The serve command lets you run a defined inference graph locally. You must point toward your file and intended class using file:Class syntax

Usage

dynamo serve [SERVICE]

Arguments

  • SERVICE - The service to start. You use file:Class syntax to specify the service.

Flags

  • --file/-f - Path to optional YAML configuration file. An example of the YAML file can be found in the configuration section of the SDK docs

  • --dry-run - Print out the dependency graph and values without starting any services.

  • --service-name - Only serve the specified service name. The rest of the discoverable components in the graph are not started.

  • --working-dir - Specify the directory to find the Service instance

  • Any additional flags that follow Class.key=value will be passed to the service constructor for the target service and parsed. Please see the configuration section of the SDK docs for more details.

Example

cd examples
# Spin up Frontend, Middle, and Backend components
dynamo serve hello_world:Frontend

# Spin up only the Middle component in the graph that is discoverable from the Frontend service
dynamo serve  --service-name Middle hello_world:Frontend

build#

The build commmand allows you to package up your inference graph and its dependancies and create an archive of it. This is commonly paired with the --containerize flag to create a single docker container that runs your inference graph. As with serve, you point toward the first service in your dependency graph.

Usage

dynamo build [SERVICE]

Arguments

  • SERVICE - The service to build. You use file:Class syntax to specify the service.

Flags

  • --working-dir - Specify the directory to find the Service instance

  • --containerize - Whether to containerize the Bento after building

Example

cd examples/hello_world
dynamo build hello_world:Frontend