Dynamo CLI#
The Dynamo CLI is a powerful tool for serving, containerizing, and deploying Dynamo applications. It leverages core pieces of the BentoML deployment stack and provides a range of commands to manage your Dynamo services.
Overview At a high level, the Dynamo CLI allows you to:
run
- quickly chat with a modelserve
- run a set of services locally (viadepends()
or.link()
)build
- create an archive of your services (called abento
)
Commands#
run
#
The run
command allows you to quickly chat with a model. Under the hood - it is running the dynamo-run
Rust binary. You can find the arguments that it takes here: dynamo-run docs
Example
dynamo run deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
serve
#
The serve
command lets you run a defined inference graph locally. You must point toward your file and intended class using file:Class syntax
Usage
dynamo serve [SERVICE]
Arguments
SERVICE
- The service to start. You use file:Class syntax to specify the service.
Flags
--file
/-f
- Path to optional YAML configuration file. An example of the YAML file can be found in the configuration section of the SDK docs--dry-run
- Print out the dependency graph and values without starting any services.--service-name
- Only serve the specified service name. The rest of the discoverable components in the graph are not started.--working-dir
- Specify the directory to find the Service instanceAny additional flags that follow Class.key=value will be passed to the service constructor for the target service and parsed. Please see the configuration section of the SDK docs for more details.
Example
cd examples
# Spin up Frontend, Middle, and Backend components
dynamo serve hello_world:Frontend
# Spin up only the Middle component in the graph that is discoverable from the Frontend service
dynamo serve --service-name Middle hello_world:Frontend
build
#
The build
commmand allows you to package up your inference graph and its dependancies and create an archive of it. This is commonly paired with the --containerize
flag to create a single docker container that runs your inference graph. As with serve
, you point toward the first service in your dependency graph.
Usage
dynamo build [SERVICE]
Arguments
SERVICE
- The service to build. You use file:Class syntax to specify the service.
Flags
--working-dir
- Specify the directory to find the Service instance--containerize
- Whether to containerize the Bento after building
Example
cd examples/hello_world
dynamo build hello_world:Frontend