nemo_automodel.components.launcher.skypilot.utils
nemo_automodel.components.launcher.skypilot.utils
Module Contents
Functions
Data
API
Return a sky cloud object for the given cloud name string.
Launch a training job on a cloud VM via SkyPilot.
The local job config written to job_dir/job_config.yaml is uploaded to REMOTE_CONFIG_PATH on the remote VM. The code in the current working directory is synced to ~/sky_workdir via SkyPilot’s workdir mechanism.
Parameters:
config
Populated SkyPilotConfig (including the training command).
job_dir
Local directory holding the job artifacts.
Returns: int
0 on successful submission.