Guides#

Welcome to the NeMo-Run guides! This section provides comprehensive documentation on how to use NeMo-Run effectively for your machine learning experiments.

Get Started#

If you’re new to NeMo-Run, follow the guides in this order:

Why Use NeMo-Run? — Understand the benefits and philosophy.
Quickstart — Get something running in 5 minutes.
Configuration — Learn how to configure tasks and experiments.
Execution — Understand executors, packagers, and launchers.
Executors — Per-executor guides from local to cloud.
Management — Track, inspect, and reproduce past experiments.

Advanced Topics#

CLI Reference — Automate experiment management from the command line.
Ray Integration — Distributed Ray workloads on Kubernetes, Slurm, and Lepton.
Architecture — Internals for contributors and power users.

Core Concepts#

NeMo-Run is built around three core responsibilities:

Configuration — Define ML experiments using a flexible, Pythonic configuration system.
Execution — Run experiments seamlessly across local machines, Slurm clusters, cloud providers, and more.
Management — Track, reproduce, and organize experiments with built-in experiment management.