Local Executor#

The Local executor runs evaluations on your machine using Docker. It provides a fast way to iterate if you have Docker installed, evaluating existing endpoints.

See common concepts and commands in Executors.

Prerequisites#

Docker
Python environment with the NeMo Evaluator Launcher CLI available (install the launcher by following Installation Guide)

Quick Start#

For detailed step-by-step instructions on evaluating existing endpoints, refer to the NeMo Evaluator Launcher guide, which covers:

Choosing models and tasks
Setting up API keys (for NVIDIA APIs, see Setting up API Keys)
Creating configuration files
Running evaluations

Here’s a quick overview for the Local executor:

Run evaluation for existing endpoint#

# Run evaluation
nemo-evaluator-launcher run --config packages/nemo-evaluator-launcher/examples/local_basic.yaml \
  -o target.api_endpoint.api_key_name=NGC_API_KEY

Environment Variables and Secrets#

Environment variables use the unified prefix syntax ($host:, $lit:, $runtime:) described in Environment Variables. Declare them at the top-level env_vars: section, at evaluation.env_vars, or per-task. Secret values are stored in a .secrets.env file alongside the generated run.sh and sourced at runtime — they never appear in the script itself.

env_vars:
  HF_TOKEN: $host:HF_TOKEN
evaluation:
  tasks:
    - name: my_task
      env_vars:
        CUSTOM_VAR: $host:MY_CUSTOM_VAR

Mounting and Storage#

The Local executor uses Docker volume mounts for data persistence:

Docker Volumes#

Results Mount: Each task’s artifacts directory mounts as /results in evaluation containers
Custom Mounts: Use to extra_docker_args field to define custom volume mounts (see Advanced configuration )

Advanced configuration#

You can customize your local executor by specifying extra_docker_args. This parameter allows you to pass any flag to the docker run command that is executed by the NeMo Evaluator Launcher. You can use it to mount additional volumes, set environment variables or customize your network settings.

For example, if you would like your job to use a specific docker network, you can specify:

execution:
  extra_docker_args: "--network my-custom-network"

Replace my-custom-network with host to access the host network.

To mount additional custom volumes, do:

execution:
  extra_docker_args: "--volume /my/local/path:/my/container/path"

Rerunning Evaluations#

The Local executor generates reusable scripts for rerunning evaluations:

Script Generation#

The Local executor automatically generates scripts:

run_all.sequential.sh: Script to run all evaluation tasks sequentially (in output directory)
run.sh: Individual scripts for each task (in each task subdirectory)
Reproducible: Scripts contain all necessary commands and configurations

Manual Rerun#

# Rerun all tasks
cd /path/to/output_dir/2024-01-15-10-30-45-abc12345/
bash run_all.sequential.sh

# Rerun specific task
cd /path/to/output_dir/2024-01-15-10-30-45-abc12345/task1/
bash run.sh

Key Features#

Docker-based execution: Isolated, reproducible runs
Script generation: Reusable scripts for rerunning evaluations
Real-time logs: Status tracking via log files

Monitoring and Job Management#

For monitoring jobs, checking status, and managing evaluations, see Executors.