NVIDIA STREAM Benchmark#

x86 package folder structure#

stream-gpu-test.sh script in the root directory of the package to invoke the stream_test executable for NVIDIA GPUs.

NVIDIA STREAM in the folder ./stream-gpu-linux-x86_64

  • stream_test executable. NVIDIA STREAM benchmark for GPU with double precision elements

  • stream_test_fp32 executable. NVIDIA STREAM benchmark for GPU with single precision elements

aarch64 package folder structure#

stream-test-cpu.sh script in the root directory of the package to invoke the NVIDIA STREAM executable NVIDIA Grace CPU.

stream-test-gpu.sh script in the root directory of the package to invoke the NVIDIA STREAM executable for NVIDIA Grace Hopper and NVIDIA Grace Blackwell.

  • NVIDIA STREAM in the folder ./stream-gpu-linux-aarch64

    • stream_test executable. NVIDIA STREAM benchmark for GPU with double precision elements

    • stream_test_fp32 executable. NVIDIA STREAM benchmark for GPU with single precision elements

  • NVIDIA STREAM in the folder ./stream-cpu-linux-aarch64

    • stream_test executable. NVIDIA STREAM benchmark for NVIDIA Grace CPU with double precision elements

Running the NVIDIA STREAM Benchmarks on x86_64 with NVIDIA GPUs, NVIDIA Grace Hopper, and NVIDIA Grace Blackwell systems#

The script stream-gpu-test.sh can be invoked on a command line or through a Slurm batch script to launch the NVIDIA STREAM benchmark.

The script stream-gpu-test.sh accepts the following optional parameters:

  • --d <int> device number

  • --n <int> number of elements in the arrays

  • --dt fp32 enable fp32 stream test

  • --t <string> tests which will be executed, can be any combination of:

    • C - COPY test

    • S - SCALE test

    • A - ADD test

    • T - TRAID test

    for example, value --t CST means that COPY, SCALE and TRIAD tests will be executed

    Default value CSAT

Running the NVIDIA STREAM Benchmarks on NVIDIA Grace CPU only systems#

The script stream-cpu-test.sh can be invoked on a command line or through a Slurm batch script to launch the NVIDIA STREAM benchmark.

The script stream-cpu-test.sh accepts the following optional parameters:

  • --n <int> number of elements in the arrays

  • --t <int> number of threads