NVIDIA STREAM Benchmark#
x86 package folder structure#
stream-gpu-test.sh script in the root directory of the package to invoke the stream_test executable for NVIDIA GPUs.
NVIDIA STREAM in the folder ./stream-gpu-linux-x86_64
stream_testexecutable. NVIDIA STREAM benchmark for GPU with double precision elements
stream_test_fp32executable. NVIDIA STREAM benchmark for GPU with single precision elements
aarch64 package folder structure#
stream-test-cpu.sh script in the root directory of the package to invoke the NVIDIA STREAM executable NVIDIA Grace CPU.
stream-test-gpu.sh script in the root directory of the package to invoke the NVIDIA STREAM executable for NVIDIA Grace Hopper and NVIDIA Grace Blackwell.
NVIDIA STREAM in the folder
./stream-gpu-linux-aarch64stream_testexecutable. NVIDIA STREAM benchmark for GPU with double precision elementsstream_test_fp32executable. NVIDIA STREAM benchmark for GPU with single precision elements
NVIDIA STREAM in the folder
./stream-cpu-linux-aarch64stream_testexecutable. NVIDIA STREAM benchmark for NVIDIA Grace CPU with double precision elements
Running the NVIDIA STREAM Benchmarks on x86_64 with NVIDIA GPUs, NVIDIA Grace Hopper, and NVIDIA Grace Blackwell systems#
The script stream-gpu-test.sh can be invoked on a command line or through a Slurm batch script to launch the NVIDIA STREAM benchmark.
The script stream-gpu-test.sh accepts the following optional parameters:
--d <int>device number
--n <int>number of elements in the arrays
--dt fp32enable fp32 stream test
--t <string>tests which will be executed, can be any combination of:
C- COPY test
S- SCALE test
A- ADD test
T- TRAID testfor example, value
--t CSTmeans that COPY, SCALE and TRIAD tests will be executedDefault value
CSAT
Running the NVIDIA STREAM Benchmarks on NVIDIA Grace CPU only systems#
The script stream-cpu-test.sh can be invoked on a command line or through a Slurm batch script to launch the NVIDIA STREAM benchmark.
The script stream-cpu-test.sh accepts the following optional parameters:
--n <int>number of elements in the arrays
--t <int>number of threads