Environment Variables¶
NVSHMEM provides a set of environment variables that allows users to configure the NVSHMEM implementation and receive information about the implementation.
-
NVSHMEM_VERSION
¶
Value: Any
Print the library version at start-up
-
NVSHMEM_INFO
¶
Value: Any
Print helpful text about all these environment variables
-
NVSHMEM_SYMMETRIC_SIZE
¶
Value: Non-negative integer or floating point value with an optional character suffix
Specifies the size (in bytes) of the symmetric heap memory per PE. The resulting size is implementation-defined and must be least as large as the integer ceiling of the product of the numeric prefix and the scaling factor. The allowed character suffixes for the scaling factor are as follows:
- k or K multiplies by (kibibytes)
- m or M multiplies by (mebibytes)
- g or G multiplies by (gibibytes)
- t or T multiplies by (tebibytes)
For example, string “20m” is equivalent to the integer value 20971520, or 20 mebibytes. Similarly the string “3.1M” is equivalent to the integer value 3250586. Only one multiplier is recognized and any characters following the multiplier are ignored, so “20kk” will not produce the same result as “20m”. Usage of string “.5m” will yield the same result as the string “0.5m”.
An invalid value for NVSHMEM_SYMMETRIC_SIZE
is an error, which the
NVSHMEM library shall report by either returning a nonzero value from
nvshmem_init_thread
or causing program termination.
-
NVSHMEM_DEBUG
¶
Value: WARN, INFO, TRACE, or any other value to enable basic debug output
Controls the debug information that is displayed from NVSHMEM.
-
NVSHMEM_DEBUG_FILE
¶
Value: filename
Sets the filename where the debug output is written. The filename may contain %h for the hostname and %p for the PID.
-
NVSHMEM_ENABLE_NIC_PE_MAPPING
¶
Value: Boolean
When this variable is not set or is set to 0, a PE is assigned the NIC on the node that is closest
to it. When set to 1, NVSHMEM assigns NICs to PEs on a
round-robin basis or uses NVSHMEM_HCA_PE_MAPPING
or NVSHMEM_HCA_LIST
when
they are specified.
-
NVSHMEM_HCA_PE_MAPPING
¶
Value: HCA Mapping
Specifies the mapping of HCAs to PEs as a comma-separated list so that the
GPU that corresponds to the PE uses the given HCA for all transfers. Each entry in
the comma separated list is in the hca_name:port:count
format. For example,
mlx5_0:1:2,mlx5_0:2:2
means that PEs 0 and 1 can be mapped to port 1 of
mlx5_0
, and PEs 2 and 3 can be mapped to port 2 of mlx5_0
.
For this variable to be effective, NVSHMEM_ENABLE_NIC_PE_MAPPING
must be set to 1.
-
NVSHMEM_HCA_LIST
¶
Value: List of HCAs
Specifies a comma-separated list of HCAs to use in the NVSHMEM application.
This variable is useful to skip disabled HCAs or HCAs that operate in a different mode from
InfiniBand. Each entry in the comma separated list is in the
hca_name:port
format. For example, mlx5_1:1,mlx5_2:2
specifies using port 1
of mlx5_1
and port 2 of mlx5_2
in the application. A ^ before the list
indicates an exclusion list. For example, ^mlx5_1:1,mlx5_1:2
means that NVSHMEM should not use
mlx5_1
port 1 and mlx5_1
port 2.
For this variable to be effective, NVSHMEM_ENABLE_NIC_PE_MAPPING
must be set to 1.
-
NVSHMEM_MPI_LIB_NAME
¶
Value: String
Name of the MPI library that is used for bootstrapping NVSHMEM. By default,
NVSHMEM looks for libraries with the name libmpi.so
.
-
NVSHMEM_SHMEM_LIB_NAME
¶
Value: String
Name of the OpenSHMEM library that is used for bootstrapping NVSHMEM. By
default, NVSHMEM looks for libraries with the name liboshmem.so
.
-
NVSHMEM_BARRIER_DISSEM_KVAL
¶
Value: Integer radix
Radix of the dissemination algorithm that is used for nvshmem_barrier
and
nvshmem_barrier_all
collectives. By default, NVSHMEM uses radix 2.
-
NVSHMEM_BARRIER_TG_DISSEM_KVAL
¶
Value: Integer radix
Radix of the dissemination algorithm used for nvshmem_barrier_warp
,
nvshmem_barrier_group
, nvshmem_barrier_all_warp
, and
nvshmem_barrier_all_group
collectives. By default, NVSHMEM uses radix 2.
-
NVSHMEM_BOOTSTRAP_PMI
¶
Value: One of ‘PMIX’, ‘PMI-2’, ‘PMI2’ (Alias of PMI-2), or ‘PMI’
Specifies the desired PMI version to use when bootstrapping NVSHMEM applications. NVSHMEM_BOOTSTRAP_PMI overrides the default PMI version set at NVSHMEM compilation time. This variable is ignored by applications being bootstrapped by an external MPI or SHMEM library. Otherwise, setting this environment variable to anything other than the values listed above results in an NVSHMEM initialization error.
-
NVSHMEM_REMOTE_TRANSPORT
¶
Value: One of ‘ucx’, ‘ibrc’, or ‘none’
Sets the desired remote transport for NVSHMEM. Setting a value of none disables all remote transports and forces NVSHMEM into local-only mode. Note: In order to properly set the transport to UCX, NVSHMEM must first be compiled with UCX support. Please see the installation guide for more information on compiling NVSHMEM.