Updates in 2021.2

General

  • Added support for the CUDA toolkit 11.4.

  • Added support for OptiX version 7.3.

  • Added support for profiling on NVIDIA virtual GPUs (vGPUs) on an upcoming GRID/vGPU release.

  • Added a new Python-based report interface for interacting with report files from Python scripts.

  • Added a new rule to warn users when sampling metrics were selected, but no sampling data was collected.

  • Renamed SOL to Throughput in the Speed of Light section.

  • Renamed several memory_* metrics used on the Source page, to better reflect the measured value. See the Source page documentation for more details.

NVIDIA Nsight Compute

  • Added support for opening cubin files in a Standalone Source Viewer without profiling the application.

  • Moved the output of all rules so that it is visible even if a section’s body is collapsed. Visibility of the rules’ output can be toggled by a new button in the report header.

  • The profiler report header now shows the report name for each baseline when ambiguous.

  • Rules can define Focused Metrics that were most important for triggering their result output. Metrics are provided per result message which additional information, such as the underlying conditions and thresholds.

  • Memory tables show tooltips for cells with derived metric calculations.

  • Added a knowledge base service to show more comprehensive background information on metric names and descriptions in their tooltips.

  • Following a link in the Source Counters hot spot tables automatically selects the corresponding metric in the Source page.

  • Added new columns for visualizing register dependencies in the SASS view of the Source page.

  • Functions in the SASS view are now sorted by name.

  • Added support for OptiX 7.x resource tracking in the interactive profile activity. The Resources tool window will show information on instantiated optixDeviceContexts, optixModules, optixProgramGroups, optixPipelines and optixDenoiser objects.

  • Added support for new CUDA graph memory allocation APIs.

  • Improved consistency between command line parameters and the Next Trigger filter in the API Stream window for handling of regex inputs. The Next Trigger filter now considers kernel/API name as a regular expression only if string has regex: as prefix.

  • Added ability to select font settings in the options dialog.

  • Added ability to configure the metrics shown on the summary page via the options dialog.

  • The selected heatmap color scale now also applies to the Memory chart.

  • The ncu-ui script now checks for missing library dependencies, such as OpenGL or Qt.

NVIDIA Nsight Compute CLI

Resolved Issues

  • Fixed a performance issue in the NVIDIA Nsight Compute CLI when using --page raw --csv --units auto.

  • Fixed that the SSH passphase key is no longer persisted in the project file.

  • Fixed state of restore button in connection dialog. The button now supports restoring the default settings, if current setting differ from the default.

  • Fixed that the complete GPU name can be shown in the NVLINK topology diagram on macOS.

  • Fixed that collapsing the Source view reset the selected metrics.

  • Fixed that correlated lines could differ between filtered and unfiltered views of the executed functions.

  • Fixed that two application icons were shown in the macOS dock.

  • Improved HiDPI awareness.