Updates in 2020.1.1
General
Added support for the NVIDIA GA100/SM 8.x GPU architecture
Metrics passed to
--metrics
on the NVIDIA Nsight Compute CLI or in the respective Profile activity option are automatically expanded to all first-level sub-metrics if required. See the documentation on--metrics
for more details.Added new rules for detecting inefficiencies of using the sparse data compression on the NVIDIA Ampere architecture.
The version of the NVIDIA Nsight Compute target collecting the results is shown in the Session page.
Added new
launch__grid_dim_[x,y,z]
andlaunch__block_dim_[x,y,z]
metrics.
NVIDIA Nsight Compute
The Break on API Error functionality has been improved when auto profiling.
NVIDIA Nsight Compute CLI
The full path to the report output file is printed after profiling.
Added and corrected metrics in the nvprof Metric Comparison table.
Resolved Issues
Documented the breakdown: metrics prefix.
Fixed handling of escaped domain delimiters in NVTX filter expressions.
Fixed issues with the occupancy charts for small block sizes.
Fixed an issue when choosing a default report page in the options dialog.
Fixed that the scroll bar could overlap the content when exporting the report page as an image.