cuDNN Release Notes v7.6.2

Key Features and Enhancements

The following features and enhancements have been added to this release:

  • Enhanced the performance of 3D deconvolution using cudnnConvolutionBackwardData(), for the following configuration:
    • 2x2x2 filter and 2x2x2 convolution stride.
    • Tensor Core for FP16 for data input and output, and for accumulation.
    • Tensor Core for FP32 for data input and output, and for accumulation.
  • Enhanced the performance of 3D convolution using cudnnConvolutionForward(), for the following configuration:
  • Enhanced the functionality of the data type cudnnFusedOps_t by adding the below three enums:
    • CUDNN_FUSED_CONV_SCALE_BIAS_ADD_ACTIVATION
    • CUDNN_FUSED_SCALE_BIAS_ADD_ACTIVATION_GEN_BITMASK, and
    • CUDNN_FUSED_DACTIVATION_FORK_DBATCHNORM

Fixed Issues

The following issues have been fixed in this release: