Key Features and Enhancements
The following features and enhancements have been added to this release:
- Enhanced the performance of 3D deconvolution using cudnnConvolutionBackwardData(), for the
following configuration:
- 2x2x2 filter and 2x2x2 convolution stride.
- For FP16 for data input and output, and for accumulation.
- For FP32 for data input and output, and for accumulation.
- Enhanced the performance of 3D convolution using cudnnConvolutionForward(), for the following
configuration:
- Enhanced the functionality of the data type cudnnFusedOps_t by adding the below three
enums:
- CUDNN_FUSED_CONV_SCALE_BIAS_ADD_ACTIVATION
- CUDNN_FUSED_SCALE_BIAS_ADD_ACTIVATION_GEN_BITMASK, and
- CUDNN_FUSED_DACTIVATION_FORK_DBATCHNORM
Fixed Issues
The following issues have been
fixed in this release: