Deep Learning Profiler 20.12 Release Notes

Deep Learning Profiler 20.12 Release Notes (PDF)

Description

DLProf release for 20.12, available in the NVIDIA TensorFlow 1.x, TensorFlow 2.x, and PyTorch NGC containers, and as a Python Wheel on the NVIDIA PY Index.

Driver Requirements

Release 20.12 is based on NVIDIA CUDA 11.1.1, which requires NVIDIA Driver release 455 or later. However, if you are running on Tesla (for example, T4 or any other Tesla board), you may use NVIDIA driver release 418.xx, 440.30, or 450.xx. The CUDA driver's compatibility package only supports particular drivers. For a complete list of supported drivers, see the CUDA Application Compatibility topic. For more information, see CUDA Compatibility and Upgrades.

New Features

The key features of DLProf v0.18.0 / r20.12 are:

Known Issues

  • This software is accessible in the NGC TensorFlow and PyTorch containers and as a separate PIP wheel.
  • This software is only supported for TensorFlow 1.15, TensorFlow 2.3, PyTorch 1.8, TensorBoard 1.15, and TensorBoard 2.3.

Resolved Issues

  • PyTorch AMP operations that use Tensor Cores now report the correct datatype.
© Copyright 2024, NVIDIA. Last updated on Mar 27, 2024.