PyProf 20.08 Release Notes

NVIDIA Optimized Frameworks (Latest Release) Download PDF

Description

PyProf release for 20.08, is available in the NVIDIA PyTorch NGC containers and directly from the PyProf GitHub page.

New Features

The key features of PyProf v3.3.0 / r20.08 are:

  • Latest PyProf version supports PyTorch 1.5.0 (PyTorch 1.6.0 with DLProf) and Nsight Systems 2020.3.2.
  • Latest PyProf version compatible with DLProf v0.14.0 / r20.08.
  • Capture PyTorch API information and data loading configuration.
  • Added CUTLASS to the list of GEMM kernels.
  • Added optional function stack tracing to NVTX markers. Enable with pyprof.init(enable_function_stack=True).

Known Issues

  • This software only supports PyTorch 1.5.
  • Forward-Backward kernel correlation heuristics do not work correctly with PyTorch 1.6. Recommended work arounds include:
    • Use with PyTorch 1.5.
    • Use the 20.03-py3 PyTorch NGC container:
      Copy
      Copied!
                  

      docker pull nvcr.io/nvidia/pytorch:20.03-py3

    • Use DLProf in the 20.08 NGC PyTorch container.

Resolved Issues

  • None
© Copyright 2024, NVIDIA. Last updated on Apr 26, 2021.