PyProf 20.08 Release Notes
Description
PyProf release for 20.08, is available in the NVIDIA PyTorch NGC containers and directly from the PyProf GitHub page.
New Features
The key features of PyProf v3.3.0 / r20.08 are:
- Latest PyProf version supports PyTorch 1.5.0 (PyTorch 1.6.0 with DLProf) and Nsight Systems 2020.3.2.
- Latest PyProf version compatible with DLProf v0.14.0 / r20.08.
- Capture PyTorch API information and data loading configuration.
- Added CUTLASS to the list of GEMM kernels.
- Added optional function stack tracing to NVTX markers. Enable with
pyprof.init(enable_function_stack=True)
.
Known Issues
- This software only supports PyTorch 1.5.
- Forward-Backward kernel correlation heuristics do not work correctly with PyTorch 1.6.
Recommended work arounds include:
- Use with PyTorch 1.5.
- Use the 20.03-py3 PyTorch NGC container:
docker pull nvcr.io/nvidia/pytorch:20.03-py3
- Use DLProf in the 20.08 NGC PyTorch container.
Resolved Issues
- None