Description
PyProf release for 20.08, is available in the NVIDIA PyTorch NGC containers and directly from the PyProf GitHub page.
New Features
The key features of PyProf v3.3.0 / r20.08 are:
- Latest PyProf version supports PyTorch 1.5.0 (PyTorch 1.6.0 with DLProf) and Nsight Systems 2020.3.2.
- Latest PyProf version compatible with DLProf v0.14.0 / r20.08.
- Capture PyTorch API information and data loading configuration.
- Added CUTLASS to the list of GEMM kernels.
- Added optional function stack tracing to NVTX markers. Enable with
pyprof.init(enable_function_stack=True)
.
Known Issues
- This software only supports PyTorch 1.5.
- Forward-Backward kernel correlation heuristics do not work correctly with PyTorch 1.6.
Recommended work arounds include:
- Use with PyTorch 1.5.
- Use the 20.03-py3 PyTorch NGC container:
docker pull nvcr.io/nvidia/pytorch:20.03-py3
- Use DLProf in the 20.08 NGC PyTorch container.
Resolved Issues
- None