.. include:: /content/common.rsts

Release Notes |ndash| Release 0.8.0 (BETA)
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

Key Features and Enhancements
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@

Added experimental support for TensorFlow (single GPU only for now).

Added C++ API for FP8 fused attention from cuDNN.

Optimized performance in some cases when using ``FlashAttention``.

Added an option to train without biases in ``LayerNormMLP``.

Added support for zero-centered gamma in ``LayerNorm`` in JAX.

Added an option to perform reduction of amax asynchronously when training with FP8.

Fixed Issues
@@@@@@@@@@@@

Fixed multiple issues with exporting the model to ONNX.

Known Issues in This Release
@@@@@@@@@@@@@@@@@@@@@@@@@@@@

There are no known issues in this release.

Breaking Changes in This Release
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@

There are no breaking changes in this release.

Deprecated Features
@@@@@@@@@@@@@@@@@@@

The ``TransformerLayer`` arguments `attention_softmax_in_fp32` and `apply_query_key_layer_scaling` are deprecated, and will be removed in a future release. The default behavior is as if those arguments were set to ``True``.