.. include:: /content/common.rsts

Release 0.1.0 (BETA)
!!!!!!!!!!!!!!!!!!!!

Using Transformer Engine 0.1.0
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@

To upgrade to Transformer Engine 0.1.0 from an older version of
Transformer Engine, follow the installation and usage information in the
`NVIDIA Transformer Engine User Guide <../../user-guide/index.html>`_.

.. note::
   The internal Transformer Engine C++ API used for operator’s implementation is not yet officially supported. Hence this API may change in future releases without advance notice.


Key Features and Enhancements
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@

* Easy-to-use pyTorch modules enabling building of the Transformer layers
  with FP8 support on H100 GPUs
* Optimizations (e.g. fused kernels) for Transformer models
  across all precisions and NVIDIA GPU architectures
* Support for parallel execution via data parallelism, tensor parallelism
  and sequence parallelism


Fixed Issues
@@@@@@@@@@@@

There are no fixed issues in this release.


Known Issues in This Release
@@@@@@@@@@@@@@@@@@@@@@@@@@@@

The following issues are known to exist in this release:

* For some model configurations, the default fusion pattern
  in the ``LayerNormMLP`` and ``TransformerLayer`` modules does not give the best performance. You can set the environment variable ``NVTE_BIAS_GELU_NVFUSION`` to 1 to improve it.
* Running the model in higher precision
  and then switching to FP8 precision currently results in assertion failure.

Breaking Changes in This Release
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@

There are no breaking changes in this release.


Deprecated Features
@@@@@@@@@@@@@@@@@@@

There are no deprecated features in this release.