NVIDIA Docs Hub NVIDIA LaunchPad Tuning and Deploying a Language Model on NVIDIA H100 Intro to the Transformer Engine API

Intro to the Transformer Engine API

Transformer models are the backbone of language models from BERT to GPT-3 and require enormous computing resources. Transformer Engine is a library for accelerating Transformer models on NVIDIA’s newest generation of GPUs. Within this lab, we will show a variety of examples that show how TE has been integrated with PyTorch. TE will be integrated with more deep learning libraries in the future.

The TE APIs allow for existing training pipelines to benefit from the significant speedups with minimal changes to the current code.

To get started with Building and Extending Transformer API support for PyTorch, follow the steps below.

Using the Jupyter Notebook link on the left-hand navigation pane, open the Jupyter Notebook.
Run through the notebook fp8_primer.ipynb to learn how to use the new datatype, FP8, with TE.
Run through the notebook Transformer-engine.ipynb to see the TE in action with a variety of examples.

Note

To run a cell on the Jupyter Notebook, click on the cell you want to run and press enter Shift + Enter. Linux bash commands can be run inside the Jupyter Notebook by adding a bang symbol (!) before the command inside the Jupyter Notebook cell.