Optimize and inference of model using Inplace Optimize#

In this example, we show how to Model Navigator Inplace Optimize to run optimized models in place of the PyTorch models in the original pipeline. Depending on the mode, the optimize.py script can run PyTorch automatic speech recognition (ASR) Whisper pipeline or optimize and run the Whisper model in TensorRT without any changes to the original pipeline.

We recommend running this example in NVIDIA NGC PyTorch container. The Python script optimize.py wraps the Python model using Inplace Optimize and then runs profiling.

To run the optimization and profiling, run the script:

./optimize.py