Runners

Module: polygraphy.tools.args

class TrtRunnerArgs[source]

Bases: BaseRunnerArgs

TensorRT Inference: running inference with TensorRT.

Depends on:

  • TrtLoadEngineArgs

parse_impl(args)[source]

Parses command-line arguments and populates the following attributes:

optimization_profile

The index of the optimization profile to initialize the runner with.

Type:

int

allocation_strategy

The way activation memory is allocated.

Type:

str

weight_streaming_budget

The weight streaming budget in bytes.

Type:

int

weight_streaming_percent

The percentage of weights streamed.

Type:

float