Builder¶
-
class
tensorrt.
Builder
(self: tensorrt.tensorrt.Builder, logger: tensorrt.tensorrt.ILogger) → None¶ Builds an
ICudaEngine
from aINetworkDefinition
.Variables: - max_batch_size –
int
The maximum batch size which can be used at execution time, and also the batch size for which theICudaEngine
will be optimized. - max_workspace_size –
int
The maximum GPU temporary memory which theICudaEngine
can use at execution time. - debug_sync –
bool
Whether theBuilder
should use debug synchronization. If this is true, theBuilder
will synchronize after timing each layer, and report the layer name. It can be useful when diagnosing issues at build time. - min_find_iterations –
int
The number of minimization iterations used when timing layers. When timing layers, theBuilder
minimizes over a set of average times for layer execution. This parameter controls the number of iterations used in minimization. - average_find_iterations –
int
The number of averaging iterations used when timing layers. When timing layers, theBuilder
minimizes over a set of average times for layer execution. This parameter controls the number of iterations used in averaging. - platform_has_fast_fp16 –
bool
Whether the platform has fast native fp16. - platform_has_fast_int8 –
bool
Whether the platform has fast native int8. - int8_mode –
bool
Whether Int8 mode is used. - int8_calibrator –
IInt8Calibrator
The Int8 Calibration interface. - fp16_mode –
bool
Whether or not 16-bit kernels are permitted. DuringICudaEngine
build fp16 kernels will also be tried when this mode is enabled.
Parameters: logger – The logger to use. This should be a derived class of
ILogger
.-
build_cuda_engine
(self: tensorrt.tensorrt.Builder, network: tensorrt.tensorrt.INetworkDefinition) → tensorrt.tensorrt.ICudaEngine¶ Builds an
ICudaEngine
from aINetworkDefinition
.Parameters: network – The TensorRT INetworkDefinition
.Returns: A new ICudaEngine
.
-
create_network
(self: tensorrt.tensorrt.Builder) → tensorrt.tensorrt.INetworkDefinition¶ Create a
INetworkDefinition
object.Returns: An empty TensorRT INetworkDefinition
.
- max_batch_size –