Removed Python APIs and Replacements#

Warning

The APIs listed below have been removed in TensorRT 11.x and will cause runtime errors if called. Review each entry for its replacement before upgrading.

The following Python APIs have been removed. Each entry shows the removed API and its replacement or migration path.

BuilderFlag.FP16

Strong typing with ModelOpt AutoCast

BuilderFlag.INT8

Explicit quantization with Q/DQ nodes

BuilderFlag.FP8

Explicit quantization with Q/DQ nodes

BuilderFlag.BF16

Strong typing with ModelOpt AutoCast

BuilderFlag.INT4

Explicit quantization with Q/DQ nodes

BuilderFlag.FP4

Explicit quantization with Q/DQ nodes

BuilderFlag.OBEY_PRECISION_CONSTRAINTS

Strong typing (always enforced)

BuilderFlag.PREFER_PRECISION_CONSTRAINTS

Strong typing (always enforced)

BuilderFlag.DIRECT_IO

Removed (unneeded)

IBuilderConfig.int8_calibrator

Explicit quantization with Q/DQ nodes

IBuilderConfig.set_calibration_profile()

Explicit quantization with Q/DQ nodes

IBuilderConfig.get_calibration_profile()

Explicit quantization with Q/DQ nodes

IBuilderConfig.set_quantization_flag()

Explicit quantization with Q/DQ nodes

IBuilderConfig.get_quantization_flag()

Explicit quantization with Q/DQ nodes

IBuilderConfig.quantization_flags

Explicit quantization with Q/DQ nodes

IBuilderConfig.clear_quantization_flag()

Explicit quantization with Q/DQ nodes

ICudaEngine.device_memory_size

ICudaEngine.device_memory_size_v2

ICudaEngine.device_memory_size_for_profile()

ICudaEngine.device_memory_size_for_profile_v2()

ICudaEngine.has_implicit_batch_dimension

Removed (always False)

ICudaEngine.weight_streaming_budget

ICudaEngine.weight_streaming_budget_v2

ICudaEngine.minimum_weight_streaming_budget

Compute from streamable_weights_size and available memory

ICudaEngine.create_execution_context_without_device_memory()

ICudaEngine.create_execution_context()

ICudaEngine.get_profile_tensor_values()

ICudaEngine.get_profile_tensor_values_v2()

IExecutionContext.all_input_shapes_specified

Removed (always True)

IExecutionContext.device_memory

IExecutionContext.device_memory_v2

IInt8Calibrator (all subclasses)

Explicit quantization with Q/DQ nodes

ILayer.precision

Strong typing (set types on tensors directly)

ILayer.precision_is_set

Removed

ILayer.reset_precision()

Removed

ILayer.set_output_type()

Strong typing (set types on tensors directly)

ILayer.output_type_is_set()

Removed

ILayer.reset_output_type()

Removed

INormalizationLayer.compute_precision

Removed (use strong typing)

INetworkDefinition.add_plugin_v2()

INetworkDefinition.add_plugin_v3()

INetworkDefinition.add_normalization()

INetworkDefinition.add_normalization_v2()

IPluginV2DynamicExt

IPluginV3

IPluginCreator

IPluginCreatorV3One

IPluginRegistry.register_creator() (old overload)

IPluginRegistry.register_creator() (accepts IPluginCreatorInterface)

IPluginRegistry.plugin_creator_list

IPluginRegistry.all_creators

IPluginRegistry.get_plugin_creator()

IPluginRegistry.get_creator()

IRefitter.set_dynamic_range()

Explicit quantization with Q/DQ nodes

IRefitter.get_dynamic_range_min()

Explicit quantization with Q/DQ nodes

IRefitter.get_dynamic_range_max()

Explicit quantization with Q/DQ nodes

IRefitter.get_tensors_with_dynamic_range()

Explicit quantization with Q/DQ nodes

ITensor.set_type()

Strong typing (type determined by network construction)

ITensor.dynamic_range

Explicit quantization with Q/DQ nodes

ITensor.is_dynamic_range_set

Removed

ITensor.reset_dynamic_range()

Removed

TacticSource.CUBLAS

Removed

TacticSource.CUBLAS_LT

Removed

TacticSource.CUDNN

Removed