Removed Python APIs and Replacements#
Warning
The APIs listed below have been removed in TensorRT 11.x and will cause runtime errors if called. Review each entry for its replacement before upgrading.
The following Python APIs have been removed. Each entry shows the removed API and its replacement or migration path.
BuilderFlag.FP16Strong typing with ModelOpt AutoCast
BuilderFlag.INT8Explicit quantization with Q/DQ nodes
BuilderFlag.FP8Explicit quantization with Q/DQ nodes
BuilderFlag.BF16Strong typing with ModelOpt AutoCast
BuilderFlag.INT4Explicit quantization with Q/DQ nodes
BuilderFlag.FP4Explicit quantization with Q/DQ nodes
BuilderFlag.OBEY_PRECISION_CONSTRAINTSStrong typing (always enforced)
BuilderFlag.PREFER_PRECISION_CONSTRAINTSStrong typing (always enforced)
BuilderFlag.DIRECT_IORemoved (unneeded)
IBuilderConfig.int8_calibratorExplicit quantization with Q/DQ nodes
IBuilderConfig.set_calibration_profile()Explicit quantization with Q/DQ nodes
IBuilderConfig.get_calibration_profile()Explicit quantization with Q/DQ nodes
IBuilderConfig.set_quantization_flag()Explicit quantization with Q/DQ nodes
IBuilderConfig.get_quantization_flag()Explicit quantization with Q/DQ nodes
IBuilderConfig.quantization_flagsExplicit quantization with Q/DQ nodes
IBuilderConfig.clear_quantization_flag()Explicit quantization with Q/DQ nodes
ICudaEngine.device_memory_sizeICudaEngine.device_memory_size_v2ICudaEngine.device_memory_size_for_profile()ICudaEngine.device_memory_size_for_profile_v2()ICudaEngine.has_implicit_batch_dimensionRemoved (always
False)ICudaEngine.weight_streaming_budgetICudaEngine.weight_streaming_budget_v2ICudaEngine.minimum_weight_streaming_budgetCompute from
streamable_weights_sizeand available memoryICudaEngine.create_execution_context_without_device_memory()ICudaEngine.create_execution_context()ICudaEngine.get_profile_tensor_values()ICudaEngine.get_profile_tensor_values_v2()IExecutionContext.all_input_shapes_specifiedRemoved (always
True)IExecutionContext.device_memoryIExecutionContext.device_memory_v2IInt8Calibrator(all subclasses)Explicit quantization with Q/DQ nodes
ILayer.precisionStrong typing (set types on tensors directly)
ILayer.precision_is_setRemoved
ILayer.reset_precision()Removed
ILayer.set_output_type()Strong typing (set types on tensors directly)
ILayer.output_type_is_set()Removed
ILayer.reset_output_type()Removed
INormalizationLayer.compute_precisionRemoved (use strong typing)
INetworkDefinition.add_plugin_v2()INetworkDefinition.add_plugin_v3()INetworkDefinition.add_normalization()INetworkDefinition.add_normalization_v2()IPluginV2DynamicExtIPluginV3IPluginCreatorIPluginCreatorV3OneIPluginRegistry.register_creator()(old overload)IPluginRegistry.register_creator()(acceptsIPluginCreatorInterface)IPluginRegistry.plugin_creator_listIPluginRegistry.all_creatorsIPluginRegistry.get_plugin_creator()IPluginRegistry.get_creator()IRefitter.set_dynamic_range()Explicit quantization with Q/DQ nodes
IRefitter.get_dynamic_range_min()Explicit quantization with Q/DQ nodes
IRefitter.get_dynamic_range_max()Explicit quantization with Q/DQ nodes
IRefitter.get_tensors_with_dynamic_range()Explicit quantization with Q/DQ nodes
ITensor.set_type()Strong typing (type determined by network construction)
ITensor.dynamic_rangeExplicit quantization with Q/DQ nodes
ITensor.is_dynamic_range_setRemoved
ITensor.reset_dynamic_range()Removed
TacticSource.CUBLASRemoved
TacticSource.CUBLAS_LTRemoved
TacticSource.CUDNNRemoved