Removed C++ APIs and Replacements#
Warning
The APIs listed below have been removed in TensorRT 11.x and will cause compile-time errors if used. Review each entry for its replacement before upgrading.
BuilderFlag::kFP16Strong typing with ModelOpt AutoCast
BuilderFlag::kINT8Explicit quantization with Q/DQ nodes
BuilderFlag::kFP8Explicit quantization with Q/DQ nodes
BuilderFlag::kBF16Strong typing with ModelOpt AutoCast
BuilderFlag::kINT4Explicit quantization with Q/DQ nodes
BuilderFlag::kFP4Explicit quantization with Q/DQ nodes
BuilderFlag::kOBEY_PRECISION_CONSTRAINTSStrong typing (always enforced)
BuilderFlag::kPREFER_PRECISION_CONSTRAINTSStrong typing (always enforced)
BuilderFlag::kDIRECT_IORemoved (not needed in 11.x)
IAlgorithm,IAlgorithmContext,IAlgorithmIOInfo,IAlgorithmSelector,IAlgorithmVariantUse editable mode in
ITimingCacheinstead.IBuilderConfig::setInt8Calibrator(IInt8Calibrator*)Explicit quantization with Q/DQ nodes
IBuilder::platformHasFastFp16()Removed; use strongly typed networks instead of querying platform FP16 support. Third-party code that links against TensorRT 11.x — including the ONNX Runtime TensorRT Execution Provider — must migrate away from this API. ONNX Runtime 1.27+ supports TensorRT 11.x. Refer to the TensorRT 11.0.0 release notes (Known Issues) and TensorRT 11.1.0 release notes (Fixed Issues).
IBuilder::platformHasFastInt8()Removed; use explicit quantization with Q/DQ nodes instead of querying platform INT8 support.
IBuilderConfig::getInt8Calibrator()Removed
IBuilderConfig::setCalibrationProfile(IOptimizationProfile const*)Removed
IBuilderConfig::getCalibrationProfile()Removed
IBuilderConfig::setQuantizationFlags(QuantizationFlags)Removed
IBuilderConfig::getQuantizationFlags()Removed
IBuilderConfig::clearQuantizationFlag(QuantizationFlag)Removed
IBuilderConfig::setQuantizationFlag(QuantizationFlag)Removed
IBuilderConfig::getQuantizationFlag(QuantizationFlag)Removed
ICudaEngine::createExecutionContextWithoutDeviceMemory()ICudaEngine::createExecutionContext()ICudaEngine::getDeviceMemorySize()ICudaEngine::getDeviceMemorySizeV2()ICudaEngine::getDeviceMemorySizeForProfile(int32_t)ICudaEngine::getDeviceMemorySizeForProfileV2(int32_t)ICudaEngine::getMinimumWeightStreamingBudget()Compute from
getStreamableWeightsSize()ICudaEngine::getProfileTensorValues(char const*, int32_t, OptProfileSelector)ICudaEngine::getProfileTensorValuesV2()ICudaEngine::getWeightStreamingBudget()ICudaEngine::getWeightStreamingBudgetV2()ICudaEngine::hasImplicitBatchDimension()Removed (always
false)ICudaEngine::setWeightStreamingBudget(int64_t)ICudaEngine::setWeightStreamingBudgetV2(int64_t)IExecutionContext::allInputShapesSpecified()Removed (always
true)IExecutionContext::setDeviceMemory(void*)IExecutionContext::setDeviceMemoryV2(void*, int64_t)IGpuAllocator::allocate(uint64_t, uint64_t, AllocatorFlags)IGpuAllocator::allocateAsync(uint64_t, uint64_t, AllocatorFlags, cudaStream_t)IGpuAllocator::deallocate(void*)IGpuAllocator::deallocateAsync(void*, cudaStream_t)IInt8Calibrator(all subclasses)Explicit quantization with Q/DQ nodes
ILayer::setPrecision(DataType)Strong typing (set types on tensors directly)
ILayer::getPrecision()Removed
ILayer::precisionIsSet()Removed
ILayer::resetPrecision()Removed
ILayer::setOutputType(int32_t, DataType)Strong typing (set types on tensors directly)
ILayer::outputTypeIsSet(int32_t)Removed
ILayer::resetOutputType(int32_t)Removed
INetworkDefinition::addAttention(..., bool)INetworkDefinition::addAttentionV2(..., CausalMaskKind)INetworkDefinition::addNMS(ITensor&, ITensor&, ITensor&)INetworkDefinition::addNMS(..., DataType)(4-arg version)INetworkDefinition::addNonZero(ITensor&)INetworkDefinition::addNonZero(ITensor&, DataType)INetworkDefinition::addNormalization(...)INetworkDefinition::addNormalizationV2(...)INetworkDefinition::addPluginV2(ITensor* const*, int32_t, IPluginV2&)INetworkDefinition::addPluginV3(...)INetworkDefinition::addTopK(ITensor&, TopKOperation, int32_t, uint32_t)INetworkDefinition::addTopK(..., DataType)(5-arg version)INormalizationLayer::setComputePrecision(DataType)Removed (use strong typing)
IOutputAllocator::reallocateOutput(char const*, void*, uint64_t, uint64_t)IOutputAllocator::reallocateOutputAsync(..., cudaStream_t)IPluginCreatorIPluginCreatorV3OneIPluginRegistry::deregisterCreator(IPluginCreator const&)IPluginRegistry::deregisterCreator(IPluginCreatorInterface const&)IPluginRegistry::getPluginCreator(...)IPluginRegistry::getCreator(...)IPluginRegistry::getPluginCreatorList(int32_t*)IPluginRegistry::getAllCreators(int32_t*)IPluginRegistry::registerCreator(IPluginCreator&, ...)IPluginRegistry::registerCreator(IPluginCreatorInterface&, ...)IPluginV2DynamicExtIPluginV3IPluginV2ExtIPluginV3IPluginV2IOExtIPluginV3IPluginV2LayerIPluginV3LayerIRefitter::setDynamicRange(char const*, float, float)Explicit quantization with Q/DQ nodes
IRefitter::getDynamicRangeMin(char const*)Removed
IRefitter::getDynamicRangeMax(char const*)Removed
IRefitter::getTensorsWithDynamicRange()Removed
IRuntime::deserializeCudaEngine(IStreamReader&)IRuntime::deserializeCudaEngine(IStreamReaderV2&)ITensor::setType(DataType)Strong typing (type determined by network construction)
ITensor::setDynamicRange(float, float)Explicit quantization with Q/DQ nodes
ITensor::dynamicRangeIsSet()Removed
ITensor::resetDynamicRange()Removed
ITensor::getDynamicRangeMin()Removed
ITensor::getDynamicRangeMax()Removed
ITensor::setBroadcastAcrossBatch(bool)Removed (implicit batch not supported)
ITensor::getBroadcastAcrossBatch()Removed (implicit batch not supported)
TacticSource::kCUBLASRemoved
TacticSource::kCUBLAS_LTRemoved
TacticSource::kCUDNNRemoved
DetectionOutputParametersRemoved
NMSParametersRemoved
CodeTypeSSDRemoved
Removed C++ Plugins and Replacements#
Warning
The plugins listed below have been removed in TensorRT 11.x. Using them will cause compilation or linker errors. Review each entry for its replacement before upgrading.
BatchedNMS_TRTUse
INetworkDefinition::addNMS()BatchedNMSDynamic_TRTUse
INetworkDefinition::addNMS()BatchTilePlugin_TRTImplement with standard TensorRT layers
Clip_TRTUse
INetworkDefinition::addActivation()withkCLIPCoordConvACImplement with standard TensorRT layers (concatenate coordinate channels with
IConcatenationLayer, then apply convolution)CustomGeluPluginDynamicUse
INetworkDefinition::addActivation()withkGELU_ERForkGELU_TANHEfficientNMS_ONNX_TRTUse
INetworkDefinition::addNMS()LReLU_TRTUse
INetworkDefinition::addActivation()withkLEAKY_RELUNMS_TRTUse
INetworkDefinition::addNMS()NMSDynamic_TRTUse
INetworkDefinition::addNMS()Normalize_TRTUse
INetworkDefinition::addNormalizationV2()ProposalImplement with standard TensorRT layers
SingleStepLSTMPluginUse
INetworkDefinition::addLoop()or standard RNN decompositionSpecialSlice_TRTUse
INetworkDefinition::addSlice()SplitUse
INetworkDefinition::addSlice()
Deprecated BERT Plugins#
The following OSS BERT plugin classes are deprecated in 11.0.0 and scheduled for removal in a future release. Migrate to the listed replacements before upgrading beyond 11.x.
bertQKVToContextPlugin/CustomQKVToContextPluginDynamicRefer to Migrate to IAttention for more information.