TensorRT 10.4.0
|
The TensorRT API version 1 namespace. More...
Namespaces | |
namespace | anonymous_namespace{NvInfer.h} |
namespace | anonymous_namespace{NvInferRuntime.h} |
namespace | apiv |
namespace | consistency |
namespace | impl |
namespace | plugin |
namespace | safe |
The safety subset of TensorRT's API version 1 namespace. | |
namespace | serialize |
namespace | v_1_0 |
Forward declare IErrorRecorder for use in other interfaces. | |
namespace | v_2_0 |
Classes | |
class | Dims2 |
Descriptor for two-dimensional data. More... | |
class | Dims3 |
Descriptor for three-dimensional data. More... | |
class | Dims4 |
Descriptor for four-dimensional data. More... | |
class | Dims64 |
class | DimsExprs |
Analog of class Dims with expressions instead of constants for the dimensions. More... | |
class | DimsHW |
Descriptor for two-dimensional spatial data. More... | |
struct | DynamicPluginTensorDesc |
Summarizes tensors that a plugin might see for an input or output. More... | |
class | IActivationLayer |
An Activation layer in a network definition. More... | |
class | IAlgorithm |
Describes a variation of execution of a layer. An algorithm is represented by IAlgorithmVariant and the IAlgorithmIOInfo for each of its inputs and outputs. An algorithm can be selected or reproduced using AlgorithmSelector::selectAlgorithms(). More... | |
class | IAlgorithmContext |
Describes the context and requirements, that could be fulfilled by one or more instances of IAlgorithm. More... | |
class | IAlgorithmIOInfo |
Carries information about input or output of the algorithm. IAlgorithmIOInfo for all the input and output along with IAlgorithmVariant denotes the variation of algorithm and can be used to select or reproduce an algorithm using IAlgorithmSelector::selectAlgorithms(). More... | |
class | IAlgorithmVariant |
provides a unique 128-bit identifier, which along with the input and output information denotes the variation of algorithm and can be used to select or reproduce an algorithm, using IAlgorithmSelector::selectAlgorithms() More... | |
class | IAssertionLayer |
An assertion layer in a network. More... | |
class | IBuilder |
Builds an engine from a network definition. More... | |
class | IBuilderConfig |
Holds properties for configuring a builder to produce an engine. More... | |
class | ICastLayer |
A cast layer in a network. More... | |
class | IConcatenationLayer |
A concatenation layer in a network definition. More... | |
class | IConditionLayer |
This layer represents a condition input to an IIfConditional. More... | |
class | IConstantLayer |
Layer that represents a constant value. More... | |
class | IConvolutionLayer |
A convolution layer in a network definition. More... | |
class | ICudaEngine |
An engine for executing inference on a built network, with functionally unsafe features. More... | |
class | IDeconvolutionLayer |
A deconvolution layer in a network definition. More... | |
class | IDequantizeLayer |
A Dequantize layer in a network definition. More... | |
class | IDimensionExpr |
An IDimensionExpr represents an integer expression constructed from constants, input dimensions, and binary operations. These expressions are can be used in overrides of IPluginV2DynamicExt::getOutputDimensions or IPluginV3OneBuild::getOutputShapes() to define output dimensions in terms of input dimensions. More... | |
class | IEinsumLayer |
An Einsum layer in a network. More... | |
class | IElementWiseLayer |
A elementwise layer in a network definition. More... | |
class | IEngineInspector |
An engine inspector which prints out the layer information of an engine or an execution context. More... | |
class | IExecutionContext |
Context for executing inference using an engine, with functionally unsafe features. More... | |
class | IExprBuilder |
Object for constructing IDimensionExpr. More... | |
class | IFillLayer |
Generate a tensor according to a specified mode. More... | |
class | IGatherLayer |
A Gather layer in a network definition. Supports several kinds of gathering. More... | |
class | IGridSampleLayer |
A GridSample layer in a network definition. More... | |
class | IHostMemory |
Class to handle library allocated memory that is accessible to the user. More... | |
class | IIdentityLayer |
A layer that represents the identity function. More... | |
class | IIfConditional |
Helper for constructing conditionally-executed subgraphs. More... | |
class | IIfConditionalBoundaryLayer |
This is a base class for Conditional boundary layers. More... | |
class | IIfConditionalInputLayer |
This layer represents an input to an IIfConditional. More... | |
class | IIfConditionalOutputLayer |
This layer represents an output of an IIfConditional. More... | |
class | IInt8Calibrator |
Application-implemented interface for calibration. More... | |
class | IIteratorLayer |
A layer to do iterations. More... | |
class | ILayer |
Base class for all layer classes in a network definition. More... | |
class | ILogger |
Application-implemented logging interface for the builder, refitter and runtime. More... | |
class | ILoggerFinder |
A virtual base class to find a logger. Allows a plugin to find an instance of a logger if it needs to emit a log message. A pointer to an instance of this class is passed to a plugin shared library on initialization when that plugin is serialized as part of a version-compatible plan. See the plugin chapter in the developer guide for details. More... | |
class | ILoop |
Helper for creating a recurrent subgraph. More... | |
class | ILoopBoundaryLayer |
This is a base class for Loop boundary layers. More... | |
class | ILoopOutputLayer |
An ILoopOutputLayer is the sole way to get output from a loop. More... | |
class | ILRNLayer |
A LRN layer in a network definition. More... | |
class | IMatrixMultiplyLayer |
Layer that represents a Matrix Multiplication. More... | |
class | INetworkDefinition |
A network definition for input to the builder. More... | |
class | INMSLayer |
A non-maximum suppression layer in a network definition. More... | |
class | INoCopy |
Forward declaration of IEngineInspector for use by other interfaces. More... | |
class | INonZeroLayer |
class | INormalizationLayer |
A normalization layer in a network definition. More... | |
class | InterfaceInfo |
Version information associated with a TRT interface. More... | |
class | IOneHotLayer |
A OneHot layer in a network definition. More... | |
class | IOptimizationProfile |
Optimization profile for dynamic input dimensions and shape tensors. More... | |
class | IPaddingLayer |
Layer that represents a padding operation. More... | |
class | IParametricReLULayer |
Layer that represents a parametric ReLU operation. More... | |
class | IPluginRegistry |
Single registration point for all plugins in an application. It is used to find plugin implementations during engine deserialization. Internally, the plugin registry is considered to be a singleton so all plugins in an application are part of the same global registry. Note that the plugin registry is only supported for plugins of type IPluginV2 and should also have a corresponding IPluginCreator implementation. More... | |
class | IPluginResourceContext |
Interface for plugins to access per context resources provided by TensorRT. More... | |
class | IPluginV2 |
Plugin class for user-implemented layers. More... | |
class | IPluginV2DynamicExt |
Similar to IPluginV2Ext, but with support for dynamic shapes. More... | |
class | IPluginV2Ext |
Plugin class for user-implemented layers. More... | |
class | IPluginV2IOExt |
Plugin class for user-implemented layers. More... | |
class | IPluginV2Layer |
Layer type for pluginV2. More... | |
class | IPluginV3Layer |
Layer type for V3 plugins. More... | |
class | IPoolingLayer |
A Pooling layer in a network definition. More... | |
class | IQuantizeLayer |
A Quantize layer in a network definition. More... | |
class | IRaggedSoftMaxLayer |
A RaggedSoftmax layer in a network definition. More... | |
class | IRecurrenceLayer |
A recurrence layer in a network definition. More... | |
class | IReduceLayer |
Layer that represents a reduction across a non-bool tensor. More... | |
class | IRefitter |
Updates weights in an engine. More... | |
class | IResizeLayer |
A resize layer in a network definition. More... | |
class | IReverseSequenceLayer |
A ReverseSequence layer in a network definition. More... | |
class | IRuntime |
Allows a serialized functionally unsafe engine to be deserialized. More... | |
class | IScaleLayer |
A Scale layer in a network definition. More... | |
class | IScatterLayer |
A scatter layer in a network definition. Supports several kinds of scattering. More... | |
class | ISelectLayer |
Select elements from two data tensors based on a condition tensor. More... | |
class | ISerializationConfig |
Holds properties for configuring an engine to serialize the binary. More... | |
class | IShapeLayer |
Layer type for getting shape of a tensor. More... | |
class | IShuffleLayer |
Layer type for shuffling data. More... | |
class | ISliceLayer |
Slices an input tensor into an output tensor based on the offset and strides. More... | |
class | ISoftMaxLayer |
A Softmax layer in a network definition. More... | |
class | ITensor |
A tensor in a network definition. More... | |
class | ITimingCache |
Class to handle tactic timing info collected from builder. More... | |
class | ITopKLayer |
Layer that represents a TopK reduction. More... | |
class | ITripLimitLayer |
A layer that represents a trip-count limiter. More... | |
class | IUnaryLayer |
Layer that represents an unary operation. More... | |
class | IVersionedInterface |
An Interface class for version control. More... | |
struct | Permutation |
Represents a permutation of dimensions. More... | |
class | PluginField |
Structure containing plugin attribute field names and associated data This information can be parsed to decode necessary plugin metadata. More... | |
struct | PluginFieldCollection |
Plugin field collection struct. More... | |
class | PluginRegistrar |
Register the plugin creator to the registry The static registry object will be instantiated when the plugin library is loaded. This static object will register all creators available in the library to the registry. More... | |
struct | PluginTensorDesc |
Fields that a plugin might see for an input or output. More... | |
class | Weights |
An array of weights used as a layer parameter. More... | |
Enumerations | |
enum class | LayerType : int32_t { kCONVOLUTION = 0 , kCAST = 1 , kACTIVATION = 2 , kPOOLING = 3 , kLRN = 4 , kSCALE = 5 , kSOFTMAX = 6 , kDECONVOLUTION = 7 , kCONCATENATION = 8 , kELEMENTWISE = 9 , kPLUGIN = 10 , kUNARY = 11 , kPADDING = 12 , kSHUFFLE = 13 , kREDUCE = 14 , kTOPK = 15 , kGATHER = 16 , kMATRIX_MULTIPLY = 17 , kRAGGED_SOFTMAX = 18 , kCONSTANT = 19 , kIDENTITY = 20 , kPLUGIN_V2 = 21 , kSLICE = 22 , kSHAPE = 23 , kPARAMETRIC_RELU = 24 , kRESIZE = 25 , kTRIP_LIMIT = 26 , kRECURRENCE = 27 , kITERATOR = 28 , kLOOP_OUTPUT = 29 , kSELECT = 30 , kFILL = 31 , kQUANTIZE = 32 , kDEQUANTIZE = 33 , kCONDITION = 34 , kCONDITIONAL_INPUT = 35 , kCONDITIONAL_OUTPUT = 36 , kSCATTER = 37 , kEINSUM = 38 , kASSERTION = 39 , kONE_HOT = 40 , kNON_ZERO = 41 , kGRID_SAMPLE = 42 , kNMS = 43 , kREVERSE_SEQUENCE = 44 , kNORMALIZATION = 45 , kPLUGIN_V3 = 46 } |
The type values of layer classes. More... | |
enum class | ActivationType : int32_t { kRELU = 0 , kSIGMOID = 1 , kTANH = 2 , kLEAKY_RELU = 3 , kELU = 4 , kSELU = 5 , kSOFTSIGN = 6 , kSOFTPLUS = 7 , kCLIP = 8 , kHARD_SIGMOID = 9 , kSCALED_TANH = 10 , kTHRESHOLDED_RELU = 11 , kGELU_ERF = 12 , kGELU_TANH = 13 } |
Enumerates the types of activation to perform in an activation layer. More... | |
enum class | PaddingMode : int32_t { kEXPLICIT_ROUND_DOWN = 0 , kEXPLICIT_ROUND_UP = 1 , kSAME_UPPER = 2 , kSAME_LOWER = 3 } |
Enumerates the modes of padding to perform in convolution, deconvolution and pooling layer, padding mode takes precedence if setPaddingMode() and setPrePadding() are also used. More... | |
enum class | PoolingType : int32_t { kMAX = 0 , kAVERAGE = 1 , kMAX_AVERAGE_BLEND = 2 } |
The type of pooling to perform in a pooling layer. More... | |
enum class | ScaleMode : int32_t { kUNIFORM = 0 , kCHANNEL = 1 , kELEMENTWISE = 2 } |
Controls how shift, scale and power are applied in a Scale layer. More... | |
enum class | ElementWiseOperation : int32_t { kSUM = 0 , kPROD = 1 , kMAX = 2 , kMIN = 3 , kSUB = 4 , kDIV = 5 , kPOW = 6 , kFLOOR_DIV = 7 , kAND = 8 , kOR = 9 , kXOR = 10 , kEQUAL = 11 , kGREATER = 12 , kLESS = 13 } |
Enumerates the binary operations that may be performed by an ElementWise layer. More... | |
enum class | GatherMode : int32_t { kDEFAULT = 0 , kELEMENT = 1 , kND = 2 } |
Control form of IGatherLayer. More... | |
enum class | UnaryOperation : int32_t { kEXP = 0 , kLOG = 1 , kSQRT = 2 , kRECIP = 3 , kABS = 4 , kNEG = 5 , kSIN = 6 , kCOS = 7 , kTAN = 8 , kSINH = 9 , kCOSH = 10 , kASIN = 11 , kACOS = 12 , kATAN = 13 , kASINH = 14 , kACOSH = 15 , kATANH = 16 , kCEIL = 17 , kFLOOR = 18 , kERF = 19 , kNOT = 20 , kSIGN = 21 , kROUND = 22 , kISINF = 23 , kISNAN = 24 } |
Enumerates the unary operations that may be performed by a Unary layer. More... | |
enum class | ReduceOperation : int32_t { kSUM = 0 , kPROD = 1 , kMAX = 2 , kMIN = 3 , kAVG = 4 } |
Enumerates the reduce operations that may be performed by a Reduce layer. More... | |
enum class | SampleMode : int32_t { kSTRICT_BOUNDS = 0 , kWRAP = 1 , kCLAMP = 2 , kFILL = 3 , kREFLECT = 4 } |
Controls how ISliceLayer and IGridSample handle out-of-bounds coordinates. More... | |
enum class | TopKOperation : int32_t { kMAX = 0 , kMIN = 1 } |
Enumerates the operations that may be performed by a TopK layer. More... | |
enum class | MatrixOperation : int32_t { kNONE = 0 , kTRANSPOSE = 1 , kVECTOR = 2 } |
Enumerates the operations that may be performed on a tensor by IMatrixMultiplyLayer before multiplication. More... | |
enum class | InterpolationMode : int32_t { kNEAREST = 0 , kLINEAR = 1 , kCUBIC = 2 } |
Enumerates various modes of interpolation. More... | |
enum class | ResizeCoordinateTransformation : int32_t { kALIGN_CORNERS = 0 , kASYMMETRIC = 1 , kHALF_PIXEL = 2 } |
The resize coordinate transformation function. More... | |
enum class | ResizeSelector : int32_t { kFORMULA = 0 , kUPPER = 1 } |
The coordinate selector when resize to single pixel output. More... | |
enum class | ResizeRoundMode : int32_t { kHALF_UP = 0 , kHALF_DOWN = 1 , kFLOOR = 2 , kCEIL = 3 } |
The rounding mode for nearest neighbor resize. More... | |
enum class | LoopOutput : int32_t { kLAST_VALUE = 0 , kCONCATENATE = 1 , kREVERSE = 2 } |
enum class | TripLimit : int32_t { kCOUNT = 0 , kWHILE = 1 } |
enum class | FillOperation : int32_t { kLINSPACE = 0 , kRANDOM_UNIFORM = 1 , kRANDOM_NORMAL = 2 } |
Enumerates the tensor fill operations that may performed by a fill layer. More... | |
enum class | ScatterMode : int32_t { kELEMENT = 0 , kND = 1 } |
Control form of IScatterLayer. More... | |
enum class | BoundingBoxFormat : int32_t { kCORNER_PAIRS = 0 , kCENTER_SIZES = 1 } |
Representation of bounding box data used for the Boxes input tensor in INMSLayer. More... | |
enum class | CalibrationAlgoType : int32_t { kLEGACY_CALIBRATION = 0 , kENTROPY_CALIBRATION = 1 , kENTROPY_CALIBRATION_2 = 2 , kMINMAX_CALIBRATION = 3 } |
Version of calibration algorithm to use. More... | |
enum class | QuantizationFlag : int32_t { kCALIBRATE_BEFORE_FUSION = 0 } |
List of valid flags for quantizing the network to int8. More... | |
enum class | RuntimePlatform : int32_t { kSAME_AS_BUILD = 0 , kWINDOWS_AMD64 = 1 } |
Describes the intended runtime platform (operating system and CPU architecture) for the execution of the TensorRT engine. TensorRT provides support for cross-platform engine compatibility when the target runtime platform is different from the build platform. More... | |
enum class | BuilderFlag : int32_t { kFP16 = 0 , kINT8 = 1 , kDEBUG = 2 , kGPU_FALLBACK = 3 , kREFIT = 4 , kDISABLE_TIMING_CACHE = 5 , kTF32 = 6 , kSPARSE_WEIGHTS = 7 , kSAFETY_SCOPE = 8 , kOBEY_PRECISION_CONSTRAINTS = 9 , kPREFER_PRECISION_CONSTRAINTS = 10 , kDIRECT_IO = 11 , kREJECT_EMPTY_ALGORITHMS = 12 , kVERSION_COMPATIBLE = 13 , kEXCLUDE_LEAN_RUNTIME = 14 , kFP8 = 15 , kERROR_ON_TIMING_CACHE_MISS = 16 , kBF16 = 17 , kDISABLE_COMPILATION_CACHE = 18 , kSTRIP_PLAN = 19 , kWEIGHTLESS = kSTRIP_PLAN , kREFIT_IDENTICAL = 20 , kWEIGHT_STREAMING = 21 , kINT4 = 22 , kREFIT_INDIVIDUAL = 23 } |
List of valid modes that the builder can enable when creating an engine from a network definition. More... | |
enum class | MemoryPoolType : int32_t { kWORKSPACE = 0 , kDLA_MANAGED_SRAM = 1 , kDLA_LOCAL_DRAM = 2 , kDLA_GLOBAL_DRAM = 3 , kTACTIC_DRAM = 4 , kTACTIC_SHARED_MEMORY = 5 } |
The type for memory pools used by TensorRT. More... | |
enum class | PreviewFeature : int32_t { kPROFILE_SHARING_0806 = 0 , kALIASED_PLUGIN_IO_10_03 = 1 } |
Define preview features. More... | |
enum class | HardwareCompatibilityLevel : int32_t { kNONE = 0 , kAMPERE_PLUS = 1 } |
Describes requirements of compatibility with GPU architectures other than that of the GPU on which the engine was built. More... | |
enum class | NetworkDefinitionCreationFlag : int32_t { kEXPLICIT_BATCH = 0 , kSTRONGLY_TYPED = 1 } |
List of immutable network properties expressed at network creation time. NetworkDefinitionCreationFlag is used with createNetworkV2() to specify immutable properties of the network. More... | |
enum class | EngineCapability : int32_t { kSTANDARD = 0 , kSAFETY = 1 , kDLA_STANDALONE = 2 } |
List of supported engine capability flows. More... | |
enum class | DimensionOperation : int32_t { kSUM = 0 , kPROD = 1 , kMAX = 2 , kMIN = 3 , kSUB = 4 , kEQUAL = 5 , kLESS = 6 , kFLOOR_DIV = 7 , kCEIL_DIV = 8 } |
An operation on two IDimensionExpr, which represent integer expressions used in dimension computations. More... | |
enum class | TensorLocation : int32_t { kDEVICE = 0 , kHOST = 1 } |
The location for tensor data storage, device or host. More... | |
enum class | WeightsRole : int32_t { kKERNEL = 0 , kBIAS = 1 , kSHIFT = 2 , kSCALE = 3 , kCONSTANT = 4 , kANY = 5 } |
How a layer uses particular Weights. More... | |
enum class | DeviceType : int32_t { kGPU = 0 , kDLA = 1 } |
The device that this layer/network will execute on. More... | |
enum class | TempfileControlFlag : int32_t { kALLOW_IN_MEMORY_FILES = 0 , kALLOW_TEMPORARY_FILES = 1 } |
Flags used to control TensorRT's behavior when creating executable temporary files. More... | |
enum class | OptProfileSelector : int32_t { kMIN = 0 , kOPT = 1 , kMAX = 2 } |
When setting or querying optimization profile parameters (such as shape tensor inputs or dynamic dimensions), select whether we are interested in the minimum, optimum, or maximum values for these parameters. The minimum and maximum specify the permitted range that is supported at runtime, while the optimum value is used for the kernel selection. This should be the "typical" value that is expected to occur at runtime. More... | |
enum class | TacticSource : int32_t { kCUBLAS = 0 , kCUBLAS_LT = 1 , kCUDNN = 2 , kEDGE_MASK_CONVOLUTIONS = 3 , kJIT_CONVOLUTIONS = 4 } |
List of tactic sources for TensorRT. More... | |
enum class | ProfilingVerbosity : int32_t { kLAYER_NAMES_ONLY = 0 , kNONE = 1 , kDETAILED = 2 } |
List of verbosity levels of layer information exposed in NVTX annotations and in IEngineInspector. More... | |
enum class | SerializationFlag : int32_t { kEXCLUDE_WEIGHTS = 0 , kEXCLUDE_LEAN_RUNTIME = 1 } |
List of valid flags that the engine can enable when serializing the bytes. More... | |
enum class | ExecutionContextAllocationStrategy : int32_t { kSTATIC = 0 , kON_PROFILE_CHANGE = 1 , kUSER_MANAGED = 2 } |
Different memory allocation behaviors for IExecutionContext. More... | |
enum class | LayerInformationFormat : int32_t { kONELINE = 0 , kJSON = 1 } |
The format in which the IEngineInspector prints the layer information. More... | |
enum class | DataType : int32_t { kFLOAT = 0 , kHALF = 1 , kINT8 = 2 , kINT32 = 3 , kBOOL = 4 , kUINT8 = 5 , kFP8 = 6 , kBF16 = 7 , kINT64 = 8 , kINT4 = 9 } |
The type of weights and tensors. More... | |
enum class | TensorFormat : int32_t { kLINEAR = 0 , kCHW2 = 1 , kHWC8 = 2 , kCHW4 = 3 , kCHW16 = 4 , kCHW32 = 5 , kDHWC8 = 6 , kCDHW32 = 7 , kHWC = 8 , kDLA_LINEAR = 9 , kDLA_HWC4 = 10 , kHWC16 = 11 , kDHWC = 12 } |
Format of the input/output tensors. More... | |
enum class | APILanguage : int32_t { kCPP = 0 , kPYTHON = 1 } |
Programming language used in the implementation of a TRT interface. More... | |
enum class | AllocatorFlag : int32_t { kRESIZABLE = 0 } |
Allowed type of memory allocation. More... | |
enum class | ErrorCode : int32_t { kSUCCESS = 0 , kUNSPECIFIED_ERROR = 1 , kINTERNAL_ERROR = 2 , kINVALID_ARGUMENT = 3 , kINVALID_CONFIG = 4 , kFAILED_ALLOCATION = 5 , kFAILED_INITIALIZATION = 6 , kFAILED_EXECUTION = 7 , kFAILED_COMPUTATION = 8 , kINVALID_STATE = 9 , kUNSUPPORTED_STATE = 10 } |
Error codes that can be returned by TensorRT during execution. More... | |
enum class | TensorIOMode : int32_t { kNONE = 0 , kINPUT = 1 , kOUTPUT = 2 } |
Definition of tensor IO Mode. More... | |
enum class | PluginVersion : uint8_t { kV2 = 0 , kV2_EXT = 1 , kV2_IOEXT = 2 , kV2_DYNAMICEXT = 3 , kV2_DYNAMICEXT_PYTHON = kPLUGIN_VERSION_PYTHON_BIT | 3 } |
enum class | PluginCreatorVersion : int32_t { kV1 = 0 , kV1_PYTHON = kPLUGIN_VERSION_PYTHON_BIT } |
Enum to identify version of the plugin creator. More... | |
enum class | PluginFieldType : int32_t { kFLOAT16 = 0 , kFLOAT32 = 1 , kFLOAT64 = 2 , kINT8 = 3 , kINT16 = 4 , kINT32 = 5 , kCHAR = 6 , kDIMS = 7 , kUNKNOWN = 8 , kBF16 = 9 , kINT64 = 10 , kFP8 = 11 , kINT4 = 12 } |
The possible field types for custom layer. More... | |
enum class | PluginCapabilityType : int32_t { kCORE = 0 , kBUILD = 1 , kRUNTIME = 2 } |
Enumerates the different capability types a IPluginV3 object may have. More... | |
enum class | TensorRTPhase : int32_t { kBUILD = 0 , kRUNTIME = 1 } |
Indicates a phase of operation of TensorRT. More... | |
Functions | |
template<> | |
constexpr int32_t | EnumMax< LayerType > () noexcept |
template<> | |
constexpr int32_t | EnumMax< ScaleMode > () noexcept |
template<> | |
constexpr int32_t | EnumMax< GatherMode > () noexcept |
template<> | |
constexpr int32_t | EnumMax< UnaryOperation > () noexcept |
template<> | |
constexpr int32_t | EnumMax< ReduceOperation > () noexcept |
template<> | |
constexpr int32_t | EnumMax< SampleMode > () noexcept |
template<> | |
constexpr int32_t | EnumMax< TopKOperation > () noexcept |
template<> | |
constexpr int32_t | EnumMax< MatrixOperation > () noexcept |
template<> | |
constexpr int32_t | EnumMax< LoopOutput > () noexcept |
template<> | |
constexpr int32_t | EnumMax< TripLimit > () noexcept |
template<> | |
constexpr int32_t | EnumMax< FillOperation > () noexcept |
template<> | |
constexpr int32_t | EnumMax< ScatterMode > () noexcept |
template<> | |
constexpr int32_t | EnumMax< BoundingBoxFormat > () noexcept |
template<> | |
constexpr int32_t | EnumMax< CalibrationAlgoType > () noexcept |
template<> | |
constexpr int32_t | EnumMax< QuantizationFlag > () noexcept |
template<> | |
constexpr int32_t | EnumMax< BuilderFlag > () noexcept |
template<> | |
constexpr int32_t | EnumMax< MemoryPoolType > () noexcept |
template<> | |
constexpr int32_t | EnumMax< NetworkDefinitionCreationFlag > () noexcept |
nvinfer1::IPluginRegistry * | getBuilderPluginRegistry (nvinfer1::EngineCapability capability) noexcept |
Return the plugin registry for building a Standard engine, or nullptr if no registry exists. More... | |
nvinfer1::safe::IPluginRegistry * | getBuilderSafePluginRegistry (nvinfer1::EngineCapability capability) noexcept |
Return the plugin registry for building a Safety engine, or nullptr if no registry exists. More... | |
template<> | |
constexpr int32_t | EnumMax< DimensionOperation > () noexcept |
Maximum number of elements in DimensionOperation enum. More... | |
template<> | |
constexpr int32_t | EnumMax< WeightsRole > () noexcept |
Maximum number of elements in WeightsRole enum. More... | |
template<> | |
constexpr int32_t | EnumMax< DeviceType > () noexcept |
Maximum number of elements in DeviceType enum. More... | |
template<> | |
constexpr int32_t | EnumMax< TempfileControlFlag > () noexcept |
Maximum number of elements in TempfileControlFlag enum. More... | |
template<> | |
constexpr int32_t | EnumMax< OptProfileSelector > () noexcept |
Number of different values of OptProfileSelector enum. More... | |
template<> | |
constexpr int32_t | EnumMax< TacticSource > () noexcept |
Maximum number of tactic sources in TacticSource enum. More... | |
template<> | |
constexpr int32_t | EnumMax< ProfilingVerbosity > () noexcept |
Maximum number of profile verbosity levels in ProfilingVerbosity enum. More... | |
template<> | |
constexpr int32_t | EnumMax< SerializationFlag > () noexcept |
Maximum number of serialization flags in SerializationFlag enum. More... | |
template<> | |
constexpr int32_t | EnumMax< ExecutionContextAllocationStrategy > () noexcept |
Maximum number of memory allocation strategies in ExecutionContextAllocationStrategy enum. More... | |
template<> | |
constexpr int32_t | EnumMax< LayerInformationFormat > () noexcept |
template<typename T > | |
constexpr int32_t | EnumMax () noexcept |
Maximum number of elements in an enumeration type. More... | |
The TensorRT API version 1 namespace.
using nvinfer1::AllocatorFlags = typedef uint32_t |
using nvinfer1::AsciiChar = typedef char_t |
AsciiChar is the type used by TensorRT to represent valid ASCII characters. This type is widely used in automotive safety context.
using nvinfer1::BuilderFlags = typedef uint32_t |
Represents one or more BuilderFlag values using binary OR operations, e.g., 1U << BuilderFlag::kFP16 | 1U << BuilderFlag::kDEBUG.
using nvinfer1::char_t = typedef char |
char_t is the type used by TensorRT to represent all valid characters.
using nvinfer1::Dims = typedef Dims64 |
Alias for Dims64.
using nvinfer1::IAlgorithmSelector = typedef v_1_0::IAlgorithmSelector |
using nvinfer1::IDebugListener = typedef v_1_0::IDebugListener |
using nvinfer1::IGpuAllocator = typedef v_1_0::IGpuAllocator |
using nvinfer1::IGpuAsyncAllocator = typedef v_1_0::IGpuAsyncAllocator |
using nvinfer1::IInt8EntropyCalibrator = typedef v_1_0::IInt8EntropyCalibrator |
using nvinfer1::IInt8EntropyCalibrator2 = typedef v_1_0::IInt8EntropyCalibrator2 |
using nvinfer1::IInt8LegacyCalibrator = typedef v_1_0::IInt8LegacyCalibrator |
using nvinfer1::IInt8MinMaxCalibrator = typedef v_1_0::IInt8MinMaxCalibrator |
using nvinfer1::InterfaceKind = typedef char const* |
using nvinfer1::IOutputAllocator = typedef v_1_0::IOutputAllocator |
using nvinfer1::IPluginCapability = typedef v_1_0::IPluginCapability |
using nvinfer1::IPluginCreator = typedef v_1_0::IPluginCreator |
using nvinfer1::IPluginCreatorInterface = typedef v_1_0::IPluginCreatorInterface |
using nvinfer1::IPluginCreatorV3One = typedef v_1_0::IPluginCreatorV3One |
using nvinfer1::IPluginResource = typedef v_1_0::IPluginResource |
using nvinfer1::IPluginV3 = typedef v_1_0::IPluginV3 |
using nvinfer1::IPluginV3OneBuild = typedef v_1_0::IPluginV3OneBuild |
using nvinfer1::IPluginV3OneBuildV2 = typedef v_2_0::IPluginV3OneBuild |
using nvinfer1::IPluginV3OneCore = typedef v_1_0::IPluginV3OneCore |
using nvinfer1::IPluginV3OneRuntime = typedef v_1_0::IPluginV3OneRuntime |
using nvinfer1::IProfiler = typedef v_1_0::IProfiler |
using nvinfer1::IProgressMonitor = typedef v_1_0::IProgressMonitor |
using nvinfer1::IStreamReader = typedef v_1_0::IStreamReader |
using nvinfer1::NetworkDefinitionCreationFlags = typedef uint32_t |
Represents one or more NetworkDefinitionCreationFlag flags using binary OR operations. e.g., 1U << NetworkDefinitionCreationFlag::kSTRONGLY_TYPED.
using nvinfer1::PluginFormat = typedef TensorFormat |
PluginFormat is reserved for backward compatibility.
using nvinfer1::QuantizationFlags = typedef uint32_t |
Represents one or more QuantizationFlag values using binary OR operations.
using nvinfer1::SerializationFlags = typedef uint32_t |
Represents one or more SerializationFlag values using binary OR operations, e.g., 1U << SerializationFlag::kEXCLUDE_LEAN_RUNTIME.
using nvinfer1::TacticSources = typedef uint32_t |
Represents a collection of one or more TacticSource values combine using bitwise-OR operations.
using nvinfer1::TempfileControlFlags = typedef uint32_t |
Represents a collection of one or more TempfileControlFlag values combined using bitwise-OR operations.
using nvinfer1::TensorFormats = typedef uint32_t |
It is capable of representing one or more TensorFormat by binary OR operations, e.g., 1U << TensorFormat::kCHW4 | 1U << TensorFormat::kCHW32.
|
strong |
Enumerates the types of activation to perform in an activation layer.
|
strong |
|
strong |
|
strong |
Representation of bounding box data used for the Boxes input tensor in INMSLayer.
|
strong |
List of valid modes that the builder can enable when creating an engine from a network definition.
Enumerator | |
---|---|
kFP16 | Enable FP16 layer selection, with FP32 fallback. |
kINT8 | Enable Int8 layer selection, with FP32 fallback with FP16 fallback if kFP16 also specified. |
kDEBUG | Enable debugging of layers via synchronizing after every layer. |
kGPU_FALLBACK | Enable layers marked to execute on GPU if layer cannot execute on DLA. |
kREFIT | Enable building a refittable engine. |
kDISABLE_TIMING_CACHE | Disable reuse of timing information across identical layers. |
kTF32 | Allow (but not require) computations on tensors of type DataType::kFLOAT to use TF32. TF32 computes inner products by rounding the inputs to 10-bit mantissas before multiplying, but accumulates the sum using 23-bit mantissas. Enabled by default. |
kSPARSE_WEIGHTS | Allow the builder to examine weights and use optimized functions when weights have suitable sparsity. |
kSAFETY_SCOPE | Change the allowed parameters in the EngineCapability::kSTANDARD flow to match the restrictions that EngineCapability::kSAFETY check against for DeviceType::kGPU and EngineCapability::kDLA_STANDALONE check against the DeviceType::kDLA case. This flag is forced to true if EngineCapability::kSAFETY at build time if it is unset. This flag is only supported in NVIDIA Drive(R) products. |
kOBEY_PRECISION_CONSTRAINTS | Require that layers execute in specified precisions. Build fails otherwise. |
kPREFER_PRECISION_CONSTRAINTS | Prefer that layers execute in specified precisions. Fall back (with warning) to another precision if build would otherwise fail. |
kDIRECT_IO | Require that no reformats be inserted between a layer and a network I/O tensor for which ITensor::setAllowedFormats was called. Build fails if a reformat is required for functional correctness. |
kREJECT_EMPTY_ALGORITHMS | Fail if IAlgorithmSelector::selectAlgorithms returns an empty set of algorithms. |
kVERSION_COMPATIBLE | Restrict to lean runtime operators to provide version forward compatibility for the plan. This flag is only supported by NVIDIA Volta and later GPUs. This flag is not supported in NVIDIA Drive(R) products. |
kEXCLUDE_LEAN_RUNTIME | Exclude lean runtime from the plan when version forward compatability is enabled. By default, this flag is unset, so the lean runtime will be included in the plan. If BuilderFlag::kVERSION_COMPATIBLE is not set then the value of this flag will be ignored. |
kFP8 | Enable plugins with FP8 input/output. This flag is not supported with hardware-compatibility mode. \see HardwareCompatibilityLevel |
kERROR_ON_TIMING_CACHE_MISS | Emit error when a tactic being timed is not present in the timing cache. This flag has an effect only when IBuilderConfig has an associated ITimingCache. |
kBF16 | Enable DataType::kBF16 layer selection, with FP32 fallback. This flag is only supported by NVIDIA Ampere and later GPUs. |
kDISABLE_COMPILATION_CACHE | Disable caching of JIT-compilation results during engine build. By default, JIT-compiled code will be serialized as part of the timing cache, which may significantly increase the cache size. Setting this flag prevents the code from being serialized. This flag has an effect only when BuilderFlag::DISABLE_TIMING_CACHE is not set. |
kSTRIP_PLAN | Strip the refittable weights from the engine plan file. |
kWEIGHTLESS |
|
kREFIT_IDENTICAL | Create a refittable engine under the assumption that the refit weights will be identical to those provided at build time. The resulting engine will have the same performance as a non-refittable one. All refittable weights can be refitted through the refit API, but if the refit weights are not identical to the build-time weights, behavior is undefined. When used alongside 'kSTRIP_PLAN', this flag will result in a small plan file for which weights are later supplied via refitting. This enables use of a single set of weights with different inference backends, or with TensorRT plans for multiple GPU architectures. |
kWEIGHT_STREAMING | Enable weight streaming for the current engine. Weight streaming from the host enables execution of models that do not fit in GPU memory by allowing TensorRT to intelligently stream network weights from the CPU DRAM. Please see ICudaEngine::getMinimumWeightStreamingBudget for the default memory budget when this flag is enabled. Enabling this feature changes the behavior of IRuntime::deserializeCudaEngine to allocate the entire network’s weights on the CPU DRAM instead of GPU memory. Then, ICudaEngine::createExecutionContext will determine the optimal split of weights between the CPU and GPU and place weights accordingly. Future TensorRT versions may enable this flag by default.
|
kINT4 | Enable plugins with INT4 input/output. |
kREFIT_INDIVIDUAL | Enable building a refittable engine and provide fine-grained control. This allows control over which weights are refittable or not using INetworkDefinition::markWeightsRefittable and INetworkDefinition::unmarkWeightsRefittable. By default, all weights are non-refittable when this flag is enabled. This flag cannot be used together with kREFIT or kREFIT_IDENTICAL. |
|
strong |
Version of calibration algorithm to use.
Enumerator | |
---|---|
kLEGACY_CALIBRATION | Legacy calibration. |
kENTROPY_CALIBRATION | Legacy entropy calibration. |
kENTROPY_CALIBRATION_2 | Entropy calibration. |
kMINMAX_CALIBRATION | Minmax calibration. |
|
strong |
The type of weights and tensors.
|
strong |
|
strong |
An operation on two IDimensionExpr, which represent integer expressions used in dimension computations.
For example, given two IDimensionExpr x and y and an IExprBuilder& eb, eb.operation(DimensionOperation::kSUM, x, y) creates a representation of x+y.
|
strong |
Enumerates the binary operations that may be performed by an ElementWise layer.
Operations kAND, kOR, and kXOR must have inputs of DataType::kBOOL.
Operation kPOW must have inputs of floating-point type or DataType::kINT8.
All other operations must have inputs of floating-point type, DataType::kINT8, DataType::kINT32, or DataType::kINT64.
|
strong |
List of supported engine capability flows.
The EngineCapability determines the restrictions of a network during build time and what runtime it targets. When BuilderFlag::kSAFETY_SCOPE is not set (by default), EngineCapability::kSTANDARD does not provide any restrictions on functionality and the resulting serialized engine can be executed with TensorRT's standard runtime APIs in the nvinfer1 namespace. EngineCapability::kSAFETY provides a restricted subset of network operations that are safety certified and the resulting serialized engine can be executed with TensorRT's safe runtime APIs in the nvinfer1::safe namespace. EngineCapability::kDLA_STANDALONE provides a restricted subset of network operations that are DLA compatible and the resulting serialized engine can be executed using standalone DLA runtime APIs. See sampleCudla for an example of integrating cuDLA APIs with TensorRT APIs.
Enumerator | |
---|---|
kSTANDARD | Standard: TensorRT flow without targeting the safety runtime. This flow supports both DeviceType::kGPU and DeviceType::kDLA. |
kSAFETY | Safety: TensorRT flow with restrictions targeting the safety runtime. See safety documentation for list of supported layers and formats. This flow supports only DeviceType::kGPU. This flag is only supported in NVIDIA Drive(R) products. |
kDLA_STANDALONE | DLA Standalone: TensorRT flow with restrictions targeting external, to TensorRT, DLA runtimes. See DLA documentation for list of supported layers and formats. This flow supports only DeviceType::kDLA. |
|
strong |
Error codes that can be returned by TensorRT during execution.
|
strong |
Different memory allocation behaviors for IExecutionContext.
IExecutionContext requires a block of device memory for internal activation tensors during inference. The user can either let the execution context manage the memory in various ways or allocate the memory themselves.
|
strong |
Enumerates the tensor fill operations that may performed by a fill layer.
Enumerator | |
---|---|
kLINSPACE | Compute each value via an affine function of its indices. For example, suppose the parameters for the IFillLayer are:
Element [i,j] of the output is Alpha + Beta[0]*i + Beta[1]*j. Thus the output matrix is: 1 11 21 31 101 111 121 131 201 211 221 231 A static beta b is implicitly a 1D tensor, i.e. Beta = [b]. |
kRANDOM_UNIFORM | Randomly draw values from a uniform distribution. |
kRANDOM_NORMAL | Randomly draw values from a normal distribution. |
|
strong |
Control form of IGatherLayer.
Enumerator | |
---|---|
kDEFAULT | Similar to ONNX Gather. |
kELEMENT | Similar to ONNX GatherElements. |
kND | Similar to ONNX GatherND. |
|
strong |
Describes requirements of compatibility with GPU architectures other than that of the GPU on which the engine was built.
Levels except kNONE are only supported for engines built on NVIDIA Ampere and later GPUs.
|
strong |
|
strong |
The format in which the IEngineInspector prints the layer information.
Enumerator | |
---|---|
kONELINE | Print layer information in one line per layer. |
kJSON | Print layer information in JSON format. |
|
strong |
The type values of layer classes.
|
strong |
|
strong |
Enumerates the operations that may be performed on a tensor by IMatrixMultiplyLayer before multiplication.
Enumerator | |
---|---|
kNONE | Treat x as a matrix if it has two dimensions, or as a collection of matrices if x has more than two dimensions, where the last two dimensions are the matrix dimensions. x must have at least two dimensions. |
kTRANSPOSE | Like kNONE, but transpose the matrix dimensions. |
kVECTOR | Treat x as a vector if it has one dimension, or as a collection of vectors if x has more than one dimension. x must have at least one dimension. The first input tensor with dimensions [M,K] used with MatrixOperation::kVECTOR is equivalent to a tensor with dimensions [M, 1, K] with MatrixOperation::kNONE, i.e. is treated as M row vectors of length K, or dimensions [M, K, 1] with MatrixOperation::kTRANSPOSE. The second input tensor with dimensions [M,K] used with MatrixOperation::kVECTOR is equivalent to a tensor with dimensions [M, K, 1] with MatrixOperation::kNONE, i.e. is treated as M column vectors of length K, or dimensions [M, 1, K] with MatrixOperation::kTRANSPOSE. |
|
strong |
The type for memory pools used by TensorRT.
Enumerator | |
---|---|
kWORKSPACE | kWORKSPACE is used by TensorRT to store intermediate buffers within an operation. This defaults to max device memory. Set to a smaller value to restrict tactics that use over the threshold en masse. For more targeted removal of tactics use the IAlgorithmSelector interface. |
kDLA_MANAGED_SRAM | kDLA_MANAGED_SRAM is a fast software managed RAM used by DLA to communicate within a layer. The size of this pool must be at least 4 KiB and must be a power of 2. This defaults to 1 MiB. Orin has capacity of 1 MiB per core. |
kDLA_LOCAL_DRAM | kDLA_LOCAL_DRAM is host RAM used by DLA to share intermediate tensor data across operations. The size of this pool must be at least 4 KiB and must be a power of 2. This defaults to 1 GiB. |
kDLA_GLOBAL_DRAM | kDLA_GLOBAL_DRAM is host RAM used by DLA to store weights and metadata for execution. The size of this pool must be at least 4 KiB and must be a power of 2. This defaults to 512 MiB. |
kTACTIC_DRAM | kTACTIC_DRAM is the device DRAM used by the optimizer to run tactics. On embedded devices, where host and device memory are unified, this includes all host memory required by TensorRT to build the network up to the point of each memory allocation. This defaults to 75% of totalGlobalMem as reported by cudaGetDeviceProperties when cudaGetDeviceProperties.embedded is true, and 100% otherwise. |
kTACTIC_SHARED_MEMORY | kTACTIC_SHARED_MEMORY defines the maximum sum of shared memory reserved by the driver and used for executing CUDA kernels. Adjust this value to restrict tactics that exceed the specified threshold en masse. The default value is device max capability. This value must be less than 1GiB. The driver reserved shared memory can be queried from cuDeviceGetAttribute(&reservedShmem, CU_DEVICE_ATTRIBUTE_RESERVED_SHARED_MEMORY_PER_BLOCK). Updating this flag will override the shared memory limit set by HardwareCompatibilityLevel, which defaults to 48KiB - reservedShmem. |
|
strong |
List of immutable network properties expressed at network creation time. NetworkDefinitionCreationFlag is used with createNetworkV2() to specify immutable properties of the network.
|
strong |
When setting or querying optimization profile parameters (such as shape tensor inputs or dynamic dimensions), select whether we are interested in the minimum, optimum, or maximum values for these parameters. The minimum and maximum specify the permitted range that is supported at runtime, while the optimum value is used for the kernel selection. This should be the "typical" value that is expected to occur at runtime.
|
strong |
Enumerates the modes of padding to perform in convolution, deconvolution and pooling layer, padding mode takes precedence if setPaddingMode() and setPrePadding() are also used.
There are two padding styles, EXPLICIT and SAME with each style having two variants. The EXPLICIT style determine if the final sampling location is used or not. The SAME style determine if the asymmetry in the padding is on the pre or post padding.
Formulas for Convolution:
Formulas for Deconvolution:
Formulas for Pooling:
Pooling Example 1:
Pooling Example 2:
|
strong |
Enumerates the different capability types a IPluginV3 object may have.
Enumerator | |
---|---|
kCORE | Core capability. Every IPluginV3 object must have this. |
kBUILD | Build capability. IPluginV3 objects provided to TensorRT build phase must have this. |
kRUNTIME | Runtime capability. IPluginV3 objects provided to TensorRT build and execution phases must have this. |
|
strong |
|
strong |
The possible field types for custom layer.
Enumerator | |
---|---|
kFLOAT16 | FP16 field type. |
kFLOAT32 | FP32 field type. |
kFLOAT64 | FP64 field type. |
kINT8 | INT8 field type. |
kINT16 | INT16 field type. |
kINT32 | INT32 field type. |
kCHAR | char field type. |
kDIMS | nvinfer1::Dims field type. |
kUNKNOWN | Unknown field type. |
kBF16 | BF16 field type. |
kINT64 | INT64 field type. |
kFP8 | FP8 field type. |
kINT4 | INT4 field type. |
|
strong |
|
strong |
|
strong |
Define preview features.
Preview Features have been fully tested but are not yet as stable as other features in TensorRT. They are provided as opt-in features for at least one release.
Enumerator | |
---|---|
kPROFILE_SHARING_0806 | Allows optimization profiles to be shared across execution contexts.
|
kALIASED_PLUGIN_IO_10_03 | Allows plugin I/O to be aliased when using IPluginV3OneBuildV2 |
|
strong |
List of verbosity levels of layer information exposed in NVTX annotations and in IEngineInspector.
|
strong |
List of valid flags for quantizing the network to int8.
Enumerator | |
---|---|
kCALIBRATE_BEFORE_FUSION | Run int8 calibration pass before layer fusion. Only valid for IInt8LegacyCalibrator and IInt8EntropyCalibrator. The builder always runs the int8 calibration pass before layer fusion for IInt8MinMaxCalibrator and IInt8EntropyCalibrator2. Disabled by default. |
|
strong |
Enumerates the reduce operations that may be performed by a Reduce layer.
The table shows the result of reducing across an empty volume of a given type.
Operation | kFLOAT and kHALF | kINT32 | kINT8 |
---|---|---|---|
kSUM | 0 | 0 | 0 |
kPROD | 1 | 1 | 1 |
kMAX | negative infinity | INT_MIN | -128 |
kMIN | positive infinity | INT_MAX | 127 |
kAVG | NaN | 0 | -128 |
The current version of TensorRT usually performs reduction for kINT8 via kFLOAT or kHALF. The kINT8 values show the quantized representations of the floating-point values.
Enumerator | |
---|---|
kSUM | |
kPROD | |
kMAX | |
kMIN | |
kAVG |
|
strong |
The resize coordinate transformation function.
|
strong |
|
strong |
|
strong |
Describes the intended runtime platform (operating system and CPU architecture) for the execution of the TensorRT engine. TensorRT provides support for cross-platform engine compatibility when the target runtime platform is different from the build platform.
|
strong |
Controls how ISliceLayer and IGridSample handle out-of-bounds coordinates.
|
strong |
Controls how shift, scale and power are applied in a Scale layer.
Enumerator | |
---|---|
kUNIFORM | Identical coefficients across all elements of the tensor. |
kCHANNEL | Per-channel coefficients. |
kELEMENTWISE | Elementwise coefficients. |
|
strong |
Control form of IScatterLayer.
Enumerator | |
---|---|
kELEMENT | Similar to ONNX ScatterElements. |
kND | Similar to ONNX ScatterND. |
|
strong |
|
strong |
List of tactic sources for TensorRT.
Enumerator | |
---|---|
kCUBLAS | cuBLAS tactics. Disabled by default.
|
kCUBLAS_LT | cuBLAS LT tactics. Disabled by default.
|
kCUDNN | cuDNN tactics. Disabled by default.
|
kEDGE_MASK_CONVOLUTIONS | Enables convolution tactics implemented with edge mask tables. These tactics tradeoff memory for performance by consuming additional memory space proportional to the input size. Enabled by default. |
kJIT_CONVOLUTIONS | Enables convolution tactics implemented with source-code JIT fusion. The engine building time may increase when this is enabled. Enabled by default. |
|
strong |
Flags used to control TensorRT's behavior when creating executable temporary files.
On some platforms the TensorRT runtime may need to create files in a temporary directory or use platform-specific APIs to create files in-memory to load temporary DLLs that implement runtime code. These flags allow the application to explicitly control TensorRT's use of these files. This will preclude the use of certain TensorRT APIs for deserializing and loading lean runtimes.
|
strong |
Format of the input/output tensors.
This enum is used by both plugins and network I/O tensors.
Many of the formats are vector-major or vector-minor. These formats specify a vector dimension and scalars per vector. For example, suppose that the tensor has has dimensions [M,N,C,H,W], the vector dimension is C and there are V scalars per vector.
In interfaces that refer to "components per element", that's the value of V above.
For more information about data formats, see the topic "Data Format Description" located in the TensorRT Developer Guide. https://docs.nvidia.com/deeplearning/tensorrt/developer-guide/index.html#data-format-desc
|
strong |
|
strong |
|
strong |
|
strong |
|
strong |
|
strong |
Enumerates the unary operations that may be performed by a Unary layer.
Operations kNOT must have inputs of DataType::kBOOL.
Operation kSIGN and kABS must have inputs of floating-point type, DataType::kINT8, DataType::kINT32 or DataType::kINT64.
Operation kISINF must have inputs of floating-point type.
All other operations must have inputs of floating-point type.
|
strong |
How a layer uses particular Weights.
The power weights of an IScaleLayer are omitted. Refitting those is not supported.
Enumerator | |
---|---|
kKERNEL | kernel for IConvolutionLayer or IDeconvolutionLayer |
kBIAS | bias for IConvolutionLayer or IDeconvolutionLayer |
kSHIFT | shift part of IScaleLayer |
kSCALE | scale part of IScaleLayer |
kCONSTANT | weights for IConstantLayer |
kANY | Any other weights role. |
|
constexprnoexcept |
Maximum number of elements in an enumeration type.
|
inlineconstexprnoexcept |
Maximum number of elements in BoundingBoxFormat enum.
|
inlineconstexprnoexcept |
Maximum number of builder flags in BuilderFlag enum.
|
inlineconstexprnoexcept |
Maximum number of elements in CalibrationAlgoType enum.
|
inlineconstexprnoexcept |
Maximum number of elements in DeviceType enum.
|
inlineconstexprnoexcept |
Maximum number of elements in DimensionOperation enum.
|
inlineconstexprnoexcept |
Maximum number of memory allocation strategies in ExecutionContextAllocationStrategy enum.
|
inlineconstexprnoexcept |
Maximum number of elements in FillOperation enum.
|
inlineconstexprnoexcept |
Maximum number of elements in GatherMode enum.
|
inlineconstexprnoexcept |
Maximum number of layer information formats in LayerInformationFormat enum.
|
inlineconstexprnoexcept |
Maximum number of elements in LayerType enum.
|
inlineconstexprnoexcept |
Maximum number of elements in LoopOutput enum.
|
inlineconstexprnoexcept |
Maximum number of elements in MatrixOperation enum.
|
inlineconstexprnoexcept |
Maximum number of memory pool types in the MemoryPoolType enum.
|
inlineconstexprnoexcept |
Maximum number of elements in NetworkDefinitionCreationFlag enum.
|
inlineconstexprnoexcept |
Number of different values of OptProfileSelector enum.
|
inlineconstexprnoexcept |
Maximum number of profile verbosity levels in ProfilingVerbosity enum.
|
inlineconstexprnoexcept |
Maximum number of quantization flags in QuantizationFlag enum.
|
inlineconstexprnoexcept |
Maximum number of elements in ReduceOperation enum.
|
inlineconstexprnoexcept |
Maximum number of elements in SampleMode enum.
|
inlineconstexprnoexcept |
Maximum number of elements in ScaleMode enum.
|
inlineconstexprnoexcept |
Maximum number of elements in ScatterMode enum.
|
inlineconstexprnoexcept |
Maximum number of serialization flags in SerializationFlag enum.
|
inlineconstexprnoexcept |
Maximum number of tactic sources in TacticSource enum.
|
inlineconstexprnoexcept |
Maximum number of elements in TempfileControlFlag enum.
|
inlineconstexprnoexcept |
Maximum number of elements in TopKOperation enum.
|
inlineconstexprnoexcept |
Maximum number of elements in TripLimit enum.
|
inlineconstexprnoexcept |
Maximum number of elements in UnaryOperation enum.
|
inlineconstexprnoexcept |
Maximum number of elements in WeightsRole enum.
|
noexcept |
Return the plugin registry for building a Standard engine, or nullptr if no registry exists.
Also return nullptr if the input argument is not EngineCapability::kSTANDARD. Engine capabilities EngineCapability::kSTANDARD and EngineCapability::kSAFETY have distinct plugin registries. When building a Safety engine, use nvinfer1::getBuilderSafePluginRegistry(). Use IPluginRegistry::registerCreator from the registry to register plugins. Plugins registered in a registry associated with a specific engine capability are only available when building engines with that engine capability.
There is no plugin registry for EngineCapability::kDLA_STANDALONE.
|
noexcept |
Return the plugin registry for building a Safety engine, or nullptr if no registry exists.
Also return nullptr if the input argument is not EngineCapability::kSAFETY. When building a Standard engine, use nvinfer1::getBuilderPluginRegistry(). Use safe::IPluginRegistry::registerCreator from the registry to register plugins.
Copyright © 2024 NVIDIA Corporation
Privacy Policy |
Manage My Privacy |
Do Not Sell or Share My Data |
Terms of Service |
Accessibility |
Corporate Policies |
Product Security |
Contact