TensorRT-RTX 1.1.1
nvinfer1::IAttentionInputLayer Class Reference

This layer represents an input to an attention subgraph. More...

#include <NvInfer.h>

Inheritance diagram for nvinfer1::IAttentionInputLayer:
nvinfer1::IAttentionBoundaryLayer nvinfer1::ILayer nvinfer1::INoCopy

Public Member Functions

void setInput (int32_t index, ITensor &tensor) noexcept
 Append or replace an input of this layer with a specific tensor. More...
 
- Public Member Functions inherited from nvinfer1::IAttentionBoundaryLayer
IAttentiongetAttention () const noexcept
 Get a pointer to the IAttention associated with this boundary layer. More...
 
- Public Member Functions inherited from nvinfer1::ILayer
LayerType getType () const noexcept
 Return the type of a layer. More...
 
void setName (char const *name) noexcept
 Set the name of a layer. More...
 
char const * getName () const noexcept
 Return the name of a layer. More...
 
int32_t getNbInputs () const noexcept
 Get the number of inputs of a layer. More...
 
ITensorgetInput (int32_t index) const noexcept
 Get the layer input corresponding to the given index. More...
 
int32_t getNbOutputs () const noexcept
 Get the number of outputs of a layer. More...
 
ITensorgetOutput (int32_t index) const noexcept
 Get the layer output corresponding to the given index. More...
 
void setInput (int32_t index, ITensor &tensor) noexcept
 Replace an input of this layer with a specific tensor. More...
 
TRT_DEPRECATED void setPrecision (DataType dataType) noexcept
 Set the preferred or required computational precision of this layer in a weakly-typed network. More...
 
DataType getPrecision () const noexcept
 get the computational precision of this layer More...
 
TRT_DEPRECATED bool precisionIsSet () const noexcept
 whether the computational precision has been set for this layer More...
 
TRT_DEPRECATED void resetPrecision () noexcept
 reset the computational precision for this layer More...
 
TRT_DEPRECATED void setOutputType (int32_t index, DataType dataType) noexcept
 Set the output type of this layer in a weakly-typed network. More...
 
DataType getOutputType (int32_t index) const noexcept
 get the output type of this layer More...
 
TRT_DEPRECATED bool outputTypeIsSet (int32_t index) const noexcept
 whether the output type has been set for this layer More...
 
TRT_DEPRECATED void resetOutputType (int32_t index) noexcept
 reset the output type for this layer More...
 
void setMetadata (char const *metadata) noexcept
 Set the metadata for this layer. More...
 
char const * getMetadata () const noexcept
 Get the metadata of the layer. More...
 

Protected Member Functions

virtual ~IAttentionInputLayer () noexcept=default
 
- Protected Member Functions inherited from nvinfer1::IAttentionBoundaryLayer
virtual ~IAttentionBoundaryLayer () noexcept=default
 
- Protected Member Functions inherited from nvinfer1::ILayer
virtual ~ILayer () noexcept=default
 
- Protected Member Functions inherited from nvinfer1::INoCopy
 INoCopy ()=default
 
virtual ~INoCopy ()=default
 
 INoCopy (INoCopy const &other)=delete
 
INoCopyoperator= (INoCopy const &other)=delete
 
 INoCopy (INoCopy &&other)=delete
 
INoCopyoperator= (INoCopy &&other)=delete
 

Protected Attributes

apiv::VAttentionInputLayer * mImpl
 
- Protected Attributes inherited from nvinfer1::IAttentionBoundaryLayer
apiv::VAttentionBoundaryLayer * mBoundary
 
- Protected Attributes inherited from nvinfer1::ILayer
apiv::VLayer * mLayer
 

Detailed Description

This layer represents an input to an attention subgraph.

This layer is automatically created when an IAttention is created. Clients typically do not deal with the layer directly, but instead specify its input via addAttention or IAttention::setInput.

An IAttentionInputLayer has three to four inputs and one output.

Constructor & Destructor Documentation

◆ ~IAttentionInputLayer()

virtual nvinfer1::IAttentionInputLayer::~IAttentionInputLayer ( )
protectedvirtualdefaultnoexcept

Member Function Documentation

◆ setInput()

void nvinfer1::ILayer::setInput ( int32_t  index,
ITensor tensor 
)
inlinenoexcept

Append or replace an input of this layer with a specific tensor.

Parameters
indexthe index of the input to modify.
tensorthe new input tensor

The indices are as follows:

Input 0 is the input query tensor. Input 1 is the input key tensor. Input 2 is the input value tensor. Input 3 is the optional mask tensor. setMask should be used instead of setInput

Member Data Documentation

◆ mImpl

apiv::VAttentionInputLayer* nvinfer1::IAttentionInputLayer::mImpl
protected

The documentation for this class was generated from the following file:

  Copyright © 2024 NVIDIA Corporation
  Privacy Policy | Manage My Privacy | Do Not Sell or Share My Data | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact