TensorRT-RTX 1.1.1
|
This layer represents an input to an attention subgraph. More...
#include <NvInfer.h>
Public Member Functions | |
void | setInput (int32_t index, ITensor &tensor) noexcept |
Append or replace an input of this layer with a specific tensor. More... | |
![]() | |
IAttention * | getAttention () const noexcept |
Get a pointer to the IAttention associated with this boundary layer. More... | |
![]() | |
LayerType | getType () const noexcept |
Return the type of a layer. More... | |
void | setName (char const *name) noexcept |
Set the name of a layer. More... | |
char const * | getName () const noexcept |
Return the name of a layer. More... | |
int32_t | getNbInputs () const noexcept |
Get the number of inputs of a layer. More... | |
ITensor * | getInput (int32_t index) const noexcept |
Get the layer input corresponding to the given index. More... | |
int32_t | getNbOutputs () const noexcept |
Get the number of outputs of a layer. More... | |
ITensor * | getOutput (int32_t index) const noexcept |
Get the layer output corresponding to the given index. More... | |
void | setInput (int32_t index, ITensor &tensor) noexcept |
Replace an input of this layer with a specific tensor. More... | |
TRT_DEPRECATED void | setPrecision (DataType dataType) noexcept |
Set the preferred or required computational precision of this layer in a weakly-typed network. More... | |
DataType | getPrecision () const noexcept |
get the computational precision of this layer More... | |
TRT_DEPRECATED bool | precisionIsSet () const noexcept |
whether the computational precision has been set for this layer More... | |
TRT_DEPRECATED void | resetPrecision () noexcept |
reset the computational precision for this layer More... | |
TRT_DEPRECATED void | setOutputType (int32_t index, DataType dataType) noexcept |
Set the output type of this layer in a weakly-typed network. More... | |
DataType | getOutputType (int32_t index) const noexcept |
get the output type of this layer More... | |
TRT_DEPRECATED bool | outputTypeIsSet (int32_t index) const noexcept |
whether the output type has been set for this layer More... | |
TRT_DEPRECATED void | resetOutputType (int32_t index) noexcept |
reset the output type for this layer More... | |
void | setMetadata (char const *metadata) noexcept |
Set the metadata for this layer. More... | |
char const * | getMetadata () const noexcept |
Get the metadata of the layer. More... | |
Protected Member Functions | |
virtual | ~IAttentionInputLayer () noexcept=default |
![]() | |
virtual | ~IAttentionBoundaryLayer () noexcept=default |
![]() | |
virtual | ~ILayer () noexcept=default |
![]() | |
INoCopy ()=default | |
virtual | ~INoCopy ()=default |
INoCopy (INoCopy const &other)=delete | |
INoCopy & | operator= (INoCopy const &other)=delete |
INoCopy (INoCopy &&other)=delete | |
INoCopy & | operator= (INoCopy &&other)=delete |
Protected Attributes | |
apiv::VAttentionInputLayer * | mImpl |
![]() | |
apiv::VAttentionBoundaryLayer * | mBoundary |
![]() | |
apiv::VLayer * | mLayer |
This layer represents an input to an attention subgraph.
This layer is automatically created when an IAttention
is created. Clients typically do not deal with the layer directly, but instead specify its input via addAttention
or IAttention::setInput
.
An IAttentionInputLayer has three to four inputs and one output.
|
protectedvirtualdefaultnoexcept |
|
inlinenoexcept |
Append or replace an input of this layer with a specific tensor.
index | the index of the input to modify. |
tensor | the new input tensor |
The indices are as follows:
Input 0 is the input query tensor. Input 1 is the input key tensor. Input 2 is the input value tensor. Input 3 is the optional mask tensor. setMask should be used instead of setInput
|
protected |
Copyright © 2024 NVIDIA Corporation
Privacy Policy |
Manage My Privacy |
Do Not Sell or Share My Data |
Terms of Service |
Accessibility |
Corporate Policies |
Product Security |
Contact