CUpti_ActivityCdpKernel Struct Reference
[CUPTI Activity API]

The activity record for CDP (CUDA Dynamic Parallelism) kernel. More...


Data Fields

int32_t  blockX
int32_t  blockY
int32_t  blockZ
uint64_t  completed
uint32_t  contextId
uint32_t  correlationId
uint32_t  deviceId
int32_t  dynamicSharedMemory
uint64_t  end
int64_t  gridId
int32_t  gridX
int32_t  gridY
int32_t  gridZ
CUpti_ActivityKind  kind
uint32_t  localMemoryPerThread
uint32_t  localMemoryTotal
const char *  name
uint32_t  parentBlockX
uint32_t  parentBlockY
uint32_t  parentBlockZ
int64_t  parentGridId
uint64_t  queued
uint16_t  registersPerThread
uint8_t  sharedMemoryConfig
uint64_t  start
int32_t  staticSharedMemory
uint32_t  streamId
uint64_t  submitted
uint8_t  executed:4
uint8_t  requested:4

Detailed Description

This activity record represents a CDP kernel execution.

Field Documentation

The X-dimension block size for the kernel.

The Y-dimension block size for the kernel.

The Z-dimension grid size for the kernel.

The timestamp when kernel is marked as completed, in ns. A value of CUPTI_TIMESTAMP_UNKNOWN indicates that the completion time is unknown.

The ID of the context where the kernel is executing.

The correlation ID of the kernel. Each kernel execution is assigned a unique correlation ID that is identical to the correlation ID in the driver API activity record that launched the kernel.

The ID of the device where the kernel is executing.

The dynamic shared memory reserved for the kernel, in bytes.

The end timestamp for the kernel execution, in ns. A value of 0 for both the start and end timestamps indicates that timestamp information could not be collected for the kernel.

The cache configuration used for the kernel. The value is one of the CUfunc_cache enumeration values from cuda.h.

The grid ID of the kernel. Each kernel execution is assigned a unique grid ID.

The X-dimension grid size for the kernel.

The Y-dimension grid size for the kernel.

The Z-dimension grid size for the kernel.

The activity record kind, must be CUPTI_ACTIVITY_KIND_CDP_KERNEL

The amount of local memory reserved for each thread, in bytes.

The total amount of local memory reserved for the kernel, in bytes.

The name of the kernel. This name is shared across all activity records representing the same kernel, and so should not be modified.

The X-dimension of the parent block.

The Y-dimension of the parent block.

The Z-dimension of the parent block.

The grid ID of the parent kernel.

The timestamp when kernel is queued up, in ns. A value of CUPTI_TIMESTAMP_UNKNOWN indicates that the queued time is unknown.

The number of registers required for each thread executing the kernel.

The cache configuration requested by the kernel. The value is one of the CUfunc_cache enumeration values from cuda.h.

The shared memory configuration used for the kernel. The value is one of the CUsharedconfig enumeration values from cuda.h.

The start timestamp for the kernel execution, in ns. A value of 0 for both the start and end timestamps indicates that timestamp information could not be collected for the kernel.

The static shared memory allocated for the kernel, in bytes.

The ID of the stream where the kernel is executing.

The timestamp when kernel is submitted to the gpu, in ns. A value of CUPTI_TIMESTAMP_UNKNOWN indicates that the submission time is unknown.


Generated on Tue Jul 21 01:14:13 2020 for Cupti by  doxygen 1.5.8