2. CUPTI Python API Reference

2.1. Documentation Issues

The CUPTI Python API Reference section of the document is automatically generated and has some issues:

  • All the CUPTI Python enumerations, functions and classes are listed together in a single section.

  • The members of the python classes are not adequately documented. To get more information for a member, please refer the CUPTI C documentation.

  • The kind member of python classes has type int, instead of cupti.cupti.ActivityKind. While using the kind member, please use cupti.cupti.ActivityKind to get the enum value.

  • The “See also” section for each CUPTI Python API function shows the corresponding CUPTI C API name. But it does not provide a link to the CUPTI C documentation.

  • The “See <data type name>” references in the generated documentation point to the original C data structures instead of their Python equivalents.

2.2. API Reference

exception cupti.cupti.cuptiError(status: int)

Bases: Exception

class cupti.cupti.ActivityAPI(size=1)

Bases: object

cbid

cbid : int

correlation_id

correlation_id : int

end

end : int

kind

kind : int

process_id

process_id : int

ptr

Get the pointer address to the data as Python :py:`int`.

return_value

return_value : int

start

start : int

thread_id

thread_id : int

class cupti.cupti.ActivityAttribute(value)

Bases: IntEnum

See CUpti_ActivityAttribute.

DEVICE_BUFFER_FORCE_INT = 2147483647
DEVICE_BUFFER_POOL_LIMIT = 2
DEVICE_BUFFER_PRE_ALLOCATE_VALUE = 6
DEVICE_BUFFER_SIZE = 0
DEVICE_BUFFER_SIZE_CDP = 1
MEM_ALLOCATION_TYPE_HOST_PINNED = 8
PER_THREAD_ACTIVITY_BUFFER = 9
PROFILING_SEMAPHORE_POOL_LIMIT = 4
PROFILING_SEMAPHORE_POOL_SIZE = 3
PROFILING_SEMAPHORE_PRE_ALLOCATE_VALUE = 7
ZEROED_OUT_ACTIVITY_BUFFER = 5
class cupti.cupti.ActivityAutoBoostState(size=1)

Bases: object

enabled

enabled : int

pid

pid : int

ptr

Get the pointer address to the data as Python :py:`int`.

class cupti.cupti.ActivityCdpKernel(size=1)

Bases: object

block_x

block_x : int

block_y

block_y : int

block_z

block_z : int

cache_config

cache_config : cupti.cupti.CacheConfig

completed

completed : int

context_id

context_id : int

correlation_id

correlation_id : int

device_id

device_id : int

dynamic_shared_memory

dynamic_shared_memory : int

end

end : int

grid_id

grid_id : int

grid_x

grid_x : int

grid_y

grid_y : int

grid_z

grid_z : int

kind

kind : int

local_memory_per_thread

local_memory_per_thread : int

local_memory_total

local_memory_total : int

name

name : str

pad

pad : int

parent_block_x

parent_block_x : int

parent_block_y

parent_block_y : int

parent_block_z

parent_block_z : int

parent_grid_id

parent_grid_id : int

ptr

Get the pointer address to the data as Python :py:`int`.

queued

queued : int

registers_per_thread

registers_per_thread : int

shared_memory_config

shared_memory_config : int

start

start : int

static_shared_memory

static_shared_memory : int

stream_id

stream_id : int

submitted

submitted : int

class cupti.cupti.ActivityComputeApiKind(value)

Bases: IntEnum

See CUpti_ActivityComputeApiKind.

CUDA = 1
CUDA_MPS = 2
FORCE_INT = 2147483647
UNKNOWN = 0
class cupti.cupti.ActivityContext3(size=1)

Bases: object

cig_mode

cig_mode : int

compute_api_kind

compute_api_kind : int

context_id

context_id : int

device_id

device_id : int

is_green_context

is_green_context : int

kind

kind : int

null_stream_id

null_stream_id : int

num_multiprocessors

num_multiprocessors : int

padding

padding : int

padding2

padding2 : int

parent_context_id

parent_context_id : int

ptr

Get the pointer address to the data as Python :py:`int`.

class cupti.cupti.ActivityCudaEvent2(size=1)

Bases: object

context_id

context_id : int

correlation_id

correlation_id : int

cuda_event_sync_id

cuda_event_sync_id : int

device_id

device_id : int

device_timestamp

device_timestamp : int

event_id

event_id : int

kind

kind : int

pad

pad : int

pad2

pad2 : int

ptr

Get the pointer address to the data as Python :py:`int`.

reserved0

reserved0 : int

stream_id

stream_id : int

class cupti.cupti.ActivityDevice5(size=1)

Bases: object

compute_capability_major

compute_capability_major (~_numpy.uint32):

compute_capability_minor

compute_capability_minor (~_numpy.uint32):

compute_instance_id

compute_instance_id (~_numpy.uint32):

constant_memory_size

constant_memory_size (~_numpy.uint32):

core_clock_rate

core_clock_rate (~_numpy.uint32):

ecc_enabled

ecc_enabled (~_numpy.uint32):

flags_

flags_ (~_numpy.int32):

global_memory_bandwidth

global_memory_bandwidth (~_numpy.uint64):

global_memory_size

global_memory_size (~_numpy.uint64):

gpu_instance_id

gpu_instance_id (~_numpy.uint32):

id

id (~_numpy.uint32):

is_cuda_visible

is_cuda_visible (~_numpy.uint8):

is_mig_enabled

is_mig_enabled (~_numpy.uint8):

is_numa_node

is_numa_node (~_numpy.uint32):

kind

kind (~_numpy.int32):

l2cache_size

l2cache_size (~_numpy.uint32):

max_block_dim_x

max_block_dim_x (~_numpy.uint32):

max_block_dim_y

max_block_dim_y (~_numpy.uint32):

max_block_dim_z

max_block_dim_z (~_numpy.uint32):

max_blocks_per_multiprocessor

max_blocks_per_multiprocessor (~_numpy.uint32):

max_grid_dim_x

max_grid_dim_x (~_numpy.uint32):

max_grid_dim_y

max_grid_dim_y (~_numpy.uint32):

max_grid_dim_z

max_grid_dim_z (~_numpy.uint32):

max_ipc

max_ipc (~_numpy.uint32):

max_registers_per_block

max_registers_per_block (~_numpy.uint32):

max_registers_per_multiprocessor

max_registers_per_multiprocessor (~_numpy.uint32):

max_shared_memory_per_block

max_shared_memory_per_block (~_numpy.uint32):

max_shared_memory_per_multiprocessor

max_shared_memory_per_multiprocessor (~_numpy.uint32):

max_threads_per_block

max_threads_per_block (~_numpy.uint32):

max_warps_per_multiprocessor

max_warps_per_multiprocessor (~_numpy.uint32):

mig_uuid
name

name (~_numpy.intp):

num_memcpy_engines

num_memcpy_engines (~_numpy.uint32):

num_multiprocessors

num_multiprocessors (~_numpy.uint32):

num_threads_per_warp

num_threads_per_warp (~_numpy.uint32):

numa_id

numa_id (~_numpy.uint32):

ptr

Get the pointer address to the data as Python :py:`int`.

uuid
class cupti.cupti.ActivityDeviceAttribute(size=1)

Bases: object

attribute

attribute : cupti.cupti.Attribute

device_id

device_id : int

flags_

flags_ : int

kind

kind : int

ptr

Get the pointer address to the data as Python :py:`int`.

value

value : cupti.cupti.Value

class cupti.cupti.ActivityDeviceGraphTrace(size=1)

Bases: object

context_id

context_id : int

device_id

device_id : int

device_launch_mode

device_launch_mode : int

end

end : int

graph_id

graph_id : int

kind

kind : int

launcher_graph_id

launcher_graph_id : int

ptr

Get the pointer address to the data as Python :py:`int`.

start

start : int

stream_id

stream_id : int

class cupti.cupti.ActivityEnvironment(size=1)

Bases: object

data

data : cupti.cupti.Data

device_id

device_id : int

environment_kind

environment_kind : int

kind

kind : int

ptr

Get the pointer address to the data as Python :py:`int`.

timestamp

timestamp : int

class cupti.cupti.ActivityEnvironmentKind(value)

Bases: IntEnum

See CUpti_ActivityEnvironmentKind.

COOLING = 4
COUNT = 5
KIND_FORCE_INT = 2147483647
POWER = 3
SPEED = 1
TEMPERATURE = 2
UNKNOWN = 0
class cupti.cupti.ActivityExternalCorrelation(size=1)

Bases: object

correlation_id

correlation_id : int

external_id

external_id : int

external_kind

external_kind : int

kind

kind : int

ptr

Get the pointer address to the data as Python :py:`int`.

class cupti.cupti.ActivityFlag(value)

Bases: IntEnum

See CUpti_ActivityFlag.

DEVICE_ATTRIBUTE_CUDEVICE = 1
DEVICE_CONCURRENT_KERNELS = 1
FLUSH_FORCED = 1
FORCE_INT = 2147483647
GLOBAL_ACCESS_KIND_CACHED = 512
GLOBAL_ACCESS_KIND_LOAD = 256
GLOBAL_ACCESS_KIND_SIZE_MASK = 255
INSTRUCTION_CLASS_MASK = 510
INSTRUCTION_VALUE_INVALID = 1
MARKER_COLOR_ARGB = 2
MARKER_COLOR_NONE = 1
MARKER_INSTANTANEOUS = 1
MARKER_START = 2
MARKER_SYNC_ACQUIRE = 8
MARKER_SYNC_ACQUIRE_FAILED = 32
MARKER_SYNC_ACQUIRE_SUCCESS = 16
MARKER_SYNC_RELEASE = 64
MEMCPY_ASYNC = 1
MEMSET_ASYNC = 1
METRIC_OVERFLOWED = 1
METRIC_VALUE_INVALID = 2
NONE = 0
SHARED_ACCESS_KIND_LOAD = 256
SHARED_ACCESS_KIND_SIZE_MASK = 255
THRASHING_IN_CPU = 1
THROTTLING_IN_CPU = 1
class cupti.cupti.ActivityFunction(size=1)

Bases: object

context_id

context_id : int

function_ind_ex

function_ind_ex : int

id

id : int

kind

kind : int

module_id

module_id : int

name

name : str

pad

pad : int

ptr

Get the pointer address to the data as Python :py:`int`.

class cupti.cupti.ActivityGraphTrace2(size=1)

Bases: object

context_id

context_id : int

correlation_id

correlation_id : int

device_id

device_id : int

end

end : int

end_context_id

end_context_id : int

end_device_id

end_device_id : int

graph_id

graph_id : int

kind

kind : int

ptr

Get the pointer address to the data as Python :py:`int`.

start

start : int

stream_id

stream_id : int

class cupti.cupti.ActivityInstructionClass(value)

Bases: IntEnum

See CUpti_ActivityInstructionClass.

BARRIER = 17
BIT_CONVERSION = 4
CONSTANT = 11
CONTROL_FLOW = 5
FP_16 = 19
FP_32 = 1
FP_64 = 2
GENERIC = 9
GLOBAL = 6
GLOBAL_ATOMIC = 13
INTEGER = 3
INTER_THREAD_COMMUNICATION = 16
KIND_FORCE_INT = 2147483647
LOCAL = 8
MISCELLANEOUS = 18
SHARED = 7
SHARED_ATOMIC = 14
SURFACE = 10
SURFACE_ATOMIC = 15
TEXTURE = 12
UNIFORM = 20
UNKNOWN = 0
class cupti.cupti.ActivityJit2(size=1)

Bases: object

cache_path

cache_path : str

cache_size

cache_size : int

correlation_id

correlation_id : int

device_id

device_id : int

end

end : int

jit_entry_type

jit_entry_type : int

jit_operation_correlation_id

jit_operation_correlation_id : int

jit_operation_type

jit_operation_type : int

kind

kind : int

padding

padding : int

process_id

process_id : int

ptr

Get the pointer address to the data as Python :py:`int`.

start

start : int

thread_id

thread_id : int

class cupti.cupti.ActivityJitEntryType(value)

Bases: IntEnum

See CUpti_ActivityJitEntryType.

INVALID = 0
NVVM_IR_TO_PTX = 2
PTX_TO_CUBIN = 1
TYPE_FORCE_INT = 2147483647
class cupti.cupti.ActivityJitOperationType(value)

Bases: IntEnum

See CUpti_ActivityJitOperationType.

CACHE_LOAD = 1
CACHE_STORE = 2
COMPILE = 3
INVALID = 0
TYPE_FORCE_INT = 2147483647
class cupti.cupti.ActivityKernel9(size=1)

Bases: object

block_x

block_x : int

block_y

block_y : int

block_z

block_z : int

cache_config

cache_config : cupti.cupti.CacheConfig

channel_id

channel_id : int

channel_type

channel_type : int

cluster_scheduling_policy

cluster_scheduling_policy : int

cluster_x

cluster_x : int

cluster_y

cluster_y : int

cluster_z

cluster_z : int

completed

completed : int

context_id

context_id : int

correlation_id

correlation_id : int

device_id

device_id : int

dynamic_shared_memory

dynamic_shared_memory : int

end

end : int

graph_id

graph_id : int

graph_node_id

graph_node_id : int

grid_id

grid_id : int

grid_x

grid_x : int

grid_y

grid_y : int

grid_z

grid_z : int

is_shared_memory_carveout_requested

is_shared_memory_carveout_requested : int

kind

kind : int

launch_type

launch_type : int

local_memory_per_thread

local_memory_per_thread : int

local_memory_total

local_memory_total : int

local_memory_total_v2

local_memory_total_v2 : int

max_active_clusters

max_active_clusters : int

max_potential_cluster_size

max_potential_cluster_size : int

name

name : str

p_access_policy_window

p_access_policy_window : int

padding

padding : int

partitioned_global_cache_executed

partitioned_global_cache_executed : int

partitioned_global_cache_requested

partitioned_global_cache_requested : int

ptr

Get the pointer address to the data as Python :py:`int`.

queued

queued : int

registers_per_thread

registers_per_thread : int

reserved0

reserved0 : int

shared_memory_carveout_requested

shared_memory_carveout_requested : int

shared_memory_config

shared_memory_config : int

shared_memory_executed

shared_memory_executed : int

shmem_limit_config

shmem_limit_config : int

start

start : int

static_shared_memory

static_shared_memory : int

stream_id

stream_id : int

submitted

submitted : int

class cupti.cupti.ActivityKind(value)

Bases: IntEnum

See CUpti_ActivityKind.

BRANCH = 16
CDP_KERNEL = 18
CONCURRENT_KERNEL = 10
CONFIDENTIAL_COMPUTE_ROTATION = 53
CONTEXT = 9
COUNT = 56
CUDA_EVENT = 36
DEVICE = 8
DEVICE_ATTRIBUTE = 28
DEVICE_GRAPH_TRACE = 54
DRIVER = 4
ENVIRONMENT = 20
EVENT = 6
EVENT_INSTANCE = 21
EXTERNAL_CORRELATION = 39
FORCE_INT = 2147483647
FUNCTION = 26
GLOBAL_ACCESS = 15
GRAPH_TRACE = 51
INSTANTANEOUS_EVENT = 41
INSTANTANEOUS_EVENT_INSTANCE = 42
INSTANTANEOUS_METRIC = 43
INSTANTANEOUS_METRIC_INSTANCE = 44
INSTRUCTION_CORRELATION = 32
INSTRUCTION_EXECUTION = 24
INTERNAL_LAUNCH_API = 48
INVALID = 0
JIT = 52
KERNEL = 3
MARKER = 12
MARKER_DATA = 13
MEMCPY = 1
MEMCPY2 = 22
MEMORY = 45
MEMORY2 = 49
MEMORY_POOL = 50
MEMSET = 2
MEM_DECOMPRESS = 55
METRIC = 7
METRIC_INSTANCE = 23
MODULE = 27
NAME = 11
OPENACC_DATA = 33
OPENACC_LAUNCH = 34
OPENACC_OTHER = 35
OPENMP = 47
OVERHEAD = 17
PCIE = 46
PC_SAMPLING = 30
PC_SAMPLING_RECORD_INFO = 31
PREEMPTION = 19
RUNTIME = 5
SHARED_ACCESS = 29
SOURCE_LOCATOR = 14
STREAM = 37
SYNCHRONIZATION = 38
UNIFIED_MEMORY_COUNTER = 25
class cupti.cupti.ActivityLaunchType(value)

Bases: IntEnum

See CUpti_ActivityLaunchType.

CBL_COMMANDLIST = 3
COOPERATIVE_MULTI_DEVICE = 2
COOPERATIVE_SINGLE_DEVICE = 1
REGULAR = 0
class cupti.cupti.ActivityMarker2(size=1)

Bases: object

domain

domain : str

flags_

flags_ : int

id

id : int

kind

kind : int

name

name : str

object_id

object_id : cupti.cupti.ActivityObjectKindId

object_kind

object_kind : int

pad

pad : int

ptr

Get the pointer address to the data as Python :py:`int`.

timestamp

timestamp : int

class cupti.cupti.ActivityMarkerData(size=1)

Bases: object

category

category : int

color

color : int

flags_

flags_ : int

id

id : int

kind

kind : int

payload

payload : cupti.cupti.MetricValue

payload_kind

payload_kind : int

ptr

Get the pointer address to the data as Python :py:`int`.

class cupti.cupti.ActivityMemDecompress(size=1)

Bases: object

channel_id

channel_id : int

channel_type

channel_type : int

context_id

context_id : int

correlation_id

correlation_id : int

device_id

device_id : int

end

end : int

kind

kind : int

number_of_operations

number_of_operations : int

ptr

Get the pointer address to the data as Python :py:`int`.

reserved0

reserved0 : int

source_bytes

source_bytes : int

start

start : int

stream_id

stream_id : int

class cupti.cupti.ActivityMemcpy6(size=1)

Bases: object

bytes

bytes : int

channel_id

channel_id : int

channel_type

channel_type : int

context_id

context_id : int

copy_count

copy_count : int

copy_kind

copy_kind : int

correlation_id

correlation_id : int

device_id

device_id : int

dst_kind

dst_kind : int

end

end : int

flags_

flags_ : int

graph_id

graph_id : int

graph_node_id

graph_node_id : int

kind

kind : int

pad

pad : int

pad2

pad2 : int

ptr

Get the pointer address to the data as Python :py:`int`.

reserved0

reserved0 : int

runtime_correlation_id

runtime_correlation_id : int

src_kind

src_kind : int

start

start : int

stream_id

stream_id : int

class cupti.cupti.ActivityMemcpyKind(value)

Bases: IntEnum

See CUpti_ActivityMemcpyKind.

ATOA = 5
ATOD = 6
ATOH = 4
DTOA = 7
DTOD = 8
DTOH = 2
FORCE_INT = 2147483647
HTOA = 3
HTOD = 1
HTOH = 9
PTOP = 10
UNKNOWN = 0
class cupti.cupti.ActivityMemcpyPtoP4(size=1)

Bases: object

bytes

bytes : int

channel_id

channel_id : int

channel_type

channel_type : int

context_id

context_id : int

copy_kind

copy_kind : int

correlation_id

correlation_id : int

device_id

device_id : int

dst_context_id

dst_context_id : int

dst_device_id

dst_device_id : int

dst_kind

dst_kind : int

end

end : int

flags_

flags_ : int

graph_id

graph_id : int

graph_node_id

graph_node_id : int

kind

kind : int

ptr

Get the pointer address to the data as Python :py:`int`.

reserved0

reserved0 : int

src_context_id

src_context_id : int

src_device_id

src_device_id : int

src_kind

src_kind : int

start

start : int

stream_id

stream_id : int

class cupti.cupti.ActivityMemory(size=1)

Bases: object

address

address : int

alloc_pc

alloc_pc : int

bytes

bytes : int

context_id

context_id : int

device_id

device_id : int

end

end : int

free_pc

free_pc : int

kind

kind : int

memory_kind

memory_kind : int

name

name : str

pad

pad : int

process_id

process_id : int

ptr

Get the pointer address to the data as Python :py:`int`.

start

start : int

class cupti.cupti.ActivityMemory4(size=1)

Bases: object

address

address : int

bytes

bytes : int

context_id

context_id : int

correlation_id

correlation_id : int

device_id

device_id : int

is_async

is_async : int

kind

kind : int

memory_kind

memory_kind : int

memory_operation_type

memory_operation_type : int

memory_pool_config

memory_pool_config : cupti.cupti.MemoryPoolConfig

name

name : str

pad1

pad1 : int

pc

pc : int

process_id

process_id : int

ptr

Get the pointer address to the data as Python :py:`int`.

source

source : str

stream_id

stream_id : int

timestamp

timestamp : int

class cupti.cupti.ActivityMemoryKind(value)

Bases: IntEnum

See CUpti_ActivityMemoryKind.

ARRAY = 4
DEVICE = 3
DEVICE_STATIC = 6
FORCE_INT = 2147483647
MANAGED = 5
MANAGED_STATIC = 7
PAGEABLE = 1
PINNED = 2
UNKNOWN = 0
class cupti.cupti.ActivityMemoryOperationType(value)

Bases: IntEnum

See CUpti_ActivityMemoryOperationType.

ALLOCATION = 1
FORCE_INT = 2147483647
INVALID = 0
RELEASE = 2
class cupti.cupti.ActivityMemoryPool2(size=1)

Bases: object

address

address : int

correlation_id

correlation_id : int

device_id

device_id : int

kind

kind : int

memory_pool_operation_type

memory_pool_operation_type : int

memory_pool_type

memory_pool_type : int

min_bytes_to_keep

min_bytes_to_keep : int

process_id

process_id : int

ptr

Get the pointer address to the data as Python :py:`int`.

release_threshold

release_threshold : int

size_

size_ : int

timestamp

timestamp : int

utilized_size

utilized_size : int

class cupti.cupti.ActivityMemoryPoolOperationType(value)

Bases: IntEnum

See CUpti_ActivityMemoryPoolOperationType.

CREATED = 1
DESTROYED = 2
FORCE_INT = 2147483647
INVALID = 0
TRIMMED = 3
class cupti.cupti.ActivityMemoryPoolType(value)

Bases: IntEnum

See CUpti_ActivityMemoryPoolType.

FORCE_INT = 2147483647
IMPORTED = 2
INVALID = 0
LOCAL = 1
class cupti.cupti.ActivityMemset4(size=1)

Bases: object

bytes

bytes : int

channel_id

channel_id : int

channel_type

channel_type : int

context_id

context_id : int

correlation_id

correlation_id : int

device_id

device_id : int

end

end : int

flags_

flags_ : int

graph_id

graph_id : int

graph_node_id

graph_node_id : int

kind

kind : int

memory_kind

memory_kind : int

pad

pad : int

pad2

pad2 : int

ptr

Get the pointer address to the data as Python :py:`int`.

reserved0

reserved0 : int

start

start : int

stream_id

stream_id : int

value

value : int

class cupti.cupti.ActivityModule(size=1)

Bases: object

context_id

context_id : int

cubin

cubin : int

cubin_size

cubin_size : int

id

id : int

kind

kind : int

ptr

Get the pointer address to the data as Python :py:`int`.

class cupti.cupti.ActivityName(size=1)

Bases: object

kind

kind : int

name

name : str

object_id

object_id : cupti.cupti.ActivityObjectKindId

object_kind

object_kind : int

pad

pad : int

ptr

Get the pointer address to the data as Python :py:`int`.

class cupti.cupti.ActivityObjectKind(value)

Bases: IntEnum

See CUpti_ActivityObjectKind.

CONTEXT = 4
DEVICE = 3
FORCE_INT = 2147483647
PROCESS = 1
STREAM = 5
THREAD = 2
UNKNOWN = 0
class cupti.cupti.ActivityObjectKindId

Bases: object

contextId

contextId: ‘uint32_t’

deviceId

deviceId: ‘uint32_t’

processId

processId: ‘uint32_t’

streamId

streamId: ‘uint32_t’

threadId

threadId: ‘uint32_t’

class cupti.cupti.ActivityOpenAccData(size=1)

Bases: object

async

async : int

async_map

async_map : int

bytes

bytes : int

cu_context_id

cu_context_id : int

cu_device_id

cu_device_id : int

cu_process_id

cu_process_id : int

cu_stream_id

cu_stream_id : int

cu_thread_id

cu_thread_id : int

device_number

device_number : int

device_ptr

device_ptr : int

device_type

device_type : int

end

end : int

end_line_no

end_line_no : int

event_kind

event_kind : int

external_id

external_id : int

func_end_line_no

func_end_line_no : int

func_line_no

func_line_no : int

func_name

func_name : str

host_ptr

host_ptr : int

implicit

implicit : int

kind

kind : int

line_no

line_no : int

parent_construct

parent_construct : int

ptr

Get the pointer address to the data as Python :py:`int`.

src_file

src_file : str

start

start : int

thread_id

thread_id : int

var_name

var_name : str

version

version : int

class cupti.cupti.ActivityOpenAccLaunch(size=1)

Bases: object

async

async : int

async_map

async_map : int

cu_context_id

cu_context_id : int

cu_device_id

cu_device_id : int

cu_process_id

cu_process_id : int

cu_stream_id

cu_stream_id : int

cu_thread_id

cu_thread_id : int

device_number

device_number : int

device_type

device_type : int

end

end : int

end_line_no

end_line_no : int

event_kind

event_kind : int

external_id

external_id : int

func_end_line_no

func_end_line_no : int

func_line_no

func_line_no : int

func_name

func_name : str

implicit

implicit : int

kernel_name

kernel_name : str

kind

kind : int

line_no

line_no : int

num_gangs

num_gangs : int

num_workers

num_workers : int

parent_construct

parent_construct : int

ptr

Get the pointer address to the data as Python :py:`int`.

src_file

src_file : str

start

start : int

thread_id

thread_id : int

vector_length

vector_length : int

version

version : int

class cupti.cupti.ActivityOpenAccOther(size=1)

Bases: object

async

async : int

async_map

async_map : int

cu_context_id

cu_context_id : int

cu_device_id

cu_device_id : int

cu_process_id

cu_process_id : int

cu_stream_id

cu_stream_id : int

cu_thread_id

cu_thread_id : int

device_number

device_number : int

device_type

device_type : int

end

end : int

end_line_no

end_line_no : int

event_kind

event_kind : int

external_id

external_id : int

func_end_line_no

func_end_line_no : int

func_line_no

func_line_no : int

func_name

func_name : str

implicit

implicit : int

kind

kind : int

line_no

line_no : int

parent_construct

parent_construct : int

ptr

Get the pointer address to the data as Python :py:`int`.

src_file

src_file : str

start

start : int

thread_id

thread_id : int

version

version : int

class cupti.cupti.ActivityOpenMp(size=1)

Bases: object

cu_process_id

cu_process_id : int

cu_thread_id

cu_thread_id : int

end

end : int

event_kind

event_kind : int

kind

kind : int

ptr

Get the pointer address to the data as Python :py:`int`.

start

start : int

thread_id

thread_id : int

version

version : int

class cupti.cupti.ActivityOverhead3(size=1)

Bases: object

correlation_id

correlation_id : int

end

end : int

kind

kind : int

object_id

object_id: cupti.cupti.ActivityObjectKindId

object_kind

object_kind : int

overhead_data

overhead_data : int

overhead_kind

overhead_kind : int

ptr

Get the pointer address to the data as Python :py:`int`.

reserved0

reserved0 : int

start

start : int

class cupti.cupti.ActivityOverheadKind(value)

Bases: IntEnum

See CUpti_ActivityOverheadKind.

ACTIVITY_BUFFER_REQUEST = 458752
COMMAND_BUFFER_FULL = 393216
CUPTI_BUFFER_FLUSH = 65536
CUPTI_INSTRUMENTATION = 131072
CUPTI_RESOURCE = 196608
DRIVER_COMPILER = 1
FORCE_INT = 2147483647
LAZY_FUNCTION_LOADING = 327680
RUNTIME_TRIGGERED_MODULE_LOADING = 262144
UNKNOWN = 0
UVM_ACTIVITY_INIT = 524288
class cupti.cupti.ActivityPCSamplingPeriod(value)

Bases: IntEnum

See CUpti_ActivityPCSamplingPeriod.

FORCE_INT = 2147483647
HIGH = 4
INVALID = 0
LOW = 2
MAX = 5
MID = 3
MIN = 1
class cupti.cupti.ActivityPCSamplingStallReason(value)

Bases: IntEnum

See CUpti_ActivityPCSamplingStallReason.

CONSTANT_MEMORY_DEPENDENCY = 7
EXEC_DEPENDENCY = 3
FORCE_INT = 2147483647
INST_FETCH = 2
INVALID = 0
MEMORY_DEPENDENCY = 4
MEMORY_THROTTLE = 9
NONE = 1
NOT_SELECTED = 10
OTHER = 11
PIPE_BUSY = 8
SLEEPING = 12
SYNC = 6
TEXTURE = 5
class cupti.cupti.ActivityPartitionedGlobalCacheConfig(value)

Bases: IntEnum

See CUpti_ActivityPartitionedGlobalCacheConfig.

FORCE_INT = 2147483647
NOT_SUPPORTED = 1
OFF = 2
ON = 3
UNKNOWN = 0
class cupti.cupti.ActivityPreemption(size=1)

Bases: object

block_x

block_x : int

block_y

block_y : int

block_z

block_z : int

grid_id

grid_id : int

kind

kind : int

pad

pad : int

preemption_kind

preemption_kind : int

ptr

Get the pointer address to the data as Python :py:`int`.

timestamp

timestamp : int

class cupti.cupti.ActivityPreemptionKind(value)

Bases: IntEnum

See CUpti_ActivityPreemptionKind.

FORCE_INT = 2147483647
RESTORE = 2
SAVE = 1
UNKNOWN = 0
class cupti.cupti.ActivityStream(size=1)

Bases: object

context_id

context_id : int

correlation_id

correlation_id : int

flag

flag : int

kind

kind : int

priority

priority : int

ptr

Get the pointer address to the data as Python :py:`int`.

stream_id

stream_id : int

class cupti.cupti.ActivityStreamFlag(value)

Bases: IntEnum

See CUpti_ActivityStreamFlag.

FLAG_DEFAULT = 1
FLAG_FORCE_INT = 2147483647
FLAG_NON_BLOCKING = 2
FLAG_NULL = 3
FLAG_UNKNOWN = 0
MASK = 65535
class cupti.cupti.ActivitySynchronization2(size=1)

Bases: object

context_id

context_id : int

correlation_id

correlation_id : int

cuda_event_id

cuda_event_id : int

cuda_event_sync_id

cuda_event_sync_id : int

end

end : int

kind

kind : int

pad

pad : int

ptr

Get the pointer address to the data as Python :py:`int`.

return_value

return_value : int

start

start : int

stream_id

stream_id : int

type

type : int

class cupti.cupti.ActivitySynchronizationType(value)

Bases: IntEnum

See CUpti_ActivitySynchronizationType.

CONTEXT_SYNCHRONIZE = 4
EVENT_SYNCHRONIZE = 1
FORCE_INT = 2147483647
STREAM_SYNCHRONIZE = 3
STREAM_WAIT_EVENT = 2
UNKNOWN = 0
class cupti.cupti.ActivityThreadIdType(value)

Bases: IntEnum

See CUpti_ActivityThreadIdType.

DEFAULT = 0
FORCE_INT = 2147483647
SIZE = 2
SYSTEM = 1
class cupti.cupti.ActivityUnifiedMemoryAccessType(value)

Bases: IntEnum

See CUpti_ActivityUnifiedMemoryAccessType.

ATOMIC = 3
PREFETCH = 4
READ = 1
UNKNOWN = 0
WRITE = 2
class cupti.cupti.ActivityUnifiedMemoryCounter3(size=1)

Bases: object

address

address : int

counter_kind

counter_kind : int

dst_id

dst_id : int

end

end : int

flags_

flags_ : int

kind

kind : int

pad

pad : int

process_id

process_id : int

processors

processors: list[int]

ptr

Get the pointer address to the data as Python :py:`int`.

src_id

src_id : int

start

start : int

stream_id

stream_id : int

value

value : int

class cupti.cupti.ActivityUnifiedMemoryCounterConfig(size=1)

Bases: object

device_id

device_id : int

enable

enable : int

kind

kind : int

ptr

Get the pointer address to the data as Python :py:`int`.

scope

scope : int

class cupti.cupti.ActivityUnifiedMemoryCounterKind(value)

Bases: IntEnum

See CUpti_ActivityUnifiedMemoryCounterKind.

BYTES_TRANSFER_DTOD = 8
BYTES_TRANSFER_DTOH = 2
BYTES_TRANSFER_HTOD = 1
COUNT = 9
CPU_PAGE_FAULT_COUNT = 3
FORCE_INT = 2147483647
GPU_PAGE_FAULT = 4
REMOTE_MAP = 7
THRASHING = 5
THROTTLING = 6
UNKNOWN = 0
class cupti.cupti.ActivityUnifiedMemoryCounterScope(value)

Bases: IntEnum

See CUpti_ActivityUnifiedMemoryCounterScope.

COUNT = 3
FORCE_INT = 2147483647
PROCESS_ALL_DEVICES = 2
PROCESS_SINGLE_DEVICE = 1
UNKNOWN = 0
class cupti.cupti.ActivityUnifiedMemoryMigrationCause(value)

Bases: IntEnum

See CUpti_ActivityUnifiedMemoryMigrationCause.

ACCESS_COUNTERS = 5
COHERENCE = 2
EVICTION = 4
PREFETCH = 3
UNKNOWN = 0
USER = 1
class cupti.cupti.ActivityUnifiedMemoryRemoteMapCause(value)

Bases: IntEnum

See CUpti_ActivityUnifiedMemoryRemoteMapCause.

COHERENCE = 1
EVICTION = 5
OUT_OF_MEMORY = 4
POLICY = 3
THRASHING = 2
UNKNOWN = 0
class cupti.cupti.ApiCallbackSite(value)

Bases: IntEnum

See CUpti_ApiCallbackSite.

API_CBSITE_FORCE_INT = 2147483647
API_ENTER = 0
API_EXIT = 1
class cupti.cupti.Attribute

Bases: object

cu

cu: ‘int32_t’

cupti

cupti: ‘int32_t’

class cupti.cupti.CacheConfig

Bases: object

both

both: ‘uint8_t’

executed

executed: ‘uint8_t’

requested

requested: ‘uint8_t’

class cupti.cupti.CallbackData(size=1)

Bases: object

callback_site

callback_site : int

cbid
context

context : int

context_uid

context_uid : int

correlation_data

correlation_data : int

correlation_id

correlation_id : int

domain
function_name

function_name : str

function_params

function_params : int

function_params_py_obj
function_return_value

function_return_value : int

ptr

Get the pointer address to the data as Python :py:`int`.

symbol_name

symbol_name : str

class cupti.cupti.CallbackDomain(value)

Bases: IntEnum

See CUpti_CallbackDomain.

DRIVER_API = 1
FORCE_INT = 2147483647
INVALID = 0
NVTX = 5
RESOURCE = 3
RUNTIME_API = 2
SIZE = 7
STATE = 6
SYNCHRONIZE = 4
class cupti.cupti.CallbackIdResource(value)

Bases: IntEnum

See CUpti_CallbackIdResource.

CONTEXT_CREATED = 1
CONTEXT_DESTROY_STARTING = 2
CU_INIT_FINISHED = 5
FORCE_INT = 2147483647
GRAPHEXEC_CREATED = 18
GRAPHEXEC_CREATE_STARTING = 17
GRAPHEXEC_DESTROY_STARTING = 19
GRAPHNODE_CLONED = 20
GRAPHNODE_CREATED = 13
GRAPHNODE_CREATE_STARTING = 12
GRAPHNODE_DEPENDENCY_CREATED = 15
GRAPHNODE_DEPENDENCY_DESTROY_STARTING = 16
GRAPHNODE_DESTROY_STARTING = 14
GRAPH_CLONED = 11
GRAPH_CREATED = 9
GRAPH_DESTROY_STARTING = 10
INVALID = 0
MODULE_LOADED = 6
MODULE_PROFILED = 8
MODULE_UNLOAD_STARTING = 7
SIZE = 22
STREAM_ATTRIBUTE_CHANGED = 21
STREAM_CREATED = 3
STREAM_DESTROY_STARTING = 4
class cupti.cupti.CallbackIdState(value)

Bases: IntEnum

See CUpti_CallbackIdState.

ERROR = 2
FATAL_ERROR = 1
FORCE_INT = 2147483647
INVALID = 0
SIZE = 4
WARNING = 3
class cupti.cupti.CallbackIdSync(value)

Bases: IntEnum

See CUpti_CallbackIdSync.

CONTEXT_SYNCHRONIZED = 2
FORCE_INT = 2147483647
INVALID = 0
SIZE = 3
STREAM_SYNCHRONIZED = 1
class cupti.cupti.ChannelType(value)

Bases: IntEnum

See CUpti_ChannelType.

ASYNC_MEMCPY = 2
COMPUTE = 1
DECOMP = 3
FORCE_INT = 2147483647
INVALID = 0
class cupti.cupti.ContextCigMode(value)

Bases: IntEnum

See CUpti_ContextCigMode.

CIG = 1
CIG_FALLBACK = 2
FORCE_INT = 2147483647
NONE = 0
class cupti.cupti.Data

Bases: object

clocksThrottleReasons

clocksThrottleReasons: ‘int32_t’

fanSpeed

fanSpeed: ‘uint32_t’

gpuTemperature

gpuTemperature: ‘uint32_t’

memoryClock

memoryClock: ‘uint32_t’

pcieLinkGen

pcieLinkGen: ‘uint32_t’

pcieLinkWidth

pcieLinkWidth: ‘uint32_t’

power

power: ‘uint32_t’

powerLimit

powerLimit: ‘uint32_t’

smClock

smClock: ‘uint32_t’

class cupti.cupti.DevType(value)

Bases: IntEnum

See CUpti_DevType.

FORCE_INT = 2147483647
GPU = 1
INVALID = 0
NPU = 2
class cupti.cupti.DeviceAttribute(value)

Bases: IntEnum

See CUpti_DeviceAttribute.

DEVICE_CLASS = 10
FLOP_DP_PER_CYCLE = 12
FLOP_HP_PER_CYCLE = 17
FLOP_SP_PER_CYCLE = 11
FORCE_INT = 2147483647
GLOBAL_MEMORY_BANDWIDTH = 3
INSTRUCTION_PER_CYCLE = 4
INSTRUCTION_THROUGHPUT_SINGLE_PRECISION = 5
MAX_EVENT_DOMAIN_ID = 2
MAX_EVENT_ID = 1
MAX_FRAME_BUFFERS = 6
MAX_L2_UNITS = 13
MAX_SHARED_MEMORY_CACHE_CONFIG_PREFER_EQUAL = 16
MAX_SHARED_MEMORY_CACHE_CONFIG_PREFER_L1 = 15
MAX_SHARED_MEMORY_CACHE_CONFIG_PREFER_SHARED = 14
NVSWITCH_PRESENT = 20
PCIE_GEN = 9
class cupti.cupti.DeviceVirtualizationMode(value)

Bases: IntEnum

See CUpti_DeviceVirtualizationMode.

FORCE_INT = 2147483647
NONE = 0
PASS_THROUGH = 1
VIRTUAL_GPU = 2
class cupti.cupti.EnvironmentClocksThrottleReason(value)

Bases: IntEnum

See CUpti_EnvironmentClocksThrottleReason.

FORCE_INT = 2147483647
GPU_IDLE = 1
HW_SLOWDOWN = 8
NONE = 0
SW_POWER_CAP = 4
UNKNOWN = 2147483648
UNSUPPORTED = 1073741824
USER_DEFINED_CLOCKS = 2
class cupti.cupti.ExternalCorrelationKind(value)

Bases: IntEnum

See CUpti_ExternalCorrelationKind.

CUSTOM0 = 3
CUSTOM1 = 4
CUSTOM2 = 5
FORCE_INT = 2147483647
INVALID = 0
OPENACC = 2
SIZE = 6
UNKNOWN = 1
class cupti.cupti.FuncShmemLimitConfig(value)

Bases: IntEnum

See CUpti_FuncShmemLimitConfig.

DEFAULT = 0
FORCE_INT = 2147483647
OPTIN = 1
class cupti.cupti.GraphData(size=1)

Bases: object

dependency

dependency : int

graph

graph : int

graph_exec

graph_exec : int

node

node : int

node_type

node_type : int

original_graph

original_graph : int

original_node

original_node : int

ptr

Get the pointer address to the data as Python :py:`int`.

class cupti.cupti.MemoryPoolConfig

Bases: object

address

address: ‘uint64_t’

memory_pool_type

memory_pool_type: ‘int32_t’

process_id

process_id: ‘uint64_t’

release_threshold

release_threshold: ‘uint64_t’

size

size: ‘uint64_t’

utilized_size

utilized_size: ‘uint64_t’

class cupti.cupti.MetricValue

Bases: object

metricValueDouble

metricValueDouble: ‘double’

metricValueInt64

metricValueInt64: ‘int64_t’

metricValuePercent

metricValuePercent: ‘double’

metricValueThroughput

metricValueThroughput: ‘uint64_t’

metricValueUint64

metricValueUint64: ‘uint64_t’

metricValueUtilizationLevel

metricValueUtilizationLevel: ‘int32_t’

class cupti.cupti.MetricValueKind(value)

Bases: IntEnum

See CUpti_MetricValueKind.

DOUBLE = 0
FORCE_INT = 2147483647
INT64 = 4
PERCENT = 2
THROUGHPUT = 3
UINT64 = 1
UTILIZATION_LEVEL = 5
class cupti.cupti.MetricValueUtilizationLevel(value)

Bases: IntEnum

See CUpti_MetricValueUtilizationLevel.

FORCE_INT = 2147483647
HIGH = 8
IDLE = 0
LOW = 2
MAX = 10
MID = 5
class cupti.cupti.ModuleResourceData(size=1)

Bases: object

cubin_size

cubin_size : int

module_id

module_id : int

p_cubin

p_cubin : str

ptr

Get the pointer address to the data as Python :py:`int`.

class cupti.cupti.Notification

Bases: object

message

message: unicode

result

result: ‘int32_t’

class cupti.cupti.OpenAccConstructKind(value)

Bases: IntEnum

See CUpti_OpenAccConstructKind.

ATOMIC = 8
DATA = 4
DECLARE = 9
ENTER_DATA = 5
EXIT_DATA = 6
FORCE_INT = 2147483647
HOST_DATA = 7
INIT = 10
KERNELS = 2
LOOP = 3
PARALLEL = 1
ROUTINE = 14
RUNTIME_API = 16
SET = 12
SHUTDOWN = 11
UNKNOWN = 0
UPDATE = 13
WAIT = 15
class cupti.cupti.OpenAccEventKind(value)

Bases: IntEnum

See CUpti_OpenAccEventKind.

ALLOC = 15
COMPUTE_CONSTRUCT = 9
CREATE = 13
DELETE = 14
DEVICE_INIT = 1
DEVICE_SHUTDOWN = 2
ENQUEUE_DOWNLOAD = 6
ENQUEUE_LAUNCH = 4
ENQUEUE_UPLOAD = 5
ENTER_DATA = 11
EXIT_DATA = 12
FORCE_INT = 2147483647
FREE = 16
IMPLICIT_WAIT = 8
INVALID = 0
RUNTIME_SHUTDOWN = 3
UPDATE = 10
WAIT = 7
class cupti.cupti.OpenMpEventKind(value)

Bases: IntEnum

See CUpti_OpenMpEventKind.

FORCE_INT = 2147483647
IDLE = 4
INVALID = 0
PARALLEL = 1
TASK = 2
THREAD = 3
WAIT_BARRIER = 5
WAIT_TASKWAIT = 6
class cupti.cupti.PcieDeviceType(value)

Bases: IntEnum

See CUpti_PcieDeviceType.

BRIDGE = 1
FORCE_INT = 2147483647
GPU = 0
class cupti.cupti.ResourceData(size=1)

Bases: object

context

context : int

ptr

Get the pointer address to the data as Python :py:`int`.

resource_descriptor

resource_descriptor : int

class cupti.cupti.ResourceHandle

Bases: object

stream

stream: ‘intptr_t’

class cupti.cupti.Result(value)

Bases: IntEnum

See CUptiResult.

ERROR_API_NOT_IMPLEMENTED = 11
ERROR_CDP_TRACING_NOT_SUPPORTED = 32
ERROR_CMP_DEVICE_NOT_SUPPORTED = 42
ERROR_CONFIDENTIAL_COMPUTING_NOT_SUPPORTED = 41
ERROR_CUDA_COMPILER_NOT_COMPATIBLE = 34
ERROR_DISABLED = 23
ERROR_FORCE_INT = 2147483647
ERROR_HARDWARE = 9
ERROR_HARDWARE_BUSY = 26
ERROR_INSUFFICIENT_PRIVILEGES = 35
ERROR_INVALID_CONTEXT = 3
ERROR_INVALID_DEVICE = 2
ERROR_INVALID_EVENT_DOMAIN_ID = 4
ERROR_INVALID_EVENT_ID = 5
ERROR_INVALID_EVENT_NAME = 6
ERROR_INVALID_EVENT_VALUE = 22
ERROR_INVALID_HANDLE = 19
ERROR_INVALID_KIND = 21
ERROR_INVALID_METRIC_ID = 16
ERROR_INVALID_METRIC_NAME = 17
ERROR_INVALID_METRIC_VALUE = 25
ERROR_INVALID_MODULE = 24
ERROR_INVALID_OPERATION = 7
ERROR_INVALID_PARAMETER = 1
ERROR_INVALID_STREAM = 20
ERROR_LEGACY_PROFILER_NOT_SUPPORTED = 38
ERROR_MAX_LIMIT_REACHED = 12
ERROR_MIG_DEVICE_NOT_SUPPORTED = 43
ERROR_MULTIPLE_SUBSCRIBERS_NOT_SUPPORTED = 39
ERROR_NOT_COMPATIBLE = 14
ERROR_NOT_INITIALIZED = 15
ERROR_NOT_READY = 13
ERROR_NOT_SUPPORTED = 27
ERROR_OLD_PROFILER_API_INITIALIZED = 36
ERROR_OPENACC_UNDEFINED_ROUTINE = 37
ERROR_OUT_OF_MEMORY = 8
ERROR_PARAMETER_SIZE_NOT_SUFFICIENT = 10
ERROR_QUEUE_EMPTY = 18
ERROR_SLI_DEVICE_NOT_SUPPORTED = 44
ERROR_UM_PROFILING_NOT_SUPPORTED = 28
ERROR_UM_PROFILING_NOT_SUPPORTED_ON_DEVICE = 29
ERROR_UM_PROFILING_NOT_SUPPORTED_ON_NON_P2P_DEVICES = 30
ERROR_UM_PROFILING_NOT_SUPPORTED_WITH_MPS = 31
ERROR_UNKNOWN = 999
ERROR_VIRTUALIZED_DEVICE_INSUFFICIENT_PRIVILEGES = 40
ERROR_VIRTUALIZED_DEVICE_NOT_SUPPORTED = 33
ERROR_WSL_DEVICE_NOT_SUPPORTED = 45
SUCCESS = 0
class cupti.cupti.StateData(size=1)

Bases: object

notification

notification : cupti.cupti.Notification

ptr

Get the pointer address to the data as Python :py:`int`.

class cupti.cupti.StreamAttrData(size=1)

Bases: object

attr

attr : int

ptr

Get the pointer address to the data as Python :py:`int`.

stream

stream : int

value

value : int

class cupti.cupti.SynchronizeData(size=1)

Bases: object

context

context : int

ptr

Get the pointer address to the data as Python :py:`int`.

stream

stream : int

class cupti.cupti.Value

Bases: object

vDouble

vDouble: ‘double’

vInt32

vInt32: ‘int32_t’

vInt64

vInt64: ‘int64_t’

vUint32

vUint32: ‘uint32_t’

vUint64

vUint64: ‘uint64_t’

class cupti.cupti.driver_api_trace_cbid(value)

Bases: IntEnum

See CUpti_driver_api_trace_cbid.

FORCE_INT = 2147483647
INVALID = 0
SIZE = 782
cu64Array3DCreate = 230
cu64Array3DGetDescriptor = 231
cu64ArrayCreate = 228
cu64ArrayGetDescriptor = 229
cu64D3D10ResourceGetMappedPitch = 200
cu64D3D10ResourceGetMappedPointer = 198
cu64D3D10ResourceGetMappedSize = 199
cu64D3D10ResourceGetSurfaceDimensions = 201
cu64D3D9MapVertexBuffer = 206
cu64D3D9ResourceGetMappedPitch = 205
cu64D3D9ResourceGetMappedPointer = 203
cu64D3D9ResourceGetMappedSize = 204
cu64D3D9ResourceGetSurfaceDimensions = 202
cu64DeviceTotalMem = 197
cu64GLMapBufferObject = 207
cu64GLMapBufferObjectAsync = 208
cu64GraphicsResourceGetMappedPointer = 131
cu64MemAlloc = 30
cu64MemAllocPitch = 32
cu64MemFree = 34
cu64MemGetAddressRange = 36
cu64MemGetInfo = 28
cu64MemHostAlloc = 215
cu64MemHostGetDevicePointer = 41
cu64Memcpy2D = 232
cu64Memcpy2DAsync = 234
cu64Memcpy2DUnaligned = 233
cu64Memcpy3D = 59
cu64Memcpy3DAsync = 70
cu64MemcpyAtoD = 52
cu64MemcpyDtoA = 50
cu64MemcpyDtoD = 48
cu64MemcpyDtoDAsync = 65
cu64MemcpyDtoH = 46
cu64MemcpyDtoHAsync = 63
cu64MemcpyHtoD = 44
cu64MemcpyHtoDAsync = 61
cu64MemsetD16 = 74
cu64MemsetD16Async = 219
cu64MemsetD2D16 = 80
cu64MemsetD2D16Async = 225
cu64MemsetD2D32 = 82
cu64MemsetD2D32Async = 227
cu64MemsetD2D8 = 78
cu64MemsetD2D8Async = 223
cu64MemsetD32 = 76
cu64MemsetD32Async = 221
cu64MemsetD8 = 72
cu64MemsetD8Async = 217
cu64ModuleGetGlobal = 25
cu64TexRefGetAddress = 104
cu64TexRefSetAddress = 96
cu64TexRefSetAddress2D = 98
cuArray3DCreate = 90
cuArray3DCreate_v2 = 274
cuArray3DGetDescriptor = 91
cuArray3DGetDescriptor_v2 = 275
cuArrayCreate = 87
cuArrayCreate_v2 = 272
cuArrayDestroy = 89
cuArrayGetDescriptor = 88
cuArrayGetDescriptor_v2 = 273
cuArrayGetMemoryRequirements = 654
cuArrayGetPlane = 597
cuArrayGetSparseProperties = 582
cuBinaryFree = 376
cuCheckpointProcessCheckpoint = 771
cuCheckpointProcessGetRestoreThreadId = 768
cuCheckpointProcessGetState = 769
cuCheckpointProcessLock = 770
cuCheckpointProcessRestore = 772
cuCheckpointProcessUnlock = 773
cuCompilePtx = 375
cuCoredumpGetAttribute = 701
cuCoredumpGetAttributeGlobal = 702
cuCoredumpSetAttribute = 703
cuCoredumpSetAttributeGlobal = 704
cuCtxAttach = 12
cuCtxCreate = 10
cuCtxCreate_v2 = 235
cuCtxCreate_v3 = 645
cuCtxCreate_v4 = 757
cuCtxDestroy = 11
cuCtxDestroy_v2 = 322
cuCtxDetach = 13
cuCtxDisablePeerAccess = 314
cuCtxEnablePeerAccess = 313
cuCtxFromGreenCtx = 753
cuCtxGetApiVersion = 296
cuCtxGetCacheConfig = 299
cuCtxGetCurrent = 304
cuCtxGetDevResource = 746
cuCtxGetDevice = 16
cuCtxGetExecAffinity = 646
cuCtxGetFlags = 391
cuCtxGetId = 695
cuCtxGetLimit = 137
cuCtxGetSharedMemConfig = 337
cuCtxGetStreamPriorityRange = 370
cuCtxPopCurrent = 15
cuCtxPopCurrent_v2 = 324
cuCtxPushCurrent = 14
cuCtxPushCurrent_v2 = 323
cuCtxRecordEvent = 755
cuCtxResetPersistingL2Cache = 568
cuCtxSetCacheConfig = 300
cuCtxSetCurrent = 303
cuCtxSetFlags = 705
cuCtxSetLimit = 136
cuCtxSetSharedMemConfig = 336
cuCtxSynchronize = 17
cuCtxWaitEvent = 756
cuD3D10CtxCreate = 139
cuD3D10CtxCreateOnDevice = 212
cuD3D10CtxCreate_v2 = 236
cuD3D10GetDevice = 138
cuD3D10GetDevices = 211
cuD3D10GetDirect3DDevice = 297
cuD3D10MapResources = 143
cuD3D10RegisterResource = 141
cuD3D10ResourceGetMappedArray = 146
cuD3D10ResourceGetMappedPitch = 149
cuD3D10ResourceGetMappedPitch_v2 = 262
cuD3D10ResourceGetMappedPointer = 147
cuD3D10ResourceGetMappedPointer_v2 = 260
cuD3D10ResourceGetMappedSize = 148
cuD3D10ResourceGetMappedSize_v2 = 261
cuD3D10ResourceGetSurfaceDimensions = 150
cuD3D10ResourceGetSurfaceDimensions_v2 = 263
cuD3D10ResourceSetMapFlags = 145
cuD3D10UnmapResources = 144
cuD3D10UnregisterResource = 142
cuD3D11CtxCreate = 152
cuD3D11CtxCreateOnDevice = 210
cuD3D11CtxCreate_v2 = 237
cuD3D11GetDevice = 151
cuD3D11GetDevices = 209
cuD3D11GetDirect3DDevice = 298
cuD3D9Begin = 168
cuD3D9CtxCreate = 155
cuD3D9CtxCreateOnDevice = 214
cuD3D9CtxCreate_v2 = 238
cuD3D9End = 169
cuD3D9GetDevice = 154
cuD3D9GetDevices = 213
cuD3D9GetDirect3DDevice = 157
cuD3D9MapResources = 160
cuD3D9MapVertexBuffer = 171
cuD3D9MapVertexBuffer_v2 = 268
cuD3D9RegisterResource = 158
cuD3D9RegisterVertexBuffer = 170
cuD3D9ResourceGetMappedArray = 164
cuD3D9ResourceGetMappedPitch = 167
cuD3D9ResourceGetMappedPitch_v2 = 267
cuD3D9ResourceGetMappedPointer = 165
cuD3D9ResourceGetMappedPointer_v2 = 265
cuD3D9ResourceGetMappedSize = 166
cuD3D9ResourceGetMappedSize_v2 = 266
cuD3D9ResourceGetSurfaceDimensions = 163
cuD3D9ResourceGetSurfaceDimensions_v2 = 264
cuD3D9ResourceSetMapFlags = 162
cuD3D9UnmapResources = 161
cuD3D9UnmapVertexBuffer = 172
cuD3D9UnregisterResource = 159
cuD3D9UnregisterVertexBuffer = 173
cuDestroyExternalMemory = 488
cuDestroyExternalSemaphore = 494
cuDevResourceGenerateDesc = 748
cuDevSmResourceSplitByCount = 751
cuDeviceCanAccessPeer = 312
cuDeviceComputeCapability = 6
cuDeviceGet = 3
cuDeviceGetAttribute = 9
cuDeviceGetByPCIBusId = 331
cuDeviceGetCount = 4
cuDeviceGetDefaultMemPool = 606
cuDeviceGetDevResource = 745
cuDeviceGetExecAffinitySupport = 644
cuDeviceGetGraphMemAttribute = 641
cuDeviceGetLuid = 532
cuDeviceGetMemPool = 610
cuDeviceGetName = 5
cuDeviceGetNvSciSyncAttributes = 542
cuDeviceGetP2PAttribute = 454
cuDeviceGetPCIBusId = 332
cuDeviceGetProperties = 8
cuDeviceGetTexture1DLinearMaxWidth = 579
cuDeviceGetUuid = 482
cuDeviceGetUuid_v2 = 647
cuDeviceGraphMemTrim = 640
cuDevicePrimaryCtxGetState = 392
cuDevicePrimaryCtxRelease = 387
cuDevicePrimaryCtxRelease_v2 = 544
cuDevicePrimaryCtxReset = 389
cuDevicePrimaryCtxReset_v2 = 545
cuDevicePrimaryCtxRetain = 386
cuDevicePrimaryCtxSetFlags = 388
cuDevicePrimaryCtxSetFlags_v2 = 546
cuDeviceRegisterAsyncNotification = 735
cuDeviceSetGraphMemAttribute = 642
cuDeviceSetMemPool = 609
cuDeviceTotalMem = 7
cuDeviceTotalMem_v2 = 259
cuDeviceUnregisterAsyncNotification = 736
cuDriverGetVersion = 2
cuEGLStreamConsumerAcquireFrame = 395
cuEGLStreamConsumerConnect = 393
cuEGLStreamConsumerConnectWithFlags = 470
cuEGLStreamConsumerDisconnect = 394
cuEGLStreamConsumerReleaseFrame = 396
cuEGLStreamProducerConnect = 446
cuEGLStreamProducerDisconnect = 447
cuEGLStreamProducerPresentFrame = 448
cuEGLStreamProducerReturnFrame = 453
cuEventCreate = 118
cuEventCreateFromEGLSync = 479
cuEventCreateFromNVNSync = 469
cuEventDestroy = 122
cuEventDestroy_v2 = 325
cuEventElapsedTime = 123
cuEventElapsedTime_v2 = 780
cuEventQuery = 120
cuEventRecord = 119
cuEventRecordWithFlags = 587
cuEventRecordWithFlags_ptsz = 588
cuEventRecord_ptsz = 441
cuEventSynchronize = 121
cuExternalMemoryGetMappedBuffer = 486
cuExternalMemoryGetMappedMipmappedArray = 487
cuFlushGPUDirectRDMAWrites = 627
cuFuncGetAttribute = 85
cuFuncGetModule = 566
cuFuncGetName = 718
cuFuncGetParamInfo = 733
cuFuncIsLoaded = 741
cuFuncLoad = 742
cuFuncSetAttribute = 481
cuFuncSetBlockShape = 83
cuFuncSetCacheConfig = 86
cuFuncSetSharedMemConfig = 338
cuFuncSetSharedSize = 84
cuGLCtxCreate = 174
cuGLCtxCreate_v2 = 239
cuGLGetDevices = 333
cuGLGetDevices_v2 = 385
cuGLInit = 178
cuGLMapBufferObject = 180
cuGLMapBufferObjectAsync = 184
cuGLMapBufferObjectAsync_v2 = 270
cuGLMapBufferObjectAsync_v2_ptsz = 445
cuGLMapBufferObject_v2 = 269
cuGLMapBufferObject_v2_ptds = 417
cuGLRegisterBufferObject = 179
cuGLSetBufferObjectMapFlags = 183
cuGLUnmapBufferObject = 181
cuGLUnmapBufferObjectAsync = 185
cuGLUnregisterBufferObject = 182
cuGetErrorName = 373
cuGetErrorString = 372
cuGetExportTable = 135
cuGetProcAddress = 626
cuGetProcAddress_v2 = 677
cuGraphAddBatchMemOpNode = 669
cuGraphAddChildGraphNode = 525
cuGraphAddDependencies = 518
cuGraphAddDependencies_v2 = 727
cuGraphAddEmptyNode = 526
cuGraphAddEventRecordNode = 589
cuGraphAddEventWaitNode = 590
cuGraphAddExternalSemaphoresSignalNode = 618
cuGraphAddExternalSemaphoresWaitNode = 621
cuGraphAddHostNode = 530
cuGraphAddKernelNode = 502
cuGraphAddKernelNode_v2 = 689
cuGraphAddMemAllocNode = 638
cuGraphAddMemFreeNode = 639
cuGraphAddMemcpyNode = 504
cuGraphAddMemsetNode = 506
cuGraphAddNode = 712
cuGraphAddNode_v2 = 723
cuGraphBatchMemOpNodeGetParams = 670
cuGraphBatchMemOpNodeSetParams = 671
cuGraphChildGraphNodeGetGraph = 529
cuGraphClone = 523
cuGraphConditionalHandleCreate = 722
cuGraphCreate = 501
cuGraphDebugDotPrint = 628
cuGraphDestroy = 517
cuGraphDestroyNode = 522
cuGraphEventRecordNodeGetEvent = 591
cuGraphEventRecordNodeSetEvent = 593
cuGraphEventWaitNodeGetEvent = 592
cuGraphEventWaitNodeSetEvent = 594
cuGraphExecBatchMemOpNodeSetParams = 672
cuGraphExecChildGraphNodeSetParams = 586
cuGraphExecDestroy = 516
cuGraphExecEventRecordNodeSetEvent = 595
cuGraphExecEventWaitNodeSetEvent = 596
cuGraphExecExternalSemaphoresSignalNodeSetParams = 624
cuGraphExecExternalSemaphoresWaitNodeSetParams = 625
cuGraphExecGetFlags = 658
cuGraphExecHostNodeSetParams = 564
cuGraphExecKernelNodeSetParams = 538
cuGraphExecKernelNodeSetParams_v2 = 692
cuGraphExecMemcpyNodeSetParams = 562
cuGraphExecMemsetNodeSetParams = 563
cuGraphExecNodeSetParams = 714
cuGraphExecUpdate = 561
cuGraphExecUpdate_v2 = 696
cuGraphExternalSemaphoresSignalNodeGetParams = 619
cuGraphExternalSemaphoresSignalNodeSetParams = 620
cuGraphExternalSemaphoresWaitNodeGetParams = 622
cuGraphExternalSemaphoresWaitNodeSetParams = 623
cuGraphGetEdges = 535
cuGraphGetEdges_v2 = 724
cuGraphGetNodes = 534
cuGraphGetRootNodes = 510
cuGraphHostNodeGetParams = 531
cuGraphHostNodeSetParams = 533
cuGraphInstantiate = 513
cuGraphInstantiateWithFlags = 643
cuGraphInstantiateWithParams = 656
cuGraphInstantiateWithParams_ptsz = 657
cuGraphInstantiate_v2 = 578
cuGraphKernelNodeCopyAttributes = 569
cuGraphKernelNodeGetAttribute = 570
cuGraphKernelNodeGetParams = 503
cuGraphKernelNodeGetParams_v2 = 690
cuGraphKernelNodeSetAttribute = 571
cuGraphKernelNodeSetParams = 521
cuGraphKernelNodeSetParams_v2 = 691
cuGraphLaunch = 514
cuGraphLaunch_ptsz = 515
cuGraphMemAllocNodeGetParams = 648
cuGraphMemFreeNodeGetParams = 649
cuGraphMemcpyNodeGetParams = 505
cuGraphMemcpyNodeSetParams = 520
cuGraphMemsetNodeGetParams = 507
cuGraphMemsetNodeSetParams = 508
cuGraphNodeFindInClone = 524
cuGraphNodeGetDependencies = 511
cuGraphNodeGetDependencies_v2 = 725
cuGraphNodeGetDependentNodes = 512
cuGraphNodeGetDependentNodes_v2 = 726
cuGraphNodeGetEnabled = 651
cuGraphNodeGetType = 509
cuGraphNodeSetEnabled = 650
cuGraphNodeSetParams = 713
cuGraphReleaseUserObject = 637
cuGraphRemoveDependencies = 519
cuGraphRemoveDependencies_v2 = 728
cuGraphRetainUserObject = 636
cuGraphUpload = 580
cuGraphUpload_ptsz = 581
cuGraphicsD3D10RegisterResource = 140
cuGraphicsD3D11RegisterResource = 153
cuGraphicsD3D9RegisterResource = 156
cuGraphicsEGLRegisterImage = 390
cuGraphicsGLRegisterBuffer = 175
cuGraphicsGLRegisterImage = 176
cuGraphicsMapResources = 133
cuGraphicsMapResources_ptsz = 443
cuGraphicsResourceGetMappedEglFrame = 449
cuGraphicsResourceGetMappedMipmappedArray = 360
cuGraphicsResourceGetMappedPointer = 130
cuGraphicsResourceGetMappedPointer_v2 = 258
cuGraphicsResourceSetMapFlags = 132
cuGraphicsResourceSetMapFlags_v2 = 380
cuGraphicsSubResourceGetMappedArray = 129
cuGraphicsUnmapResources = 134
cuGraphicsUnmapResources_ptsz = 444
cuGraphicsUnregisterResource = 128
cuGraphicsVDPAURegisterOutputSurface = 189
cuGraphicsVDPAURegisterVideoSurface = 188
cuGreenCtxCreate = 743
cuGreenCtxDestroy = 744
cuGreenCtxGetDevResource = 747
cuGreenCtxRecordEvent = 749
cuGreenCtxStreamCreate = 758
cuGreenCtxWaitEvent = 750
cuImportExternalMemory = 485
cuImportExternalSemaphore = 489
cuInit = 1
cuIpcCloseMemHandle = 330
cuIpcGetEventHandle = 334
cuIpcGetMemHandle = 328
cuIpcOpenEventHandle = 335
cuIpcOpenMemHandle = 329
cuIpcOpenMemHandle_v2 = 567
cuKernelGetAttribute = 686
cuKernelGetFunction = 683
cuKernelGetLibrary = 754
cuKernelGetName = 719
cuKernelGetParamInfo = 734
cuKernelSetAttribute = 687
cuKernelSetCacheConfig = 688
cuLaunch = 115
cuLaunchCooperativeKernel = 477
cuLaunchCooperativeKernelMultiDevice = 480
cuLaunchCooperativeKernel_ptsz = 478
cuLaunchGrid = 116
cuLaunchGridAsync = 117
cuLaunchHostFunc = 527
cuLaunchHostFunc_ptsz = 528
cuLaunchKernel = 307
cuLaunchKernelEx = 652
cuLaunchKernelEx_ptsz = 653
cuLaunchKernel_ptsz = 442
cuLibraryEnumerateKernels = 740
cuLibraryGetGlobal = 684
cuLibraryGetKernel = 681
cuLibraryGetKernelCount = 739
cuLibraryGetManaged = 685
cuLibraryGetModule = 682
cuLibraryGetUnifiedFunction = 700
cuLibraryLoadData = 678
cuLibraryLoadFromFile = 679
cuLibraryUnload = 680
cuLinkAddData = 363
cuLinkAddData_v2 = 382
cuLinkAddFile = 364
cuLinkAddFile_v2 = 383
cuLinkComplete = 365
cuLinkCreate = 362
cuLinkCreate_v2 = 381
cuLinkDestroy = 366
cuLogsCurrent = 765
cuLogsDumpToFile = 766
cuLogsDumpToMemory = 767
cuLogsRegisterCallback = 763
cuLogsUnregisterCallback = 764
cuMemAddressFree = 548
cuMemAddressReserve = 547
cuMemAdvise = 457
cuMemAdvise_v2 = 715
cuMemAlloc = 29
cuMemAllocAsync = 598
cuMemAllocAsync_ptsz = 599
cuMemAllocFromPoolAsync = 611
cuMemAllocFromPoolAsync_ptsz = 612
cuMemAllocHost = 37
cuMemAllocHost_v2 = 294
cuMemAllocManaged = 371
cuMemAllocPitch = 31
cuMemAllocPitch_v2 = 244
cuMemAlloc_v2 = 243
cuMemBatchDecompressAsync = 761
cuMemBatchDecompressAsync_ptsz = 762
cuMemCreate = 549
cuMemExportToShareableHandle = 554
cuMemFree = 33
cuMemFreeAsync = 600
cuMemFreeAsync_ptsz = 601
cuMemFreeHost = 38
cuMemFree_v2 = 245
cuMemGetAccess = 558
cuMemGetAddressRange = 35
cuMemGetAddressRange_v2 = 246
cuMemGetAllocationGranularity = 556
cuMemGetAllocationPropertiesFromHandle = 557
cuMemGetHandleForAddressRange = 674
cuMemGetInfo = 27
cuMemGetInfo_v2 = 242
cuMemHostAlloc = 39
cuMemHostAlloc_v2 = 271
cuMemHostGetDevicePointer = 40
cuMemHostGetDevicePointer_v2 = 247
cuMemHostGetFlags = 42
cuMemHostRegister = 301
cuMemHostRegister_v2 = 379
cuMemHostUnregister = 302
cuMemImportFromShareableHandle = 555
cuMemMap = 551
cuMemMapArrayAsync = 584
cuMemMapArrayAsync_ptsz = 585
cuMemPeerGetDevicePointer = 317
cuMemPeerRegister = 315
cuMemPeerUnregister = 316
cuMemPoolCreate = 607
cuMemPoolDestroy = 608
cuMemPoolExportPointer = 615
cuMemPoolExportToShareableHandle = 613
cuMemPoolGetAccess = 617
cuMemPoolGetAttribute = 604
cuMemPoolImportFromShareableHandle = 614
cuMemPoolImportPointer = 616
cuMemPoolSetAccess = 605
cuMemPoolSetAttribute = 603
cuMemPoolTrimTo = 602
cuMemPrefetchAsync = 467
cuMemPrefetchAsync_ptsz = 468
cuMemPrefetchAsync_v2 = 716
cuMemPrefetchAsync_v2_ptsz = 717
cuMemRangeGetAttribute = 471
cuMemRangeGetAttributes = 472
cuMemRelease = 550
cuMemRetainAllocationHandle = 565
cuMemSetAccess = 553
cuMemUnmap = 552
cuMemcpy = 305
cuMemcpy2D = 56
cuMemcpy2DAsync = 68
cuMemcpy2DAsync_v2 = 289
cuMemcpy2DAsync_v2_ptsz = 424
cuMemcpy2DUnaligned = 57
cuMemcpy2DUnaligned_v2 = 288
cuMemcpy2DUnaligned_v2_ptds = 406
cuMemcpy2D_v2 = 287
cuMemcpy2D_v2_ptds = 405
cuMemcpy3D = 58
cuMemcpy3DAsync = 69
cuMemcpy3DAsync_v2 = 291
cuMemcpy3DAsync_v2_ptsz = 425
cuMemcpy3DBatchAsync = 778
cuMemcpy3DBatchAsync_ptsz = 779
cuMemcpy3DPeer = 320
cuMemcpy3DPeerAsync = 321
cuMemcpy3DPeerAsync_ptsz = 427
cuMemcpy3DPeer_ptds = 410
cuMemcpy3D_v2 = 290
cuMemcpy3D_v2_ptds = 407
cuMemcpyAsync = 306
cuMemcpyAsync_ptsz = 418
cuMemcpyAtoA = 55
cuMemcpyAtoA_v2 = 286
cuMemcpyAtoA_v2_ptds = 404
cuMemcpyAtoD = 51
cuMemcpyAtoD_v2 = 284
cuMemcpyAtoD_v2_ptds = 401
cuMemcpyAtoH = 54
cuMemcpyAtoHAsync = 67
cuMemcpyAtoHAsync_v2 = 283
cuMemcpyAtoHAsync_v2_ptsz = 420
cuMemcpyAtoH_v2 = 282
cuMemcpyAtoH_v2_ptds = 403
cuMemcpyBatchAsync = 776
cuMemcpyBatchAsync_ptsz = 777
cuMemcpyDtoA = 49
cuMemcpyDtoA_v2 = 285
cuMemcpyDtoA_v2_ptds = 400
cuMemcpyDtoD = 47
cuMemcpyDtoDAsync = 64
cuMemcpyDtoDAsync_v2 = 281
cuMemcpyDtoDAsync_v2_ptsz = 423
cuMemcpyDtoD_v2 = 280
cuMemcpyDtoD_v2_ptds = 399
cuMemcpyDtoH = 45
cuMemcpyDtoHAsync = 62
cuMemcpyDtoHAsync_v2 = 279
cuMemcpyDtoHAsync_v2_ptsz = 422
cuMemcpyDtoH_v2 = 278
cuMemcpyDtoH_v2_ptds = 398
cuMemcpyHtoA = 53
cuMemcpyHtoAAsync = 66
cuMemcpyHtoAAsync_v2 = 293
cuMemcpyHtoAAsync_v2_ptsz = 419
cuMemcpyHtoA_v2 = 292
cuMemcpyHtoA_v2_ptds = 402
cuMemcpyHtoD = 43
cuMemcpyHtoDAsync = 60
cuMemcpyHtoDAsync_v2 = 277
cuMemcpyHtoDAsync_v2_ptsz = 421
cuMemcpyHtoD_v2 = 276
cuMemcpyHtoD_v2_ptds = 397
cuMemcpyPeer = 318
cuMemcpyPeerAsync = 319
cuMemcpyPeerAsync_ptsz = 426
cuMemcpyPeer_ptds = 409
cuMemcpy_ptds = 408
cuMemcpy_v2 = 248
cuMemsetD16 = 73
cuMemsetD16Async = 218
cuMemsetD16Async_ptsz = 429
cuMemsetD16_v2 = 250
cuMemsetD16_v2_ptds = 412
cuMemsetD2D16 = 79
cuMemsetD2D16Async = 224
cuMemsetD2D16Async_ptsz = 432
cuMemsetD2D16_v2 = 253
cuMemsetD2D16_v2_ptds = 415
cuMemsetD2D32 = 81
cuMemsetD2D32Async = 226
cuMemsetD2D32Async_ptsz = 433
cuMemsetD2D32_v2 = 254
cuMemsetD2D32_v2_ptds = 416
cuMemsetD2D8 = 77
cuMemsetD2D8Async = 222
cuMemsetD2D8Async_ptsz = 431
cuMemsetD2D8_v2 = 252
cuMemsetD2D8_v2_ptds = 414
cuMemsetD32 = 75
cuMemsetD32Async = 220
cuMemsetD32Async_ptsz = 430
cuMemsetD32_v2 = 251
cuMemsetD32_v2_ptds = 413
cuMemsetD8 = 71
cuMemsetD8Async = 216
cuMemsetD8Async_ptsz = 428
cuMemsetD8_v2 = 249
cuMemsetD8_v2_ptds = 411
cuMipmappedArrayCreate = 347
cuMipmappedArrayDestroy = 349
cuMipmappedArrayGetLevel = 348
cuMipmappedArrayGetMemoryRequirements = 655
cuMipmappedArrayGetSparseProperties = 583
cuModuleEnumerateFunctions = 738
cuModuleGetFunction = 23
cuModuleGetFunctionCount = 737
cuModuleGetGlobal = 24
cuModuleGetGlobal_v2 = 241
cuModuleGetLoadingMode = 673
cuModuleGetSurfRef = 190
cuModuleGetTexRef = 26
cuModuleLoad = 18
cuModuleLoadData = 19
cuModuleLoadDataEx = 20
cuModuleLoadFatBinary = 21
cuModuleUnload = 22
cuMulticastAddDevice = 707
cuMulticastBindAddr = 709
cuMulticastBindMem = 708
cuMulticastCreate = 706
cuMulticastGetGranularity = 711
cuMulticastUnbind = 710
cuNNSetAllocator = 466
cuNVNbufferGetPointer = 464
cuNVNtextureGetArray = 465
cuOccupancyAvailableDynamicSMemPerBlock = 543
cuOccupancyMaxActiveBlocksPerMultiprocessor = 374
cuOccupancyMaxActiveBlocksPerMultiprocessorWithFlags = 451
cuOccupancyMaxActiveClusters = 676
cuOccupancyMaxPotentialBlockSize = 384
cuOccupancyMaxPotentialBlockSizeWithFlags = 452
cuOccupancyMaxPotentialClusterSize = 675
cuParamSetSize = 110
cuParamSetTexRef = 114
cuParamSetf = 112
cuParamSeti = 111
cuParamSetv = 113
cuPointerGetAttribute = 310
cuPointerGetAttributes = 450
cuPointerSetAttribute = 378
cuProfilerInitialize = 311
cuProfilerStart = 308
cuProfilerStop = 309
cuSignalExternalSemaphoresAsync = 490
cuSignalExternalSemaphoresAsync_ptsz = 491
cuStreamAddCallback = 346
cuStreamAddCallback_ptsz = 437
cuStreamAttachMemAsync = 377
cuStreamAttachMemAsync_ptsz = 438
cuStreamBatchMemOp = 462
cuStreamBatchMemOp_ptsz = 463
cuStreamBatchMemOp_v2 = 667
cuStreamBatchMemOp_v2_ptsz = 668
cuStreamBeginCapture = 495
cuStreamBeginCaptureToGraph = 720
cuStreamBeginCaptureToGraph_ptsz = 721
cuStreamBeginCapture_ptsz = 496
cuStreamBeginCapture_v2 = 539
cuStreamBeginCapture_v2_ptsz = 540
cuStreamCopyAttributes = 572
cuStreamCopyAttributes_ptsz = 573
cuStreamCreate = 124
cuStreamCreateWithPriority = 367
cuStreamDestroy = 127
cuStreamDestroy_v2 = 326
cuStreamEndCapture = 497
cuStreamEndCapture_ptsz = 498
cuStreamGetAttribute = 574
cuStreamGetAttribute_ptsz = 575
cuStreamGetCaptureInfo = 536
cuStreamGetCaptureInfo_ptsz = 537
cuStreamGetCaptureInfo_v2 = 629
cuStreamGetCaptureInfo_v2_ptsz = 630
cuStreamGetCaptureInfo_v3 = 729
cuStreamGetCaptureInfo_v3_ptsz = 730
cuStreamGetCtx = 483
cuStreamGetCtx_ptsz = 484
cuStreamGetCtx_v2 = 759
cuStreamGetCtx_v2_ptsz = 760
cuStreamGetDevice = 774
cuStreamGetDevice_ptsz = 775
cuStreamGetFlags = 369
cuStreamGetFlags_ptsz = 435
cuStreamGetGreenCtx = 752
cuStreamGetId = 693
cuStreamGetId_ptsz = 694
cuStreamGetPriority = 368
cuStreamGetPriority_ptsz = 434
cuStreamIsCapturing = 499
cuStreamIsCapturing_ptsz = 500
cuStreamQuery = 125
cuStreamQuery_ptsz = 439
cuStreamSetAttribute = 576
cuStreamSetAttribute_ptsz = 577
cuStreamSetFlags = 559
cuStreamSetFlags_ptsz = 560
cuStreamSynchronize = 126
cuStreamSynchronize_ptsz = 440
cuStreamUpdateCaptureDependencies = 631
cuStreamUpdateCaptureDependencies_ptsz = 632
cuStreamUpdateCaptureDependencies_v2 = 731
cuStreamUpdateCaptureDependencies_v2_ptsz = 732
cuStreamWaitEvent = 295
cuStreamWaitEvent_ptsz = 436
cuStreamWaitValue32 = 458
cuStreamWaitValue32_ptsz = 459
cuStreamWaitValue32_v2 = 659
cuStreamWaitValue32_v2_ptsz = 660
cuStreamWaitValue64 = 473
cuStreamWaitValue64_ptsz = 474
cuStreamWaitValue64_v2 = 661
cuStreamWaitValue64_v2_ptsz = 662
cuStreamWriteValue32 = 460
cuStreamWriteValue32_ptsz = 461
cuStreamWriteValue32_v2 = 663
cuStreamWriteValue32_v2_ptsz = 664
cuStreamWriteValue64 = 475
cuStreamWriteValue64_ptsz = 476
cuStreamWriteValue64_v2 = 665
cuStreamWriteValue64_v2_ptsz = 666
cuSurfObjectCreate = 343
cuSurfObjectDestroy = 344
cuSurfObjectGetResourceDesc = 345
cuSurfRefCreate = 191
cuSurfRefDestroy = 192
cuSurfRefGetArray = 196
cuSurfRefGetFormat = 195
cuSurfRefSetArray = 194
cuSurfRefSetFormat = 193
cuTensorMapEncodeIm2col = 698
cuTensorMapEncodeIm2colWide = 781
cuTensorMapEncodeTiled = 697
cuTensorMapReplaceAddress = 699
cuTexObjectCreate = 339
cuTexObjectDestroy = 340
cuTexObjectGetResourceDesc = 341
cuTexObjectGetResourceViewDesc = 361
cuTexObjectGetTextureDesc = 342
cuTexRefCreate = 92
cuTexRefDestroy = 93
cuTexRefGetAddress = 103
cuTexRefGetAddressMode = 106
cuTexRefGetAddress_v2 = 257
cuTexRefGetArray = 105
cuTexRefGetBorderColor = 456
cuTexRefGetFilterMode = 107
cuTexRefGetFlags = 109
cuTexRefGetFormat = 108
cuTexRefGetMaxAnisotropy = 359
cuTexRefGetMipmapFilterMode = 356
cuTexRefGetMipmapLevelBias = 357
cuTexRefGetMipmapLevelClamp = 358
cuTexRefGetMipmappedArray = 355
cuTexRefSetAddress = 95
cuTexRefSetAddress2D = 97
cuTexRefSetAddress2D_v2 = 256
cuTexRefSetAddress2D_v3 = 327
cuTexRefSetAddressMode = 100
cuTexRefSetAddress_v2 = 255
cuTexRefSetArray = 94
cuTexRefSetBorderColor = 455
cuTexRefSetFilterMode = 101
cuTexRefSetFlags = 102
cuTexRefSetFormat = 99
cuTexRefSetMaxAnisotropy = 354
cuTexRefSetMipmapFilterMode = 351
cuTexRefSetMipmapLevelBias = 352
cuTexRefSetMipmapLevelClamp = 353
cuTexRefSetMipmappedArray = 350
cuThreadExchangeStreamCaptureMode = 541
cuUserObjectCreate = 633
cuUserObjectRelease = 635
cuUserObjectRetain = 634
cuVDPAUCtxCreate = 187
cuVDPAUCtxCreate_v2 = 240
cuVDPAUGetDevice = 186
cuWGLGetDevice = 177
cuWaitExternalSemaphoresAsync = 492
cuWaitExternalSemaphoresAsync_ptsz = 493
class cupti.cupti.runtime_api_trace_cbid(value)

Bases: IntEnum

See CUpti_runtime_api_trace_cbid.

FORCE_INT = 2147483647
INVALID = 0
SIZE = 487
cuda470_v12060 = 470
cuda471_v12060 = 471
cuda472_v12060 = 472
cuda473_v12060 = 473
cuda474_v12060 = 474
cuda475_v12060 = 475
cuda476_v12060 = 476
cuda477_v12060 = 477
cuda478_v12060 = 478
cuda479_v12060 = 479
cudaArrayGetInfo_v4010 = 181
cudaArrayGetMemoryRequirements_v11060 = 428
cudaArrayGetPlane_v11020 = 381
cudaArrayGetSparseProperties_v11010 = 359
cudaBindSurfaceToArray_v3020 = 61
cudaBindTexture2D_v3020 = 56
cudaBindTextureToArray_v3020 = 57
cudaBindTextureToMipmappedArray_v5000 = 195
cudaBindTexture_v3020 = 55
cudaChooseDevice_v3020 = 5
cudaConfigureCall_v3020 = 8
cudaCreateChannelDesc_v3020 = 7
cudaCreateSurfaceObject_v5000 = 189
cudaCreateTextureObject_v2_v11080 = 434
cudaCreateTextureObject_v5000 = 185
cudaCtxResetPersistingL2Cache_v11000 = 337
cudaD3D10GetDevice_v3020 = 88
cudaD3D10GetDevices_v3020 = 89
cudaD3D10GetDirect3DDevice_v3020 = 149
cudaD3D10MapResources_v3020 = 94
cudaD3D10RegisterResource_v3020 = 92
cudaD3D10ResourceGetMappedArray_v3020 = 98
cudaD3D10ResourceGetMappedPitch_v3020 = 101
cudaD3D10ResourceGetMappedPointer_v3020 = 99
cudaD3D10ResourceGetMappedSize_v3020 = 100
cudaD3D10ResourceGetSurfaceDimensions_v3020 = 97
cudaD3D10ResourceSetMapFlags_v3020 = 96
cudaD3D10SetDirect3DDevice_v3020 = 90
cudaD3D10UnmapResources_v3020 = 95
cudaD3D10UnregisterResource_v3020 = 93
cudaD3D11GetDevice_v3020 = 84
cudaD3D11GetDevices_v3020 = 85
cudaD3D11GetDirect3DDevice_v3020 = 148
cudaD3D11SetDirect3DDevice_v3020 = 86
cudaD3D9Begin_v3020 = 117
cudaD3D9End_v3020 = 118
cudaD3D9GetDevice_v3020 = 102
cudaD3D9GetDevices_v3020 = 103
cudaD3D9GetDirect3DDevice_v3020 = 105
cudaD3D9MapResources_v3020 = 109
cudaD3D9MapVertexBuffer_v3020 = 121
cudaD3D9RegisterResource_v3020 = 107
cudaD3D9RegisterVertexBuffer_v3020 = 119
cudaD3D9ResourceGetMappedArray_v3020 = 113
cudaD3D9ResourceGetMappedPitch_v3020 = 116
cudaD3D9ResourceGetMappedPointer_v3020 = 114
cudaD3D9ResourceGetMappedSize_v3020 = 115
cudaD3D9ResourceGetSurfaceDimensions_v3020 = 112
cudaD3D9ResourceSetMapFlags_v3020 = 111
cudaD3D9SetDirect3DDevice_v3020 = 104
cudaD3D9UnmapResources_v3020 = 110
cudaD3D9UnmapVertexBuffer_v3020 = 122
cudaD3D9UnregisterResource_v3020 = 108
cudaD3D9UnregisterVertexBuffer_v3020 = 120
cudaDestroyExternalMemory_v10000 = 277
cudaDestroyExternalSemaphore_v10000 = 283
cudaDestroySurfaceObject_v5000 = 190
cudaDestroyTextureObject_v5000 = 186
cudaDeviceCanAccessPeer_v4000 = 154
cudaDeviceDisablePeerAccess_v4000 = 156
cudaDeviceEnablePeerAccess_v4000 = 155
cudaDeviceFlushGPUDirectRDMAWrites_v11030 = 405
cudaDeviceGetAttribute_v5000 = 200
cudaDeviceGetByPCIBusId_v4010 = 173
cudaDeviceGetCacheConfig_v3020 = 168
cudaDeviceGetDefaultMemPool_v11020 = 372
cudaDeviceGetGraphMemAttribute_v11040 = 424
cudaDeviceGetLimit_v3020 = 166
cudaDeviceGetMemPool_v11020 = 386
cudaDeviceGetNvSciSyncAttributes_v10020 = 328
cudaDeviceGetP2PAttribute_v8000 = 255
cudaDeviceGetPCIBusId_v4010 = 174
cudaDeviceGetSharedMemConfig_v4020 = 183
cudaDeviceGetStreamPriorityRange_v5050 = 205
cudaDeviceGetTexture1DLinearMaxWidth_v11010 = 347
cudaDeviceGraphMemTrim_v11040 = 423
cudaDeviceRegisterAsyncNotification_v12040 = 465
cudaDeviceReset_v3020 = 164
cudaDeviceSetCacheConfig_v3020 = 169
cudaDeviceSetGraphMemAttribute_v11040 = 425
cudaDeviceSetLimit_v3020 = 167
cudaDeviceSetMemPool_v11020 = 385
cudaDeviceSetSharedMemConfig_v4020 = 184
cudaDeviceSynchronize_v3020 = 165
cudaDeviceUnregisterAsyncNotification_v12040 = 466
cudaDriverGetVersion_v3020 = 1
cudaEGLStreamConsumerAcquireFrame_v7000 = 259
cudaEGLStreamConsumerConnectWithFlags_v7000 = 268
cudaEGLStreamConsumerConnect_v7000 = 257
cudaEGLStreamConsumerDisconnect_v7000 = 258
cudaEGLStreamConsumerReleaseFrame_v7000 = 260
cudaEGLStreamProducerConnect_v7000 = 261
cudaEGLStreamProducerDisconnect_v7000 = 262
cudaEGLStreamProducerPresentFrame_v7000 = 263
cudaEGLStreamProducerReturnFrame_v7000 = 264
cudaEventCreateFromEGLSync_v9000 = 271
cudaEventCreateWithFlags_v3020 = 134
cudaEventCreate_v3020 = 133
cudaEventDestroy_v3020 = 136
cudaEventElapsedTime_v2_v12080 = 486
cudaEventElapsedTime_v3020 = 139
cudaEventQuery_v3020 = 138
cudaEventRecordWithFlags_ptsz_v11010 = 371
cudaEventRecordWithFlags_v11010 = 370
cudaEventRecord_ptsz_v7000 = 242
cudaEventRecord_v3020 = 135
cudaEventSynchronize_v3020 = 137
cudaExternalMemoryGetMappedBuffer_v10000 = 275
cudaExternalMemoryGetMappedMipmappedArray_v10000 = 276
cudaFreeArray_v3020 = 24
cudaFreeAsync_ptsz_v11020 = 376
cudaFreeAsync_v11020 = 375
cudaFreeHost_v3020 = 26
cudaFreeMipmappedArray_v5000 = 194
cudaFree_v3020 = 22
cudaFuncGetAttributes_v3020 = 15
cudaFuncGetName_v12030 = 451
cudaFuncGetParamInfo_v12040 = 467
cudaFuncSetAttribute_v9000 = 273
cudaFuncSetCacheConfig_v3020 = 14
cudaFuncSetSharedMemConfig_v4020 = 182
cudaGLGetDevices_v4010 = 175
cudaGLMapBufferObjectAsync_v3020 = 69
cudaGLMapBufferObject_v3020 = 65
cudaGLRegisterBufferObject_v3020 = 64
cudaGLSetBufferObjectMapFlags_v3020 = 68
cudaGLSetGLDevice_v3020 = 63
cudaGLUnmapBufferObjectAsync_v3020 = 70
cudaGLUnmapBufferObject_v3020 = 66
cudaGLUnregisterBufferObject_v3020 = 67
cudaGetChannelDesc_v3020 = 6
cudaGetDeviceCount_v3020 = 3
cudaGetDeviceFlags_v7000 = 212
cudaGetDeviceProperties_v2_v12000 = 440
cudaGetDeviceProperties_v3020 = 4
cudaGetDevice_v3020 = 17
cudaGetDriverEntryPointByVersion_ptsz_v12050 = 469
cudaGetDriverEntryPointByVersion_v12050 = 468
cudaGetDriverEntryPoint_ptsz_v11030 = 407
cudaGetDriverEntryPoint_v11030 = 406
cudaGetErrorName_v6050 = 209
cudaGetErrorString_v3020 = 12
cudaGetFuncBySymbol_v11000 = 336
cudaGetKernel_v12000 = 439
cudaGetLastError_v3020 = 10
cudaGetMipmappedArrayLevel_v5000 = 193
cudaGetSurfaceObjectResourceDesc_v5000 = 191
cudaGetSurfaceReference_v3020 = 62
cudaGetSymbolAddress_v3020 = 53
cudaGetSymbolSize_v3020 = 54
cudaGetTextureAlignmentOffset_v3020 = 59
cudaGetTextureObjectResourceDesc_v5000 = 187
cudaGetTextureObjectResourceViewDesc_v5000 = 199
cudaGetTextureObjectTextureDesc_v2_v11080 = 435
cudaGetTextureObjectTextureDesc_v5000 = 188
cudaGetTextureReference_v3020 = 60
cudaGraphAddChildGraphNode_v10000 = 298
cudaGraphAddDependencies_v10000 = 307
cudaGraphAddDependencies_v2_v12030 = 458
cudaGraphAddEmptyNode_v10000 = 300
cudaGraphAddEventRecordNode_v11010 = 362
cudaGraphAddEventWaitNode_v11010 = 365
cudaGraphAddExternalSemaphoresSignalNode_v11020 = 397
cudaGraphAddExternalSemaphoresWaitNode_v11020 = 400
cudaGraphAddHostNode_v10000 = 296
cudaGraphAddKernelNode_v10000 = 289
cudaGraphAddMemAllocNode_v11040 = 419
cudaGraphAddMemFreeNode_v11040 = 421
cudaGraphAddMemcpyNode1D_v11010 = 352
cudaGraphAddMemcpyNodeFromSymbol_v11010 = 351
cudaGraphAddMemcpyNodeToSymbol_v11010 = 350
cudaGraphAddMemcpyNode_v10000 = 290
cudaGraphAddMemsetNode_v10000 = 293
cudaGraphAddNode_v12020 = 445
cudaGraphAddNode_v2_v12030 = 460
cudaGraphChildGraphNodeGetGraph_v10000 = 299
cudaGraphClone_v10000 = 301
cudaGraphConditionalHandleCreate_v12030 = 454
cudaGraphCreate_v10000 = 286
cudaGraphDebugDotPrint_v11030 = 408
cudaGraphDestroyNode_v10000 = 309
cudaGraphDestroy_v10000 = 314
cudaGraphEventRecordNodeGetEvent_v11010 = 363
cudaGraphEventRecordNodeSetEvent_v11010 = 364
cudaGraphEventWaitNodeGetEvent_v11010 = 366
cudaGraphEventWaitNodeSetEvent_v11010 = 367
cudaGraphExecChildGraphNodeSetParams_v11010 = 361
cudaGraphExecDestroy_v10000 = 313
cudaGraphExecEventRecordNodeSetEvent_v11010 = 368
cudaGraphExecEventWaitNodeSetEvent_v11010 = 369
cudaGraphExecExternalSemaphoresSignalNodeSetParams_v11020 = 403
cudaGraphExecExternalSemaphoresWaitNodeSetParams_v11020 = 404
cudaGraphExecGetFlags_v12000 = 438
cudaGraphExecHostNodeSetParams_v10020 = 334
cudaGraphExecKernelNodeSetParams_v10010 = 326
cudaGraphExecMemcpyNodeSetParams1D_v11010 = 358
cudaGraphExecMemcpyNodeSetParamsFromSymbol_v11010 = 357
cudaGraphExecMemcpyNodeSetParamsToSymbol_v11010 = 356
cudaGraphExecMemcpyNodeSetParams_v10020 = 332
cudaGraphExecMemsetNodeSetParams_v10020 = 333
cudaGraphExecNodeSetParams_v12020 = 447
cudaGraphExecUpdate_v10020 = 335
cudaGraphExternalSemaphoresSignalNodeGetParams_v11020 = 398
cudaGraphExternalSemaphoresSignalNodeSetParams_v11020 = 399
cudaGraphExternalSemaphoresWaitNodeGetParams_v11020 = 401
cudaGraphExternalSemaphoresWaitNodeSetParams_v11020 = 402
cudaGraphGetEdges_v10000 = 323
cudaGraphGetEdges_v2_v12030 = 455
cudaGraphGetNodes_v10000 = 322
cudaGraphGetRootNodes_v10000 = 304
cudaGraphHostNodeGetParams_v10000 = 297
cudaGraphHostNodeSetParams_v10000 = 321
cudaGraphInstantiateWithFlags_v11040 = 418
cudaGraphInstantiateWithParams_ptsz_v12000 = 437
cudaGraphInstantiateWithParams_v12000 = 436
cudaGraphInstantiate_v10000 = 310
cudaGraphInstantiate_v12000 = 443
cudaGraphKernelNodeCopyAttributes_v11000 = 338
cudaGraphKernelNodeGetAttribute_v11000 = 339
cudaGraphKernelNodeGetParams_v10000 = 287
cudaGraphKernelNodeSetAttribute_v11000 = 340
cudaGraphKernelNodeSetParams_v10000 = 288
cudaGraphLaunch_ptsz_v10000 = 312
cudaGraphLaunch_v10000 = 311
cudaGraphMemAllocNodeGetParams_v11040 = 420
cudaGraphMemFreeNodeGetParams_v11040 = 422
cudaGraphMemcpyNodeGetParams_v10000 = 291
cudaGraphMemcpyNodeSetParams1D_v11010 = 355
cudaGraphMemcpyNodeSetParamsFromSymbol_v11010 = 354
cudaGraphMemcpyNodeSetParamsToSymbol_v11010 = 353
cudaGraphMemcpyNodeSetParams_v10000 = 292
cudaGraphMemsetNodeGetParams_v10000 = 294
cudaGraphMemsetNodeSetParams_v10000 = 295
cudaGraphNodeFindInClone_v10000 = 302
cudaGraphNodeGetDependencies_v10000 = 305
cudaGraphNodeGetDependencies_v2_v12030 = 456
cudaGraphNodeGetDependentNodes_v10000 = 306
cudaGraphNodeGetDependentNodes_v2_v12030 = 457
cudaGraphNodeGetEnabled_v11060 = 427
cudaGraphNodeGetType_v10000 = 303
cudaGraphNodeSetEnabled_v11060 = 426
cudaGraphNodeSetParams_v12020 = 446
cudaGraphReleaseUserObject_v11030 = 417
cudaGraphRemoveDependencies_v10000 = 308
cudaGraphRemoveDependencies_v2_v12030 = 459
cudaGraphRetainUserObject_v11030 = 416
cudaGraphUpload_ptsz_v10000 = 349
cudaGraphUpload_v10000 = 348
cudaGraphicsD3D10RegisterResource_v3020 = 91
cudaGraphicsD3D11RegisterResource_v3020 = 87
cudaGraphicsD3D9RegisterResource_v3020 = 106
cudaGraphicsEGLRegisterImage_v7000 = 256
cudaGraphicsGLRegisterBuffer_v3020 = 73
cudaGraphicsGLRegisterImage_v3020 = 72
cudaGraphicsMapResources_v3020 = 76
cudaGraphicsResourceGetMappedEglFrame_v7000 = 265
cudaGraphicsResourceGetMappedMipmappedArray_v5000 = 196
cudaGraphicsResourceGetMappedPointer_v3020 = 78
cudaGraphicsResourceSetMapFlags_v3020 = 75
cudaGraphicsSubResourceGetMappedArray_v3020 = 79
cudaGraphicsUnmapResources_v3020 = 77
cudaGraphicsUnregisterResource_v3020 = 74
cudaGraphicsVDPAURegisterOutputSurface_v3020 = 83
cudaGraphicsVDPAURegisterVideoSurface_v3020 = 82
cudaHostAlloc_v3020 = 27
cudaHostGetDevicePointer_v3020 = 28
cudaHostGetFlags_v3020 = 29
cudaHostRegister_v4000 = 152
cudaHostUnregister_v4000 = 153
cudaImportExternalMemory_v10000 = 274
cudaImportExternalSemaphore_v10000 = 278
cudaInitDevice_v12000 = 444
cudaIpcCloseMemHandle_v4010 = 180
cudaIpcGetEventHandle_v4010 = 176
cudaIpcGetMemHandle_v4010 = 178
cudaIpcOpenEventHandle_v4010 = 177
cudaIpcOpenMemHandle_v4010 = 179
cudaLaunchCooperativeKernelMultiDevice_v9000 = 272
cudaLaunchCooperativeKernel_ptsz_v9000 = 270
cudaLaunchCooperativeKernel_v9000 = 269
cudaLaunchHostFunc_ptsz_v10000 = 285
cudaLaunchHostFunc_v10000 = 284
cudaLaunchKernelExC_ptsz_v11060 = 431
cudaLaunchKernelExC_v11060 = 430
cudaLaunchKernel_ptsz_v7000 = 214
cudaLaunchKernel_v7000 = 211
cudaLaunch_ptsz_v7000 = 213
cudaLaunch_v3020 = 13
cudaMalloc3DArray_v3020 = 141
cudaMalloc3D_v3020 = 140
cudaMallocArray_v3020 = 23
cudaMallocAsync_ptsz_v11020 = 374
cudaMallocAsync_v11020 = 373
cudaMallocFromPoolAsync_ptsz_v11020 = 392
cudaMallocFromPoolAsync_v11020 = 391
cudaMallocHost_v3020 = 25
cudaMallocManaged_v6000 = 206
cudaMallocMipmappedArray_v5000 = 192
cudaMallocPitch_v3020 = 21
cudaMalloc_v3020 = 20
cudaMemAdvise_v2_v12020 = 448
cudaMemAdvise_v8000 = 254
cudaMemGetInfo_v3020 = 30
cudaMemPoolCreate_v11020 = 383
cudaMemPoolDestroy_v11020 = 384
cudaMemPoolExportPointer_v11020 = 389
cudaMemPoolExportToShareableHandle_v11020 = 387
cudaMemPoolGetAccess_v11020 = 382
cudaMemPoolGetAttribute_v11020 = 379
cudaMemPoolImportFromShareableHandle_v11020 = 388
cudaMemPoolImportPointer_v11020 = 390
cudaMemPoolSetAccess_v11020 = 380
cudaMemPoolSetAttribute_v11020 = 378
cudaMemPoolTrimTo_v11020 = 377
cudaMemPrefetchAsync_ptsz_v8000 = 253
cudaMemPrefetchAsync_v2_ptsz_v12020 = 450
cudaMemPrefetchAsync_v2_v12020 = 449
cudaMemPrefetchAsync_v8000 = 252
cudaMemRangeGetAttribute_v8000 = 266
cudaMemRangeGetAttributes_v8000 = 267
cudaMemcpy2DArrayToArray_ptds_v7000 = 222
cudaMemcpy2DArrayToArray_v3020 = 38
cudaMemcpy2DAsync_ptsz_v7000 = 228
cudaMemcpy2DAsync_v3020 = 44
cudaMemcpy2DFromArrayAsync_ptsz_v7000 = 230
cudaMemcpy2DFromArrayAsync_v3020 = 46
cudaMemcpy2DFromArray_ptds_v7000 = 220
cudaMemcpy2DFromArray_v3020 = 36
cudaMemcpy2DToArrayAsync_ptsz_v7000 = 229
cudaMemcpy2DToArrayAsync_v3020 = 45
cudaMemcpy2DToArray_ptds_v7000 = 218
cudaMemcpy2DToArray_v3020 = 34
cudaMemcpy2D_ptds_v7000 = 216
cudaMemcpy2D_v3020 = 32
cudaMemcpy3DAsync_ptsz_v7000 = 246
cudaMemcpy3DAsync_v3020 = 145
cudaMemcpy3DBatchAsync_ptsz_v12080 = 485
cudaMemcpy3DBatchAsync_v12080 = 484
cudaMemcpy3DPeerAsync_ptsz_v7000 = 250
cudaMemcpy3DPeerAsync_v4000 = 163
cudaMemcpy3DPeer_ptds_v7000 = 249
cudaMemcpy3DPeer_v4000 = 162
cudaMemcpy3D_ptds_v7000 = 245
cudaMemcpy3D_v3020 = 144
cudaMemcpyArrayToArray_ptds_v7000 = 221
cudaMemcpyArrayToArray_v3020 = 37
cudaMemcpyAsync_ptsz_v7000 = 225
cudaMemcpyAsync_v3020 = 41
cudaMemcpyBatchAsync_ptsz_v12080 = 483
cudaMemcpyBatchAsync_v12080 = 482
cudaMemcpyFromArrayAsync_ptsz_v7000 = 227
cudaMemcpyFromArrayAsync_v3020 = 43
cudaMemcpyFromArray_ptds_v7000 = 219
cudaMemcpyFromArray_v3020 = 35
cudaMemcpyFromSymbolAsync_ptsz_v7000 = 232
cudaMemcpyFromSymbolAsync_v3020 = 48
cudaMemcpyFromSymbol_ptds_v7000 = 224
cudaMemcpyFromSymbol_v3020 = 40
cudaMemcpyPeerAsync_v4000 = 161
cudaMemcpyPeer_v4000 = 160
cudaMemcpyToArrayAsync_ptsz_v7000 = 226
cudaMemcpyToArrayAsync_v3020 = 42
cudaMemcpyToArray_ptds_v7000 = 217
cudaMemcpyToArray_v3020 = 33
cudaMemcpyToSymbolAsync_ptsz_v7000 = 231
cudaMemcpyToSymbolAsync_v3020 = 47
cudaMemcpyToSymbol_ptds_v7000 = 223
cudaMemcpyToSymbol_v3020 = 39
cudaMemcpy_ptds_v7000 = 215
cudaMemcpy_v3020 = 31
cudaMemset2DAsync_ptsz_v7000 = 236
cudaMemset2DAsync_v3020 = 52
cudaMemset2D_ptds_v7000 = 234
cudaMemset2D_v3020 = 50
cudaMemset3DAsync_ptsz_v7000 = 244
cudaMemset3DAsync_v3020 = 143
cudaMemset3D_ptds_v7000 = 243
cudaMemset3D_v3020 = 142
cudaMemsetAsync_ptsz_v7000 = 235
cudaMemsetAsync_v3020 = 51
cudaMemset_ptds_v7000 = 233
cudaMemset_v3020 = 49
cudaMipmappedArrayGetMemoryRequirements_v11060 = 429
cudaMipmappedArrayGetSparseProperties_v11010 = 360
cudaOccupancyAvailableDynamicSMemPerBlock_v10200 = 329
cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags_v7000 = 251
cudaOccupancyMaxActiveBlocksPerMultiprocessor_v6000 = 207
cudaOccupancyMaxActiveBlocksPerMultiprocessor_v6050 = 210
cudaOccupancyMaxActiveClusters_v11070 = 433
cudaOccupancyMaxPotentialClusterSize_v11070 = 432
cudaPeekAtLastError_v3020 = 11
cudaPeerGetDevicePointer_v4000 = 159
cudaPeerRegister_v4000 = 157
cudaPeerUnregister_v4000 = 158
cudaPointerGetAttributes_v4000 = 151
cudaProfilerInitialize_v4000 = 170
cudaProfilerStart_v4000 = 171
cudaProfilerStop_v4000 = 172
cudaRuntimeGetVersion_v3020 = 2
cudaSetDeviceFlags_v3020 = 19
cudaSetDevice_v3020 = 16
cudaSetDoubleForDevice_v3020 = 124
cudaSetDoubleForHost_v3020 = 125
cudaSetValidDevices_v3020 = 18
cudaSetupArgument_v3020 = 9
cudaSignalExternalSemaphoresAsync_ptsz_v10000 = 280
cudaSignalExternalSemaphoresAsync_v10000 = 279
cudaSignalExternalSemaphoresAsync_v2_ptsz_v11020 = 394
cudaSignalExternalSemaphoresAsync_v2_v11020 = 393
cudaStreamAddCallback_ptsz_v7000 = 248
cudaStreamAddCallback_v5000 = 197
cudaStreamAttachMemAsync_ptsz_v7000 = 241
cudaStreamAttachMemAsync_v6000 = 208
cudaStreamBeginCaptureToGraph_ptsz_v12030 = 453
cudaStreamBeginCaptureToGraph_v12030 = 452
cudaStreamBeginCapture_ptsz_v10000 = 316
cudaStreamBeginCapture_v10000 = 315
cudaStreamCopyAttributes_ptsz_v11000 = 342
cudaStreamCopyAttributes_v11000 = 341
cudaStreamCreateWithFlags_v5000 = 198
cudaStreamCreateWithPriority_v5050 = 202
cudaStreamCreate_v3020 = 129
cudaStreamDestroy_v3020 = 130
cudaStreamDestroy_v5050 = 201
cudaStreamEndCapture_ptsz_v10000 = 320
cudaStreamEndCapture_v10000 = 319
cudaStreamGetAttribute_ptsz_v11000 = 344
cudaStreamGetAttribute_v11000 = 343
cudaStreamGetCaptureInfo_ptsz_v10010 = 325
cudaStreamGetCaptureInfo_v10010 = 324
cudaStreamGetCaptureInfo_v2_ptsz_v11030 = 410
cudaStreamGetCaptureInfo_v2_v11030 = 409
cudaStreamGetCaptureInfo_v3_ptsz_v12030 = 462
cudaStreamGetCaptureInfo_v3_v12030 = 461
cudaStreamGetDevice_ptsz_v12080 = 481
cudaStreamGetDevice_v12080 = 480
cudaStreamGetFlags_ptsz_v7000 = 238
cudaStreamGetFlags_v5050 = 204
cudaStreamGetId_ptsz_v12000 = 442
cudaStreamGetId_v12000 = 441
cudaStreamGetPriority_ptsz_v7000 = 237
cudaStreamGetPriority_v5050 = 203
cudaStreamIsCapturing_ptsz_v10000 = 318
cudaStreamIsCapturing_v10000 = 317
cudaStreamQuery_ptsz_v7000 = 240
cudaStreamQuery_v3020 = 132
cudaStreamSetAttribute_ptsz_v11000 = 346
cudaStreamSetAttribute_v11000 = 345
cudaStreamSetFlags_ptsz_v10200 = 331
cudaStreamSetFlags_v10200 = 330
cudaStreamSynchronize_ptsz_v7000 = 239
cudaStreamSynchronize_v3020 = 131
cudaStreamUpdateCaptureDependencies_ptsz_v11030 = 412
cudaStreamUpdateCaptureDependencies_v11030 = 411
cudaStreamUpdateCaptureDependencies_v2_ptsz_v12030 = 464
cudaStreamUpdateCaptureDependencies_v2_v12030 = 463
cudaStreamWaitEvent_ptsz_v7000 = 247
cudaStreamWaitEvent_v3020 = 147
cudaThreadExchangeStreamCaptureMode_v10010 = 327
cudaThreadExit_v3020 = 123
cudaThreadGetCacheConfig_v3020 = 150
cudaThreadGetLimit_v3020 = 127
cudaThreadSetCacheConfig_v3020 = 146
cudaThreadSetLimit_v3020 = 128
cudaThreadSynchronize_v3020 = 126
cudaUnbindTexture_v3020 = 58
cudaUserObjectCreate_v11030 = 413
cudaUserObjectRelease_v11030 = 415
cudaUserObjectRetain_v11030 = 414
cudaVDPAUGetDevice_v3020 = 80
cudaVDPAUSetVDPAUDevice_v3020 = 81
cudaWGLGetDevice_v3020 = 71
cudaWaitExternalSemaphoresAsync_ptsz_v10000 = 282
cudaWaitExternalSemaphoresAsync_v10000 = 281
cudaWaitExternalSemaphoresAsync_v2_ptsz_v11020 = 396
cudaWaitExternalSemaphoresAsync_v2_v11020 = 395
cupti.cupti.activity_configure_unified_memory_counter(intptr_t config, uint32_t count)

Set Unified Memory Counter configuration.

Args:

config (intptr_t): A pointer to CUpti_ActivityUnifiedMemoryCounterConfig structures containing Unified Memory counter configuration. count (uint32_t): Number of Unified Memory counter configuration structures.

See also

cuptiActivityConfigureUnifiedMemoryCounter

cupti.cupti.activity_disable(int kind)

Disable collection of a specific kind of activity record.

Args:

kind (CUpti_ActivityKind): The kind of activity record to stop collecting.

See also

cuptiActivityDisable

cupti.cupti.activity_disable_context(intptr_t context, int kind)

Disable collection of a specific kind of activity record for a context.

Args:

context (intptr_t): The context for which activity is to be disabled. kind (CUpti_ActivityKind): The kind of activity record to stop collecting.

See also

cuptiActivityDisableContext

cupti.cupti.activity_enable(int kind)

Enable collection of a specific kind of activity record.

Args:

kind (CUpti_ActivityKind): The kind of activity record to collect.

See also

cuptiActivityEnable

cupti.cupti.activity_enable_allocation_source(uint8_t enable)

Enables tracking the source library for memory allocation requests.

Args:

enable (uint8_t): is a boolean, denoting whether the source library of the memory allocation request needs to be tracked.

See also

cuptiActivityEnableAllocationSource

cupti.cupti.activity_enable_and_dump(int kind)

Enable collection of a specific kind of activity record. For certain activity kinds it dumps existing records.

Args:

kind (CUpti_ActivityKind): The kind of activity record to collect.

See also

cuptiActivityEnableAndDump

cupti.cupti.activity_enable_context(intptr_t context, int kind)

Enable collection of a specific kind of activity record for a context.

Args:

context (intptr_t): The context for which activity is to be enabled. kind (CUpti_ActivityKind): The kind of activity record to collect.

See also

cuptiActivityEnableContext

cupti.cupti.activity_enable_device_graph(uint8_t enable)

Controls the collection of records for device launched graphs.

Args:

enable (uint8_t): is a boolean, denoting whether these records should be collected.

See also

cuptiActivityEnableDeviceGraph

cupti.cupti.activity_enable_driver_api(uint32_t cbid, uint8_t enable)

Controls the collection of activity records for specific CUDA Driver APIs.

Args:

cbid (uint32_t): callback id of the CUDA Driver API. This can be found in the header cupti_driver_cbid.h. enable (uint8_t): is a boolean, denoting whether to enable or disable the collection.

See also

cuptiActivityEnableDriverApi

cupti.cupti.activity_enable_hw_trace(uint8_t enable)

Enables the collection of CUDA kernel timestamps through HW events.

Args:

enable (uint8_t): is a boolean, denoting whether to enable or disable the collection through HW events.

See also

cuptiActivityEnableHWTrace

cupti.cupti.activity_enable_latency_timestamps(uint8_t enable)

Controls the collection of queued and submitted timestamps for kernels.

Args:

enable (uint8_t): is a boolean, denoting whether these timestamps should be collected.

See also

cuptiActivityEnableLatencyTimestamps

cupti.cupti.activity_enable_launch_attributes(uint8_t enable)

Controls the collection of launch attributes for kernels.

Args:

enable (uint8_t): is a boolean denoting whether these launch attributes should be collected.

See also

cuptiActivityEnableLaunchAttributes

cupti.cupti.activity_enable_runtime_api(uint32_t cbid, uint8_t enable)

Controls the collection of activity records for specific CUDA Runtime APIs.

Args:

cbid (uint32_t): callback id of the CUDA Runtime API. This can be found in the header cupti_runtime_cbid.h. enable (uint8_t): is a boolean, denoting whether to enable or disable the collection.

See also

cuptiActivityEnableRuntimeApi

cupti.cupti.activity_flush_all(uint32_t flag)

Request to deliver activity records via the buffer completion callback.

Args:

flag (uint32_t): The flag can be set to indicate a forced flush. See CUpti_ActivityFlag.

See also

cuptiActivityFlushAll

cupti.cupti.activity_flush_period(uint32_t time)

Sets the flush period for the worker thread.

Args:

time (uint32_t): flush period in milliseconds (ms).

See also

cuptiActivityFlushPeriod

cupti.cupti.activity_get_attribute(int attr, intptr_t value_size, intptr_t value)

Read an activity API attribute.

Args:

attr (CUpti_ActivityAttribute): The attribute to read. value_size (intptr_t): Size of buffer pointed by the value, and returns the number of bytes written to value. value (intptr_t): Returns the value of the attribute.

See also

cuptiActivityGetAttribute

cupti.cupti.activity_get_num_dropped_records(intptr_t context, uint32_t stream_id, intptr_t dropped)

Get the number of activity records that were dropped of insufficient buffer space.

Args:

context (intptr_t): The context, or NULL to get dropped count from global queue. stream_id (uint32_t): The stream ID. dropped (intptr_t): The number of records that were dropped since the last call to this function.

See also

cuptiActivityGetNumDroppedRecords

cupti.cupti.activity_pop_external_correlation_id(int kind) uint64_t

Pop an external correlation id for the calling thread.

Args:

kind (CUpti_ExternalCorrelationKind): The kind of external API activities should be correlated with.

Returns:

uint64_t: If the function returns successful, contains the last external correlation id for this kind, can be NULL.

See also

cuptiActivityPopExternalCorrelationId

cupti.cupti.activity_push_external_correlation_id(int kind, uint64_t id)

Push an external correlation id for the calling thread.

Args:

kind (CUpti_ExternalCorrelationKind): The kind of external API activities should be correlated with. id (uint64_t): External correlation id.

See also

cuptiActivityPushExternalCorrelationId

cupti.cupti.activity_register_callbacks(func_buffer_requested, func_buffer_completed)

Registers callback functions with CUPTI for activity buffer handling.

Args:

func_buffer_requested (function): callback which is invoked when an empty buffer is requested by CUPTI. func_buffer_completed (function): callback which is invoked when a buffer containing activity records is available from CUPTI.

See also

cuptiActivityRegisterCallbacks

cupti.cupti.activity_register_timestamp_callback(func_timestamp)

Registers callback function with CUPTI for providing timestamp.

Args:

func_timestamp (function): callback which is invoked when a timestamp is needed by CUPTI.

See also

cuptiActivityRegisterTimestampCallback

cupti.cupti.activity_set_attribute(int attr, intptr_t value_size, intptr_t value)

Write an activity API attribute.

Args:

attr (CUpti_ActivityAttribute): The attribute to write. value_size (intptr_t): The size, in bytes, of the value. value (intptr_t): The attribute value to write.

See also

cuptiActivitySetAttribute

cupti.cupti.compute_capability_supported(int major, int minor) int

Check support for a compute capability.

Args:

major (int): The major revision number of the compute capability. minor (int): The minor revision number of the compute capability.

Returns:

int: Pointer to an integer to return the support status.

See also

cuptiComputeCapabilitySupported

cupti.cupti.device_supported(int dev) int

Check support for a compute device.

Args:

dev (int): The device handle returned by CUDA Driver API cuDeviceGet.

Returns:

int: Pointer to an integer to return the support status.

See also

cuptiDeviceSupported

cupti.cupti.device_virtualization_mode(int dev) int

Query the virtualization mode of the device.

Args:

dev (int): The device handle returned by CUDA Driver API cuDeviceGet.

Returns:

int: Pointer to an CUpti_DeviceVirtualizationMode to return the virtualization mode.

See also

cuptiDeviceVirtualizationMode

cupti.cupti.enable_all_domains(uint32_t enable, intptr_t subscriber)

Enable or disable all callbacks in all domains.

Args:

enable (uint32_t): New enable state for all callbacks in all domain. Zero disables all callbacks, non-zero enables all callbacks. subscriber (intptr_t): Handle to callback subscription.

See also

cuptiEnableAllDomains

cupti.cupti.enable_callback(uint32_t enable, intptr_t subscriber, int domain, uint32_t cbid)

Enable or disabled callbacks for a specific domain and callback ID.

Args:

enable (uint32_t): New enable state for the callback. Zero disables the callback, non-zero enables the callback. subscriber (intptr_t): Handle to callback subscription. domain (CUpti_CallbackDomain): The domain of the callback. cbid (uint32_t): The ID of the callback.

See also

cuptiEnableCallback

cupti.cupti.enable_domain(uint32_t enable, intptr_t subscriber, int domain)

Enable or disabled all callbacks for a specific domain.

Args:

enable (uint32_t): New enable state for all callbacks in the domain. Zero disables all callbacks, non-zero enables all callbacks. subscriber (intptr_t): Handle to callback subscription. domain (CUpti_CallbackDomain): The domain of the callback.

See also

cuptiEnableDomain

cupti.cupti.finalize()

Detach CUPTI from the running process.

See also

cuptiFinalize

cupti.cupti.get_auto_boost_state(intptr_t context, intptr_t state)

Get auto boost state.

Args:

context (intptr_t): A valid CUcontext. state (intptr_t): A pointer to CUpti_ActivityAutoBoostState structure which contains the current state and the id of the process that has requested the current state.

See also

cuptiGetAutoBoostState

cupti.cupti.get_callback_name(int domain, uint32_t cbid)

Get the name of a callback for a specific domain and callback ID.

Args:

domain (CUpti_CallbackDomain): The domain of the callback. cbid (uint32_t): The ID of the callback.

Returns:

name (str): Returns name of the callback for the specified domain and callback ID

See also

cuptiGetCallbackName

cupti.cupti.get_callback_state(intptr_t subscriber, int domain, uint32_t cbid) uint32_t

Get the current enabled/disabled state of a callback for a specific domain and function ID.

Args:

subscriber (intptr_t): Handle to the initialize subscriber. domain (CUpti_CallbackDomain): The domain of the callback. cbid (uint32_t): The ID of the callback.

Returns:

uint32_t: Returns non-zero if callback enabled, zero if not enabled.

See also

cuptiGetCallbackState

cupti.cupti.get_context_id(intptr_t context) uint32_t

Get the ID of a context.

Args:

context (intptr_t): The context.

Returns:

uint32_t: Returns a process-unique ID for the context.

See also

cuptiGetContextId

cupti.cupti.get_device_id(intptr_t context) uint32_t

Get the ID of a device.

Args:

context (intptr_t): The context, or NULL to indicate the current context.

Returns:

uint32_t: Returns the ID of the device that is current for the calling thread.

See also

cuptiGetDeviceId

cupti.cupti.get_graph_exec_id(intptr_t graph_exec) uint32_t

Get the unique ID of executable graph.

Args:

graph_exec (intptr_t): The executable graph.

Returns:

uint32_t: Returns the unique ID of the executable graph.

See also

cuptiGetGraphExecId

cupti.cupti.get_graph_id(intptr_t graph) uint32_t

Get the unique ID of graph.

Args:

graph (intptr_t): The graph.

Returns:

uint32_t: Returns the unique ID of the graph.

See also

cuptiGetGraphId

cupti.cupti.get_graph_node_id(intptr_t node) uint64_t

Get the unique ID of a graph node.

Args:

node (intptr_t): The graph node.

Returns:

uint64_t: Returns the unique ID of the node.

See also

cuptiGetGraphNodeId

cupti.cupti.get_last_error()

Returns the last error from a cupti call or callback.

See also

cuptiGetLastError

cupti.cupti.get_stream_id_ex(intptr_t context, intptr_t stream, uint8_t per_thread_stream) uint32_t

Get the ID of a stream.

Args:

context (intptr_t): If non-NULL then the stream is checked to ensure that it belongs to this context. Typically this parameter should be null. stream (intptr_t): The stream. per_thread_stream (uint8_t): Flag to indicate if program is compiled for per-thread streams.

Returns:

uint32_t: Returns a context-unique ID for the stream.

See also

cuptiGetStreamIdEx

cupti.cupti.get_thread_id_type() int

Get the thread-id type.

Returns:

int: .

See also

cuptiGetThreadIdType

cupti.cupti.get_timestamp() uint64_t

Get the CUPTI timestamp.

Returns:

uint64_t: Returns the CUPTI timestamp.

See also

cuptiGetTimestamp

cupti.cupti.set_thread_id_type(int type)

Set the thread-id type.

Args:

type (CUpti_ActivityThreadIdType): .

See also

cuptiSetThreadIdType

cupti.cupti.subscribe(callback, userdata) intptr_t

Initialize a callback subscriber with a callback function and user data.

Args:

callback (CUpti_CallbackFunc): The callback function. userdata (intptr_t): A pointer to user data. This data will be passed to the callback function via the userdata parameter.

Returns:

intptr_t: Returns handle to initialize subscriber.

See also

cuptiSubscribe

cupti.cupti.supported_domains()

Get the available callback domains.

Returns:

list[cupti.cupti.CallbackDomain]: List of all available callback domains

See also

cuptiSupportedDomains

cupti.cupti.unsubscribe(intptr_t subscriber)

Unregister a callback subscriber.

Args:

subscriber (intptr_t): Handle to the initialize subscriber.

See also

cuptiUnsubscribe