2. CUPTI Python API Reference¶
2.1. Documentation Issues¶
The CUPTI Python API Reference section of the document is automatically generated and has some issues:
All the CUPTI Python enumerations, functions and classes are listed together in a single section.
The members of the python classes are not adequately documented. To get more information for a member, please refer the CUPTI C documentation.
The
kindmember of python classes has typeint, instead ofcupti.cupti.ActivityKind. While using thekindmember, please usecupti.cupti.ActivityKindto get the enum value.The “See also” section for each CUPTI Python API function shows the corresponding CUPTI C API name. But it does not provide a link to the CUPTI C documentation.
The “See <data type name>” references in the generated documentation point to the original C data structures instead of their Python equivalents.
2.2. API Reference¶
- exception cupti.cupti.cuptiError(status: int)¶
Bases:
Exception
- class cupti.cupti.ActivityAPI(size=1)¶
Bases:
object- cbid¶
cbid : int
- correlation_id¶
correlation_id : int
- end¶
end : int
- kind¶
kind : int
- process_id¶
process_id : int
- return_value¶
return_value : int
- start¶
start : int
- thread_id¶
thread_id : int
- class cupti.cupti.ActivityAttribute(value)¶
Bases:
IntEnumSee CUpti_ActivityAttribute.
- DEVICE_BUFFER_FORCE_INT = 2147483647¶
- DEVICE_BUFFER_POOL_LIMIT = 2¶
- DEVICE_BUFFER_PRE_ALLOCATE_VALUE = 6¶
- DEVICE_BUFFER_SIZE = 0¶
- DEVICE_BUFFER_SIZE_CDP = 1¶
- MEM_ALLOCATION_TYPE_HOST_PINNED = 8¶
- PER_THREAD_ACTIVITY_BUFFER = 9¶
- PROFILING_SEMAPHORE_POOL_LIMIT = 4¶
- PROFILING_SEMAPHORE_POOL_SIZE = 3¶
- PROFILING_SEMAPHORE_PRE_ALLOCATE_VALUE = 7¶
- ZEROED_OUT_ACTIVITY_BUFFER = 5¶
- class cupti.cupti.ActivityAutoBoostState(size=1)¶
Bases:
object- enabled¶
enabled : int
- pid¶
pid : int
- class cupti.cupti.ActivityCdpKernel(size=1)¶
Bases:
object- block_x¶
block_x : int
- block_y¶
block_y : int
- block_z¶
block_z : int
- cache_config¶
cache_config : cupti.cupti.CacheConfig
- completed¶
completed : int
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
dynamic_shared_memory : int
- end¶
end : int
- grid_id¶
grid_id : int
- grid_x¶
grid_x : int
- grid_y¶
grid_y : int
- grid_z¶
grid_z : int
- kind¶
kind : int
- local_memory_per_thread¶
local_memory_per_thread : int
- local_memory_total¶
local_memory_total : int
- name¶
name : str
- pad¶
pad : int
- parent_block_x¶
parent_block_x : int
- parent_block_y¶
parent_block_y : int
- parent_block_z¶
parent_block_z : int
- parent_grid_id¶
parent_grid_id : int
- queued¶
queued : int
- registers_per_thread¶
registers_per_thread : int
shared_memory_config : int
- start¶
start : int
static_shared_memory : int
- stream_id¶
stream_id : int
- submitted¶
submitted : int
- class cupti.cupti.ActivityComputeApiKind(value)¶
Bases:
IntEnumSee CUpti_ActivityComputeApiKind.
- CUDA = 1¶
- CUDA_MPS = 2¶
- FORCE_INT = 2147483647¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityContext3(size=1)¶
Bases:
object- cig_mode¶
cig_mode : int
- compute_api_kind¶
compute_api_kind : int
- context_id¶
context_id : int
- device_id¶
device_id : int
- is_green_context¶
is_green_context : int
- kind¶
kind : int
- null_stream_id¶
null_stream_id : int
- num_multiprocessors¶
num_multiprocessors : int
- padding¶
padding : int
- padding2¶
padding2 : int
- parent_context_id¶
parent_context_id : int
- class cupti.cupti.ActivityCudaEvent2(size=1)¶
Bases:
object- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- cuda_event_sync_id¶
cuda_event_sync_id : int
- device_id¶
device_id : int
- device_timestamp¶
device_timestamp : int
- event_id¶
event_id : int
- kind¶
kind : int
- pad¶
pad : int
- pad2¶
pad2 : int
- reserved0¶
reserved0 : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityDevice5(size=1)¶
Bases:
object- compute_capability_major¶
compute_capability_major (~_numpy.uint32):
- compute_capability_minor¶
compute_capability_minor (~_numpy.uint32):
- compute_instance_id¶
compute_instance_id (~_numpy.uint32):
- constant_memory_size¶
constant_memory_size (~_numpy.uint32):
- core_clock_rate¶
core_clock_rate (~_numpy.uint32):
- ecc_enabled¶
ecc_enabled (~_numpy.uint32):
- global_memory_bandwidth¶
global_memory_bandwidth (~_numpy.uint64):
- global_memory_size¶
global_memory_size (~_numpy.uint64):
- gpu_instance_id¶
gpu_instance_id (~_numpy.uint32):
- id¶
id (~_numpy.uint32):
- is_cuda_visible¶
is_cuda_visible (~_numpy.uint8):
- is_mig_enabled¶
is_mig_enabled (~_numpy.uint8):
- is_numa_node¶
is_numa_node (~_numpy.uint32):
- kind¶
kind (~_numpy.int32):
- l2cache_size¶
l2cache_size (~_numpy.uint32):
- max_block_dim_x¶
max_block_dim_x (~_numpy.uint32):
- max_block_dim_y¶
max_block_dim_y (~_numpy.uint32):
- max_block_dim_z¶
max_block_dim_z (~_numpy.uint32):
- max_blocks_per_multiprocessor¶
max_blocks_per_multiprocessor (~_numpy.uint32):
- max_grid_dim_x¶
max_grid_dim_x (~_numpy.uint32):
- max_grid_dim_y¶
max_grid_dim_y (~_numpy.uint32):
- max_grid_dim_z¶
max_grid_dim_z (~_numpy.uint32):
- max_ipc¶
max_ipc (~_numpy.uint32):
- max_registers_per_block¶
max_registers_per_block (~_numpy.uint32):
- max_registers_per_multiprocessor¶
max_registers_per_multiprocessor (~_numpy.uint32):
max_shared_memory_per_block (~_numpy.uint32):
max_shared_memory_per_multiprocessor (~_numpy.uint32):
- max_threads_per_block¶
max_threads_per_block (~_numpy.uint32):
- max_warps_per_multiprocessor¶
max_warps_per_multiprocessor (~_numpy.uint32):
- mig_uuid¶
- name¶
name (~_numpy.intp):
- num_memcpy_engines¶
num_memcpy_engines (~_numpy.uint32):
- num_multiprocessors¶
num_multiprocessors (~_numpy.uint32):
- num_threads_per_warp¶
num_threads_per_warp (~_numpy.uint32):
- numa_id¶
numa_id (~_numpy.uint32):
- uuid¶
- class cupti.cupti.ActivityDeviceAttribute(size=1)¶
Bases:
object- attribute¶
attribute : cupti.cupti.Attribute
- device_id¶
device_id : int
- kind¶
kind : int
- value¶
value : cupti.cupti.Value
- class cupti.cupti.ActivityDeviceGraphTrace(size=1)¶
Bases:
object- context_id¶
context_id : int
- device_id¶
device_id : int
- device_launch_mode¶
device_launch_mode : int
- end¶
end : int
- graph_id¶
graph_id : int
- kind¶
kind : int
- launcher_graph_id¶
launcher_graph_id : int
- start¶
start : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityEnvironment(size=1)¶
Bases:
object- data¶
data : cupti.cupti.Data
- device_id¶
device_id : int
- environment_kind¶
environment_kind : int
- kind¶
kind : int
- timestamp¶
timestamp : int
- class cupti.cupti.ActivityEnvironmentKind(value)¶
Bases:
IntEnumSee CUpti_ActivityEnvironmentKind.
- COOLING = 4¶
- COUNT = 5¶
- KIND_FORCE_INT = 2147483647¶
- POWER = 3¶
- SPEED = 1¶
- TEMPERATURE = 2¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityExternalCorrelation(size=1)¶
Bases:
object- correlation_id¶
correlation_id : int
- external_id¶
external_id : int
- external_kind¶
external_kind : int
- kind¶
kind : int
- class cupti.cupti.ActivityFlag(value)¶
Bases:
IntEnumSee CUpti_ActivityFlag.
- DEVICE_ATTRIBUTE_CUDEVICE = 1¶
- DEVICE_CONCURRENT_KERNELS = 1¶
- FLUSH_FORCED = 1¶
- FORCE_INT = 2147483647¶
- GLOBAL_ACCESS_KIND_CACHED = 512¶
- GLOBAL_ACCESS_KIND_LOAD = 256¶
- GLOBAL_ACCESS_KIND_SIZE_MASK = 255¶
- INSTRUCTION_CLASS_MASK = 510¶
- INSTRUCTION_VALUE_INVALID = 1¶
- MARKER_COLOR_ARGB = 2¶
- MARKER_COLOR_NONE = 1¶
- MARKER_INSTANTANEOUS = 1¶
- MARKER_START = 2¶
- MARKER_SYNC_ACQUIRE = 8¶
- MARKER_SYNC_ACQUIRE_FAILED = 32¶
- MARKER_SYNC_ACQUIRE_SUCCESS = 16¶
- MARKER_SYNC_RELEASE = 64¶
- MEMCPY_ASYNC = 1¶
- MEMSET_ASYNC = 1¶
- METRIC_OVERFLOWED = 1¶
- METRIC_VALUE_INVALID = 2¶
- NONE = 0¶
- SHARED_ACCESS_KIND_LOAD = 256¶
- SHARED_ACCESS_KIND_SIZE_MASK = 255¶
- THRASHING_IN_CPU = 1¶
- THROTTLING_IN_CPU = 1¶
- class cupti.cupti.ActivityFunction(size=1)¶
Bases:
object- context_id¶
context_id : int
- function_ind_ex¶
function_ind_ex : int
- id¶
id : int
- kind¶
kind : int
- module_id¶
module_id : int
- name¶
name : str
- pad¶
pad : int
- class cupti.cupti.ActivityGraphTrace2(size=1)¶
Bases:
object- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- end¶
end : int
- end_context_id¶
end_context_id : int
- end_device_id¶
end_device_id : int
- graph_id¶
graph_id : int
- kind¶
kind : int
- start¶
start : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityInstructionClass(value)¶
Bases:
IntEnumSee CUpti_ActivityInstructionClass.
- BARRIER = 17¶
- BIT_CONVERSION = 4¶
- CONSTANT = 11¶
- CONTROL_FLOW = 5¶
- FP_16 = 19¶
- FP_32 = 1¶
- FP_64 = 2¶
- GENERIC = 9¶
- GLOBAL = 6¶
- GLOBAL_ATOMIC = 13¶
- INTEGER = 3¶
- INTER_THREAD_COMMUNICATION = 16¶
- KIND_FORCE_INT = 2147483647¶
- LOCAL = 8¶
- MISCELLANEOUS = 18¶
- SHARED = 7¶
- SHARED_ATOMIC = 14¶
- SURFACE = 10¶
- SURFACE_ATOMIC = 15¶
- TEXTURE = 12¶
- UNIFORM = 20¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityJit2(size=1)¶
Bases:
object- cache_path¶
cache_path : str
- cache_size¶
cache_size : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- end¶
end : int
- jit_entry_type¶
jit_entry_type : int
- jit_operation_correlation_id¶
jit_operation_correlation_id : int
- jit_operation_type¶
jit_operation_type : int
- kind¶
kind : int
- padding¶
padding : int
- process_id¶
process_id : int
- start¶
start : int
- thread_id¶
thread_id : int
- class cupti.cupti.ActivityJitEntryType(value)¶
Bases:
IntEnumSee CUpti_ActivityJitEntryType.
- INVALID = 0¶
- NVVM_IR_TO_PTX = 2¶
- PTX_TO_CUBIN = 1¶
- TYPE_FORCE_INT = 2147483647¶
- class cupti.cupti.ActivityJitOperationType(value)¶
Bases:
IntEnumSee CUpti_ActivityJitOperationType.
- CACHE_LOAD = 1¶
- CACHE_STORE = 2¶
- COMPILE = 3¶
- INVALID = 0¶
- TYPE_FORCE_INT = 2147483647¶
- class cupti.cupti.ActivityKernel9(size=1)¶
Bases:
object- block_x¶
block_x : int
- block_y¶
block_y : int
- block_z¶
block_z : int
- cache_config¶
cache_config : cupti.cupti.CacheConfig
- channel_id¶
channel_id : int
- channel_type¶
channel_type : int
- cluster_scheduling_policy¶
cluster_scheduling_policy : int
- cluster_x¶
cluster_x : int
- cluster_y¶
cluster_y : int
- cluster_z¶
cluster_z : int
- completed¶
completed : int
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
dynamic_shared_memory : int
- end¶
end : int
- graph_id¶
graph_id : int
- graph_node_id¶
graph_node_id : int
- grid_id¶
grid_id : int
- grid_x¶
grid_x : int
- grid_y¶
grid_y : int
- grid_z¶
grid_z : int
is_shared_memory_carveout_requested : int
- kind¶
kind : int
- launch_type¶
launch_type : int
- local_memory_per_thread¶
local_memory_per_thread : int
- local_memory_total¶
local_memory_total : int
- local_memory_total_v2¶
local_memory_total_v2 : int
- max_active_clusters¶
max_active_clusters : int
- max_potential_cluster_size¶
max_potential_cluster_size : int
- name¶
name : str
- p_access_policy_window¶
p_access_policy_window : int
- padding¶
padding : int
- partitioned_global_cache_executed¶
partitioned_global_cache_executed : int
- partitioned_global_cache_requested¶
partitioned_global_cache_requested : int
- queued¶
queued : int
- registers_per_thread¶
registers_per_thread : int
- reserved0¶
reserved0 : int
shared_memory_carveout_requested : int
shared_memory_config : int
shared_memory_executed : int
- shmem_limit_config¶
shmem_limit_config : int
- start¶
start : int
static_shared_memory : int
- stream_id¶
stream_id : int
- submitted¶
submitted : int
- class cupti.cupti.ActivityKind(value)¶
Bases:
IntEnumSee CUpti_ActivityKind.
- BRANCH = 16¶
- CDP_KERNEL = 18¶
- CONCURRENT_KERNEL = 10¶
- CONFIDENTIAL_COMPUTE_ROTATION = 53¶
- CONTEXT = 9¶
- COUNT = 56¶
- CUDA_EVENT = 36¶
- DEVICE = 8¶
- DEVICE_ATTRIBUTE = 28¶
- DEVICE_GRAPH_TRACE = 54¶
- DRIVER = 4¶
- ENVIRONMENT = 20¶
- EVENT = 6¶
- EVENT_INSTANCE = 21¶
- EXTERNAL_CORRELATION = 39¶
- FORCE_INT = 2147483647¶
- FUNCTION = 26¶
- GLOBAL_ACCESS = 15¶
- GRAPH_TRACE = 51¶
- INSTANTANEOUS_EVENT = 41¶
- INSTANTANEOUS_EVENT_INSTANCE = 42¶
- INSTANTANEOUS_METRIC = 43¶
- INSTANTANEOUS_METRIC_INSTANCE = 44¶
- INSTRUCTION_CORRELATION = 32¶
- INSTRUCTION_EXECUTION = 24¶
- INTERNAL_LAUNCH_API = 48¶
- INVALID = 0¶
- JIT = 52¶
- KERNEL = 3¶
- MARKER = 12¶
- MARKER_DATA = 13¶
- MEMCPY = 1¶
- MEMCPY2 = 22¶
- MEMORY = 45¶
- MEMORY2 = 49¶
- MEMORY_POOL = 50¶
- MEMSET = 2¶
- MEM_DECOMPRESS = 55¶
- METRIC = 7¶
- METRIC_INSTANCE = 23¶
- MODULE = 27¶
- NAME = 11¶
- NVLINK = 40¶
- OPENACC_DATA = 33¶
- OPENACC_LAUNCH = 34¶
- OPENACC_OTHER = 35¶
- OPENMP = 47¶
- OVERHEAD = 17¶
- PCIE = 46¶
- PC_SAMPLING = 30¶
- PC_SAMPLING_RECORD_INFO = 31¶
- PREEMPTION = 19¶
- RUNTIME = 5¶
- SHARED_ACCESS = 29¶
- SOURCE_LOCATOR = 14¶
- STREAM = 37¶
- SYNCHRONIZATION = 38¶
- UNIFIED_MEMORY_COUNTER = 25¶
- class cupti.cupti.ActivityLaunchType(value)¶
Bases:
IntEnumSee CUpti_ActivityLaunchType.
- CBL_COMMANDLIST = 3¶
- COOPERATIVE_MULTI_DEVICE = 2¶
- COOPERATIVE_SINGLE_DEVICE = 1¶
- REGULAR = 0¶
- class cupti.cupti.ActivityMarker2(size=1)¶
Bases:
object- domain¶
domain : str
- id¶
id : int
- kind¶
kind : int
- name¶
name : str
- object_id¶
object_id : cupti.cupti.ActivityObjectKindId
- object_kind¶
object_kind : int
- pad¶
pad : int
- timestamp¶
timestamp : int
- class cupti.cupti.ActivityMarkerData(size=1)¶
Bases:
object- category¶
category : int
- color¶
color : int
- id¶
id : int
- kind¶
kind : int
- payload¶
payload : cupti.cupti.MetricValue
- payload_kind¶
payload_kind : int
- class cupti.cupti.ActivityMemDecompress(size=1)¶
Bases:
object- channel_id¶
channel_id : int
- channel_type¶
channel_type : int
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- end¶
end : int
- kind¶
kind : int
- number_of_operations¶
number_of_operations : int
- reserved0¶
reserved0 : int
- source_bytes¶
source_bytes : int
- start¶
start : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityMemcpy6(size=1)¶
Bases:
object- bytes¶
bytes : int
- channel_id¶
channel_id : int
- channel_type¶
channel_type : int
- context_id¶
context_id : int
- copy_count¶
copy_count : int
- copy_kind¶
copy_kind : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- dst_kind¶
dst_kind : int
- end¶
end : int
- graph_id¶
graph_id : int
- graph_node_id¶
graph_node_id : int
- kind¶
kind : int
- pad¶
pad : int
- pad2¶
pad2 : int
- reserved0¶
reserved0 : int
- runtime_correlation_id¶
runtime_correlation_id : int
- src_kind¶
src_kind : int
- start¶
start : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityMemcpyKind(value)¶
Bases:
IntEnumSee CUpti_ActivityMemcpyKind.
- ATOA = 5¶
- ATOD = 6¶
- ATOH = 4¶
- DTOA = 7¶
- DTOD = 8¶
- DTOH = 2¶
- FORCE_INT = 2147483647¶
- HTOA = 3¶
- HTOD = 1¶
- HTOH = 9¶
- PTOP = 10¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityMemcpyPtoP4(size=1)¶
Bases:
object- bytes¶
bytes : int
- channel_id¶
channel_id : int
- channel_type¶
channel_type : int
- context_id¶
context_id : int
- copy_kind¶
copy_kind : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- dst_context_id¶
dst_context_id : int
- dst_device_id¶
dst_device_id : int
- dst_kind¶
dst_kind : int
- end¶
end : int
- graph_id¶
graph_id : int
- graph_node_id¶
graph_node_id : int
- kind¶
kind : int
- reserved0¶
reserved0 : int
- src_context_id¶
src_context_id : int
- src_device_id¶
src_device_id : int
- src_kind¶
src_kind : int
- start¶
start : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityMemory(size=1)¶
Bases:
object- address¶
address : int
- alloc_pc¶
alloc_pc : int
- bytes¶
bytes : int
- context_id¶
context_id : int
- device_id¶
device_id : int
- end¶
end : int
- free_pc¶
free_pc : int
- kind¶
kind : int
- memory_kind¶
memory_kind : int
- name¶
name : str
- pad¶
pad : int
- process_id¶
process_id : int
- start¶
start : int
- class cupti.cupti.ActivityMemory4(size=1)¶
Bases:
object- address¶
address : int
- bytes¶
bytes : int
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- is_async¶
is_async : int
- kind¶
kind : int
- memory_kind¶
memory_kind : int
- memory_operation_type¶
memory_operation_type : int
- memory_pool_config¶
memory_pool_config : cupti.cupti.MemoryPoolConfig
- name¶
name : str
- pad1¶
pad1 : int
- pc¶
pc : int
- process_id¶
process_id : int
- source¶
source : str
- stream_id¶
stream_id : int
- timestamp¶
timestamp : int
- class cupti.cupti.ActivityMemoryKind(value)¶
Bases:
IntEnumSee CUpti_ActivityMemoryKind.
- ARRAY = 4¶
- DEVICE = 3¶
- DEVICE_STATIC = 6¶
- FORCE_INT = 2147483647¶
- MANAGED = 5¶
- MANAGED_STATIC = 7¶
- PAGEABLE = 1¶
- PINNED = 2¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityMemoryOperationType(value)¶
Bases:
IntEnumSee CUpti_ActivityMemoryOperationType.
- ALLOCATION = 1¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- RELEASE = 2¶
- class cupti.cupti.ActivityMemoryPool2(size=1)¶
Bases:
object- address¶
address : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- kind¶
kind : int
- memory_pool_operation_type¶
memory_pool_operation_type : int
- memory_pool_type¶
memory_pool_type : int
- min_bytes_to_keep¶
min_bytes_to_keep : int
- process_id¶
process_id : int
- release_threshold¶
release_threshold : int
- timestamp¶
timestamp : int
- utilized_size¶
utilized_size : int
- class cupti.cupti.ActivityMemoryPoolOperationType(value)¶
Bases:
IntEnumSee CUpti_ActivityMemoryPoolOperationType.
- CREATED = 1¶
- DESTROYED = 2¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- TRIMMED = 3¶
- class cupti.cupti.ActivityMemoryPoolType(value)¶
Bases:
IntEnumSee CUpti_ActivityMemoryPoolType.
- FORCE_INT = 2147483647¶
- IMPORTED = 2¶
- INVALID = 0¶
- LOCAL = 1¶
- class cupti.cupti.ActivityMemset4(size=1)¶
Bases:
object- bytes¶
bytes : int
- channel_id¶
channel_id : int
- channel_type¶
channel_type : int
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- end¶
end : int
- graph_id¶
graph_id : int
- graph_node_id¶
graph_node_id : int
- kind¶
kind : int
- memory_kind¶
memory_kind : int
- pad¶
pad : int
- pad2¶
pad2 : int
- reserved0¶
reserved0 : int
- start¶
start : int
- stream_id¶
stream_id : int
- value¶
value : int
- class cupti.cupti.ActivityModule(size=1)¶
Bases:
object- context_id¶
context_id : int
- cubin¶
cubin : int
- cubin_size¶
cubin_size : int
- id¶
id : int
- kind¶
kind : int
- class cupti.cupti.ActivityName(size=1)¶
Bases:
object- kind¶
kind : int
- name¶
name : str
- object_id¶
object_id : cupti.cupti.ActivityObjectKindId
- object_kind¶
object_kind : int
- pad¶
pad : int
- class cupti.cupti.ActivityObjectKind(value)¶
Bases:
IntEnumSee CUpti_ActivityObjectKind.
- CONTEXT = 4¶
- DEVICE = 3¶
- FORCE_INT = 2147483647¶
- PROCESS = 1¶
- STREAM = 5¶
- THREAD = 2¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityObjectKindId¶
Bases:
object- contextId¶
contextId: ‘uint32_t’
- deviceId¶
deviceId: ‘uint32_t’
- processId¶
processId: ‘uint32_t’
- streamId¶
streamId: ‘uint32_t’
- threadId¶
threadId: ‘uint32_t’
- class cupti.cupti.ActivityOpenAccData(size=1)¶
Bases:
object- async¶
async : int
- async_map¶
async_map : int
- bytes¶
bytes : int
- cu_context_id¶
cu_context_id : int
- cu_device_id¶
cu_device_id : int
- cu_process_id¶
cu_process_id : int
- cu_stream_id¶
cu_stream_id : int
- cu_thread_id¶
cu_thread_id : int
- device_number¶
device_number : int
- device_ptr¶
device_ptr : int
- device_type¶
device_type : int
- end¶
end : int
- end_line_no¶
end_line_no : int
- event_kind¶
event_kind : int
- external_id¶
external_id : int
- func_end_line_no¶
func_end_line_no : int
- func_line_no¶
func_line_no : int
- func_name¶
func_name : str
- host_ptr¶
host_ptr : int
- implicit¶
implicit : int
- kind¶
kind : int
- line_no¶
line_no : int
- parent_construct¶
parent_construct : int
- src_file¶
src_file : str
- start¶
start : int
- thread_id¶
thread_id : int
- var_name¶
var_name : str
- version¶
version : int
- class cupti.cupti.ActivityOpenAccLaunch(size=1)¶
Bases:
object- async¶
async : int
- async_map¶
async_map : int
- cu_context_id¶
cu_context_id : int
- cu_device_id¶
cu_device_id : int
- cu_process_id¶
cu_process_id : int
- cu_stream_id¶
cu_stream_id : int
- cu_thread_id¶
cu_thread_id : int
- device_number¶
device_number : int
- device_type¶
device_type : int
- end¶
end : int
- end_line_no¶
end_line_no : int
- event_kind¶
event_kind : int
- external_id¶
external_id : int
- func_end_line_no¶
func_end_line_no : int
- func_line_no¶
func_line_no : int
- func_name¶
func_name : str
- implicit¶
implicit : int
- kernel_name¶
kernel_name : str
- kind¶
kind : int
- line_no¶
line_no : int
- num_gangs¶
num_gangs : int
- num_workers¶
num_workers : int
- parent_construct¶
parent_construct : int
- src_file¶
src_file : str
- start¶
start : int
- thread_id¶
thread_id : int
- vector_length¶
vector_length : int
- version¶
version : int
- class cupti.cupti.ActivityOpenAccOther(size=1)¶
Bases:
object- async¶
async : int
- async_map¶
async_map : int
- cu_context_id¶
cu_context_id : int
- cu_device_id¶
cu_device_id : int
- cu_process_id¶
cu_process_id : int
- cu_stream_id¶
cu_stream_id : int
- cu_thread_id¶
cu_thread_id : int
- device_number¶
device_number : int
- device_type¶
device_type : int
- end¶
end : int
- end_line_no¶
end_line_no : int
- event_kind¶
event_kind : int
- external_id¶
external_id : int
- func_end_line_no¶
func_end_line_no : int
- func_line_no¶
func_line_no : int
- func_name¶
func_name : str
- implicit¶
implicit : int
- kind¶
kind : int
- line_no¶
line_no : int
- parent_construct¶
parent_construct : int
- src_file¶
src_file : str
- start¶
start : int
- thread_id¶
thread_id : int
- version¶
version : int
- class cupti.cupti.ActivityOpenMp(size=1)¶
Bases:
object- cu_process_id¶
cu_process_id : int
- cu_thread_id¶
cu_thread_id : int
- end¶
end : int
- event_kind¶
event_kind : int
- kind¶
kind : int
- start¶
start : int
- thread_id¶
thread_id : int
- version¶
version : int
- class cupti.cupti.ActivityOverhead3(size=1)¶
Bases:
object- correlation_id¶
correlation_id : int
- end¶
end : int
- kind¶
kind : int
- object_id¶
object_id: cupti.cupti.ActivityObjectKindId
- object_kind¶
object_kind : int
- overhead_data¶
overhead_data : int
- overhead_kind¶
overhead_kind : int
- reserved0¶
reserved0 : int
- start¶
start : int
- class cupti.cupti.ActivityOverheadKind(value)¶
Bases:
IntEnumSee CUpti_ActivityOverheadKind.
- ACTIVITY_BUFFER_REQUEST = 458752¶
- COMMAND_BUFFER_FULL = 393216¶
- CUPTI_BUFFER_FLUSH = 65536¶
- CUPTI_INSTRUMENTATION = 131072¶
- CUPTI_RESOURCE = 196608¶
- DRIVER_COMPILER = 1¶
- FORCE_INT = 2147483647¶
- LAZY_FUNCTION_LOADING = 327680¶
- RUNTIME_TRIGGERED_MODULE_LOADING = 262144¶
- UNKNOWN = 0¶
- UVM_ACTIVITY_INIT = 524288¶
- class cupti.cupti.ActivityPCSamplingPeriod(value)¶
Bases:
IntEnumSee CUpti_ActivityPCSamplingPeriod.
- FORCE_INT = 2147483647¶
- HIGH = 4¶
- INVALID = 0¶
- LOW = 2¶
- MAX = 5¶
- MID = 3¶
- MIN = 1¶
- class cupti.cupti.ActivityPCSamplingStallReason(value)¶
Bases:
IntEnumSee CUpti_ActivityPCSamplingStallReason.
- CONSTANT_MEMORY_DEPENDENCY = 7¶
- EXEC_DEPENDENCY = 3¶
- FORCE_INT = 2147483647¶
- INST_FETCH = 2¶
- INVALID = 0¶
- MEMORY_DEPENDENCY = 4¶
- MEMORY_THROTTLE = 9¶
- NONE = 1¶
- NOT_SELECTED = 10¶
- OTHER = 11¶
- PIPE_BUSY = 8¶
- SLEEPING = 12¶
- SYNC = 6¶
- TEXTURE = 5¶
- class cupti.cupti.ActivityPartitionedGlobalCacheConfig(value)¶
Bases:
IntEnumSee CUpti_ActivityPartitionedGlobalCacheConfig.
- FORCE_INT = 2147483647¶
- NOT_SUPPORTED = 1¶
- OFF = 2¶
- ON = 3¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityPreemption(size=1)¶
Bases:
object- block_x¶
block_x : int
- block_y¶
block_y : int
- block_z¶
block_z : int
- grid_id¶
grid_id : int
- kind¶
kind : int
- pad¶
pad : int
- preemption_kind¶
preemption_kind : int
- timestamp¶
timestamp : int
- class cupti.cupti.ActivityPreemptionKind(value)¶
Bases:
IntEnumSee CUpti_ActivityPreemptionKind.
- FORCE_INT = 2147483647¶
- RESTORE = 2¶
- SAVE = 1¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityStream(size=1)¶
Bases:
object- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- flag¶
flag : int
- kind¶
kind : int
- priority¶
priority : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityStreamFlag(value)¶
Bases:
IntEnumSee CUpti_ActivityStreamFlag.
- FLAG_DEFAULT = 1¶
- FLAG_FORCE_INT = 2147483647¶
- FLAG_NON_BLOCKING = 2¶
- FLAG_NULL = 3¶
- FLAG_UNKNOWN = 0¶
- MASK = 65535¶
- class cupti.cupti.ActivitySynchronization2(size=1)¶
Bases:
object- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- cuda_event_id¶
cuda_event_id : int
- cuda_event_sync_id¶
cuda_event_sync_id : int
- end¶
end : int
- kind¶
kind : int
- pad¶
pad : int
- return_value¶
return_value : int
- start¶
start : int
- stream_id¶
stream_id : int
- type¶
type : int
- class cupti.cupti.ActivitySynchronizationType(value)¶
Bases:
IntEnumSee CUpti_ActivitySynchronizationType.
- CONTEXT_SYNCHRONIZE = 4¶
- EVENT_SYNCHRONIZE = 1¶
- FORCE_INT = 2147483647¶
- STREAM_SYNCHRONIZE = 3¶
- STREAM_WAIT_EVENT = 2¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityThreadIdType(value)¶
Bases:
IntEnumSee CUpti_ActivityThreadIdType.
- DEFAULT = 0¶
- FORCE_INT = 2147483647¶
- SIZE = 2¶
- SYSTEM = 1¶
- class cupti.cupti.ActivityUnifiedMemoryAccessType(value)¶
Bases:
IntEnumSee CUpti_ActivityUnifiedMemoryAccessType.
- ATOMIC = 3¶
- PREFETCH = 4¶
- READ = 1¶
- UNKNOWN = 0¶
- WRITE = 2¶
- class cupti.cupti.ActivityUnifiedMemoryCounter3(size=1)¶
Bases:
object- address¶
address : int
- counter_kind¶
counter_kind : int
- dst_id¶
dst_id : int
- end¶
end : int
- kind¶
kind : int
- pad¶
pad : int
- process_id¶
process_id : int
- processors¶
processors: list[int]
- src_id¶
src_id : int
- start¶
start : int
- stream_id¶
stream_id : int
- value¶
value : int
- class cupti.cupti.ActivityUnifiedMemoryCounterConfig(size=1)¶
Bases:
object- device_id¶
device_id : int
- enable¶
enable : int
- kind¶
kind : int
- scope¶
scope : int
- class cupti.cupti.ActivityUnifiedMemoryCounterKind(value)¶
Bases:
IntEnumSee CUpti_ActivityUnifiedMemoryCounterKind.
- BYTES_TRANSFER_DTOD = 8¶
- BYTES_TRANSFER_DTOH = 2¶
- BYTES_TRANSFER_HTOD = 1¶
- COUNT = 9¶
- CPU_PAGE_FAULT_COUNT = 3¶
- FORCE_INT = 2147483647¶
- GPU_PAGE_FAULT = 4¶
- REMOTE_MAP = 7¶
- THRASHING = 5¶
- THROTTLING = 6¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityUnifiedMemoryCounterScope(value)¶
Bases:
IntEnumSee CUpti_ActivityUnifiedMemoryCounterScope.
- COUNT = 3¶
- FORCE_INT = 2147483647¶
- PROCESS_ALL_DEVICES = 2¶
- PROCESS_SINGLE_DEVICE = 1¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityUnifiedMemoryMigrationCause(value)¶
Bases:
IntEnumSee CUpti_ActivityUnifiedMemoryMigrationCause.
- ACCESS_COUNTERS = 5¶
- COHERENCE = 2¶
- EVICTION = 4¶
- PREFETCH = 3¶
- UNKNOWN = 0¶
- USER = 1¶
- class cupti.cupti.ActivityUnifiedMemoryRemoteMapCause(value)¶
Bases:
IntEnumSee CUpti_ActivityUnifiedMemoryRemoteMapCause.
- COHERENCE = 1¶
- EVICTION = 5¶
- OUT_OF_MEMORY = 4¶
- POLICY = 3¶
- THRASHING = 2¶
- UNKNOWN = 0¶
- class cupti.cupti.ApiCallbackSite(value)¶
Bases:
IntEnumSee CUpti_ApiCallbackSite.
- API_CBSITE_FORCE_INT = 2147483647¶
- API_ENTER = 0¶
- API_EXIT = 1¶
- class cupti.cupti.CacheConfig¶
Bases:
object- both¶
both: ‘uint8_t’
- executed¶
executed: ‘uint8_t’
- requested¶
requested: ‘uint8_t’
- class cupti.cupti.CallbackData(size=1)¶
Bases:
object- callback_site¶
callback_site : int
- cbid¶
- context¶
context : int
- context_uid¶
context_uid : int
- correlation_data¶
correlation_data : int
- correlation_id¶
correlation_id : int
- domain¶
- function_name¶
function_name : str
- function_params¶
function_params : int
- function_params_py_obj¶
- function_return_value¶
function_return_value : int
- symbol_name¶
symbol_name : str
- class cupti.cupti.CallbackDomain(value)¶
Bases:
IntEnumSee CUpti_CallbackDomain.
- DRIVER_API = 1¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- NVTX = 5¶
- RESOURCE = 3¶
- RUNTIME_API = 2¶
- SIZE = 7¶
- STATE = 6¶
- SYNCHRONIZE = 4¶
- class cupti.cupti.CallbackIdResource(value)¶
Bases:
IntEnumSee CUpti_CallbackIdResource.
- CONTEXT_CREATED = 1¶
- CONTEXT_DESTROY_STARTING = 2¶
- CU_INIT_FINISHED = 5¶
- FORCE_INT = 2147483647¶
- GRAPHEXEC_CREATED = 18¶
- GRAPHEXEC_CREATE_STARTING = 17¶
- GRAPHEXEC_DESTROY_STARTING = 19¶
- GRAPHNODE_CLONED = 20¶
- GRAPHNODE_CREATED = 13¶
- GRAPHNODE_CREATE_STARTING = 12¶
- GRAPHNODE_DEPENDENCY_CREATED = 15¶
- GRAPHNODE_DEPENDENCY_DESTROY_STARTING = 16¶
- GRAPHNODE_DESTROY_STARTING = 14¶
- GRAPH_CLONED = 11¶
- GRAPH_CREATED = 9¶
- GRAPH_DESTROY_STARTING = 10¶
- INVALID = 0¶
- MODULE_LOADED = 6¶
- MODULE_PROFILED = 8¶
- MODULE_UNLOAD_STARTING = 7¶
- SIZE = 22¶
- STREAM_ATTRIBUTE_CHANGED = 21¶
- STREAM_CREATED = 3¶
- STREAM_DESTROY_STARTING = 4¶
- class cupti.cupti.CallbackIdState(value)¶
Bases:
IntEnumSee CUpti_CallbackIdState.
- ERROR = 2¶
- FATAL_ERROR = 1¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- SIZE = 4¶
- WARNING = 3¶
- class cupti.cupti.CallbackIdSync(value)¶
Bases:
IntEnumSee CUpti_CallbackIdSync.
- CONTEXT_SYNCHRONIZED = 2¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- SIZE = 3¶
- STREAM_SYNCHRONIZED = 1¶
- class cupti.cupti.ChannelType(value)¶
Bases:
IntEnumSee CUpti_ChannelType.
- ASYNC_MEMCPY = 2¶
- COMPUTE = 1¶
- DECOMP = 3¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- class cupti.cupti.ContextCigMode(value)¶
Bases:
IntEnumSee CUpti_ContextCigMode.
- CIG = 1¶
- CIG_FALLBACK = 2¶
- FORCE_INT = 2147483647¶
- NONE = 0¶
- class cupti.cupti.Data¶
Bases:
object- clocksThrottleReasons¶
clocksThrottleReasons: ‘int32_t’
- fanSpeed¶
fanSpeed: ‘uint32_t’
- gpuTemperature¶
gpuTemperature: ‘uint32_t’
- memoryClock¶
memoryClock: ‘uint32_t’
- pcieLinkGen¶
pcieLinkGen: ‘uint32_t’
- pcieLinkWidth¶
pcieLinkWidth: ‘uint32_t’
- power¶
power: ‘uint32_t’
- powerLimit¶
powerLimit: ‘uint32_t’
- smClock¶
smClock: ‘uint32_t’
- class cupti.cupti.DevType(value)¶
Bases:
IntEnumSee CUpti_DevType.
- FORCE_INT = 2147483647¶
- GPU = 1¶
- INVALID = 0¶
- NPU = 2¶
- class cupti.cupti.DeviceAttribute(value)¶
Bases:
IntEnumSee CUpti_DeviceAttribute.
- DEVICE_CLASS = 10¶
- FLOP_DP_PER_CYCLE = 12¶
- FLOP_HP_PER_CYCLE = 17¶
- FLOP_SP_PER_CYCLE = 11¶
- FORCE_INT = 2147483647¶
- GLOBAL_MEMORY_BANDWIDTH = 3¶
- GPU_CPU_NVLINK_BW = 19¶
- INSTRUCTION_PER_CYCLE = 4¶
- INSTRUCTION_THROUGHPUT_SINGLE_PRECISION = 5¶
- MAX_EVENT_DOMAIN_ID = 2¶
- MAX_EVENT_ID = 1¶
- MAX_FRAME_BUFFERS = 6¶
- MAX_L2_UNITS = 13¶
- MAX_SHARED_MEMORY_CACHE_CONFIG_PREFER_EQUAL = 16¶
- MAX_SHARED_MEMORY_CACHE_CONFIG_PREFER_L1 = 15¶
- MAX_SHARED_MEMORY_CACHE_CONFIG_PREFER_SHARED = 14¶
- NVLINK_PRESENT = 18¶
- NVSWITCH_PRESENT = 20¶
- PCIE_GEN = 9¶
- PCIE_LINK_RATE = 7¶
- PCIE_LINK_WIDTH = 8¶
- class cupti.cupti.DeviceVirtualizationMode(value)¶
Bases:
IntEnumSee CUpti_DeviceVirtualizationMode.
- FORCE_INT = 2147483647¶
- NONE = 0¶
- PASS_THROUGH = 1¶
- VIRTUAL_GPU = 2¶
- class cupti.cupti.EnvironmentClocksThrottleReason(value)¶
Bases:
IntEnumSee CUpti_EnvironmentClocksThrottleReason.
- FORCE_INT = 2147483647¶
- GPU_IDLE = 1¶
- HW_SLOWDOWN = 8¶
- NONE = 0¶
- SW_POWER_CAP = 4¶
- UNKNOWN = 2147483648¶
- UNSUPPORTED = 1073741824¶
- USER_DEFINED_CLOCKS = 2¶
- class cupti.cupti.ExternalCorrelationKind(value)¶
Bases:
IntEnumSee CUpti_ExternalCorrelationKind.
- CUSTOM0 = 3¶
- CUSTOM1 = 4¶
- CUSTOM2 = 5¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- OPENACC = 2¶
- SIZE = 6¶
- UNKNOWN = 1¶
- class cupti.cupti.FuncShmemLimitConfig(value)¶
Bases:
IntEnumSee CUpti_FuncShmemLimitConfig.
- DEFAULT = 0¶
- FORCE_INT = 2147483647¶
- OPTIN = 1¶
- class cupti.cupti.GraphData(size=1)¶
Bases:
object- dependency¶
dependency : int
- graph¶
graph : int
- graph_exec¶
graph_exec : int
- node¶
node : int
- node_type¶
node_type : int
- original_graph¶
original_graph : int
- original_node¶
original_node : int
- class cupti.cupti.MemoryPoolConfig¶
Bases:
object- address¶
address: ‘uint64_t’
- memory_pool_type¶
memory_pool_type: ‘int32_t’
- process_id¶
process_id: ‘uint64_t’
- release_threshold¶
release_threshold: ‘uint64_t’
- size¶
size: ‘uint64_t’
- utilized_size¶
utilized_size: ‘uint64_t’
- class cupti.cupti.MetricValue¶
Bases:
object- metricValueDouble¶
metricValueDouble: ‘double’
- metricValueInt64¶
metricValueInt64: ‘int64_t’
- metricValuePercent¶
metricValuePercent: ‘double’
- metricValueThroughput¶
metricValueThroughput: ‘uint64_t’
- metricValueUint64¶
metricValueUint64: ‘uint64_t’
- metricValueUtilizationLevel¶
metricValueUtilizationLevel: ‘int32_t’
- class cupti.cupti.MetricValueKind(value)¶
Bases:
IntEnumSee CUpti_MetricValueKind.
- DOUBLE = 0¶
- FORCE_INT = 2147483647¶
- INT64 = 4¶
- PERCENT = 2¶
- THROUGHPUT = 3¶
- UINT64 = 1¶
- UTILIZATION_LEVEL = 5¶
- class cupti.cupti.MetricValueUtilizationLevel(value)¶
Bases:
IntEnumSee CUpti_MetricValueUtilizationLevel.
- FORCE_INT = 2147483647¶
- HIGH = 8¶
- IDLE = 0¶
- LOW = 2¶
- MAX = 10¶
- MID = 5¶
- class cupti.cupti.ModuleResourceData(size=1)¶
Bases:
object- cubin_size¶
cubin_size : int
- module_id¶
module_id : int
- p_cubin¶
p_cubin : str
- class cupti.cupti.OpenAccConstructKind(value)¶
Bases:
IntEnumSee CUpti_OpenAccConstructKind.
- ATOMIC = 8¶
- DATA = 4¶
- DECLARE = 9¶
- ENTER_DATA = 5¶
- EXIT_DATA = 6¶
- FORCE_INT = 2147483647¶
- HOST_DATA = 7¶
- INIT = 10¶
- KERNELS = 2¶
- LOOP = 3¶
- PARALLEL = 1¶
- ROUTINE = 14¶
- RUNTIME_API = 16¶
- SET = 12¶
- SHUTDOWN = 11¶
- UNKNOWN = 0¶
- UPDATE = 13¶
- WAIT = 15¶
- class cupti.cupti.OpenAccEventKind(value)¶
Bases:
IntEnumSee CUpti_OpenAccEventKind.
- ALLOC = 15¶
- COMPUTE_CONSTRUCT = 9¶
- CREATE = 13¶
- DELETE = 14¶
- DEVICE_INIT = 1¶
- DEVICE_SHUTDOWN = 2¶
- ENQUEUE_DOWNLOAD = 6¶
- ENQUEUE_LAUNCH = 4¶
- ENQUEUE_UPLOAD = 5¶
- ENTER_DATA = 11¶
- EXIT_DATA = 12¶
- FORCE_INT = 2147483647¶
- FREE = 16¶
- IMPLICIT_WAIT = 8¶
- INVALID = 0¶
- RUNTIME_SHUTDOWN = 3¶
- UPDATE = 10¶
- WAIT = 7¶
- class cupti.cupti.OpenMpEventKind(value)¶
Bases:
IntEnumSee CUpti_OpenMpEventKind.
- FORCE_INT = 2147483647¶
- IDLE = 4¶
- INVALID = 0¶
- PARALLEL = 1¶
- TASK = 2¶
- THREAD = 3¶
- WAIT_BARRIER = 5¶
- WAIT_TASKWAIT = 6¶
- class cupti.cupti.PcieDeviceType(value)¶
Bases:
IntEnumSee CUpti_PcieDeviceType.
- BRIDGE = 1¶
- FORCE_INT = 2147483647¶
- GPU = 0¶
- class cupti.cupti.ResourceData(size=1)¶
Bases:
object- context¶
context : int
- resource_descriptor¶
resource_descriptor : int
- class cupti.cupti.Result(value)¶
Bases:
IntEnumSee CUptiResult.
- ERROR_API_NOT_IMPLEMENTED = 11¶
- ERROR_CDP_TRACING_NOT_SUPPORTED = 32¶
- ERROR_CMP_DEVICE_NOT_SUPPORTED = 42¶
- ERROR_CONFIDENTIAL_COMPUTING_NOT_SUPPORTED = 41¶
- ERROR_CUDA_COMPILER_NOT_COMPATIBLE = 34¶
- ERROR_DISABLED = 23¶
- ERROR_FORCE_INT = 2147483647¶
- ERROR_HARDWARE = 9¶
- ERROR_HARDWARE_BUSY = 26¶
- ERROR_INSUFFICIENT_PRIVILEGES = 35¶
- ERROR_INVALID_CONTEXT = 3¶
- ERROR_INVALID_DEVICE = 2¶
- ERROR_INVALID_EVENT_DOMAIN_ID = 4¶
- ERROR_INVALID_EVENT_ID = 5¶
- ERROR_INVALID_EVENT_NAME = 6¶
- ERROR_INVALID_EVENT_VALUE = 22¶
- ERROR_INVALID_HANDLE = 19¶
- ERROR_INVALID_KIND = 21¶
- ERROR_INVALID_METRIC_ID = 16¶
- ERROR_INVALID_METRIC_NAME = 17¶
- ERROR_INVALID_METRIC_VALUE = 25¶
- ERROR_INVALID_MODULE = 24¶
- ERROR_INVALID_OPERATION = 7¶
- ERROR_INVALID_PARAMETER = 1¶
- ERROR_INVALID_STREAM = 20¶
- ERROR_LEGACY_PROFILER_NOT_SUPPORTED = 38¶
- ERROR_MAX_LIMIT_REACHED = 12¶
- ERROR_MIG_DEVICE_NOT_SUPPORTED = 43¶
- ERROR_MULTIPLE_SUBSCRIBERS_NOT_SUPPORTED = 39¶
- ERROR_NOT_COMPATIBLE = 14¶
- ERROR_NOT_INITIALIZED = 15¶
- ERROR_NOT_READY = 13¶
- ERROR_NOT_SUPPORTED = 27¶
- ERROR_OLD_PROFILER_API_INITIALIZED = 36¶
- ERROR_OPENACC_UNDEFINED_ROUTINE = 37¶
- ERROR_OUT_OF_MEMORY = 8¶
- ERROR_PARAMETER_SIZE_NOT_SUFFICIENT = 10¶
- ERROR_QUEUE_EMPTY = 18¶
- ERROR_SLI_DEVICE_NOT_SUPPORTED = 44¶
- ERROR_UM_PROFILING_NOT_SUPPORTED = 28¶
- ERROR_UM_PROFILING_NOT_SUPPORTED_ON_DEVICE = 29¶
- ERROR_UM_PROFILING_NOT_SUPPORTED_ON_NON_P2P_DEVICES = 30¶
- ERROR_UM_PROFILING_NOT_SUPPORTED_WITH_MPS = 31¶
- ERROR_UNKNOWN = 999¶
- ERROR_VIRTUALIZED_DEVICE_INSUFFICIENT_PRIVILEGES = 40¶
- ERROR_VIRTUALIZED_DEVICE_NOT_SUPPORTED = 33¶
- ERROR_WSL_DEVICE_NOT_SUPPORTED = 45¶
- SUCCESS = 0¶
- class cupti.cupti.StateData(size=1)¶
Bases:
object- notification¶
notification : cupti.cupti.Notification
- class cupti.cupti.StreamAttrData(size=1)¶
Bases:
object- attr¶
attr : int
- stream¶
stream : int
- value¶
value : int
- class cupti.cupti.SynchronizeData(size=1)¶
Bases:
object- context¶
context : int
- stream¶
stream : int
- class cupti.cupti.Value¶
Bases:
object- vDouble¶
vDouble: ‘double’
- vInt32¶
vInt32: ‘int32_t’
- vInt64¶
vInt64: ‘int64_t’
- vUint32¶
vUint32: ‘uint32_t’
- vUint64¶
vUint64: ‘uint64_t’
- class cupti.cupti.driver_api_trace_cbid(value)¶
Bases:
IntEnumSee CUpti_driver_api_trace_cbid.
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- SIZE = 782¶
- cu64Array3DCreate = 230¶
- cu64Array3DGetDescriptor = 231¶
- cu64ArrayCreate = 228¶
- cu64ArrayGetDescriptor = 229¶
- cu64D3D10ResourceGetMappedPitch = 200¶
- cu64D3D10ResourceGetMappedPointer = 198¶
- cu64D3D10ResourceGetMappedSize = 199¶
- cu64D3D10ResourceGetSurfaceDimensions = 201¶
- cu64D3D9MapVertexBuffer = 206¶
- cu64D3D9ResourceGetMappedPitch = 205¶
- cu64D3D9ResourceGetMappedPointer = 203¶
- cu64D3D9ResourceGetMappedSize = 204¶
- cu64D3D9ResourceGetSurfaceDimensions = 202¶
- cu64DeviceTotalMem = 197¶
- cu64GLMapBufferObject = 207¶
- cu64GLMapBufferObjectAsync = 208¶
- cu64GraphicsResourceGetMappedPointer = 131¶
- cu64MemAlloc = 30¶
- cu64MemAllocPitch = 32¶
- cu64MemFree = 34¶
- cu64MemGetAddressRange = 36¶
- cu64MemGetInfo = 28¶
- cu64MemHostAlloc = 215¶
- cu64MemHostGetDevicePointer = 41¶
- cu64Memcpy2D = 232¶
- cu64Memcpy2DAsync = 234¶
- cu64Memcpy2DUnaligned = 233¶
- cu64Memcpy3D = 59¶
- cu64Memcpy3DAsync = 70¶
- cu64MemcpyAtoD = 52¶
- cu64MemcpyDtoA = 50¶
- cu64MemcpyDtoD = 48¶
- cu64MemcpyDtoDAsync = 65¶
- cu64MemcpyDtoH = 46¶
- cu64MemcpyDtoHAsync = 63¶
- cu64MemcpyHtoD = 44¶
- cu64MemcpyHtoDAsync = 61¶
- cu64MemsetD16 = 74¶
- cu64MemsetD16Async = 219¶
- cu64MemsetD2D16 = 80¶
- cu64MemsetD2D16Async = 225¶
- cu64MemsetD2D32 = 82¶
- cu64MemsetD2D32Async = 227¶
- cu64MemsetD2D8 = 78¶
- cu64MemsetD2D8Async = 223¶
- cu64MemsetD32 = 76¶
- cu64MemsetD32Async = 221¶
- cu64MemsetD8 = 72¶
- cu64MemsetD8Async = 217¶
- cu64ModuleGetGlobal = 25¶
- cu64TexRefGetAddress = 104¶
- cu64TexRefSetAddress = 96¶
- cu64TexRefSetAddress2D = 98¶
- cuArray3DCreate = 90¶
- cuArray3DCreate_v2 = 274¶
- cuArray3DGetDescriptor = 91¶
- cuArray3DGetDescriptor_v2 = 275¶
- cuArrayCreate = 87¶
- cuArrayCreate_v2 = 272¶
- cuArrayDestroy = 89¶
- cuArrayGetDescriptor = 88¶
- cuArrayGetDescriptor_v2 = 273¶
- cuArrayGetMemoryRequirements = 654¶
- cuArrayGetPlane = 597¶
- cuArrayGetSparseProperties = 582¶
- cuBinaryFree = 376¶
- cuCheckpointProcessCheckpoint = 771¶
- cuCheckpointProcessGetRestoreThreadId = 768¶
- cuCheckpointProcessGetState = 769¶
- cuCheckpointProcessLock = 770¶
- cuCheckpointProcessRestore = 772¶
- cuCheckpointProcessUnlock = 773¶
- cuCompilePtx = 375¶
- cuCoredumpGetAttribute = 701¶
- cuCoredumpGetAttributeGlobal = 702¶
- cuCoredumpSetAttribute = 703¶
- cuCoredumpSetAttributeGlobal = 704¶
- cuCtxAttach = 12¶
- cuCtxCreate = 10¶
- cuCtxCreate_v2 = 235¶
- cuCtxCreate_v3 = 645¶
- cuCtxCreate_v4 = 757¶
- cuCtxDestroy = 11¶
- cuCtxDestroy_v2 = 322¶
- cuCtxDetach = 13¶
- cuCtxDisablePeerAccess = 314¶
- cuCtxEnablePeerAccess = 313¶
- cuCtxFromGreenCtx = 753¶
- cuCtxGetApiVersion = 296¶
- cuCtxGetCacheConfig = 299¶
- cuCtxGetCurrent = 304¶
- cuCtxGetDevResource = 746¶
- cuCtxGetDevice = 16¶
- cuCtxGetExecAffinity = 646¶
- cuCtxGetFlags = 391¶
- cuCtxGetId = 695¶
- cuCtxGetLimit = 137¶
- cuCtxGetStreamPriorityRange = 370¶
- cuCtxPopCurrent = 15¶
- cuCtxPopCurrent_v2 = 324¶
- cuCtxPushCurrent = 14¶
- cuCtxPushCurrent_v2 = 323¶
- cuCtxRecordEvent = 755¶
- cuCtxResetPersistingL2Cache = 568¶
- cuCtxSetCacheConfig = 300¶
- cuCtxSetCurrent = 303¶
- cuCtxSetFlags = 705¶
- cuCtxSetLimit = 136¶
- cuCtxSynchronize = 17¶
- cuCtxWaitEvent = 756¶
- cuD3D10CtxCreate = 139¶
- cuD3D10CtxCreateOnDevice = 212¶
- cuD3D10CtxCreate_v2 = 236¶
- cuD3D10GetDevice = 138¶
- cuD3D10GetDevices = 211¶
- cuD3D10GetDirect3DDevice = 297¶
- cuD3D10MapResources = 143¶
- cuD3D10RegisterResource = 141¶
- cuD3D10ResourceGetMappedArray = 146¶
- cuD3D10ResourceGetMappedPitch = 149¶
- cuD3D10ResourceGetMappedPitch_v2 = 262¶
- cuD3D10ResourceGetMappedPointer = 147¶
- cuD3D10ResourceGetMappedPointer_v2 = 260¶
- cuD3D10ResourceGetMappedSize = 148¶
- cuD3D10ResourceGetMappedSize_v2 = 261¶
- cuD3D10ResourceGetSurfaceDimensions = 150¶
- cuD3D10ResourceGetSurfaceDimensions_v2 = 263¶
- cuD3D10ResourceSetMapFlags = 145¶
- cuD3D10UnmapResources = 144¶
- cuD3D10UnregisterResource = 142¶
- cuD3D11CtxCreate = 152¶
- cuD3D11CtxCreateOnDevice = 210¶
- cuD3D11CtxCreate_v2 = 237¶
- cuD3D11GetDevice = 151¶
- cuD3D11GetDevices = 209¶
- cuD3D11GetDirect3DDevice = 298¶
- cuD3D9Begin = 168¶
- cuD3D9CtxCreate = 155¶
- cuD3D9CtxCreateOnDevice = 214¶
- cuD3D9CtxCreate_v2 = 238¶
- cuD3D9End = 169¶
- cuD3D9GetDevice = 154¶
- cuD3D9GetDevices = 213¶
- cuD3D9GetDirect3DDevice = 157¶
- cuD3D9MapResources = 160¶
- cuD3D9MapVertexBuffer = 171¶
- cuD3D9MapVertexBuffer_v2 = 268¶
- cuD3D9RegisterResource = 158¶
- cuD3D9RegisterVertexBuffer = 170¶
- cuD3D9ResourceGetMappedArray = 164¶
- cuD3D9ResourceGetMappedPitch = 167¶
- cuD3D9ResourceGetMappedPitch_v2 = 267¶
- cuD3D9ResourceGetMappedPointer = 165¶
- cuD3D9ResourceGetMappedPointer_v2 = 265¶
- cuD3D9ResourceGetMappedSize = 166¶
- cuD3D9ResourceGetMappedSize_v2 = 266¶
- cuD3D9ResourceGetSurfaceDimensions = 163¶
- cuD3D9ResourceGetSurfaceDimensions_v2 = 264¶
- cuD3D9ResourceSetMapFlags = 162¶
- cuD3D9UnmapResources = 161¶
- cuD3D9UnmapVertexBuffer = 172¶
- cuD3D9UnregisterResource = 159¶
- cuD3D9UnregisterVertexBuffer = 173¶
- cuDestroyExternalMemory = 488¶
- cuDestroyExternalSemaphore = 494¶
- cuDevResourceGenerateDesc = 748¶
- cuDevSmResourceSplitByCount = 751¶
- cuDeviceCanAccessPeer = 312¶
- cuDeviceComputeCapability = 6¶
- cuDeviceGet = 3¶
- cuDeviceGetAttribute = 9¶
- cuDeviceGetByPCIBusId = 331¶
- cuDeviceGetCount = 4¶
- cuDeviceGetDefaultMemPool = 606¶
- cuDeviceGetDevResource = 745¶
- cuDeviceGetExecAffinitySupport = 644¶
- cuDeviceGetGraphMemAttribute = 641¶
- cuDeviceGetLuid = 532¶
- cuDeviceGetMemPool = 610¶
- cuDeviceGetName = 5¶
- cuDeviceGetNvSciSyncAttributes = 542¶
- cuDeviceGetP2PAttribute = 454¶
- cuDeviceGetPCIBusId = 332¶
- cuDeviceGetProperties = 8¶
- cuDeviceGetTexture1DLinearMaxWidth = 579¶
- cuDeviceGetUuid = 482¶
- cuDeviceGetUuid_v2 = 647¶
- cuDeviceGraphMemTrim = 640¶
- cuDevicePrimaryCtxGetState = 392¶
- cuDevicePrimaryCtxRelease = 387¶
- cuDevicePrimaryCtxRelease_v2 = 544¶
- cuDevicePrimaryCtxReset = 389¶
- cuDevicePrimaryCtxReset_v2 = 545¶
- cuDevicePrimaryCtxRetain = 386¶
- cuDevicePrimaryCtxSetFlags = 388¶
- cuDevicePrimaryCtxSetFlags_v2 = 546¶
- cuDeviceRegisterAsyncNotification = 735¶
- cuDeviceSetGraphMemAttribute = 642¶
- cuDeviceSetMemPool = 609¶
- cuDeviceTotalMem = 7¶
- cuDeviceTotalMem_v2 = 259¶
- cuDeviceUnregisterAsyncNotification = 736¶
- cuDriverGetVersion = 2¶
- cuEGLStreamConsumerAcquireFrame = 395¶
- cuEGLStreamConsumerConnect = 393¶
- cuEGLStreamConsumerConnectWithFlags = 470¶
- cuEGLStreamConsumerDisconnect = 394¶
- cuEGLStreamConsumerReleaseFrame = 396¶
- cuEGLStreamProducerConnect = 446¶
- cuEGLStreamProducerDisconnect = 447¶
- cuEGLStreamProducerPresentFrame = 448¶
- cuEGLStreamProducerReturnFrame = 453¶
- cuEventCreate = 118¶
- cuEventCreateFromEGLSync = 479¶
- cuEventCreateFromNVNSync = 469¶
- cuEventDestroy = 122¶
- cuEventDestroy_v2 = 325¶
- cuEventElapsedTime = 123¶
- cuEventElapsedTime_v2 = 780¶
- cuEventQuery = 120¶
- cuEventRecord = 119¶
- cuEventRecordWithFlags = 587¶
- cuEventRecordWithFlags_ptsz = 588¶
- cuEventRecord_ptsz = 441¶
- cuEventSynchronize = 121¶
- cuExternalMemoryGetMappedBuffer = 486¶
- cuExternalMemoryGetMappedMipmappedArray = 487¶
- cuFlushGPUDirectRDMAWrites = 627¶
- cuFuncGetAttribute = 85¶
- cuFuncGetModule = 566¶
- cuFuncGetName = 718¶
- cuFuncGetParamInfo = 733¶
- cuFuncIsLoaded = 741¶
- cuFuncLoad = 742¶
- cuFuncSetAttribute = 481¶
- cuFuncSetBlockShape = 83¶
- cuFuncSetCacheConfig = 86¶
- cuGLCtxCreate = 174¶
- cuGLCtxCreate_v2 = 239¶
- cuGLGetDevices = 333¶
- cuGLGetDevices_v2 = 385¶
- cuGLInit = 178¶
- cuGLMapBufferObject = 180¶
- cuGLMapBufferObjectAsync = 184¶
- cuGLMapBufferObjectAsync_v2 = 270¶
- cuGLMapBufferObjectAsync_v2_ptsz = 445¶
- cuGLMapBufferObject_v2 = 269¶
- cuGLMapBufferObject_v2_ptds = 417¶
- cuGLRegisterBufferObject = 179¶
- cuGLSetBufferObjectMapFlags = 183¶
- cuGLUnmapBufferObject = 181¶
- cuGLUnmapBufferObjectAsync = 185¶
- cuGLUnregisterBufferObject = 182¶
- cuGetErrorName = 373¶
- cuGetErrorString = 372¶
- cuGetExportTable = 135¶
- cuGetProcAddress = 626¶
- cuGetProcAddress_v2 = 677¶
- cuGraphAddBatchMemOpNode = 669¶
- cuGraphAddChildGraphNode = 525¶
- cuGraphAddDependencies = 518¶
- cuGraphAddDependencies_v2 = 727¶
- cuGraphAddEmptyNode = 526¶
- cuGraphAddEventRecordNode = 589¶
- cuGraphAddEventWaitNode = 590¶
- cuGraphAddExternalSemaphoresSignalNode = 618¶
- cuGraphAddExternalSemaphoresWaitNode = 621¶
- cuGraphAddHostNode = 530¶
- cuGraphAddKernelNode = 502¶
- cuGraphAddKernelNode_v2 = 689¶
- cuGraphAddMemAllocNode = 638¶
- cuGraphAddMemFreeNode = 639¶
- cuGraphAddMemcpyNode = 504¶
- cuGraphAddMemsetNode = 506¶
- cuGraphAddNode = 712¶
- cuGraphAddNode_v2 = 723¶
- cuGraphBatchMemOpNodeGetParams = 670¶
- cuGraphBatchMemOpNodeSetParams = 671¶
- cuGraphChildGraphNodeGetGraph = 529¶
- cuGraphClone = 523¶
- cuGraphConditionalHandleCreate = 722¶
- cuGraphCreate = 501¶
- cuGraphDebugDotPrint = 628¶
- cuGraphDestroy = 517¶
- cuGraphDestroyNode = 522¶
- cuGraphEventRecordNodeGetEvent = 591¶
- cuGraphEventRecordNodeSetEvent = 593¶
- cuGraphEventWaitNodeGetEvent = 592¶
- cuGraphEventWaitNodeSetEvent = 594¶
- cuGraphExecBatchMemOpNodeSetParams = 672¶
- cuGraphExecChildGraphNodeSetParams = 586¶
- cuGraphExecDestroy = 516¶
- cuGraphExecEventRecordNodeSetEvent = 595¶
- cuGraphExecEventWaitNodeSetEvent = 596¶
- cuGraphExecExternalSemaphoresSignalNodeSetParams = 624¶
- cuGraphExecExternalSemaphoresWaitNodeSetParams = 625¶
- cuGraphExecGetFlags = 658¶
- cuGraphExecHostNodeSetParams = 564¶
- cuGraphExecKernelNodeSetParams = 538¶
- cuGraphExecKernelNodeSetParams_v2 = 692¶
- cuGraphExecMemcpyNodeSetParams = 562¶
- cuGraphExecMemsetNodeSetParams = 563¶
- cuGraphExecNodeSetParams = 714¶
- cuGraphExecUpdate = 561¶
- cuGraphExecUpdate_v2 = 696¶
- cuGraphExternalSemaphoresSignalNodeGetParams = 619¶
- cuGraphExternalSemaphoresSignalNodeSetParams = 620¶
- cuGraphExternalSemaphoresWaitNodeGetParams = 622¶
- cuGraphExternalSemaphoresWaitNodeSetParams = 623¶
- cuGraphGetEdges = 535¶
- cuGraphGetEdges_v2 = 724¶
- cuGraphGetNodes = 534¶
- cuGraphGetRootNodes = 510¶
- cuGraphHostNodeGetParams = 531¶
- cuGraphHostNodeSetParams = 533¶
- cuGraphInstantiate = 513¶
- cuGraphInstantiateWithFlags = 643¶
- cuGraphInstantiateWithParams = 656¶
- cuGraphInstantiateWithParams_ptsz = 657¶
- cuGraphInstantiate_v2 = 578¶
- cuGraphKernelNodeCopyAttributes = 569¶
- cuGraphKernelNodeGetAttribute = 570¶
- cuGraphKernelNodeGetParams = 503¶
- cuGraphKernelNodeGetParams_v2 = 690¶
- cuGraphKernelNodeSetAttribute = 571¶
- cuGraphKernelNodeSetParams = 521¶
- cuGraphKernelNodeSetParams_v2 = 691¶
- cuGraphLaunch = 514¶
- cuGraphLaunch_ptsz = 515¶
- cuGraphMemAllocNodeGetParams = 648¶
- cuGraphMemFreeNodeGetParams = 649¶
- cuGraphMemcpyNodeGetParams = 505¶
- cuGraphMemcpyNodeSetParams = 520¶
- cuGraphMemsetNodeGetParams = 507¶
- cuGraphMemsetNodeSetParams = 508¶
- cuGraphNodeFindInClone = 524¶
- cuGraphNodeGetDependencies = 511¶
- cuGraphNodeGetDependencies_v2 = 725¶
- cuGraphNodeGetDependentNodes = 512¶
- cuGraphNodeGetDependentNodes_v2 = 726¶
- cuGraphNodeGetEnabled = 651¶
- cuGraphNodeGetType = 509¶
- cuGraphNodeSetEnabled = 650¶
- cuGraphNodeSetParams = 713¶
- cuGraphReleaseUserObject = 637¶
- cuGraphRemoveDependencies = 519¶
- cuGraphRemoveDependencies_v2 = 728¶
- cuGraphRetainUserObject = 636¶
- cuGraphUpload = 580¶
- cuGraphUpload_ptsz = 581¶
- cuGraphicsD3D10RegisterResource = 140¶
- cuGraphicsD3D11RegisterResource = 153¶
- cuGraphicsD3D9RegisterResource = 156¶
- cuGraphicsEGLRegisterImage = 390¶
- cuGraphicsGLRegisterBuffer = 175¶
- cuGraphicsGLRegisterImage = 176¶
- cuGraphicsMapResources = 133¶
- cuGraphicsMapResources_ptsz = 443¶
- cuGraphicsResourceGetMappedEglFrame = 449¶
- cuGraphicsResourceGetMappedMipmappedArray = 360¶
- cuGraphicsResourceGetMappedPointer = 130¶
- cuGraphicsResourceGetMappedPointer_v2 = 258¶
- cuGraphicsResourceSetMapFlags = 132¶
- cuGraphicsResourceSetMapFlags_v2 = 380¶
- cuGraphicsSubResourceGetMappedArray = 129¶
- cuGraphicsUnmapResources = 134¶
- cuGraphicsUnmapResources_ptsz = 444¶
- cuGraphicsUnregisterResource = 128¶
- cuGraphicsVDPAURegisterOutputSurface = 189¶
- cuGraphicsVDPAURegisterVideoSurface = 188¶
- cuGreenCtxCreate = 743¶
- cuGreenCtxDestroy = 744¶
- cuGreenCtxGetDevResource = 747¶
- cuGreenCtxRecordEvent = 749¶
- cuGreenCtxStreamCreate = 758¶
- cuGreenCtxWaitEvent = 750¶
- cuImportExternalMemory = 485¶
- cuImportExternalSemaphore = 489¶
- cuInit = 1¶
- cuIpcCloseMemHandle = 330¶
- cuIpcGetEventHandle = 334¶
- cuIpcGetMemHandle = 328¶
- cuIpcOpenEventHandle = 335¶
- cuIpcOpenMemHandle = 329¶
- cuIpcOpenMemHandle_v2 = 567¶
- cuKernelGetAttribute = 686¶
- cuKernelGetFunction = 683¶
- cuKernelGetLibrary = 754¶
- cuKernelGetName = 719¶
- cuKernelGetParamInfo = 734¶
- cuKernelSetAttribute = 687¶
- cuKernelSetCacheConfig = 688¶
- cuLaunch = 115¶
- cuLaunchCooperativeKernel = 477¶
- cuLaunchCooperativeKernelMultiDevice = 480¶
- cuLaunchCooperativeKernel_ptsz = 478¶
- cuLaunchGrid = 116¶
- cuLaunchGridAsync = 117¶
- cuLaunchHostFunc = 527¶
- cuLaunchHostFunc_ptsz = 528¶
- cuLaunchKernel = 307¶
- cuLaunchKernelEx = 652¶
- cuLaunchKernelEx_ptsz = 653¶
- cuLaunchKernel_ptsz = 442¶
- cuLibraryEnumerateKernels = 740¶
- cuLibraryGetGlobal = 684¶
- cuLibraryGetKernel = 681¶
- cuLibraryGetKernelCount = 739¶
- cuLibraryGetManaged = 685¶
- cuLibraryGetModule = 682¶
- cuLibraryGetUnifiedFunction = 700¶
- cuLibraryLoadData = 678¶
- cuLibraryLoadFromFile = 679¶
- cuLibraryUnload = 680¶
- cuLinkAddData = 363¶
- cuLinkAddData_v2 = 382¶
- cuLinkAddFile = 364¶
- cuLinkAddFile_v2 = 383¶
- cuLinkComplete = 365¶
- cuLinkCreate = 362¶
- cuLinkCreate_v2 = 381¶
- cuLinkDestroy = 366¶
- cuLogsCurrent = 765¶
- cuLogsDumpToFile = 766¶
- cuLogsDumpToMemory = 767¶
- cuLogsRegisterCallback = 763¶
- cuLogsUnregisterCallback = 764¶
- cuMemAddressFree = 548¶
- cuMemAddressReserve = 547¶
- cuMemAdvise = 457¶
- cuMemAdvise_v2 = 715¶
- cuMemAlloc = 29¶
- cuMemAllocAsync = 598¶
- cuMemAllocAsync_ptsz = 599¶
- cuMemAllocFromPoolAsync = 611¶
- cuMemAllocFromPoolAsync_ptsz = 612¶
- cuMemAllocHost = 37¶
- cuMemAllocHost_v2 = 294¶
- cuMemAllocManaged = 371¶
- cuMemAllocPitch = 31¶
- cuMemAllocPitch_v2 = 244¶
- cuMemAlloc_v2 = 243¶
- cuMemBatchDecompressAsync = 761¶
- cuMemBatchDecompressAsync_ptsz = 762¶
- cuMemCreate = 549¶
- cuMemFree = 33¶
- cuMemFreeAsync = 600¶
- cuMemFreeAsync_ptsz = 601¶
- cuMemFreeHost = 38¶
- cuMemFree_v2 = 245¶
- cuMemGetAccess = 558¶
- cuMemGetAddressRange = 35¶
- cuMemGetAddressRange_v2 = 246¶
- cuMemGetAllocationGranularity = 556¶
- cuMemGetAllocationPropertiesFromHandle = 557¶
- cuMemGetHandleForAddressRange = 674¶
- cuMemGetInfo = 27¶
- cuMemGetInfo_v2 = 242¶
- cuMemHostAlloc = 39¶
- cuMemHostAlloc_v2 = 271¶
- cuMemHostGetDevicePointer = 40¶
- cuMemHostGetDevicePointer_v2 = 247¶
- cuMemHostGetFlags = 42¶
- cuMemHostRegister = 301¶
- cuMemHostRegister_v2 = 379¶
- cuMemHostUnregister = 302¶
- cuMemMap = 551¶
- cuMemMapArrayAsync = 584¶
- cuMemMapArrayAsync_ptsz = 585¶
- cuMemPeerGetDevicePointer = 317¶
- cuMemPeerRegister = 315¶
- cuMemPeerUnregister = 316¶
- cuMemPoolCreate = 607¶
- cuMemPoolDestroy = 608¶
- cuMemPoolExportPointer = 615¶
- cuMemPoolGetAccess = 617¶
- cuMemPoolGetAttribute = 604¶
- cuMemPoolImportPointer = 616¶
- cuMemPoolSetAccess = 605¶
- cuMemPoolSetAttribute = 603¶
- cuMemPoolTrimTo = 602¶
- cuMemPrefetchAsync = 467¶
- cuMemPrefetchAsync_ptsz = 468¶
- cuMemPrefetchAsync_v2 = 716¶
- cuMemPrefetchAsync_v2_ptsz = 717¶
- cuMemRangeGetAttribute = 471¶
- cuMemRangeGetAttributes = 472¶
- cuMemRelease = 550¶
- cuMemRetainAllocationHandle = 565¶
- cuMemSetAccess = 553¶
- cuMemUnmap = 552¶
- cuMemcpy = 305¶
- cuMemcpy2D = 56¶
- cuMemcpy2DAsync = 68¶
- cuMemcpy2DAsync_v2 = 289¶
- cuMemcpy2DAsync_v2_ptsz = 424¶
- cuMemcpy2DUnaligned = 57¶
- cuMemcpy2DUnaligned_v2 = 288¶
- cuMemcpy2DUnaligned_v2_ptds = 406¶
- cuMemcpy2D_v2 = 287¶
- cuMemcpy2D_v2_ptds = 405¶
- cuMemcpy3D = 58¶
- cuMemcpy3DAsync = 69¶
- cuMemcpy3DAsync_v2 = 291¶
- cuMemcpy3DAsync_v2_ptsz = 425¶
- cuMemcpy3DBatchAsync = 778¶
- cuMemcpy3DBatchAsync_ptsz = 779¶
- cuMemcpy3DPeer = 320¶
- cuMemcpy3DPeerAsync = 321¶
- cuMemcpy3DPeerAsync_ptsz = 427¶
- cuMemcpy3DPeer_ptds = 410¶
- cuMemcpy3D_v2 = 290¶
- cuMemcpy3D_v2_ptds = 407¶
- cuMemcpyAsync = 306¶
- cuMemcpyAsync_ptsz = 418¶
- cuMemcpyAtoA = 55¶
- cuMemcpyAtoA_v2 = 286¶
- cuMemcpyAtoA_v2_ptds = 404¶
- cuMemcpyAtoD = 51¶
- cuMemcpyAtoD_v2 = 284¶
- cuMemcpyAtoD_v2_ptds = 401¶
- cuMemcpyAtoH = 54¶
- cuMemcpyAtoHAsync = 67¶
- cuMemcpyAtoHAsync_v2 = 283¶
- cuMemcpyAtoHAsync_v2_ptsz = 420¶
- cuMemcpyAtoH_v2 = 282¶
- cuMemcpyAtoH_v2_ptds = 403¶
- cuMemcpyBatchAsync = 776¶
- cuMemcpyBatchAsync_ptsz = 777¶
- cuMemcpyDtoA = 49¶
- cuMemcpyDtoA_v2 = 285¶
- cuMemcpyDtoA_v2_ptds = 400¶
- cuMemcpyDtoD = 47¶
- cuMemcpyDtoDAsync = 64¶
- cuMemcpyDtoDAsync_v2 = 281¶
- cuMemcpyDtoDAsync_v2_ptsz = 423¶
- cuMemcpyDtoD_v2 = 280¶
- cuMemcpyDtoD_v2_ptds = 399¶
- cuMemcpyDtoH = 45¶
- cuMemcpyDtoHAsync = 62¶
- cuMemcpyDtoHAsync_v2 = 279¶
- cuMemcpyDtoHAsync_v2_ptsz = 422¶
- cuMemcpyDtoH_v2 = 278¶
- cuMemcpyDtoH_v2_ptds = 398¶
- cuMemcpyHtoA = 53¶
- cuMemcpyHtoAAsync = 66¶
- cuMemcpyHtoAAsync_v2 = 293¶
- cuMemcpyHtoAAsync_v2_ptsz = 419¶
- cuMemcpyHtoA_v2 = 292¶
- cuMemcpyHtoA_v2_ptds = 402¶
- cuMemcpyHtoD = 43¶
- cuMemcpyHtoDAsync = 60¶
- cuMemcpyHtoDAsync_v2 = 277¶
- cuMemcpyHtoDAsync_v2_ptsz = 421¶
- cuMemcpyHtoD_v2 = 276¶
- cuMemcpyHtoD_v2_ptds = 397¶
- cuMemcpyPeer = 318¶
- cuMemcpyPeerAsync = 319¶
- cuMemcpyPeerAsync_ptsz = 426¶
- cuMemcpyPeer_ptds = 409¶
- cuMemcpy_ptds = 408¶
- cuMemcpy_v2 = 248¶
- cuMemsetD16 = 73¶
- cuMemsetD16Async = 218¶
- cuMemsetD16Async_ptsz = 429¶
- cuMemsetD16_v2 = 250¶
- cuMemsetD16_v2_ptds = 412¶
- cuMemsetD2D16 = 79¶
- cuMemsetD2D16Async = 224¶
- cuMemsetD2D16Async_ptsz = 432¶
- cuMemsetD2D16_v2 = 253¶
- cuMemsetD2D16_v2_ptds = 415¶
- cuMemsetD2D32 = 81¶
- cuMemsetD2D32Async = 226¶
- cuMemsetD2D32Async_ptsz = 433¶
- cuMemsetD2D32_v2 = 254¶
- cuMemsetD2D32_v2_ptds = 416¶
- cuMemsetD2D8 = 77¶
- cuMemsetD2D8Async = 222¶
- cuMemsetD2D8Async_ptsz = 431¶
- cuMemsetD2D8_v2 = 252¶
- cuMemsetD2D8_v2_ptds = 414¶
- cuMemsetD32 = 75¶
- cuMemsetD32Async = 220¶
- cuMemsetD32Async_ptsz = 430¶
- cuMemsetD32_v2 = 251¶
- cuMemsetD32_v2_ptds = 413¶
- cuMemsetD8 = 71¶
- cuMemsetD8Async = 216¶
- cuMemsetD8Async_ptsz = 428¶
- cuMemsetD8_v2 = 249¶
- cuMemsetD8_v2_ptds = 411¶
- cuMipmappedArrayCreate = 347¶
- cuMipmappedArrayDestroy = 349¶
- cuMipmappedArrayGetLevel = 348¶
- cuMipmappedArrayGetMemoryRequirements = 655¶
- cuMipmappedArrayGetSparseProperties = 583¶
- cuModuleEnumerateFunctions = 738¶
- cuModuleGetFunction = 23¶
- cuModuleGetFunctionCount = 737¶
- cuModuleGetGlobal = 24¶
- cuModuleGetGlobal_v2 = 241¶
- cuModuleGetLoadingMode = 673¶
- cuModuleGetSurfRef = 190¶
- cuModuleGetTexRef = 26¶
- cuModuleLoad = 18¶
- cuModuleLoadData = 19¶
- cuModuleLoadDataEx = 20¶
- cuModuleLoadFatBinary = 21¶
- cuModuleUnload = 22¶
- cuMulticastAddDevice = 707¶
- cuMulticastBindAddr = 709¶
- cuMulticastBindMem = 708¶
- cuMulticastCreate = 706¶
- cuMulticastGetGranularity = 711¶
- cuMulticastUnbind = 710¶
- cuNNSetAllocator = 466¶
- cuNVNbufferGetPointer = 464¶
- cuNVNtextureGetArray = 465¶
- cuOccupancyAvailableDynamicSMemPerBlock = 543¶
- cuOccupancyMaxActiveBlocksPerMultiprocessor = 374¶
- cuOccupancyMaxActiveBlocksPerMultiprocessorWithFlags = 451¶
- cuOccupancyMaxActiveClusters = 676¶
- cuOccupancyMaxPotentialBlockSize = 384¶
- cuOccupancyMaxPotentialBlockSizeWithFlags = 452¶
- cuOccupancyMaxPotentialClusterSize = 675¶
- cuParamSetSize = 110¶
- cuParamSetTexRef = 114¶
- cuParamSetf = 112¶
- cuParamSeti = 111¶
- cuParamSetv = 113¶
- cuPointerGetAttribute = 310¶
- cuPointerGetAttributes = 450¶
- cuPointerSetAttribute = 378¶
- cuProfilerInitialize = 311¶
- cuProfilerStart = 308¶
- cuProfilerStop = 309¶
- cuSignalExternalSemaphoresAsync = 490¶
- cuSignalExternalSemaphoresAsync_ptsz = 491¶
- cuStreamAddCallback = 346¶
- cuStreamAddCallback_ptsz = 437¶
- cuStreamAttachMemAsync = 377¶
- cuStreamAttachMemAsync_ptsz = 438¶
- cuStreamBatchMemOp = 462¶
- cuStreamBatchMemOp_ptsz = 463¶
- cuStreamBatchMemOp_v2 = 667¶
- cuStreamBatchMemOp_v2_ptsz = 668¶
- cuStreamBeginCapture = 495¶
- cuStreamBeginCaptureToGraph = 720¶
- cuStreamBeginCaptureToGraph_ptsz = 721¶
- cuStreamBeginCapture_ptsz = 496¶
- cuStreamBeginCapture_v2 = 539¶
- cuStreamBeginCapture_v2_ptsz = 540¶
- cuStreamCopyAttributes = 572¶
- cuStreamCopyAttributes_ptsz = 573¶
- cuStreamCreate = 124¶
- cuStreamCreateWithPriority = 367¶
- cuStreamDestroy = 127¶
- cuStreamDestroy_v2 = 326¶
- cuStreamEndCapture = 497¶
- cuStreamEndCapture_ptsz = 498¶
- cuStreamGetAttribute = 574¶
- cuStreamGetAttribute_ptsz = 575¶
- cuStreamGetCaptureInfo = 536¶
- cuStreamGetCaptureInfo_ptsz = 537¶
- cuStreamGetCaptureInfo_v2 = 629¶
- cuStreamGetCaptureInfo_v2_ptsz = 630¶
- cuStreamGetCaptureInfo_v3 = 729¶
- cuStreamGetCaptureInfo_v3_ptsz = 730¶
- cuStreamGetCtx = 483¶
- cuStreamGetCtx_ptsz = 484¶
- cuStreamGetCtx_v2 = 759¶
- cuStreamGetCtx_v2_ptsz = 760¶
- cuStreamGetDevice = 774¶
- cuStreamGetDevice_ptsz = 775¶
- cuStreamGetFlags = 369¶
- cuStreamGetFlags_ptsz = 435¶
- cuStreamGetGreenCtx = 752¶
- cuStreamGetId = 693¶
- cuStreamGetId_ptsz = 694¶
- cuStreamGetPriority = 368¶
- cuStreamGetPriority_ptsz = 434¶
- cuStreamIsCapturing = 499¶
- cuStreamIsCapturing_ptsz = 500¶
- cuStreamQuery = 125¶
- cuStreamQuery_ptsz = 439¶
- cuStreamSetAttribute = 576¶
- cuStreamSetAttribute_ptsz = 577¶
- cuStreamSetFlags = 559¶
- cuStreamSetFlags_ptsz = 560¶
- cuStreamSynchronize = 126¶
- cuStreamSynchronize_ptsz = 440¶
- cuStreamUpdateCaptureDependencies = 631¶
- cuStreamUpdateCaptureDependencies_ptsz = 632¶
- cuStreamUpdateCaptureDependencies_v2 = 731¶
- cuStreamUpdateCaptureDependencies_v2_ptsz = 732¶
- cuStreamWaitEvent = 295¶
- cuStreamWaitEvent_ptsz = 436¶
- cuStreamWaitValue32 = 458¶
- cuStreamWaitValue32_ptsz = 459¶
- cuStreamWaitValue32_v2 = 659¶
- cuStreamWaitValue32_v2_ptsz = 660¶
- cuStreamWaitValue64 = 473¶
- cuStreamWaitValue64_ptsz = 474¶
- cuStreamWaitValue64_v2 = 661¶
- cuStreamWaitValue64_v2_ptsz = 662¶
- cuStreamWriteValue32 = 460¶
- cuStreamWriteValue32_ptsz = 461¶
- cuStreamWriteValue32_v2 = 663¶
- cuStreamWriteValue32_v2_ptsz = 664¶
- cuStreamWriteValue64 = 475¶
- cuStreamWriteValue64_ptsz = 476¶
- cuStreamWriteValue64_v2 = 665¶
- cuStreamWriteValue64_v2_ptsz = 666¶
- cuSurfObjectCreate = 343¶
- cuSurfObjectDestroy = 344¶
- cuSurfObjectGetResourceDesc = 345¶
- cuSurfRefCreate = 191¶
- cuSurfRefDestroy = 192¶
- cuSurfRefGetArray = 196¶
- cuSurfRefGetFormat = 195¶
- cuSurfRefSetArray = 194¶
- cuSurfRefSetFormat = 193¶
- cuTensorMapEncodeIm2col = 698¶
- cuTensorMapEncodeIm2colWide = 781¶
- cuTensorMapEncodeTiled = 697¶
- cuTensorMapReplaceAddress = 699¶
- cuTexObjectCreate = 339¶
- cuTexObjectDestroy = 340¶
- cuTexObjectGetResourceDesc = 341¶
- cuTexObjectGetResourceViewDesc = 361¶
- cuTexObjectGetTextureDesc = 342¶
- cuTexRefCreate = 92¶
- cuTexRefDestroy = 93¶
- cuTexRefGetAddress = 103¶
- cuTexRefGetAddressMode = 106¶
- cuTexRefGetAddress_v2 = 257¶
- cuTexRefGetArray = 105¶
- cuTexRefGetBorderColor = 456¶
- cuTexRefGetFilterMode = 107¶
- cuTexRefGetFlags = 109¶
- cuTexRefGetFormat = 108¶
- cuTexRefGetMaxAnisotropy = 359¶
- cuTexRefGetMipmapFilterMode = 356¶
- cuTexRefGetMipmapLevelBias = 357¶
- cuTexRefGetMipmapLevelClamp = 358¶
- cuTexRefGetMipmappedArray = 355¶
- cuTexRefSetAddress = 95¶
- cuTexRefSetAddress2D = 97¶
- cuTexRefSetAddress2D_v2 = 256¶
- cuTexRefSetAddress2D_v3 = 327¶
- cuTexRefSetAddressMode = 100¶
- cuTexRefSetAddress_v2 = 255¶
- cuTexRefSetArray = 94¶
- cuTexRefSetBorderColor = 455¶
- cuTexRefSetFilterMode = 101¶
- cuTexRefSetFlags = 102¶
- cuTexRefSetFormat = 99¶
- cuTexRefSetMaxAnisotropy = 354¶
- cuTexRefSetMipmapFilterMode = 351¶
- cuTexRefSetMipmapLevelBias = 352¶
- cuTexRefSetMipmapLevelClamp = 353¶
- cuTexRefSetMipmappedArray = 350¶
- cuThreadExchangeStreamCaptureMode = 541¶
- cuUserObjectCreate = 633¶
- cuUserObjectRelease = 635¶
- cuUserObjectRetain = 634¶
- cuVDPAUCtxCreate = 187¶
- cuVDPAUCtxCreate_v2 = 240¶
- cuVDPAUGetDevice = 186¶
- cuWGLGetDevice = 177¶
- cuWaitExternalSemaphoresAsync = 492¶
- cuWaitExternalSemaphoresAsync_ptsz = 493¶
- class cupti.cupti.runtime_api_trace_cbid(value)¶
Bases:
IntEnumSee CUpti_runtime_api_trace_cbid.
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- SIZE = 487¶
- cuda470_v12060 = 470¶
- cuda471_v12060 = 471¶
- cuda472_v12060 = 472¶
- cuda473_v12060 = 473¶
- cuda474_v12060 = 474¶
- cuda475_v12060 = 475¶
- cuda476_v12060 = 476¶
- cuda477_v12060 = 477¶
- cuda478_v12060 = 478¶
- cuda479_v12060 = 479¶
- cudaArrayGetInfo_v4010 = 181¶
- cudaArrayGetMemoryRequirements_v11060 = 428¶
- cudaArrayGetPlane_v11020 = 381¶
- cudaArrayGetSparseProperties_v11010 = 359¶
- cudaBindSurfaceToArray_v3020 = 61¶
- cudaBindTexture2D_v3020 = 56¶
- cudaBindTextureToArray_v3020 = 57¶
- cudaBindTextureToMipmappedArray_v5000 = 195¶
- cudaBindTexture_v3020 = 55¶
- cudaChooseDevice_v3020 = 5¶
- cudaConfigureCall_v3020 = 8¶
- cudaCreateChannelDesc_v3020 = 7¶
- cudaCreateSurfaceObject_v5000 = 189¶
- cudaCreateTextureObject_v2_v11080 = 434¶
- cudaCreateTextureObject_v5000 = 185¶
- cudaCtxResetPersistingL2Cache_v11000 = 337¶
- cudaD3D10GetDevice_v3020 = 88¶
- cudaD3D10GetDevices_v3020 = 89¶
- cudaD3D10GetDirect3DDevice_v3020 = 149¶
- cudaD3D10MapResources_v3020 = 94¶
- cudaD3D10RegisterResource_v3020 = 92¶
- cudaD3D10ResourceGetMappedArray_v3020 = 98¶
- cudaD3D10ResourceGetMappedPitch_v3020 = 101¶
- cudaD3D10ResourceGetMappedPointer_v3020 = 99¶
- cudaD3D10ResourceGetMappedSize_v3020 = 100¶
- cudaD3D10ResourceGetSurfaceDimensions_v3020 = 97¶
- cudaD3D10ResourceSetMapFlags_v3020 = 96¶
- cudaD3D10SetDirect3DDevice_v3020 = 90¶
- cudaD3D10UnmapResources_v3020 = 95¶
- cudaD3D10UnregisterResource_v3020 = 93¶
- cudaD3D11GetDevice_v3020 = 84¶
- cudaD3D11GetDevices_v3020 = 85¶
- cudaD3D11GetDirect3DDevice_v3020 = 148¶
- cudaD3D11SetDirect3DDevice_v3020 = 86¶
- cudaD3D9Begin_v3020 = 117¶
- cudaD3D9End_v3020 = 118¶
- cudaD3D9GetDevice_v3020 = 102¶
- cudaD3D9GetDevices_v3020 = 103¶
- cudaD3D9GetDirect3DDevice_v3020 = 105¶
- cudaD3D9MapResources_v3020 = 109¶
- cudaD3D9MapVertexBuffer_v3020 = 121¶
- cudaD3D9RegisterResource_v3020 = 107¶
- cudaD3D9RegisterVertexBuffer_v3020 = 119¶
- cudaD3D9ResourceGetMappedArray_v3020 = 113¶
- cudaD3D9ResourceGetMappedPitch_v3020 = 116¶
- cudaD3D9ResourceGetMappedPointer_v3020 = 114¶
- cudaD3D9ResourceGetMappedSize_v3020 = 115¶
- cudaD3D9ResourceGetSurfaceDimensions_v3020 = 112¶
- cudaD3D9ResourceSetMapFlags_v3020 = 111¶
- cudaD3D9SetDirect3DDevice_v3020 = 104¶
- cudaD3D9UnmapResources_v3020 = 110¶
- cudaD3D9UnmapVertexBuffer_v3020 = 122¶
- cudaD3D9UnregisterResource_v3020 = 108¶
- cudaD3D9UnregisterVertexBuffer_v3020 = 120¶
- cudaDestroyExternalMemory_v10000 = 277¶
- cudaDestroyExternalSemaphore_v10000 = 283¶
- cudaDestroySurfaceObject_v5000 = 190¶
- cudaDestroyTextureObject_v5000 = 186¶
- cudaDeviceCanAccessPeer_v4000 = 154¶
- cudaDeviceDisablePeerAccess_v4000 = 156¶
- cudaDeviceEnablePeerAccess_v4000 = 155¶
- cudaDeviceFlushGPUDirectRDMAWrites_v11030 = 405¶
- cudaDeviceGetAttribute_v5000 = 200¶
- cudaDeviceGetByPCIBusId_v4010 = 173¶
- cudaDeviceGetCacheConfig_v3020 = 168¶
- cudaDeviceGetDefaultMemPool_v11020 = 372¶
- cudaDeviceGetGraphMemAttribute_v11040 = 424¶
- cudaDeviceGetLimit_v3020 = 166¶
- cudaDeviceGetMemPool_v11020 = 386¶
- cudaDeviceGetNvSciSyncAttributes_v10020 = 328¶
- cudaDeviceGetP2PAttribute_v8000 = 255¶
- cudaDeviceGetPCIBusId_v4010 = 174¶
- cudaDeviceGetStreamPriorityRange_v5050 = 205¶
- cudaDeviceGetTexture1DLinearMaxWidth_v11010 = 347¶
- cudaDeviceGraphMemTrim_v11040 = 423¶
- cudaDeviceRegisterAsyncNotification_v12040 = 465¶
- cudaDeviceReset_v3020 = 164¶
- cudaDeviceSetCacheConfig_v3020 = 169¶
- cudaDeviceSetGraphMemAttribute_v11040 = 425¶
- cudaDeviceSetLimit_v3020 = 167¶
- cudaDeviceSetMemPool_v11020 = 385¶
- cudaDeviceSynchronize_v3020 = 165¶
- cudaDeviceUnregisterAsyncNotification_v12040 = 466¶
- cudaDriverGetVersion_v3020 = 1¶
- cudaEGLStreamConsumerAcquireFrame_v7000 = 259¶
- cudaEGLStreamConsumerConnectWithFlags_v7000 = 268¶
- cudaEGLStreamConsumerConnect_v7000 = 257¶
- cudaEGLStreamConsumerDisconnect_v7000 = 258¶
- cudaEGLStreamConsumerReleaseFrame_v7000 = 260¶
- cudaEGLStreamProducerConnect_v7000 = 261¶
- cudaEGLStreamProducerDisconnect_v7000 = 262¶
- cudaEGLStreamProducerPresentFrame_v7000 = 263¶
- cudaEGLStreamProducerReturnFrame_v7000 = 264¶
- cudaEventCreateFromEGLSync_v9000 = 271¶
- cudaEventCreateWithFlags_v3020 = 134¶
- cudaEventCreate_v3020 = 133¶
- cudaEventDestroy_v3020 = 136¶
- cudaEventElapsedTime_v2_v12080 = 486¶
- cudaEventElapsedTime_v3020 = 139¶
- cudaEventQuery_v3020 = 138¶
- cudaEventRecordWithFlags_ptsz_v11010 = 371¶
- cudaEventRecordWithFlags_v11010 = 370¶
- cudaEventRecord_ptsz_v7000 = 242¶
- cudaEventRecord_v3020 = 135¶
- cudaEventSynchronize_v3020 = 137¶
- cudaExternalMemoryGetMappedBuffer_v10000 = 275¶
- cudaExternalMemoryGetMappedMipmappedArray_v10000 = 276¶
- cudaFreeArray_v3020 = 24¶
- cudaFreeAsync_ptsz_v11020 = 376¶
- cudaFreeAsync_v11020 = 375¶
- cudaFreeHost_v3020 = 26¶
- cudaFreeMipmappedArray_v5000 = 194¶
- cudaFree_v3020 = 22¶
- cudaFuncGetAttributes_v3020 = 15¶
- cudaFuncGetName_v12030 = 451¶
- cudaFuncGetParamInfo_v12040 = 467¶
- cudaFuncSetAttribute_v9000 = 273¶
- cudaFuncSetCacheConfig_v3020 = 14¶
- cudaGLGetDevices_v4010 = 175¶
- cudaGLMapBufferObjectAsync_v3020 = 69¶
- cudaGLMapBufferObject_v3020 = 65¶
- cudaGLRegisterBufferObject_v3020 = 64¶
- cudaGLSetBufferObjectMapFlags_v3020 = 68¶
- cudaGLSetGLDevice_v3020 = 63¶
- cudaGLUnmapBufferObjectAsync_v3020 = 70¶
- cudaGLUnmapBufferObject_v3020 = 66¶
- cudaGLUnregisterBufferObject_v3020 = 67¶
- cudaGetChannelDesc_v3020 = 6¶
- cudaGetDeviceCount_v3020 = 3¶
- cudaGetDeviceFlags_v7000 = 212¶
- cudaGetDeviceProperties_v2_v12000 = 440¶
- cudaGetDeviceProperties_v3020 = 4¶
- cudaGetDevice_v3020 = 17¶
- cudaGetDriverEntryPointByVersion_ptsz_v12050 = 469¶
- cudaGetDriverEntryPointByVersion_v12050 = 468¶
- cudaGetDriverEntryPoint_ptsz_v11030 = 407¶
- cudaGetDriverEntryPoint_v11030 = 406¶
- cudaGetErrorName_v6050 = 209¶
- cudaGetErrorString_v3020 = 12¶
- cudaGetFuncBySymbol_v11000 = 336¶
- cudaGetKernel_v12000 = 439¶
- cudaGetLastError_v3020 = 10¶
- cudaGetMipmappedArrayLevel_v5000 = 193¶
- cudaGetSurfaceObjectResourceDesc_v5000 = 191¶
- cudaGetSurfaceReference_v3020 = 62¶
- cudaGetSymbolAddress_v3020 = 53¶
- cudaGetSymbolSize_v3020 = 54¶
- cudaGetTextureAlignmentOffset_v3020 = 59¶
- cudaGetTextureObjectResourceDesc_v5000 = 187¶
- cudaGetTextureObjectResourceViewDesc_v5000 = 199¶
- cudaGetTextureObjectTextureDesc_v2_v11080 = 435¶
- cudaGetTextureObjectTextureDesc_v5000 = 188¶
- cudaGetTextureReference_v3020 = 60¶
- cudaGraphAddChildGraphNode_v10000 = 298¶
- cudaGraphAddDependencies_v10000 = 307¶
- cudaGraphAddDependencies_v2_v12030 = 458¶
- cudaGraphAddEmptyNode_v10000 = 300¶
- cudaGraphAddEventRecordNode_v11010 = 362¶
- cudaGraphAddEventWaitNode_v11010 = 365¶
- cudaGraphAddExternalSemaphoresSignalNode_v11020 = 397¶
- cudaGraphAddExternalSemaphoresWaitNode_v11020 = 400¶
- cudaGraphAddHostNode_v10000 = 296¶
- cudaGraphAddKernelNode_v10000 = 289¶
- cudaGraphAddMemAllocNode_v11040 = 419¶
- cudaGraphAddMemFreeNode_v11040 = 421¶
- cudaGraphAddMemcpyNode1D_v11010 = 352¶
- cudaGraphAddMemcpyNodeFromSymbol_v11010 = 351¶
- cudaGraphAddMemcpyNodeToSymbol_v11010 = 350¶
- cudaGraphAddMemcpyNode_v10000 = 290¶
- cudaGraphAddMemsetNode_v10000 = 293¶
- cudaGraphAddNode_v12020 = 445¶
- cudaGraphAddNode_v2_v12030 = 460¶
- cudaGraphChildGraphNodeGetGraph_v10000 = 299¶
- cudaGraphClone_v10000 = 301¶
- cudaGraphConditionalHandleCreate_v12030 = 454¶
- cudaGraphCreate_v10000 = 286¶
- cudaGraphDebugDotPrint_v11030 = 408¶
- cudaGraphDestroyNode_v10000 = 309¶
- cudaGraphDestroy_v10000 = 314¶
- cudaGraphEventRecordNodeGetEvent_v11010 = 363¶
- cudaGraphEventRecordNodeSetEvent_v11010 = 364¶
- cudaGraphEventWaitNodeGetEvent_v11010 = 366¶
- cudaGraphEventWaitNodeSetEvent_v11010 = 367¶
- cudaGraphExecChildGraphNodeSetParams_v11010 = 361¶
- cudaGraphExecDestroy_v10000 = 313¶
- cudaGraphExecEventRecordNodeSetEvent_v11010 = 368¶
- cudaGraphExecEventWaitNodeSetEvent_v11010 = 369¶
- cudaGraphExecExternalSemaphoresSignalNodeSetParams_v11020 = 403¶
- cudaGraphExecExternalSemaphoresWaitNodeSetParams_v11020 = 404¶
- cudaGraphExecGetFlags_v12000 = 438¶
- cudaGraphExecHostNodeSetParams_v10020 = 334¶
- cudaGraphExecKernelNodeSetParams_v10010 = 326¶
- cudaGraphExecMemcpyNodeSetParams1D_v11010 = 358¶
- cudaGraphExecMemcpyNodeSetParamsFromSymbol_v11010 = 357¶
- cudaGraphExecMemcpyNodeSetParamsToSymbol_v11010 = 356¶
- cudaGraphExecMemcpyNodeSetParams_v10020 = 332¶
- cudaGraphExecMemsetNodeSetParams_v10020 = 333¶
- cudaGraphExecNodeSetParams_v12020 = 447¶
- cudaGraphExecUpdate_v10020 = 335¶
- cudaGraphExternalSemaphoresSignalNodeGetParams_v11020 = 398¶
- cudaGraphExternalSemaphoresSignalNodeSetParams_v11020 = 399¶
- cudaGraphExternalSemaphoresWaitNodeGetParams_v11020 = 401¶
- cudaGraphExternalSemaphoresWaitNodeSetParams_v11020 = 402¶
- cudaGraphGetEdges_v10000 = 323¶
- cudaGraphGetEdges_v2_v12030 = 455¶
- cudaGraphGetNodes_v10000 = 322¶
- cudaGraphGetRootNodes_v10000 = 304¶
- cudaGraphHostNodeGetParams_v10000 = 297¶
- cudaGraphHostNodeSetParams_v10000 = 321¶
- cudaGraphInstantiateWithFlags_v11040 = 418¶
- cudaGraphInstantiateWithParams_ptsz_v12000 = 437¶
- cudaGraphInstantiateWithParams_v12000 = 436¶
- cudaGraphInstantiate_v10000 = 310¶
- cudaGraphInstantiate_v12000 = 443¶
- cudaGraphKernelNodeCopyAttributes_v11000 = 338¶
- cudaGraphKernelNodeGetAttribute_v11000 = 339¶
- cudaGraphKernelNodeGetParams_v10000 = 287¶
- cudaGraphKernelNodeSetAttribute_v11000 = 340¶
- cudaGraphKernelNodeSetParams_v10000 = 288¶
- cudaGraphLaunch_ptsz_v10000 = 312¶
- cudaGraphLaunch_v10000 = 311¶
- cudaGraphMemAllocNodeGetParams_v11040 = 420¶
- cudaGraphMemFreeNodeGetParams_v11040 = 422¶
- cudaGraphMemcpyNodeGetParams_v10000 = 291¶
- cudaGraphMemcpyNodeSetParams1D_v11010 = 355¶
- cudaGraphMemcpyNodeSetParamsFromSymbol_v11010 = 354¶
- cudaGraphMemcpyNodeSetParamsToSymbol_v11010 = 353¶
- cudaGraphMemcpyNodeSetParams_v10000 = 292¶
- cudaGraphMemsetNodeGetParams_v10000 = 294¶
- cudaGraphMemsetNodeSetParams_v10000 = 295¶
- cudaGraphNodeFindInClone_v10000 = 302¶
- cudaGraphNodeGetDependencies_v10000 = 305¶
- cudaGraphNodeGetDependencies_v2_v12030 = 456¶
- cudaGraphNodeGetDependentNodes_v10000 = 306¶
- cudaGraphNodeGetDependentNodes_v2_v12030 = 457¶
- cudaGraphNodeGetEnabled_v11060 = 427¶
- cudaGraphNodeGetType_v10000 = 303¶
- cudaGraphNodeSetEnabled_v11060 = 426¶
- cudaGraphNodeSetParams_v12020 = 446¶
- cudaGraphReleaseUserObject_v11030 = 417¶
- cudaGraphRemoveDependencies_v10000 = 308¶
- cudaGraphRemoveDependencies_v2_v12030 = 459¶
- cudaGraphRetainUserObject_v11030 = 416¶
- cudaGraphUpload_ptsz_v10000 = 349¶
- cudaGraphUpload_v10000 = 348¶
- cudaGraphicsD3D10RegisterResource_v3020 = 91¶
- cudaGraphicsD3D11RegisterResource_v3020 = 87¶
- cudaGraphicsD3D9RegisterResource_v3020 = 106¶
- cudaGraphicsEGLRegisterImage_v7000 = 256¶
- cudaGraphicsGLRegisterBuffer_v3020 = 73¶
- cudaGraphicsGLRegisterImage_v3020 = 72¶
- cudaGraphicsMapResources_v3020 = 76¶
- cudaGraphicsResourceGetMappedEglFrame_v7000 = 265¶
- cudaGraphicsResourceGetMappedMipmappedArray_v5000 = 196¶
- cudaGraphicsResourceGetMappedPointer_v3020 = 78¶
- cudaGraphicsResourceSetMapFlags_v3020 = 75¶
- cudaGraphicsSubResourceGetMappedArray_v3020 = 79¶
- cudaGraphicsUnmapResources_v3020 = 77¶
- cudaGraphicsUnregisterResource_v3020 = 74¶
- cudaGraphicsVDPAURegisterOutputSurface_v3020 = 83¶
- cudaGraphicsVDPAURegisterVideoSurface_v3020 = 82¶
- cudaHostAlloc_v3020 = 27¶
- cudaHostGetDevicePointer_v3020 = 28¶
- cudaHostGetFlags_v3020 = 29¶
- cudaHostRegister_v4000 = 152¶
- cudaHostUnregister_v4000 = 153¶
- cudaImportExternalMemory_v10000 = 274¶
- cudaImportExternalSemaphore_v10000 = 278¶
- cudaInitDevice_v12000 = 444¶
- cudaIpcCloseMemHandle_v4010 = 180¶
- cudaIpcGetEventHandle_v4010 = 176¶
- cudaIpcGetMemHandle_v4010 = 178¶
- cudaIpcOpenEventHandle_v4010 = 177¶
- cudaIpcOpenMemHandle_v4010 = 179¶
- cudaLaunchCooperativeKernelMultiDevice_v9000 = 272¶
- cudaLaunchCooperativeKernel_ptsz_v9000 = 270¶
- cudaLaunchCooperativeKernel_v9000 = 269¶
- cudaLaunchHostFunc_ptsz_v10000 = 285¶
- cudaLaunchHostFunc_v10000 = 284¶
- cudaLaunchKernelExC_ptsz_v11060 = 431¶
- cudaLaunchKernelExC_v11060 = 430¶
- cudaLaunchKernel_ptsz_v7000 = 214¶
- cudaLaunchKernel_v7000 = 211¶
- cudaLaunch_ptsz_v7000 = 213¶
- cudaLaunch_v3020 = 13¶
- cudaMalloc3DArray_v3020 = 141¶
- cudaMalloc3D_v3020 = 140¶
- cudaMallocArray_v3020 = 23¶
- cudaMallocAsync_ptsz_v11020 = 374¶
- cudaMallocAsync_v11020 = 373¶
- cudaMallocFromPoolAsync_ptsz_v11020 = 392¶
- cudaMallocFromPoolAsync_v11020 = 391¶
- cudaMallocHost_v3020 = 25¶
- cudaMallocManaged_v6000 = 206¶
- cudaMallocMipmappedArray_v5000 = 192¶
- cudaMallocPitch_v3020 = 21¶
- cudaMalloc_v3020 = 20¶
- cudaMemAdvise_v2_v12020 = 448¶
- cudaMemAdvise_v8000 = 254¶
- cudaMemGetInfo_v3020 = 30¶
- cudaMemPoolCreate_v11020 = 383¶
- cudaMemPoolDestroy_v11020 = 384¶
- cudaMemPoolExportPointer_v11020 = 389¶
- cudaMemPoolGetAccess_v11020 = 382¶
- cudaMemPoolGetAttribute_v11020 = 379¶
- cudaMemPoolImportPointer_v11020 = 390¶
- cudaMemPoolSetAccess_v11020 = 380¶
- cudaMemPoolSetAttribute_v11020 = 378¶
- cudaMemPoolTrimTo_v11020 = 377¶
- cudaMemPrefetchAsync_ptsz_v8000 = 253¶
- cudaMemPrefetchAsync_v2_ptsz_v12020 = 450¶
- cudaMemPrefetchAsync_v2_v12020 = 449¶
- cudaMemPrefetchAsync_v8000 = 252¶
- cudaMemRangeGetAttribute_v8000 = 266¶
- cudaMemRangeGetAttributes_v8000 = 267¶
- cudaMemcpy2DArrayToArray_ptds_v7000 = 222¶
- cudaMemcpy2DArrayToArray_v3020 = 38¶
- cudaMemcpy2DAsync_ptsz_v7000 = 228¶
- cudaMemcpy2DAsync_v3020 = 44¶
- cudaMemcpy2DFromArrayAsync_ptsz_v7000 = 230¶
- cudaMemcpy2DFromArrayAsync_v3020 = 46¶
- cudaMemcpy2DFromArray_ptds_v7000 = 220¶
- cudaMemcpy2DFromArray_v3020 = 36¶
- cudaMemcpy2DToArrayAsync_ptsz_v7000 = 229¶
- cudaMemcpy2DToArrayAsync_v3020 = 45¶
- cudaMemcpy2DToArray_ptds_v7000 = 218¶
- cudaMemcpy2DToArray_v3020 = 34¶
- cudaMemcpy2D_ptds_v7000 = 216¶
- cudaMemcpy2D_v3020 = 32¶
- cudaMemcpy3DAsync_ptsz_v7000 = 246¶
- cudaMemcpy3DAsync_v3020 = 145¶
- cudaMemcpy3DBatchAsync_ptsz_v12080 = 485¶
- cudaMemcpy3DBatchAsync_v12080 = 484¶
- cudaMemcpy3DPeerAsync_ptsz_v7000 = 250¶
- cudaMemcpy3DPeerAsync_v4000 = 163¶
- cudaMemcpy3DPeer_ptds_v7000 = 249¶
- cudaMemcpy3DPeer_v4000 = 162¶
- cudaMemcpy3D_ptds_v7000 = 245¶
- cudaMemcpy3D_v3020 = 144¶
- cudaMemcpyArrayToArray_ptds_v7000 = 221¶
- cudaMemcpyArrayToArray_v3020 = 37¶
- cudaMemcpyAsync_ptsz_v7000 = 225¶
- cudaMemcpyAsync_v3020 = 41¶
- cudaMemcpyBatchAsync_ptsz_v12080 = 483¶
- cudaMemcpyBatchAsync_v12080 = 482¶
- cudaMemcpyFromArrayAsync_ptsz_v7000 = 227¶
- cudaMemcpyFromArrayAsync_v3020 = 43¶
- cudaMemcpyFromArray_ptds_v7000 = 219¶
- cudaMemcpyFromArray_v3020 = 35¶
- cudaMemcpyFromSymbolAsync_ptsz_v7000 = 232¶
- cudaMemcpyFromSymbolAsync_v3020 = 48¶
- cudaMemcpyFromSymbol_ptds_v7000 = 224¶
- cudaMemcpyFromSymbol_v3020 = 40¶
- cudaMemcpyPeerAsync_v4000 = 161¶
- cudaMemcpyPeer_v4000 = 160¶
- cudaMemcpyToArrayAsync_ptsz_v7000 = 226¶
- cudaMemcpyToArrayAsync_v3020 = 42¶
- cudaMemcpyToArray_ptds_v7000 = 217¶
- cudaMemcpyToArray_v3020 = 33¶
- cudaMemcpyToSymbolAsync_ptsz_v7000 = 231¶
- cudaMemcpyToSymbolAsync_v3020 = 47¶
- cudaMemcpyToSymbol_ptds_v7000 = 223¶
- cudaMemcpyToSymbol_v3020 = 39¶
- cudaMemcpy_ptds_v7000 = 215¶
- cudaMemcpy_v3020 = 31¶
- cudaMemset2DAsync_ptsz_v7000 = 236¶
- cudaMemset2DAsync_v3020 = 52¶
- cudaMemset2D_ptds_v7000 = 234¶
- cudaMemset2D_v3020 = 50¶
- cudaMemset3DAsync_ptsz_v7000 = 244¶
- cudaMemset3DAsync_v3020 = 143¶
- cudaMemset3D_ptds_v7000 = 243¶
- cudaMemset3D_v3020 = 142¶
- cudaMemsetAsync_ptsz_v7000 = 235¶
- cudaMemsetAsync_v3020 = 51¶
- cudaMemset_ptds_v7000 = 233¶
- cudaMemset_v3020 = 49¶
- cudaMipmappedArrayGetMemoryRequirements_v11060 = 429¶
- cudaMipmappedArrayGetSparseProperties_v11010 = 360¶
- cudaOccupancyAvailableDynamicSMemPerBlock_v10200 = 329¶
- cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags_v7000 = 251¶
- cudaOccupancyMaxActiveBlocksPerMultiprocessor_v6000 = 207¶
- cudaOccupancyMaxActiveBlocksPerMultiprocessor_v6050 = 210¶
- cudaOccupancyMaxActiveClusters_v11070 = 433¶
- cudaOccupancyMaxPotentialClusterSize_v11070 = 432¶
- cudaPeekAtLastError_v3020 = 11¶
- cudaPeerGetDevicePointer_v4000 = 159¶
- cudaPeerRegister_v4000 = 157¶
- cudaPeerUnregister_v4000 = 158¶
- cudaPointerGetAttributes_v4000 = 151¶
- cudaProfilerInitialize_v4000 = 170¶
- cudaProfilerStart_v4000 = 171¶
- cudaProfilerStop_v4000 = 172¶
- cudaRuntimeGetVersion_v3020 = 2¶
- cudaSetDeviceFlags_v3020 = 19¶
- cudaSetDevice_v3020 = 16¶
- cudaSetDoubleForDevice_v3020 = 124¶
- cudaSetDoubleForHost_v3020 = 125¶
- cudaSetValidDevices_v3020 = 18¶
- cudaSetupArgument_v3020 = 9¶
- cudaSignalExternalSemaphoresAsync_ptsz_v10000 = 280¶
- cudaSignalExternalSemaphoresAsync_v10000 = 279¶
- cudaSignalExternalSemaphoresAsync_v2_ptsz_v11020 = 394¶
- cudaSignalExternalSemaphoresAsync_v2_v11020 = 393¶
- cudaStreamAddCallback_ptsz_v7000 = 248¶
- cudaStreamAddCallback_v5000 = 197¶
- cudaStreamAttachMemAsync_ptsz_v7000 = 241¶
- cudaStreamAttachMemAsync_v6000 = 208¶
- cudaStreamBeginCaptureToGraph_ptsz_v12030 = 453¶
- cudaStreamBeginCaptureToGraph_v12030 = 452¶
- cudaStreamBeginCapture_ptsz_v10000 = 316¶
- cudaStreamBeginCapture_v10000 = 315¶
- cudaStreamCopyAttributes_ptsz_v11000 = 342¶
- cudaStreamCopyAttributes_v11000 = 341¶
- cudaStreamCreateWithFlags_v5000 = 198¶
- cudaStreamCreateWithPriority_v5050 = 202¶
- cudaStreamCreate_v3020 = 129¶
- cudaStreamDestroy_v3020 = 130¶
- cudaStreamDestroy_v5050 = 201¶
- cudaStreamEndCapture_ptsz_v10000 = 320¶
- cudaStreamEndCapture_v10000 = 319¶
- cudaStreamGetAttribute_ptsz_v11000 = 344¶
- cudaStreamGetAttribute_v11000 = 343¶
- cudaStreamGetCaptureInfo_ptsz_v10010 = 325¶
- cudaStreamGetCaptureInfo_v10010 = 324¶
- cudaStreamGetCaptureInfo_v2_ptsz_v11030 = 410¶
- cudaStreamGetCaptureInfo_v2_v11030 = 409¶
- cudaStreamGetCaptureInfo_v3_ptsz_v12030 = 462¶
- cudaStreamGetCaptureInfo_v3_v12030 = 461¶
- cudaStreamGetDevice_ptsz_v12080 = 481¶
- cudaStreamGetDevice_v12080 = 480¶
- cudaStreamGetFlags_ptsz_v7000 = 238¶
- cudaStreamGetFlags_v5050 = 204¶
- cudaStreamGetId_ptsz_v12000 = 442¶
- cudaStreamGetId_v12000 = 441¶
- cudaStreamGetPriority_ptsz_v7000 = 237¶
- cudaStreamGetPriority_v5050 = 203¶
- cudaStreamIsCapturing_ptsz_v10000 = 318¶
- cudaStreamIsCapturing_v10000 = 317¶
- cudaStreamQuery_ptsz_v7000 = 240¶
- cudaStreamQuery_v3020 = 132¶
- cudaStreamSetAttribute_ptsz_v11000 = 346¶
- cudaStreamSetAttribute_v11000 = 345¶
- cudaStreamSetFlags_ptsz_v10200 = 331¶
- cudaStreamSetFlags_v10200 = 330¶
- cudaStreamSynchronize_ptsz_v7000 = 239¶
- cudaStreamSynchronize_v3020 = 131¶
- cudaStreamUpdateCaptureDependencies_ptsz_v11030 = 412¶
- cudaStreamUpdateCaptureDependencies_v11030 = 411¶
- cudaStreamUpdateCaptureDependencies_v2_ptsz_v12030 = 464¶
- cudaStreamUpdateCaptureDependencies_v2_v12030 = 463¶
- cudaStreamWaitEvent_ptsz_v7000 = 247¶
- cudaStreamWaitEvent_v3020 = 147¶
- cudaThreadExchangeStreamCaptureMode_v10010 = 327¶
- cudaThreadExit_v3020 = 123¶
- cudaThreadGetCacheConfig_v3020 = 150¶
- cudaThreadGetLimit_v3020 = 127¶
- cudaThreadSetCacheConfig_v3020 = 146¶
- cudaThreadSetLimit_v3020 = 128¶
- cudaThreadSynchronize_v3020 = 126¶
- cudaUnbindTexture_v3020 = 58¶
- cudaUserObjectCreate_v11030 = 413¶
- cudaUserObjectRelease_v11030 = 415¶
- cudaUserObjectRetain_v11030 = 414¶
- cudaVDPAUGetDevice_v3020 = 80¶
- cudaVDPAUSetVDPAUDevice_v3020 = 81¶
- cudaWGLGetDevice_v3020 = 71¶
- cudaWaitExternalSemaphoresAsync_ptsz_v10000 = 282¶
- cudaWaitExternalSemaphoresAsync_v10000 = 281¶
- cudaWaitExternalSemaphoresAsync_v2_ptsz_v11020 = 396¶
- cudaWaitExternalSemaphoresAsync_v2_v11020 = 395¶
- cupti.cupti.activity_configure_unified_memory_counter(intptr_t config, uint32_t count)¶
Set Unified Memory Counter configuration.
- Args:
config (intptr_t): A pointer to
CUpti_ActivityUnifiedMemoryCounterConfigstructures containing Unified Memory counter configuration. count (uint32_t): Number of Unified Memory counter configuration structures.
See also
cuptiActivityConfigureUnifiedMemoryCounter
- cupti.cupti.activity_disable(int kind)¶
Disable collection of a specific kind of activity record.
- Args:
kind (CUpti_ActivityKind): The kind of activity record to stop collecting.
See also
cuptiActivityDisable
- cupti.cupti.activity_disable_context(intptr_t context, int kind)¶
Disable collection of a specific kind of activity record for a context.
- Args:
context (intptr_t): The context for which activity is to be disabled. kind (CUpti_ActivityKind): The kind of activity record to stop collecting.
See also
cuptiActivityDisableContext
- cupti.cupti.activity_enable(int kind)¶
Enable collection of a specific kind of activity record.
- Args:
kind (CUpti_ActivityKind): The kind of activity record to collect.
See also
cuptiActivityEnable
- cupti.cupti.activity_enable_allocation_source(uint8_t enable)¶
Enables tracking the source library for memory allocation requests.
- Args:
enable (uint8_t): is a boolean, denoting whether the source library of the memory allocation request needs to be tracked.
See also
cuptiActivityEnableAllocationSource
- cupti.cupti.activity_enable_and_dump(int kind)¶
Enable collection of a specific kind of activity record. For certain activity kinds it dumps existing records.
- Args:
kind (CUpti_ActivityKind): The kind of activity record to collect.
See also
cuptiActivityEnableAndDump
- cupti.cupti.activity_enable_context(intptr_t context, int kind)¶
Enable collection of a specific kind of activity record for a context.
- Args:
context (intptr_t): The context for which activity is to be enabled. kind (CUpti_ActivityKind): The kind of activity record to collect.
See also
cuptiActivityEnableContext
- cupti.cupti.activity_enable_device_graph(uint8_t enable)¶
Controls the collection of records for device launched graphs.
- Args:
enable (uint8_t): is a boolean, denoting whether these records should be collected.
See also
cuptiActivityEnableDeviceGraph
- cupti.cupti.activity_enable_driver_api(uint32_t cbid, uint8_t enable)¶
Controls the collection of activity records for specific CUDA Driver APIs.
- Args:
cbid (uint32_t): callback id of the CUDA Driver API. This can be found in the header
cupti_driver_cbid.h. enable (uint8_t): is a boolean, denoting whether to enable or disable the collection.
See also
cuptiActivityEnableDriverApi
- cupti.cupti.activity_enable_hw_trace(uint8_t enable)¶
Enables the collection of CUDA kernel timestamps through HW events.
- Args:
enable (uint8_t): is a boolean, denoting whether to enable or disable the collection through HW events.
See also
cuptiActivityEnableHWTrace
- cupti.cupti.activity_enable_latency_timestamps(uint8_t enable)¶
Controls the collection of queued and submitted timestamps for kernels.
- Args:
enable (uint8_t): is a boolean, denoting whether these timestamps should be collected.
See also
cuptiActivityEnableLatencyTimestamps
- cupti.cupti.activity_enable_launch_attributes(uint8_t enable)¶
Controls the collection of launch attributes for kernels.
- Args:
enable (uint8_t): is a boolean denoting whether these launch attributes should be collected.
See also
cuptiActivityEnableLaunchAttributes
- cupti.cupti.activity_enable_runtime_api(uint32_t cbid, uint8_t enable)¶
Controls the collection of activity records for specific CUDA Runtime APIs.
- Args:
cbid (uint32_t): callback id of the CUDA Runtime API. This can be found in the header
cupti_runtime_cbid.h. enable (uint8_t): is a boolean, denoting whether to enable or disable the collection.
See also
cuptiActivityEnableRuntimeApi
- cupti.cupti.activity_flush_all(uint32_t flag)¶
Request to deliver activity records via the buffer completion callback.
- Args:
flag (uint32_t): The flag can be set to indicate a forced flush. See CUpti_ActivityFlag.
See also
cuptiActivityFlushAll
- cupti.cupti.activity_flush_period(uint32_t time)¶
Sets the flush period for the worker thread.
- Args:
time (uint32_t): flush period in milliseconds (ms).
See also
cuptiActivityFlushPeriod
- cupti.cupti.activity_get_attribute(int attr, intptr_t value_size, intptr_t value)¶
Read an activity API attribute.
- Args:
attr (CUpti_ActivityAttribute): The attribute to read. value_size (intptr_t): Size of buffer pointed by the value, and returns the number of bytes written to
value. value (intptr_t): Returns the value of the attribute.
See also
cuptiActivityGetAttribute
- cupti.cupti.activity_get_num_dropped_records(intptr_t context, uint32_t stream_id, intptr_t dropped)¶
Get the number of activity records that were dropped of insufficient buffer space.
- Args:
context (intptr_t): The context, or NULL to get dropped count from global queue. stream_id (uint32_t): The stream ID. dropped (intptr_t): The number of records that were dropped since the last call to this function.
See also
cuptiActivityGetNumDroppedRecords
- cupti.cupti.activity_pop_external_correlation_id(int kind) uint64_t¶
Pop an external correlation id for the calling thread.
- Args:
kind (CUpti_ExternalCorrelationKind): The kind of external API activities should be correlated with.
- Returns:
uint64_t: If the function returns successful, contains the last external correlation id for this
kind, can be NULL.
See also
cuptiActivityPopExternalCorrelationId
- cupti.cupti.activity_push_external_correlation_id(int kind, uint64_t id)¶
Push an external correlation id for the calling thread.
- Args:
kind (CUpti_ExternalCorrelationKind): The kind of external API activities should be correlated with. id (uint64_t): External correlation id.
See also
cuptiActivityPushExternalCorrelationId
- cupti.cupti.activity_register_callbacks(func_buffer_requested, func_buffer_completed)¶
Registers callback functions with CUPTI for activity buffer handling.
- Args:
func_buffer_requested (function): callback which is invoked when an empty buffer is requested by CUPTI. func_buffer_completed (function): callback which is invoked when a buffer containing activity records is available from CUPTI.
See also
cuptiActivityRegisterCallbacks
- cupti.cupti.activity_register_timestamp_callback(func_timestamp)¶
Registers callback function with CUPTI for providing timestamp.
- Args:
func_timestamp (function): callback which is invoked when a timestamp is needed by CUPTI.
See also
cuptiActivityRegisterTimestampCallback
- cupti.cupti.activity_set_attribute(int attr, intptr_t value_size, intptr_t value)¶
Write an activity API attribute.
- Args:
attr (CUpti_ActivityAttribute): The attribute to write. value_size (intptr_t): The size, in bytes, of the value. value (intptr_t): The attribute value to write.
See also
cuptiActivitySetAttribute
- cupti.cupti.compute_capability_supported(int major, int minor) int¶
Check support for a compute capability.
- Args:
major (int): The major revision number of the compute capability. minor (int): The minor revision number of the compute capability.
- Returns:
int: Pointer to an integer to return the support status.
See also
cuptiComputeCapabilitySupported
- cupti.cupti.device_supported(int dev) int¶
Check support for a compute device.
- Args:
dev (int): The device handle returned by CUDA Driver API cuDeviceGet.
- Returns:
int: Pointer to an integer to return the support status.
See also
cuptiDeviceSupported
- cupti.cupti.device_virtualization_mode(int dev) int¶
Query the virtualization mode of the device.
- Args:
dev (int): The device handle returned by CUDA Driver API cuDeviceGet.
- Returns:
int: Pointer to an CUpti_DeviceVirtualizationMode to return the virtualization mode.
See also
cuptiDeviceVirtualizationMode
- cupti.cupti.enable_all_domains(uint32_t enable, intptr_t subscriber)¶
Enable or disable all callbacks in all domains.
- Args:
enable (uint32_t): New enable state for all callbacks in all domain. Zero disables all callbacks, non-zero enables all callbacks. subscriber (intptr_t): Handle to callback subscription.
See also
cuptiEnableAllDomains
- cupti.cupti.enable_callback(uint32_t enable, intptr_t subscriber, int domain, uint32_t cbid)¶
Enable or disabled callbacks for a specific domain and callback ID.
- Args:
enable (uint32_t): New enable state for the callback. Zero disables the callback, non-zero enables the callback. subscriber (intptr_t): Handle to callback subscription. domain (CUpti_CallbackDomain): The domain of the callback. cbid (uint32_t): The ID of the callback.
See also
cuptiEnableCallback
- cupti.cupti.enable_domain(uint32_t enable, intptr_t subscriber, int domain)¶
Enable or disabled all callbacks for a specific domain.
- Args:
enable (uint32_t): New enable state for all callbacks in the domain. Zero disables all callbacks, non-zero enables all callbacks. subscriber (intptr_t): Handle to callback subscription. domain (CUpti_CallbackDomain): The domain of the callback.
See also
cuptiEnableDomain
- cupti.cupti.finalize()¶
Detach CUPTI from the running process.
See also
cuptiFinalize
- cupti.cupti.get_auto_boost_state(intptr_t context, intptr_t state)¶
Get auto boost state.
- Args:
context (intptr_t): A valid CUcontext. state (intptr_t): A pointer to
CUpti_ActivityAutoBoostStatestructure which contains the current state and the id of the process that has requested the current state.
See also
cuptiGetAutoBoostState
- cupti.cupti.get_callback_name(int domain, uint32_t cbid)¶
Get the name of a callback for a specific domain and callback ID.
- Args:
domain (CUpti_CallbackDomain): The domain of the callback. cbid (uint32_t): The ID of the callback.
- Returns:
name (str): Returns name of the callback for the specified domain and callback ID
See also
cuptiGetCallbackName
- cupti.cupti.get_callback_state(intptr_t subscriber, int domain, uint32_t cbid) uint32_t¶
Get the current enabled/disabled state of a callback for a specific domain and function ID.
- Args:
subscriber (intptr_t): Handle to the initialize subscriber. domain (CUpti_CallbackDomain): The domain of the callback. cbid (uint32_t): The ID of the callback.
- Returns:
uint32_t: Returns non-zero if callback enabled, zero if not enabled.
See also
cuptiGetCallbackState
- cupti.cupti.get_context_id(intptr_t context) uint32_t¶
Get the ID of a context.
- Args:
context (intptr_t): The context.
- Returns:
uint32_t: Returns a process-unique ID for the context.
See also
cuptiGetContextId
- cupti.cupti.get_device_id(intptr_t context) uint32_t¶
Get the ID of a device.
- Args:
context (intptr_t): The context, or NULL to indicate the current context.
- Returns:
uint32_t: Returns the ID of the device that is current for the calling thread.
See also
cuptiGetDeviceId
- cupti.cupti.get_graph_exec_id(intptr_t graph_exec) uint32_t¶
Get the unique ID of executable graph.
- Args:
graph_exec (intptr_t): The executable graph.
- Returns:
uint32_t: Returns the unique ID of the executable graph.
See also
cuptiGetGraphExecId
- cupti.cupti.get_graph_id(intptr_t graph) uint32_t¶
Get the unique ID of graph.
- Args:
graph (intptr_t): The graph.
- Returns:
uint32_t: Returns the unique ID of the graph.
See also
cuptiGetGraphId
- cupti.cupti.get_graph_node_id(intptr_t node) uint64_t¶
Get the unique ID of a graph node.
- Args:
node (intptr_t): The graph node.
- Returns:
uint64_t: Returns the unique ID of the node.
See also
cuptiGetGraphNodeId
- cupti.cupti.get_last_error()¶
Returns the last error from a cupti call or callback.
See also
cuptiGetLastError
- cupti.cupti.get_stream_id_ex(intptr_t context, intptr_t stream, uint8_t per_thread_stream) uint32_t¶
Get the ID of a stream.
- Args:
context (intptr_t): If non-NULL then the stream is checked to ensure that it belongs to this context. Typically this parameter should be null. stream (intptr_t): The stream. per_thread_stream (uint8_t): Flag to indicate if program is compiled for per-thread streams.
- Returns:
uint32_t: Returns a context-unique ID for the stream.
See also
cuptiGetStreamIdEx
- cupti.cupti.get_thread_id_type() int¶
Get the thread-id type.
- Returns:
int: .
See also
cuptiGetThreadIdType
- cupti.cupti.get_timestamp() uint64_t¶
Get the CUPTI timestamp.
- Returns:
uint64_t: Returns the CUPTI timestamp.
See also
cuptiGetTimestamp
- cupti.cupti.set_thread_id_type(int type)¶
Set the thread-id type.
- Args:
type (CUpti_ActivityThreadIdType): .
See also
cuptiSetThreadIdType
- cupti.cupti.subscribe(callback, userdata) intptr_t¶
Initialize a callback subscriber with a callback function and user data.
- Args:
callback (CUpti_CallbackFunc): The callback function. userdata (intptr_t): A pointer to user data. This data will be passed to the callback function via the
userdataparameter.- Returns:
intptr_t: Returns handle to initialize subscriber.
See also
cuptiSubscribe
- cupti.cupti.supported_domains()¶
Get the available callback domains.
- Returns:
list[cupti.cupti.CallbackDomain]: List of all available callback domains
See also
cuptiSupportedDomains
- cupti.cupti.unsubscribe(intptr_t subscriber)¶
Unregister a callback subscriber.
- Args:
subscriber (intptr_t): Handle to the initialize subscriber.
See also
cuptiUnsubscribe