2. CUPTI Python API Reference¶
2.1. Documentation Issues¶
The CUPTI Python API Reference section of the document is automatically generated and has some issues:
All the CUPTI Python enumerations, functions and classes are listed together in a single section.
The members of the python classes are not adequately documented. To get more information for a member, please refer the CUPTI C documentation.
The
kind
member of python classes has typeint
, instead ofcupti.cupti.ActivityKind
. While using thekind
member, please usecupti.cupti.ActivityKind
to get the enum value.The “See also” section for each CUPTI Python API function shows the corresponding CUPTI C API name. But it does not provide a link to the CUPTI C documentation.
The “See <data type name>” references in the generated documentation point to the original C data structures instead of their Python equivalents.
2.2. API Reference¶
- exception cupti.cupti.cuptiError(status: int)¶
Bases:
Exception
- class cupti.cupti.ActivityAPI(size=1)¶
Bases:
object
- cbid¶
cbid : int
- correlation_id¶
correlation_id : int
- end¶
end : int
- kind¶
kind : int
- process_id¶
process_id : int
- return_value¶
return_value : int
- start¶
start : int
- thread_id¶
thread_id : int
- class cupti.cupti.ActivityAttribute(value)¶
Bases:
IntEnum
See CUpti_ActivityAttribute.
- DEVICE_BUFFER_FORCE_INT = 2147483647¶
- DEVICE_BUFFER_POOL_LIMIT = 2¶
- DEVICE_BUFFER_PRE_ALLOCATE_VALUE = 6¶
- DEVICE_BUFFER_SIZE = 0¶
- DEVICE_BUFFER_SIZE_CDP = 1¶
- MEM_ALLOCATION_TYPE_HOST_PINNED = 8¶
- PER_THREAD_ACTIVITY_BUFFER = 9¶
- PROFILING_SEMAPHORE_POOL_LIMIT = 4¶
- PROFILING_SEMAPHORE_POOL_SIZE = 3¶
- PROFILING_SEMAPHORE_PRE_ALLOCATE_VALUE = 7¶
- ZEROED_OUT_ACTIVITY_BUFFER = 5¶
- class cupti.cupti.ActivityAutoBoostState(size=1)¶
Bases:
object
- enabled¶
enabled : int
- pid¶
pid : int
- class cupti.cupti.ActivityCdpKernel(size=1)¶
Bases:
object
- block_x¶
block_x : int
- block_y¶
block_y : int
- block_z¶
block_z : int
- cache_config¶
cache_config : cupti.cupti.CacheConfig
- completed¶
completed : int
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
dynamic_shared_memory : int
- end¶
end : int
- grid_id¶
grid_id : int
- grid_x¶
grid_x : int
- grid_y¶
grid_y : int
- grid_z¶
grid_z : int
- kind¶
kind : int
- local_memory_per_thread¶
local_memory_per_thread : int
- local_memory_total¶
local_memory_total : int
- name¶
name : str
- pad¶
pad : int
- parent_block_x¶
parent_block_x : int
- parent_block_y¶
parent_block_y : int
- parent_block_z¶
parent_block_z : int
- parent_grid_id¶
parent_grid_id : int
- queued¶
queued : int
- registers_per_thread¶
registers_per_thread : int
shared_memory_config : int
- start¶
start : int
static_shared_memory : int
- stream_id¶
stream_id : int
- submitted¶
submitted : int
- class cupti.cupti.ActivityComputeApiKind(value)¶
Bases:
IntEnum
See CUpti_ActivityComputeApiKind.
- CUDA = 1¶
- CUDA_MPS = 2¶
- FORCE_INT = 2147483647¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityContext3(size=1)¶
Bases:
object
- cig_mode¶
cig_mode : int
- compute_api_kind¶
compute_api_kind : int
- context_id¶
context_id : int
- device_id¶
device_id : int
- is_green_context¶
is_green_context : int
- kind¶
kind : int
- null_stream_id¶
null_stream_id : int
- num_multiprocessors¶
num_multiprocessors : int
- padding¶
padding : int
- padding2¶
padding2 : int
- parent_context_id¶
parent_context_id : int
- class cupti.cupti.ActivityCudaEvent2(size=1)¶
Bases:
object
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- cuda_event_sync_id¶
cuda_event_sync_id : int
- device_id¶
device_id : int
- device_timestamp¶
device_timestamp : int
- event_id¶
event_id : int
- kind¶
kind : int
- pad¶
pad : int
- pad2¶
pad2 : int
- reserved0¶
reserved0 : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityDevice5(size=1)¶
Bases:
object
- compute_capability_major¶
compute_capability_major (~_numpy.uint32):
- compute_capability_minor¶
compute_capability_minor (~_numpy.uint32):
- compute_instance_id¶
compute_instance_id (~_numpy.uint32):
- constant_memory_size¶
constant_memory_size (~_numpy.uint32):
- core_clock_rate¶
core_clock_rate (~_numpy.uint32):
- ecc_enabled¶
ecc_enabled (~_numpy.uint32):
- global_memory_bandwidth¶
global_memory_bandwidth (~_numpy.uint64):
- global_memory_size¶
global_memory_size (~_numpy.uint64):
- gpu_instance_id¶
gpu_instance_id (~_numpy.uint32):
- id¶
id (~_numpy.uint32):
- is_cuda_visible¶
is_cuda_visible (~_numpy.uint8):
- is_mig_enabled¶
is_mig_enabled (~_numpy.uint8):
- is_numa_node¶
is_numa_node (~_numpy.uint32):
- kind¶
kind (~_numpy.int32):
- l2cache_size¶
l2cache_size (~_numpy.uint32):
- max_block_dim_x¶
max_block_dim_x (~_numpy.uint32):
- max_block_dim_y¶
max_block_dim_y (~_numpy.uint32):
- max_block_dim_z¶
max_block_dim_z (~_numpy.uint32):
- max_blocks_per_multiprocessor¶
max_blocks_per_multiprocessor (~_numpy.uint32):
- max_grid_dim_x¶
max_grid_dim_x (~_numpy.uint32):
- max_grid_dim_y¶
max_grid_dim_y (~_numpy.uint32):
- max_grid_dim_z¶
max_grid_dim_z (~_numpy.uint32):
- max_ipc¶
max_ipc (~_numpy.uint32):
- max_registers_per_block¶
max_registers_per_block (~_numpy.uint32):
- max_registers_per_multiprocessor¶
max_registers_per_multiprocessor (~_numpy.uint32):
max_shared_memory_per_block (~_numpy.uint32):
max_shared_memory_per_multiprocessor (~_numpy.uint32):
- max_threads_per_block¶
max_threads_per_block (~_numpy.uint32):
- max_warps_per_multiprocessor¶
max_warps_per_multiprocessor (~_numpy.uint32):
- mig_uuid¶
- name¶
name (~_numpy.intp):
- num_memcpy_engines¶
num_memcpy_engines (~_numpy.uint32):
- num_multiprocessors¶
num_multiprocessors (~_numpy.uint32):
- num_threads_per_warp¶
num_threads_per_warp (~_numpy.uint32):
- numa_id¶
numa_id (~_numpy.uint32):
- uuid¶
- class cupti.cupti.ActivityDeviceAttribute(size=1)¶
Bases:
object
- attribute¶
attribute : cupti.cupti.Attribute
- device_id¶
device_id : int
- kind¶
kind : int
- value¶
value : cupti.cupti.Value
- class cupti.cupti.ActivityDeviceGraphTrace(size=1)¶
Bases:
object
- context_id¶
context_id : int
- device_id¶
device_id : int
- device_launch_mode¶
device_launch_mode : int
- end¶
end : int
- graph_id¶
graph_id : int
- kind¶
kind : int
- launcher_graph_id¶
launcher_graph_id : int
- start¶
start : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityEnvironment(size=1)¶
Bases:
object
- data¶
data : cupti.cupti.Data
- device_id¶
device_id : int
- environment_kind¶
environment_kind : int
- kind¶
kind : int
- timestamp¶
timestamp : int
- class cupti.cupti.ActivityEnvironmentKind(value)¶
Bases:
IntEnum
See CUpti_ActivityEnvironmentKind.
- COOLING = 4¶
- COUNT = 5¶
- KIND_FORCE_INT = 2147483647¶
- POWER = 3¶
- SPEED = 1¶
- TEMPERATURE = 2¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityExternalCorrelation(size=1)¶
Bases:
object
- correlation_id¶
correlation_id : int
- external_id¶
external_id : int
- external_kind¶
external_kind : int
- kind¶
kind : int
- class cupti.cupti.ActivityFlag(value)¶
Bases:
IntEnum
See CUpti_ActivityFlag.
- DEVICE_ATTRIBUTE_CUDEVICE = 1¶
- DEVICE_CONCURRENT_KERNELS = 1¶
- FLUSH_FORCED = 1¶
- FORCE_INT = 2147483647¶
- GLOBAL_ACCESS_KIND_CACHED = 512¶
- GLOBAL_ACCESS_KIND_LOAD = 256¶
- GLOBAL_ACCESS_KIND_SIZE_MASK = 255¶
- INSTRUCTION_CLASS_MASK = 510¶
- INSTRUCTION_VALUE_INVALID = 1¶
- MARKER_COLOR_ARGB = 2¶
- MARKER_COLOR_NONE = 1¶
- MARKER_INSTANTANEOUS = 1¶
- MARKER_START = 2¶
- MARKER_SYNC_ACQUIRE = 8¶
- MARKER_SYNC_ACQUIRE_FAILED = 32¶
- MARKER_SYNC_ACQUIRE_SUCCESS = 16¶
- MARKER_SYNC_RELEASE = 64¶
- MEMCPY_ASYNC = 1¶
- MEMSET_ASYNC = 1¶
- METRIC_OVERFLOWED = 1¶
- METRIC_VALUE_INVALID = 2¶
- NONE = 0¶
- SHARED_ACCESS_KIND_LOAD = 256¶
- SHARED_ACCESS_KIND_SIZE_MASK = 255¶
- THRASHING_IN_CPU = 1¶
- THROTTLING_IN_CPU = 1¶
- class cupti.cupti.ActivityFunction(size=1)¶
Bases:
object
- context_id¶
context_id : int
- function_ind_ex¶
function_ind_ex : int
- id¶
id : int
- kind¶
kind : int
- module_id¶
module_id : int
- name¶
name : str
- pad¶
pad : int
- class cupti.cupti.ActivityGraphTrace2(size=1)¶
Bases:
object
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- end¶
end : int
- end_context_id¶
end_context_id : int
- end_device_id¶
end_device_id : int
- graph_id¶
graph_id : int
- kind¶
kind : int
- start¶
start : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityInstructionClass(value)¶
Bases:
IntEnum
See CUpti_ActivityInstructionClass.
- BARRIER = 17¶
- BIT_CONVERSION = 4¶
- CONSTANT = 11¶
- CONTROL_FLOW = 5¶
- FP_16 = 19¶
- FP_32 = 1¶
- FP_64 = 2¶
- GENERIC = 9¶
- GLOBAL = 6¶
- GLOBAL_ATOMIC = 13¶
- INTEGER = 3¶
- INTER_THREAD_COMMUNICATION = 16¶
- KIND_FORCE_INT = 2147483647¶
- LOCAL = 8¶
- MISCELLANEOUS = 18¶
- SHARED = 7¶
- SHARED_ATOMIC = 14¶
- SURFACE = 10¶
- SURFACE_ATOMIC = 15¶
- TEXTURE = 12¶
- UNIFORM = 20¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityJit2(size=1)¶
Bases:
object
- cache_path¶
cache_path : str
- cache_size¶
cache_size : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- end¶
end : int
- jit_entry_type¶
jit_entry_type : int
- jit_operation_correlation_id¶
jit_operation_correlation_id : int
- jit_operation_type¶
jit_operation_type : int
- kind¶
kind : int
- padding¶
padding : int
- process_id¶
process_id : int
- start¶
start : int
- thread_id¶
thread_id : int
- class cupti.cupti.ActivityJitEntryType(value)¶
Bases:
IntEnum
See CUpti_ActivityJitEntryType.
- INVALID = 0¶
- NVVM_IR_TO_PTX = 2¶
- PTX_TO_CUBIN = 1¶
- TYPE_FORCE_INT = 2147483647¶
- class cupti.cupti.ActivityJitOperationType(value)¶
Bases:
IntEnum
See CUpti_ActivityJitOperationType.
- CACHE_LOAD = 1¶
- CACHE_STORE = 2¶
- COMPILE = 3¶
- INVALID = 0¶
- TYPE_FORCE_INT = 2147483647¶
- class cupti.cupti.ActivityKernel9(size=1)¶
Bases:
object
- block_x¶
block_x : int
- block_y¶
block_y : int
- block_z¶
block_z : int
- cache_config¶
cache_config : cupti.cupti.CacheConfig
- channel_id¶
channel_id : int
- channel_type¶
channel_type : int
- cluster_scheduling_policy¶
cluster_scheduling_policy : int
- cluster_x¶
cluster_x : int
- cluster_y¶
cluster_y : int
- cluster_z¶
cluster_z : int
- completed¶
completed : int
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
dynamic_shared_memory : int
- end¶
end : int
- graph_id¶
graph_id : int
- graph_node_id¶
graph_node_id : int
- grid_id¶
grid_id : int
- grid_x¶
grid_x : int
- grid_y¶
grid_y : int
- grid_z¶
grid_z : int
is_shared_memory_carveout_requested : int
- kind¶
kind : int
- launch_type¶
launch_type : int
- local_memory_per_thread¶
local_memory_per_thread : int
- local_memory_total¶
local_memory_total : int
- local_memory_total_v2¶
local_memory_total_v2 : int
- max_active_clusters¶
max_active_clusters : int
- max_potential_cluster_size¶
max_potential_cluster_size : int
- name¶
name : str
- p_access_policy_window¶
p_access_policy_window : int
- padding¶
padding : int
- partitioned_global_cache_executed¶
partitioned_global_cache_executed : int
- partitioned_global_cache_requested¶
partitioned_global_cache_requested : int
- queued¶
queued : int
- registers_per_thread¶
registers_per_thread : int
- reserved0¶
reserved0 : int
shared_memory_carveout_requested : int
shared_memory_config : int
shared_memory_executed : int
- shmem_limit_config¶
shmem_limit_config : int
- start¶
start : int
static_shared_memory : int
- stream_id¶
stream_id : int
- submitted¶
submitted : int
- class cupti.cupti.ActivityKind(value)¶
Bases:
IntEnum
See CUpti_ActivityKind.
- BRANCH = 16¶
- CDP_KERNEL = 18¶
- CONCURRENT_KERNEL = 10¶
- CONFIDENTIAL_COMPUTE_ROTATION = 53¶
- CONTEXT = 9¶
- COUNT = 56¶
- CUDA_EVENT = 36¶
- DEVICE = 8¶
- DEVICE_ATTRIBUTE = 28¶
- DEVICE_GRAPH_TRACE = 54¶
- DRIVER = 4¶
- ENVIRONMENT = 20¶
- EVENT = 6¶
- EVENT_INSTANCE = 21¶
- EXTERNAL_CORRELATION = 39¶
- FORCE_INT = 2147483647¶
- FUNCTION = 26¶
- GLOBAL_ACCESS = 15¶
- GRAPH_TRACE = 51¶
- INSTANTANEOUS_EVENT = 41¶
- INSTANTANEOUS_EVENT_INSTANCE = 42¶
- INSTANTANEOUS_METRIC = 43¶
- INSTANTANEOUS_METRIC_INSTANCE = 44¶
- INSTRUCTION_CORRELATION = 32¶
- INSTRUCTION_EXECUTION = 24¶
- INTERNAL_LAUNCH_API = 48¶
- INVALID = 0¶
- JIT = 52¶
- KERNEL = 3¶
- MARKER = 12¶
- MARKER_DATA = 13¶
- MEMCPY = 1¶
- MEMCPY2 = 22¶
- MEMORY = 45¶
- MEMORY2 = 49¶
- MEMORY_POOL = 50¶
- MEMSET = 2¶
- MEM_DECOMPRESS = 55¶
- METRIC = 7¶
- METRIC_INSTANCE = 23¶
- MODULE = 27¶
- NAME = 11¶
- NVLINK = 40¶
- OPENACC_DATA = 33¶
- OPENACC_LAUNCH = 34¶
- OPENACC_OTHER = 35¶
- OPENMP = 47¶
- OVERHEAD = 17¶
- PCIE = 46¶
- PC_SAMPLING = 30¶
- PC_SAMPLING_RECORD_INFO = 31¶
- PREEMPTION = 19¶
- RUNTIME = 5¶
- SHARED_ACCESS = 29¶
- SOURCE_LOCATOR = 14¶
- STREAM = 37¶
- SYNCHRONIZATION = 38¶
- UNIFIED_MEMORY_COUNTER = 25¶
- class cupti.cupti.ActivityLaunchType(value)¶
Bases:
IntEnum
See CUpti_ActivityLaunchType.
- CBL_COMMANDLIST = 3¶
- COOPERATIVE_MULTI_DEVICE = 2¶
- COOPERATIVE_SINGLE_DEVICE = 1¶
- REGULAR = 0¶
- class cupti.cupti.ActivityMarker2(size=1)¶
Bases:
object
- domain¶
domain : str
- id¶
id : int
- kind¶
kind : int
- name¶
name : str
- object_id¶
object_id : cupti.cupti.ActivityObjectKindId
- object_kind¶
object_kind : int
- pad¶
pad : int
- timestamp¶
timestamp : int
- class cupti.cupti.ActivityMarkerData(size=1)¶
Bases:
object
- category¶
category : int
- color¶
color : int
- id¶
id : int
- kind¶
kind : int
- payload¶
payload : cupti.cupti.MetricValue
- payload_kind¶
payload_kind : int
- class cupti.cupti.ActivityMemDecompress(size=1)¶
Bases:
object
- channel_id¶
channel_id : int
- channel_type¶
channel_type : int
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- end¶
end : int
- kind¶
kind : int
- number_of_operations¶
number_of_operations : int
- reserved0¶
reserved0 : int
- source_bytes¶
source_bytes : int
- start¶
start : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityMemcpy6(size=1)¶
Bases:
object
- bytes¶
bytes : int
- channel_id¶
channel_id : int
- channel_type¶
channel_type : int
- context_id¶
context_id : int
- copy_count¶
copy_count : int
- copy_kind¶
copy_kind : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- dst_kind¶
dst_kind : int
- end¶
end : int
- graph_id¶
graph_id : int
- graph_node_id¶
graph_node_id : int
- kind¶
kind : int
- pad¶
pad : int
- pad2¶
pad2 : int
- reserved0¶
reserved0 : int
- runtime_correlation_id¶
runtime_correlation_id : int
- src_kind¶
src_kind : int
- start¶
start : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityMemcpyKind(value)¶
Bases:
IntEnum
See CUpti_ActivityMemcpyKind.
- ATOA = 5¶
- ATOD = 6¶
- ATOH = 4¶
- DTOA = 7¶
- DTOD = 8¶
- DTOH = 2¶
- FORCE_INT = 2147483647¶
- HTOA = 3¶
- HTOD = 1¶
- HTOH = 9¶
- PTOP = 10¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityMemcpyPtoP4(size=1)¶
Bases:
object
- bytes¶
bytes : int
- channel_id¶
channel_id : int
- channel_type¶
channel_type : int
- context_id¶
context_id : int
- copy_kind¶
copy_kind : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- dst_context_id¶
dst_context_id : int
- dst_device_id¶
dst_device_id : int
- dst_kind¶
dst_kind : int
- end¶
end : int
- graph_id¶
graph_id : int
- graph_node_id¶
graph_node_id : int
- kind¶
kind : int
- reserved0¶
reserved0 : int
- src_context_id¶
src_context_id : int
- src_device_id¶
src_device_id : int
- src_kind¶
src_kind : int
- start¶
start : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityMemory(size=1)¶
Bases:
object
- address¶
address : int
- alloc_pc¶
alloc_pc : int
- bytes¶
bytes : int
- context_id¶
context_id : int
- device_id¶
device_id : int
- end¶
end : int
- free_pc¶
free_pc : int
- kind¶
kind : int
- memory_kind¶
memory_kind : int
- name¶
name : str
- pad¶
pad : int
- process_id¶
process_id : int
- start¶
start : int
- class cupti.cupti.ActivityMemory4(size=1)¶
Bases:
object
- address¶
address : int
- bytes¶
bytes : int
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- is_async¶
is_async : int
- kind¶
kind : int
- memory_kind¶
memory_kind : int
- memory_operation_type¶
memory_operation_type : int
- memory_pool_config¶
memory_pool_config : cupti.cupti.MemoryPoolConfig
- name¶
name : str
- pad1¶
pad1 : int
- pc¶
pc : int
- process_id¶
process_id : int
- source¶
source : str
- stream_id¶
stream_id : int
- timestamp¶
timestamp : int
- class cupti.cupti.ActivityMemoryKind(value)¶
Bases:
IntEnum
See CUpti_ActivityMemoryKind.
- ARRAY = 4¶
- DEVICE = 3¶
- DEVICE_STATIC = 6¶
- FORCE_INT = 2147483647¶
- MANAGED = 5¶
- MANAGED_STATIC = 7¶
- PAGEABLE = 1¶
- PINNED = 2¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityMemoryOperationType(value)¶
Bases:
IntEnum
See CUpti_ActivityMemoryOperationType.
- ALLOCATION = 1¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- RELEASE = 2¶
- class cupti.cupti.ActivityMemoryPool2(size=1)¶
Bases:
object
- address¶
address : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- kind¶
kind : int
- memory_pool_operation_type¶
memory_pool_operation_type : int
- memory_pool_type¶
memory_pool_type : int
- min_bytes_to_keep¶
min_bytes_to_keep : int
- process_id¶
process_id : int
- release_threshold¶
release_threshold : int
- timestamp¶
timestamp : int
- utilized_size¶
utilized_size : int
- class cupti.cupti.ActivityMemoryPoolOperationType(value)¶
Bases:
IntEnum
See CUpti_ActivityMemoryPoolOperationType.
- CREATED = 1¶
- DESTROYED = 2¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- TRIMMED = 3¶
- class cupti.cupti.ActivityMemoryPoolType(value)¶
Bases:
IntEnum
See CUpti_ActivityMemoryPoolType.
- FORCE_INT = 2147483647¶
- IMPORTED = 2¶
- INVALID = 0¶
- LOCAL = 1¶
- class cupti.cupti.ActivityMemset4(size=1)¶
Bases:
object
- bytes¶
bytes : int
- channel_id¶
channel_id : int
- channel_type¶
channel_type : int
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- device_id¶
device_id : int
- end¶
end : int
- graph_id¶
graph_id : int
- graph_node_id¶
graph_node_id : int
- kind¶
kind : int
- memory_kind¶
memory_kind : int
- pad¶
pad : int
- pad2¶
pad2 : int
- reserved0¶
reserved0 : int
- start¶
start : int
- stream_id¶
stream_id : int
- value¶
value : int
- class cupti.cupti.ActivityModule(size=1)¶
Bases:
object
- context_id¶
context_id : int
- cubin¶
cubin : int
- cubin_size¶
cubin_size : int
- id¶
id : int
- kind¶
kind : int
- class cupti.cupti.ActivityName(size=1)¶
Bases:
object
- kind¶
kind : int
- name¶
name : str
- object_id¶
object_id : cupti.cupti.ActivityObjectKindId
- object_kind¶
object_kind : int
- pad¶
pad : int
- class cupti.cupti.ActivityObjectKind(value)¶
Bases:
IntEnum
See CUpti_ActivityObjectKind.
- CONTEXT = 4¶
- DEVICE = 3¶
- FORCE_INT = 2147483647¶
- PROCESS = 1¶
- STREAM = 5¶
- THREAD = 2¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityObjectKindId¶
Bases:
object
- contextId¶
contextId: ‘uint32_t’
- deviceId¶
deviceId: ‘uint32_t’
- processId¶
processId: ‘uint32_t’
- streamId¶
streamId: ‘uint32_t’
- threadId¶
threadId: ‘uint32_t’
- class cupti.cupti.ActivityOpenAccData(size=1)¶
Bases:
object
- async¶
async : int
- async_map¶
async_map : int
- bytes¶
bytes : int
- cu_context_id¶
cu_context_id : int
- cu_device_id¶
cu_device_id : int
- cu_process_id¶
cu_process_id : int
- cu_stream_id¶
cu_stream_id : int
- cu_thread_id¶
cu_thread_id : int
- device_number¶
device_number : int
- device_ptr¶
device_ptr : int
- device_type¶
device_type : int
- end¶
end : int
- end_line_no¶
end_line_no : int
- event_kind¶
event_kind : int
- external_id¶
external_id : int
- func_end_line_no¶
func_end_line_no : int
- func_line_no¶
func_line_no : int
- func_name¶
func_name : str
- host_ptr¶
host_ptr : int
- implicit¶
implicit : int
- kind¶
kind : int
- line_no¶
line_no : int
- parent_construct¶
parent_construct : int
- src_file¶
src_file : str
- start¶
start : int
- thread_id¶
thread_id : int
- var_name¶
var_name : str
- version¶
version : int
- class cupti.cupti.ActivityOpenAccLaunch(size=1)¶
Bases:
object
- async¶
async : int
- async_map¶
async_map : int
- cu_context_id¶
cu_context_id : int
- cu_device_id¶
cu_device_id : int
- cu_process_id¶
cu_process_id : int
- cu_stream_id¶
cu_stream_id : int
- cu_thread_id¶
cu_thread_id : int
- device_number¶
device_number : int
- device_type¶
device_type : int
- end¶
end : int
- end_line_no¶
end_line_no : int
- event_kind¶
event_kind : int
- external_id¶
external_id : int
- func_end_line_no¶
func_end_line_no : int
- func_line_no¶
func_line_no : int
- func_name¶
func_name : str
- implicit¶
implicit : int
- kernel_name¶
kernel_name : str
- kind¶
kind : int
- line_no¶
line_no : int
- num_gangs¶
num_gangs : int
- num_workers¶
num_workers : int
- parent_construct¶
parent_construct : int
- src_file¶
src_file : str
- start¶
start : int
- thread_id¶
thread_id : int
- vector_length¶
vector_length : int
- version¶
version : int
- class cupti.cupti.ActivityOpenAccOther(size=1)¶
Bases:
object
- async¶
async : int
- async_map¶
async_map : int
- cu_context_id¶
cu_context_id : int
- cu_device_id¶
cu_device_id : int
- cu_process_id¶
cu_process_id : int
- cu_stream_id¶
cu_stream_id : int
- cu_thread_id¶
cu_thread_id : int
- device_number¶
device_number : int
- device_type¶
device_type : int
- end¶
end : int
- end_line_no¶
end_line_no : int
- event_kind¶
event_kind : int
- external_id¶
external_id : int
- func_end_line_no¶
func_end_line_no : int
- func_line_no¶
func_line_no : int
- func_name¶
func_name : str
- implicit¶
implicit : int
- kind¶
kind : int
- line_no¶
line_no : int
- parent_construct¶
parent_construct : int
- src_file¶
src_file : str
- start¶
start : int
- thread_id¶
thread_id : int
- version¶
version : int
- class cupti.cupti.ActivityOpenMp(size=1)¶
Bases:
object
- cu_process_id¶
cu_process_id : int
- cu_thread_id¶
cu_thread_id : int
- end¶
end : int
- event_kind¶
event_kind : int
- kind¶
kind : int
- start¶
start : int
- thread_id¶
thread_id : int
- version¶
version : int
- class cupti.cupti.ActivityOverhead3(size=1)¶
Bases:
object
- correlation_id¶
correlation_id : int
- end¶
end : int
- kind¶
kind : int
- object_id¶
object_id: cupti.cupti.ActivityObjectKindId
- object_kind¶
object_kind : int
- overhead_data¶
overhead_data : int
- overhead_kind¶
overhead_kind : int
- reserved0¶
reserved0 : int
- start¶
start : int
- class cupti.cupti.ActivityOverheadKind(value)¶
Bases:
IntEnum
See CUpti_ActivityOverheadKind.
- ACTIVITY_BUFFER_REQUEST = 458752¶
- COMMAND_BUFFER_FULL = 393216¶
- CUPTI_BUFFER_FLUSH = 65536¶
- CUPTI_INSTRUMENTATION = 131072¶
- CUPTI_RESOURCE = 196608¶
- DRIVER_COMPILER = 1¶
- FORCE_INT = 2147483647¶
- LAZY_FUNCTION_LOADING = 327680¶
- RUNTIME_TRIGGERED_MODULE_LOADING = 262144¶
- UNKNOWN = 0¶
- UVM_ACTIVITY_INIT = 524288¶
- class cupti.cupti.ActivityPCSamplingPeriod(value)¶
Bases:
IntEnum
See CUpti_ActivityPCSamplingPeriod.
- FORCE_INT = 2147483647¶
- HIGH = 4¶
- INVALID = 0¶
- LOW = 2¶
- MAX = 5¶
- MID = 3¶
- MIN = 1¶
- class cupti.cupti.ActivityPCSamplingStallReason(value)¶
Bases:
IntEnum
See CUpti_ActivityPCSamplingStallReason.
- CONSTANT_MEMORY_DEPENDENCY = 7¶
- EXEC_DEPENDENCY = 3¶
- FORCE_INT = 2147483647¶
- INST_FETCH = 2¶
- INVALID = 0¶
- MEMORY_DEPENDENCY = 4¶
- MEMORY_THROTTLE = 9¶
- NONE = 1¶
- NOT_SELECTED = 10¶
- OTHER = 11¶
- PIPE_BUSY = 8¶
- SLEEPING = 12¶
- SYNC = 6¶
- TEXTURE = 5¶
- class cupti.cupti.ActivityPartitionedGlobalCacheConfig(value)¶
Bases:
IntEnum
See CUpti_ActivityPartitionedGlobalCacheConfig.
- FORCE_INT = 2147483647¶
- NOT_SUPPORTED = 1¶
- OFF = 2¶
- ON = 3¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityPreemption(size=1)¶
Bases:
object
- block_x¶
block_x : int
- block_y¶
block_y : int
- block_z¶
block_z : int
- grid_id¶
grid_id : int
- kind¶
kind : int
- pad¶
pad : int
- preemption_kind¶
preemption_kind : int
- timestamp¶
timestamp : int
- class cupti.cupti.ActivityPreemptionKind(value)¶
Bases:
IntEnum
See CUpti_ActivityPreemptionKind.
- FORCE_INT = 2147483647¶
- RESTORE = 2¶
- SAVE = 1¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityStream(size=1)¶
Bases:
object
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- flag¶
flag : int
- kind¶
kind : int
- priority¶
priority : int
- stream_id¶
stream_id : int
- class cupti.cupti.ActivityStreamFlag(value)¶
Bases:
IntEnum
See CUpti_ActivityStreamFlag.
- FLAG_DEFAULT = 1¶
- FLAG_FORCE_INT = 2147483647¶
- FLAG_NON_BLOCKING = 2¶
- FLAG_NULL = 3¶
- FLAG_UNKNOWN = 0¶
- MASK = 65535¶
- class cupti.cupti.ActivitySynchronization2(size=1)¶
Bases:
object
- context_id¶
context_id : int
- correlation_id¶
correlation_id : int
- cuda_event_id¶
cuda_event_id : int
- cuda_event_sync_id¶
cuda_event_sync_id : int
- end¶
end : int
- kind¶
kind : int
- pad¶
pad : int
- return_value¶
return_value : int
- start¶
start : int
- stream_id¶
stream_id : int
- type¶
type : int
- class cupti.cupti.ActivitySynchronizationType(value)¶
Bases:
IntEnum
See CUpti_ActivitySynchronizationType.
- CONTEXT_SYNCHRONIZE = 4¶
- EVENT_SYNCHRONIZE = 1¶
- FORCE_INT = 2147483647¶
- STREAM_SYNCHRONIZE = 3¶
- STREAM_WAIT_EVENT = 2¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityThreadIdType(value)¶
Bases:
IntEnum
See CUpti_ActivityThreadIdType.
- DEFAULT = 0¶
- FORCE_INT = 2147483647¶
- SIZE = 2¶
- SYSTEM = 1¶
- class cupti.cupti.ActivityUnifiedMemoryAccessType(value)¶
Bases:
IntEnum
See CUpti_ActivityUnifiedMemoryAccessType.
- ATOMIC = 3¶
- PREFETCH = 4¶
- READ = 1¶
- UNKNOWN = 0¶
- WRITE = 2¶
- class cupti.cupti.ActivityUnifiedMemoryCounter3(size=1)¶
Bases:
object
- address¶
address : int
- counter_kind¶
counter_kind : int
- dst_id¶
dst_id : int
- end¶
end : int
- kind¶
kind : int
- pad¶
pad : int
- process_id¶
process_id : int
- processors¶
processors: list[int]
- src_id¶
src_id : int
- start¶
start : int
- stream_id¶
stream_id : int
- value¶
value : int
- class cupti.cupti.ActivityUnifiedMemoryCounterConfig(size=1)¶
Bases:
object
- device_id¶
device_id : int
- enable¶
enable : int
- kind¶
kind : int
- scope¶
scope : int
- class cupti.cupti.ActivityUnifiedMemoryCounterKind(value)¶
Bases:
IntEnum
See CUpti_ActivityUnifiedMemoryCounterKind.
- BYTES_TRANSFER_DTOD = 8¶
- BYTES_TRANSFER_DTOH = 2¶
- BYTES_TRANSFER_HTOD = 1¶
- COUNT = 9¶
- CPU_PAGE_FAULT_COUNT = 3¶
- FORCE_INT = 2147483647¶
- GPU_PAGE_FAULT = 4¶
- REMOTE_MAP = 7¶
- THRASHING = 5¶
- THROTTLING = 6¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityUnifiedMemoryCounterScope(value)¶
Bases:
IntEnum
See CUpti_ActivityUnifiedMemoryCounterScope.
- COUNT = 3¶
- FORCE_INT = 2147483647¶
- PROCESS_ALL_DEVICES = 2¶
- PROCESS_SINGLE_DEVICE = 1¶
- UNKNOWN = 0¶
- class cupti.cupti.ActivityUnifiedMemoryMigrationCause(value)¶
Bases:
IntEnum
See CUpti_ActivityUnifiedMemoryMigrationCause.
- ACCESS_COUNTERS = 5¶
- COHERENCE = 2¶
- EVICTION = 4¶
- PREFETCH = 3¶
- UNKNOWN = 0¶
- USER = 1¶
- class cupti.cupti.ActivityUnifiedMemoryRemoteMapCause(value)¶
Bases:
IntEnum
See CUpti_ActivityUnifiedMemoryRemoteMapCause.
- COHERENCE = 1¶
- EVICTION = 5¶
- OUT_OF_MEMORY = 4¶
- POLICY = 3¶
- THRASHING = 2¶
- UNKNOWN = 0¶
- class cupti.cupti.ApiCallbackSite(value)¶
Bases:
IntEnum
See CUpti_ApiCallbackSite.
- API_CBSITE_FORCE_INT = 2147483647¶
- API_ENTER = 0¶
- API_EXIT = 1¶
- class cupti.cupti.CacheConfig¶
Bases:
object
- both¶
both: ‘uint8_t’
- executed¶
executed: ‘uint8_t’
- requested¶
requested: ‘uint8_t’
- class cupti.cupti.CallbackData(size=1)¶
Bases:
object
- callback_site¶
callback_site : int
- cbid¶
- context¶
context : int
- context_uid¶
context_uid : int
- correlation_data¶
correlation_data : int
- correlation_id¶
correlation_id : int
- domain¶
- function_name¶
function_name : str
- function_params¶
function_params : int
- function_params_py_obj¶
- function_return_value¶
function_return_value : int
- symbol_name¶
symbol_name : str
- class cupti.cupti.CallbackDomain(value)¶
Bases:
IntEnum
See CUpti_CallbackDomain.
- DRIVER_API = 1¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- NVTX = 5¶
- RESOURCE = 3¶
- RUNTIME_API = 2¶
- SIZE = 7¶
- STATE = 6¶
- SYNCHRONIZE = 4¶
- class cupti.cupti.CallbackIdResource(value)¶
Bases:
IntEnum
See CUpti_CallbackIdResource.
- CONTEXT_CREATED = 1¶
- CONTEXT_DESTROY_STARTING = 2¶
- CU_INIT_FINISHED = 5¶
- FORCE_INT = 2147483647¶
- GRAPHEXEC_CREATED = 18¶
- GRAPHEXEC_CREATE_STARTING = 17¶
- GRAPHEXEC_DESTROY_STARTING = 19¶
- GRAPHNODE_CLONED = 20¶
- GRAPHNODE_CREATED = 13¶
- GRAPHNODE_CREATE_STARTING = 12¶
- GRAPHNODE_DEPENDENCY_CREATED = 15¶
- GRAPHNODE_DEPENDENCY_DESTROY_STARTING = 16¶
- GRAPHNODE_DESTROY_STARTING = 14¶
- GRAPH_CLONED = 11¶
- GRAPH_CREATED = 9¶
- GRAPH_DESTROY_STARTING = 10¶
- INVALID = 0¶
- MODULE_LOADED = 6¶
- MODULE_PROFILED = 8¶
- MODULE_UNLOAD_STARTING = 7¶
- SIZE = 22¶
- STREAM_ATTRIBUTE_CHANGED = 21¶
- STREAM_CREATED = 3¶
- STREAM_DESTROY_STARTING = 4¶
- class cupti.cupti.CallbackIdState(value)¶
Bases:
IntEnum
See CUpti_CallbackIdState.
- ERROR = 2¶
- FATAL_ERROR = 1¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- SIZE = 4¶
- WARNING = 3¶
- class cupti.cupti.CallbackIdSync(value)¶
Bases:
IntEnum
See CUpti_CallbackIdSync.
- CONTEXT_SYNCHRONIZED = 2¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- SIZE = 3¶
- STREAM_SYNCHRONIZED = 1¶
- class cupti.cupti.ChannelType(value)¶
Bases:
IntEnum
See CUpti_ChannelType.
- ASYNC_MEMCPY = 2¶
- COMPUTE = 1¶
- DECOMP = 3¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- class cupti.cupti.ContextCigMode(value)¶
Bases:
IntEnum
See CUpti_ContextCigMode.
- CIG = 1¶
- CIG_FALLBACK = 2¶
- FORCE_INT = 2147483647¶
- NONE = 0¶
- class cupti.cupti.Data¶
Bases:
object
- clocksThrottleReasons¶
clocksThrottleReasons: ‘int32_t’
- fanSpeed¶
fanSpeed: ‘uint32_t’
- gpuTemperature¶
gpuTemperature: ‘uint32_t’
- memoryClock¶
memoryClock: ‘uint32_t’
- pcieLinkGen¶
pcieLinkGen: ‘uint32_t’
- pcieLinkWidth¶
pcieLinkWidth: ‘uint32_t’
- power¶
power: ‘uint32_t’
- powerLimit¶
powerLimit: ‘uint32_t’
- smClock¶
smClock: ‘uint32_t’
- class cupti.cupti.DevType(value)¶
Bases:
IntEnum
See CUpti_DevType.
- FORCE_INT = 2147483647¶
- GPU = 1¶
- INVALID = 0¶
- NPU = 2¶
- class cupti.cupti.DeviceAttribute(value)¶
Bases:
IntEnum
See CUpti_DeviceAttribute.
- DEVICE_CLASS = 10¶
- FLOP_DP_PER_CYCLE = 12¶
- FLOP_HP_PER_CYCLE = 17¶
- FLOP_SP_PER_CYCLE = 11¶
- FORCE_INT = 2147483647¶
- GLOBAL_MEMORY_BANDWIDTH = 3¶
- GPU_CPU_NVLINK_BW = 19¶
- INSTRUCTION_PER_CYCLE = 4¶
- INSTRUCTION_THROUGHPUT_SINGLE_PRECISION = 5¶
- MAX_EVENT_DOMAIN_ID = 2¶
- MAX_EVENT_ID = 1¶
- MAX_FRAME_BUFFERS = 6¶
- MAX_L2_UNITS = 13¶
- MAX_SHARED_MEMORY_CACHE_CONFIG_PREFER_EQUAL = 16¶
- MAX_SHARED_MEMORY_CACHE_CONFIG_PREFER_L1 = 15¶
- MAX_SHARED_MEMORY_CACHE_CONFIG_PREFER_SHARED = 14¶
- NVLINK_PRESENT = 18¶
- NVSWITCH_PRESENT = 20¶
- PCIE_GEN = 9¶
- PCIE_LINK_RATE = 7¶
- PCIE_LINK_WIDTH = 8¶
- class cupti.cupti.DeviceVirtualizationMode(value)¶
Bases:
IntEnum
See CUpti_DeviceVirtualizationMode.
- FORCE_INT = 2147483647¶
- NONE = 0¶
- PASS_THROUGH = 1¶
- VIRTUAL_GPU = 2¶
- class cupti.cupti.EnvironmentClocksThrottleReason(value)¶
Bases:
IntEnum
See CUpti_EnvironmentClocksThrottleReason.
- FORCE_INT = 2147483647¶
- GPU_IDLE = 1¶
- HW_SLOWDOWN = 8¶
- NONE = 0¶
- SW_POWER_CAP = 4¶
- UNKNOWN = 2147483648¶
- UNSUPPORTED = 1073741824¶
- USER_DEFINED_CLOCKS = 2¶
- class cupti.cupti.ExternalCorrelationKind(value)¶
Bases:
IntEnum
See CUpti_ExternalCorrelationKind.
- CUSTOM0 = 3¶
- CUSTOM1 = 4¶
- CUSTOM2 = 5¶
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- OPENACC = 2¶
- SIZE = 6¶
- UNKNOWN = 1¶
- class cupti.cupti.FuncShmemLimitConfig(value)¶
Bases:
IntEnum
See CUpti_FuncShmemLimitConfig.
- DEFAULT = 0¶
- FORCE_INT = 2147483647¶
- OPTIN = 1¶
- class cupti.cupti.GraphData(size=1)¶
Bases:
object
- dependency¶
dependency : int
- graph¶
graph : int
- graph_exec¶
graph_exec : int
- node¶
node : int
- node_type¶
node_type : int
- original_graph¶
original_graph : int
- original_node¶
original_node : int
- class cupti.cupti.MemoryPoolConfig¶
Bases:
object
- address¶
address: ‘uint64_t’
- memory_pool_type¶
memory_pool_type: ‘int32_t’
- process_id¶
process_id: ‘uint64_t’
- release_threshold¶
release_threshold: ‘uint64_t’
- size¶
size: ‘uint64_t’
- utilized_size¶
utilized_size: ‘uint64_t’
- class cupti.cupti.MetricValue¶
Bases:
object
- metricValueDouble¶
metricValueDouble: ‘double’
- metricValueInt64¶
metricValueInt64: ‘int64_t’
- metricValuePercent¶
metricValuePercent: ‘double’
- metricValueThroughput¶
metricValueThroughput: ‘uint64_t’
- metricValueUint64¶
metricValueUint64: ‘uint64_t’
- metricValueUtilizationLevel¶
metricValueUtilizationLevel: ‘int32_t’
- class cupti.cupti.MetricValueKind(value)¶
Bases:
IntEnum
See CUpti_MetricValueKind.
- DOUBLE = 0¶
- FORCE_INT = 2147483647¶
- INT64 = 4¶
- PERCENT = 2¶
- THROUGHPUT = 3¶
- UINT64 = 1¶
- UTILIZATION_LEVEL = 5¶
- class cupti.cupti.MetricValueUtilizationLevel(value)¶
Bases:
IntEnum
See CUpti_MetricValueUtilizationLevel.
- FORCE_INT = 2147483647¶
- HIGH = 8¶
- IDLE = 0¶
- LOW = 2¶
- MAX = 10¶
- MID = 5¶
- class cupti.cupti.ModuleResourceData(size=1)¶
Bases:
object
- cubin_size¶
cubin_size : int
- module_id¶
module_id : int
- p_cubin¶
p_cubin : str
- class cupti.cupti.OpenAccConstructKind(value)¶
Bases:
IntEnum
See CUpti_OpenAccConstructKind.
- ATOMIC = 8¶
- DATA = 4¶
- DECLARE = 9¶
- ENTER_DATA = 5¶
- EXIT_DATA = 6¶
- FORCE_INT = 2147483647¶
- HOST_DATA = 7¶
- INIT = 10¶
- KERNELS = 2¶
- LOOP = 3¶
- PARALLEL = 1¶
- ROUTINE = 14¶
- RUNTIME_API = 16¶
- SET = 12¶
- SHUTDOWN = 11¶
- UNKNOWN = 0¶
- UPDATE = 13¶
- WAIT = 15¶
- class cupti.cupti.OpenAccEventKind(value)¶
Bases:
IntEnum
See CUpti_OpenAccEventKind.
- ALLOC = 15¶
- COMPUTE_CONSTRUCT = 9¶
- CREATE = 13¶
- DELETE = 14¶
- DEVICE_INIT = 1¶
- DEVICE_SHUTDOWN = 2¶
- ENQUEUE_DOWNLOAD = 6¶
- ENQUEUE_LAUNCH = 4¶
- ENQUEUE_UPLOAD = 5¶
- ENTER_DATA = 11¶
- EXIT_DATA = 12¶
- FORCE_INT = 2147483647¶
- FREE = 16¶
- IMPLICIT_WAIT = 8¶
- INVALID = 0¶
- RUNTIME_SHUTDOWN = 3¶
- UPDATE = 10¶
- WAIT = 7¶
- class cupti.cupti.OpenMpEventKind(value)¶
Bases:
IntEnum
See CUpti_OpenMpEventKind.
- FORCE_INT = 2147483647¶
- IDLE = 4¶
- INVALID = 0¶
- PARALLEL = 1¶
- TASK = 2¶
- THREAD = 3¶
- WAIT_BARRIER = 5¶
- WAIT_TASKWAIT = 6¶
- class cupti.cupti.PcieDeviceType(value)¶
Bases:
IntEnum
See CUpti_PcieDeviceType.
- BRIDGE = 1¶
- FORCE_INT = 2147483647¶
- GPU = 0¶
- class cupti.cupti.ResourceData(size=1)¶
Bases:
object
- context¶
context : int
- resource_descriptor¶
resource_descriptor : int
- class cupti.cupti.Result(value)¶
Bases:
IntEnum
See CUptiResult.
- ERROR_API_NOT_IMPLEMENTED = 11¶
- ERROR_CDP_TRACING_NOT_SUPPORTED = 32¶
- ERROR_CMP_DEVICE_NOT_SUPPORTED = 42¶
- ERROR_CONFIDENTIAL_COMPUTING_NOT_SUPPORTED = 41¶
- ERROR_CUDA_COMPILER_NOT_COMPATIBLE = 34¶
- ERROR_DISABLED = 23¶
- ERROR_FORCE_INT = 2147483647¶
- ERROR_HARDWARE = 9¶
- ERROR_HARDWARE_BUSY = 26¶
- ERROR_INSUFFICIENT_PRIVILEGES = 35¶
- ERROR_INVALID_CONTEXT = 3¶
- ERROR_INVALID_DEVICE = 2¶
- ERROR_INVALID_EVENT_DOMAIN_ID = 4¶
- ERROR_INVALID_EVENT_ID = 5¶
- ERROR_INVALID_EVENT_NAME = 6¶
- ERROR_INVALID_EVENT_VALUE = 22¶
- ERROR_INVALID_HANDLE = 19¶
- ERROR_INVALID_KIND = 21¶
- ERROR_INVALID_METRIC_ID = 16¶
- ERROR_INVALID_METRIC_NAME = 17¶
- ERROR_INVALID_METRIC_VALUE = 25¶
- ERROR_INVALID_MODULE = 24¶
- ERROR_INVALID_OPERATION = 7¶
- ERROR_INVALID_PARAMETER = 1¶
- ERROR_INVALID_STREAM = 20¶
- ERROR_LEGACY_PROFILER_NOT_SUPPORTED = 38¶
- ERROR_MAX_LIMIT_REACHED = 12¶
- ERROR_MIG_DEVICE_NOT_SUPPORTED = 43¶
- ERROR_MULTIPLE_SUBSCRIBERS_NOT_SUPPORTED = 39¶
- ERROR_NOT_COMPATIBLE = 14¶
- ERROR_NOT_INITIALIZED = 15¶
- ERROR_NOT_READY = 13¶
- ERROR_NOT_SUPPORTED = 27¶
- ERROR_OLD_PROFILER_API_INITIALIZED = 36¶
- ERROR_OPENACC_UNDEFINED_ROUTINE = 37¶
- ERROR_OUT_OF_MEMORY = 8¶
- ERROR_PARAMETER_SIZE_NOT_SUFFICIENT = 10¶
- ERROR_QUEUE_EMPTY = 18¶
- ERROR_SLI_DEVICE_NOT_SUPPORTED = 44¶
- ERROR_UM_PROFILING_NOT_SUPPORTED = 28¶
- ERROR_UM_PROFILING_NOT_SUPPORTED_ON_DEVICE = 29¶
- ERROR_UM_PROFILING_NOT_SUPPORTED_ON_NON_P2P_DEVICES = 30¶
- ERROR_UM_PROFILING_NOT_SUPPORTED_WITH_MPS = 31¶
- ERROR_UNKNOWN = 999¶
- ERROR_VIRTUALIZED_DEVICE_INSUFFICIENT_PRIVILEGES = 40¶
- ERROR_VIRTUALIZED_DEVICE_NOT_SUPPORTED = 33¶
- ERROR_WSL_DEVICE_NOT_SUPPORTED = 45¶
- SUCCESS = 0¶
- class cupti.cupti.StateData(size=1)¶
Bases:
object
- notification¶
notification : cupti.cupti.Notification
- class cupti.cupti.StreamAttrData(size=1)¶
Bases:
object
- attr¶
attr : int
- stream¶
stream : int
- value¶
value : int
- class cupti.cupti.SynchronizeData(size=1)¶
Bases:
object
- context¶
context : int
- stream¶
stream : int
- class cupti.cupti.Value¶
Bases:
object
- vDouble¶
vDouble: ‘double’
- vInt32¶
vInt32: ‘int32_t’
- vInt64¶
vInt64: ‘int64_t’
- vUint32¶
vUint32: ‘uint32_t’
- vUint64¶
vUint64: ‘uint64_t’
- class cupti.cupti.driver_api_trace_cbid(value)¶
Bases:
IntEnum
See CUpti_driver_api_trace_cbid.
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- SIZE = 782¶
- cu64Array3DCreate = 230¶
- cu64Array3DGetDescriptor = 231¶
- cu64ArrayCreate = 228¶
- cu64ArrayGetDescriptor = 229¶
- cu64D3D10ResourceGetMappedPitch = 200¶
- cu64D3D10ResourceGetMappedPointer = 198¶
- cu64D3D10ResourceGetMappedSize = 199¶
- cu64D3D10ResourceGetSurfaceDimensions = 201¶
- cu64D3D9MapVertexBuffer = 206¶
- cu64D3D9ResourceGetMappedPitch = 205¶
- cu64D3D9ResourceGetMappedPointer = 203¶
- cu64D3D9ResourceGetMappedSize = 204¶
- cu64D3D9ResourceGetSurfaceDimensions = 202¶
- cu64DeviceTotalMem = 197¶
- cu64GLMapBufferObject = 207¶
- cu64GLMapBufferObjectAsync = 208¶
- cu64GraphicsResourceGetMappedPointer = 131¶
- cu64MemAlloc = 30¶
- cu64MemAllocPitch = 32¶
- cu64MemFree = 34¶
- cu64MemGetAddressRange = 36¶
- cu64MemGetInfo = 28¶
- cu64MemHostAlloc = 215¶
- cu64MemHostGetDevicePointer = 41¶
- cu64Memcpy2D = 232¶
- cu64Memcpy2DAsync = 234¶
- cu64Memcpy2DUnaligned = 233¶
- cu64Memcpy3D = 59¶
- cu64Memcpy3DAsync = 70¶
- cu64MemcpyAtoD = 52¶
- cu64MemcpyDtoA = 50¶
- cu64MemcpyDtoD = 48¶
- cu64MemcpyDtoDAsync = 65¶
- cu64MemcpyDtoH = 46¶
- cu64MemcpyDtoHAsync = 63¶
- cu64MemcpyHtoD = 44¶
- cu64MemcpyHtoDAsync = 61¶
- cu64MemsetD16 = 74¶
- cu64MemsetD16Async = 219¶
- cu64MemsetD2D16 = 80¶
- cu64MemsetD2D16Async = 225¶
- cu64MemsetD2D32 = 82¶
- cu64MemsetD2D32Async = 227¶
- cu64MemsetD2D8 = 78¶
- cu64MemsetD2D8Async = 223¶
- cu64MemsetD32 = 76¶
- cu64MemsetD32Async = 221¶
- cu64MemsetD8 = 72¶
- cu64MemsetD8Async = 217¶
- cu64ModuleGetGlobal = 25¶
- cu64TexRefGetAddress = 104¶
- cu64TexRefSetAddress = 96¶
- cu64TexRefSetAddress2D = 98¶
- cuArray3DCreate = 90¶
- cuArray3DCreate_v2 = 274¶
- cuArray3DGetDescriptor = 91¶
- cuArray3DGetDescriptor_v2 = 275¶
- cuArrayCreate = 87¶
- cuArrayCreate_v2 = 272¶
- cuArrayDestroy = 89¶
- cuArrayGetDescriptor = 88¶
- cuArrayGetDescriptor_v2 = 273¶
- cuArrayGetMemoryRequirements = 654¶
- cuArrayGetPlane = 597¶
- cuArrayGetSparseProperties = 582¶
- cuBinaryFree = 376¶
- cuCheckpointProcessCheckpoint = 771¶
- cuCheckpointProcessGetRestoreThreadId = 768¶
- cuCheckpointProcessGetState = 769¶
- cuCheckpointProcessLock = 770¶
- cuCheckpointProcessRestore = 772¶
- cuCheckpointProcessUnlock = 773¶
- cuCompilePtx = 375¶
- cuCoredumpGetAttribute = 701¶
- cuCoredumpGetAttributeGlobal = 702¶
- cuCoredumpSetAttribute = 703¶
- cuCoredumpSetAttributeGlobal = 704¶
- cuCtxAttach = 12¶
- cuCtxCreate = 10¶
- cuCtxCreate_v2 = 235¶
- cuCtxCreate_v3 = 645¶
- cuCtxCreate_v4 = 757¶
- cuCtxDestroy = 11¶
- cuCtxDestroy_v2 = 322¶
- cuCtxDetach = 13¶
- cuCtxDisablePeerAccess = 314¶
- cuCtxEnablePeerAccess = 313¶
- cuCtxFromGreenCtx = 753¶
- cuCtxGetApiVersion = 296¶
- cuCtxGetCacheConfig = 299¶
- cuCtxGetCurrent = 304¶
- cuCtxGetDevResource = 746¶
- cuCtxGetDevice = 16¶
- cuCtxGetExecAffinity = 646¶
- cuCtxGetFlags = 391¶
- cuCtxGetId = 695¶
- cuCtxGetLimit = 137¶
- cuCtxGetStreamPriorityRange = 370¶
- cuCtxPopCurrent = 15¶
- cuCtxPopCurrent_v2 = 324¶
- cuCtxPushCurrent = 14¶
- cuCtxPushCurrent_v2 = 323¶
- cuCtxRecordEvent = 755¶
- cuCtxResetPersistingL2Cache = 568¶
- cuCtxSetCacheConfig = 300¶
- cuCtxSetCurrent = 303¶
- cuCtxSetFlags = 705¶
- cuCtxSetLimit = 136¶
- cuCtxSynchronize = 17¶
- cuCtxWaitEvent = 756¶
- cuD3D10CtxCreate = 139¶
- cuD3D10CtxCreateOnDevice = 212¶
- cuD3D10CtxCreate_v2 = 236¶
- cuD3D10GetDevice = 138¶
- cuD3D10GetDevices = 211¶
- cuD3D10GetDirect3DDevice = 297¶
- cuD3D10MapResources = 143¶
- cuD3D10RegisterResource = 141¶
- cuD3D10ResourceGetMappedArray = 146¶
- cuD3D10ResourceGetMappedPitch = 149¶
- cuD3D10ResourceGetMappedPitch_v2 = 262¶
- cuD3D10ResourceGetMappedPointer = 147¶
- cuD3D10ResourceGetMappedPointer_v2 = 260¶
- cuD3D10ResourceGetMappedSize = 148¶
- cuD3D10ResourceGetMappedSize_v2 = 261¶
- cuD3D10ResourceGetSurfaceDimensions = 150¶
- cuD3D10ResourceGetSurfaceDimensions_v2 = 263¶
- cuD3D10ResourceSetMapFlags = 145¶
- cuD3D10UnmapResources = 144¶
- cuD3D10UnregisterResource = 142¶
- cuD3D11CtxCreate = 152¶
- cuD3D11CtxCreateOnDevice = 210¶
- cuD3D11CtxCreate_v2 = 237¶
- cuD3D11GetDevice = 151¶
- cuD3D11GetDevices = 209¶
- cuD3D11GetDirect3DDevice = 298¶
- cuD3D9Begin = 168¶
- cuD3D9CtxCreate = 155¶
- cuD3D9CtxCreateOnDevice = 214¶
- cuD3D9CtxCreate_v2 = 238¶
- cuD3D9End = 169¶
- cuD3D9GetDevice = 154¶
- cuD3D9GetDevices = 213¶
- cuD3D9GetDirect3DDevice = 157¶
- cuD3D9MapResources = 160¶
- cuD3D9MapVertexBuffer = 171¶
- cuD3D9MapVertexBuffer_v2 = 268¶
- cuD3D9RegisterResource = 158¶
- cuD3D9RegisterVertexBuffer = 170¶
- cuD3D9ResourceGetMappedArray = 164¶
- cuD3D9ResourceGetMappedPitch = 167¶
- cuD3D9ResourceGetMappedPitch_v2 = 267¶
- cuD3D9ResourceGetMappedPointer = 165¶
- cuD3D9ResourceGetMappedPointer_v2 = 265¶
- cuD3D9ResourceGetMappedSize = 166¶
- cuD3D9ResourceGetMappedSize_v2 = 266¶
- cuD3D9ResourceGetSurfaceDimensions = 163¶
- cuD3D9ResourceGetSurfaceDimensions_v2 = 264¶
- cuD3D9ResourceSetMapFlags = 162¶
- cuD3D9UnmapResources = 161¶
- cuD3D9UnmapVertexBuffer = 172¶
- cuD3D9UnregisterResource = 159¶
- cuD3D9UnregisterVertexBuffer = 173¶
- cuDestroyExternalMemory = 488¶
- cuDestroyExternalSemaphore = 494¶
- cuDevResourceGenerateDesc = 748¶
- cuDevSmResourceSplitByCount = 751¶
- cuDeviceCanAccessPeer = 312¶
- cuDeviceComputeCapability = 6¶
- cuDeviceGet = 3¶
- cuDeviceGetAttribute = 9¶
- cuDeviceGetByPCIBusId = 331¶
- cuDeviceGetCount = 4¶
- cuDeviceGetDefaultMemPool = 606¶
- cuDeviceGetDevResource = 745¶
- cuDeviceGetExecAffinitySupport = 644¶
- cuDeviceGetGraphMemAttribute = 641¶
- cuDeviceGetLuid = 532¶
- cuDeviceGetMemPool = 610¶
- cuDeviceGetName = 5¶
- cuDeviceGetNvSciSyncAttributes = 542¶
- cuDeviceGetP2PAttribute = 454¶
- cuDeviceGetPCIBusId = 332¶
- cuDeviceGetProperties = 8¶
- cuDeviceGetTexture1DLinearMaxWidth = 579¶
- cuDeviceGetUuid = 482¶
- cuDeviceGetUuid_v2 = 647¶
- cuDeviceGraphMemTrim = 640¶
- cuDevicePrimaryCtxGetState = 392¶
- cuDevicePrimaryCtxRelease = 387¶
- cuDevicePrimaryCtxRelease_v2 = 544¶
- cuDevicePrimaryCtxReset = 389¶
- cuDevicePrimaryCtxReset_v2 = 545¶
- cuDevicePrimaryCtxRetain = 386¶
- cuDevicePrimaryCtxSetFlags = 388¶
- cuDevicePrimaryCtxSetFlags_v2 = 546¶
- cuDeviceRegisterAsyncNotification = 735¶
- cuDeviceSetGraphMemAttribute = 642¶
- cuDeviceSetMemPool = 609¶
- cuDeviceTotalMem = 7¶
- cuDeviceTotalMem_v2 = 259¶
- cuDeviceUnregisterAsyncNotification = 736¶
- cuDriverGetVersion = 2¶
- cuEGLStreamConsumerAcquireFrame = 395¶
- cuEGLStreamConsumerConnect = 393¶
- cuEGLStreamConsumerConnectWithFlags = 470¶
- cuEGLStreamConsumerDisconnect = 394¶
- cuEGLStreamConsumerReleaseFrame = 396¶
- cuEGLStreamProducerConnect = 446¶
- cuEGLStreamProducerDisconnect = 447¶
- cuEGLStreamProducerPresentFrame = 448¶
- cuEGLStreamProducerReturnFrame = 453¶
- cuEventCreate = 118¶
- cuEventCreateFromEGLSync = 479¶
- cuEventCreateFromNVNSync = 469¶
- cuEventDestroy = 122¶
- cuEventDestroy_v2 = 325¶
- cuEventElapsedTime = 123¶
- cuEventElapsedTime_v2 = 780¶
- cuEventQuery = 120¶
- cuEventRecord = 119¶
- cuEventRecordWithFlags = 587¶
- cuEventRecordWithFlags_ptsz = 588¶
- cuEventRecord_ptsz = 441¶
- cuEventSynchronize = 121¶
- cuExternalMemoryGetMappedBuffer = 486¶
- cuExternalMemoryGetMappedMipmappedArray = 487¶
- cuFlushGPUDirectRDMAWrites = 627¶
- cuFuncGetAttribute = 85¶
- cuFuncGetModule = 566¶
- cuFuncGetName = 718¶
- cuFuncGetParamInfo = 733¶
- cuFuncIsLoaded = 741¶
- cuFuncLoad = 742¶
- cuFuncSetAttribute = 481¶
- cuFuncSetBlockShape = 83¶
- cuFuncSetCacheConfig = 86¶
- cuGLCtxCreate = 174¶
- cuGLCtxCreate_v2 = 239¶
- cuGLGetDevices = 333¶
- cuGLGetDevices_v2 = 385¶
- cuGLInit = 178¶
- cuGLMapBufferObject = 180¶
- cuGLMapBufferObjectAsync = 184¶
- cuGLMapBufferObjectAsync_v2 = 270¶
- cuGLMapBufferObjectAsync_v2_ptsz = 445¶
- cuGLMapBufferObject_v2 = 269¶
- cuGLMapBufferObject_v2_ptds = 417¶
- cuGLRegisterBufferObject = 179¶
- cuGLSetBufferObjectMapFlags = 183¶
- cuGLUnmapBufferObject = 181¶
- cuGLUnmapBufferObjectAsync = 185¶
- cuGLUnregisterBufferObject = 182¶
- cuGetErrorName = 373¶
- cuGetErrorString = 372¶
- cuGetExportTable = 135¶
- cuGetProcAddress = 626¶
- cuGetProcAddress_v2 = 677¶
- cuGraphAddBatchMemOpNode = 669¶
- cuGraphAddChildGraphNode = 525¶
- cuGraphAddDependencies = 518¶
- cuGraphAddDependencies_v2 = 727¶
- cuGraphAddEmptyNode = 526¶
- cuGraphAddEventRecordNode = 589¶
- cuGraphAddEventWaitNode = 590¶
- cuGraphAddExternalSemaphoresSignalNode = 618¶
- cuGraphAddExternalSemaphoresWaitNode = 621¶
- cuGraphAddHostNode = 530¶
- cuGraphAddKernelNode = 502¶
- cuGraphAddKernelNode_v2 = 689¶
- cuGraphAddMemAllocNode = 638¶
- cuGraphAddMemFreeNode = 639¶
- cuGraphAddMemcpyNode = 504¶
- cuGraphAddMemsetNode = 506¶
- cuGraphAddNode = 712¶
- cuGraphAddNode_v2 = 723¶
- cuGraphBatchMemOpNodeGetParams = 670¶
- cuGraphBatchMemOpNodeSetParams = 671¶
- cuGraphChildGraphNodeGetGraph = 529¶
- cuGraphClone = 523¶
- cuGraphConditionalHandleCreate = 722¶
- cuGraphCreate = 501¶
- cuGraphDebugDotPrint = 628¶
- cuGraphDestroy = 517¶
- cuGraphDestroyNode = 522¶
- cuGraphEventRecordNodeGetEvent = 591¶
- cuGraphEventRecordNodeSetEvent = 593¶
- cuGraphEventWaitNodeGetEvent = 592¶
- cuGraphEventWaitNodeSetEvent = 594¶
- cuGraphExecBatchMemOpNodeSetParams = 672¶
- cuGraphExecChildGraphNodeSetParams = 586¶
- cuGraphExecDestroy = 516¶
- cuGraphExecEventRecordNodeSetEvent = 595¶
- cuGraphExecEventWaitNodeSetEvent = 596¶
- cuGraphExecExternalSemaphoresSignalNodeSetParams = 624¶
- cuGraphExecExternalSemaphoresWaitNodeSetParams = 625¶
- cuGraphExecGetFlags = 658¶
- cuGraphExecHostNodeSetParams = 564¶
- cuGraphExecKernelNodeSetParams = 538¶
- cuGraphExecKernelNodeSetParams_v2 = 692¶
- cuGraphExecMemcpyNodeSetParams = 562¶
- cuGraphExecMemsetNodeSetParams = 563¶
- cuGraphExecNodeSetParams = 714¶
- cuGraphExecUpdate = 561¶
- cuGraphExecUpdate_v2 = 696¶
- cuGraphExternalSemaphoresSignalNodeGetParams = 619¶
- cuGraphExternalSemaphoresSignalNodeSetParams = 620¶
- cuGraphExternalSemaphoresWaitNodeGetParams = 622¶
- cuGraphExternalSemaphoresWaitNodeSetParams = 623¶
- cuGraphGetEdges = 535¶
- cuGraphGetEdges_v2 = 724¶
- cuGraphGetNodes = 534¶
- cuGraphGetRootNodes = 510¶
- cuGraphHostNodeGetParams = 531¶
- cuGraphHostNodeSetParams = 533¶
- cuGraphInstantiate = 513¶
- cuGraphInstantiateWithFlags = 643¶
- cuGraphInstantiateWithParams = 656¶
- cuGraphInstantiateWithParams_ptsz = 657¶
- cuGraphInstantiate_v2 = 578¶
- cuGraphKernelNodeCopyAttributes = 569¶
- cuGraphKernelNodeGetAttribute = 570¶
- cuGraphKernelNodeGetParams = 503¶
- cuGraphKernelNodeGetParams_v2 = 690¶
- cuGraphKernelNodeSetAttribute = 571¶
- cuGraphKernelNodeSetParams = 521¶
- cuGraphKernelNodeSetParams_v2 = 691¶
- cuGraphLaunch = 514¶
- cuGraphLaunch_ptsz = 515¶
- cuGraphMemAllocNodeGetParams = 648¶
- cuGraphMemFreeNodeGetParams = 649¶
- cuGraphMemcpyNodeGetParams = 505¶
- cuGraphMemcpyNodeSetParams = 520¶
- cuGraphMemsetNodeGetParams = 507¶
- cuGraphMemsetNodeSetParams = 508¶
- cuGraphNodeFindInClone = 524¶
- cuGraphNodeGetDependencies = 511¶
- cuGraphNodeGetDependencies_v2 = 725¶
- cuGraphNodeGetDependentNodes = 512¶
- cuGraphNodeGetDependentNodes_v2 = 726¶
- cuGraphNodeGetEnabled = 651¶
- cuGraphNodeGetType = 509¶
- cuGraphNodeSetEnabled = 650¶
- cuGraphNodeSetParams = 713¶
- cuGraphReleaseUserObject = 637¶
- cuGraphRemoveDependencies = 519¶
- cuGraphRemoveDependencies_v2 = 728¶
- cuGraphRetainUserObject = 636¶
- cuGraphUpload = 580¶
- cuGraphUpload_ptsz = 581¶
- cuGraphicsD3D10RegisterResource = 140¶
- cuGraphicsD3D11RegisterResource = 153¶
- cuGraphicsD3D9RegisterResource = 156¶
- cuGraphicsEGLRegisterImage = 390¶
- cuGraphicsGLRegisterBuffer = 175¶
- cuGraphicsGLRegisterImage = 176¶
- cuGraphicsMapResources = 133¶
- cuGraphicsMapResources_ptsz = 443¶
- cuGraphicsResourceGetMappedEglFrame = 449¶
- cuGraphicsResourceGetMappedMipmappedArray = 360¶
- cuGraphicsResourceGetMappedPointer = 130¶
- cuGraphicsResourceGetMappedPointer_v2 = 258¶
- cuGraphicsResourceSetMapFlags = 132¶
- cuGraphicsResourceSetMapFlags_v2 = 380¶
- cuGraphicsSubResourceGetMappedArray = 129¶
- cuGraphicsUnmapResources = 134¶
- cuGraphicsUnmapResources_ptsz = 444¶
- cuGraphicsUnregisterResource = 128¶
- cuGraphicsVDPAURegisterOutputSurface = 189¶
- cuGraphicsVDPAURegisterVideoSurface = 188¶
- cuGreenCtxCreate = 743¶
- cuGreenCtxDestroy = 744¶
- cuGreenCtxGetDevResource = 747¶
- cuGreenCtxRecordEvent = 749¶
- cuGreenCtxStreamCreate = 758¶
- cuGreenCtxWaitEvent = 750¶
- cuImportExternalMemory = 485¶
- cuImportExternalSemaphore = 489¶
- cuInit = 1¶
- cuIpcCloseMemHandle = 330¶
- cuIpcGetEventHandle = 334¶
- cuIpcGetMemHandle = 328¶
- cuIpcOpenEventHandle = 335¶
- cuIpcOpenMemHandle = 329¶
- cuIpcOpenMemHandle_v2 = 567¶
- cuKernelGetAttribute = 686¶
- cuKernelGetFunction = 683¶
- cuKernelGetLibrary = 754¶
- cuKernelGetName = 719¶
- cuKernelGetParamInfo = 734¶
- cuKernelSetAttribute = 687¶
- cuKernelSetCacheConfig = 688¶
- cuLaunch = 115¶
- cuLaunchCooperativeKernel = 477¶
- cuLaunchCooperativeKernelMultiDevice = 480¶
- cuLaunchCooperativeKernel_ptsz = 478¶
- cuLaunchGrid = 116¶
- cuLaunchGridAsync = 117¶
- cuLaunchHostFunc = 527¶
- cuLaunchHostFunc_ptsz = 528¶
- cuLaunchKernel = 307¶
- cuLaunchKernelEx = 652¶
- cuLaunchKernelEx_ptsz = 653¶
- cuLaunchKernel_ptsz = 442¶
- cuLibraryEnumerateKernels = 740¶
- cuLibraryGetGlobal = 684¶
- cuLibraryGetKernel = 681¶
- cuLibraryGetKernelCount = 739¶
- cuLibraryGetManaged = 685¶
- cuLibraryGetModule = 682¶
- cuLibraryGetUnifiedFunction = 700¶
- cuLibraryLoadData = 678¶
- cuLibraryLoadFromFile = 679¶
- cuLibraryUnload = 680¶
- cuLinkAddData = 363¶
- cuLinkAddData_v2 = 382¶
- cuLinkAddFile = 364¶
- cuLinkAddFile_v2 = 383¶
- cuLinkComplete = 365¶
- cuLinkCreate = 362¶
- cuLinkCreate_v2 = 381¶
- cuLinkDestroy = 366¶
- cuLogsCurrent = 765¶
- cuLogsDumpToFile = 766¶
- cuLogsDumpToMemory = 767¶
- cuLogsRegisterCallback = 763¶
- cuLogsUnregisterCallback = 764¶
- cuMemAddressFree = 548¶
- cuMemAddressReserve = 547¶
- cuMemAdvise = 457¶
- cuMemAdvise_v2 = 715¶
- cuMemAlloc = 29¶
- cuMemAllocAsync = 598¶
- cuMemAllocAsync_ptsz = 599¶
- cuMemAllocFromPoolAsync = 611¶
- cuMemAllocFromPoolAsync_ptsz = 612¶
- cuMemAllocHost = 37¶
- cuMemAllocHost_v2 = 294¶
- cuMemAllocManaged = 371¶
- cuMemAllocPitch = 31¶
- cuMemAllocPitch_v2 = 244¶
- cuMemAlloc_v2 = 243¶
- cuMemBatchDecompressAsync = 761¶
- cuMemBatchDecompressAsync_ptsz = 762¶
- cuMemCreate = 549¶
- cuMemFree = 33¶
- cuMemFreeAsync = 600¶
- cuMemFreeAsync_ptsz = 601¶
- cuMemFreeHost = 38¶
- cuMemFree_v2 = 245¶
- cuMemGetAccess = 558¶
- cuMemGetAddressRange = 35¶
- cuMemGetAddressRange_v2 = 246¶
- cuMemGetAllocationGranularity = 556¶
- cuMemGetAllocationPropertiesFromHandle = 557¶
- cuMemGetHandleForAddressRange = 674¶
- cuMemGetInfo = 27¶
- cuMemGetInfo_v2 = 242¶
- cuMemHostAlloc = 39¶
- cuMemHostAlloc_v2 = 271¶
- cuMemHostGetDevicePointer = 40¶
- cuMemHostGetDevicePointer_v2 = 247¶
- cuMemHostGetFlags = 42¶
- cuMemHostRegister = 301¶
- cuMemHostRegister_v2 = 379¶
- cuMemHostUnregister = 302¶
- cuMemMap = 551¶
- cuMemMapArrayAsync = 584¶
- cuMemMapArrayAsync_ptsz = 585¶
- cuMemPeerGetDevicePointer = 317¶
- cuMemPeerRegister = 315¶
- cuMemPeerUnregister = 316¶
- cuMemPoolCreate = 607¶
- cuMemPoolDestroy = 608¶
- cuMemPoolExportPointer = 615¶
- cuMemPoolGetAccess = 617¶
- cuMemPoolGetAttribute = 604¶
- cuMemPoolImportPointer = 616¶
- cuMemPoolSetAccess = 605¶
- cuMemPoolSetAttribute = 603¶
- cuMemPoolTrimTo = 602¶
- cuMemPrefetchAsync = 467¶
- cuMemPrefetchAsync_ptsz = 468¶
- cuMemPrefetchAsync_v2 = 716¶
- cuMemPrefetchAsync_v2_ptsz = 717¶
- cuMemRangeGetAttribute = 471¶
- cuMemRangeGetAttributes = 472¶
- cuMemRelease = 550¶
- cuMemRetainAllocationHandle = 565¶
- cuMemSetAccess = 553¶
- cuMemUnmap = 552¶
- cuMemcpy = 305¶
- cuMemcpy2D = 56¶
- cuMemcpy2DAsync = 68¶
- cuMemcpy2DAsync_v2 = 289¶
- cuMemcpy2DAsync_v2_ptsz = 424¶
- cuMemcpy2DUnaligned = 57¶
- cuMemcpy2DUnaligned_v2 = 288¶
- cuMemcpy2DUnaligned_v2_ptds = 406¶
- cuMemcpy2D_v2 = 287¶
- cuMemcpy2D_v2_ptds = 405¶
- cuMemcpy3D = 58¶
- cuMemcpy3DAsync = 69¶
- cuMemcpy3DAsync_v2 = 291¶
- cuMemcpy3DAsync_v2_ptsz = 425¶
- cuMemcpy3DBatchAsync = 778¶
- cuMemcpy3DBatchAsync_ptsz = 779¶
- cuMemcpy3DPeer = 320¶
- cuMemcpy3DPeerAsync = 321¶
- cuMemcpy3DPeerAsync_ptsz = 427¶
- cuMemcpy3DPeer_ptds = 410¶
- cuMemcpy3D_v2 = 290¶
- cuMemcpy3D_v2_ptds = 407¶
- cuMemcpyAsync = 306¶
- cuMemcpyAsync_ptsz = 418¶
- cuMemcpyAtoA = 55¶
- cuMemcpyAtoA_v2 = 286¶
- cuMemcpyAtoA_v2_ptds = 404¶
- cuMemcpyAtoD = 51¶
- cuMemcpyAtoD_v2 = 284¶
- cuMemcpyAtoD_v2_ptds = 401¶
- cuMemcpyAtoH = 54¶
- cuMemcpyAtoHAsync = 67¶
- cuMemcpyAtoHAsync_v2 = 283¶
- cuMemcpyAtoHAsync_v2_ptsz = 420¶
- cuMemcpyAtoH_v2 = 282¶
- cuMemcpyAtoH_v2_ptds = 403¶
- cuMemcpyBatchAsync = 776¶
- cuMemcpyBatchAsync_ptsz = 777¶
- cuMemcpyDtoA = 49¶
- cuMemcpyDtoA_v2 = 285¶
- cuMemcpyDtoA_v2_ptds = 400¶
- cuMemcpyDtoD = 47¶
- cuMemcpyDtoDAsync = 64¶
- cuMemcpyDtoDAsync_v2 = 281¶
- cuMemcpyDtoDAsync_v2_ptsz = 423¶
- cuMemcpyDtoD_v2 = 280¶
- cuMemcpyDtoD_v2_ptds = 399¶
- cuMemcpyDtoH = 45¶
- cuMemcpyDtoHAsync = 62¶
- cuMemcpyDtoHAsync_v2 = 279¶
- cuMemcpyDtoHAsync_v2_ptsz = 422¶
- cuMemcpyDtoH_v2 = 278¶
- cuMemcpyDtoH_v2_ptds = 398¶
- cuMemcpyHtoA = 53¶
- cuMemcpyHtoAAsync = 66¶
- cuMemcpyHtoAAsync_v2 = 293¶
- cuMemcpyHtoAAsync_v2_ptsz = 419¶
- cuMemcpyHtoA_v2 = 292¶
- cuMemcpyHtoA_v2_ptds = 402¶
- cuMemcpyHtoD = 43¶
- cuMemcpyHtoDAsync = 60¶
- cuMemcpyHtoDAsync_v2 = 277¶
- cuMemcpyHtoDAsync_v2_ptsz = 421¶
- cuMemcpyHtoD_v2 = 276¶
- cuMemcpyHtoD_v2_ptds = 397¶
- cuMemcpyPeer = 318¶
- cuMemcpyPeerAsync = 319¶
- cuMemcpyPeerAsync_ptsz = 426¶
- cuMemcpyPeer_ptds = 409¶
- cuMemcpy_ptds = 408¶
- cuMemcpy_v2 = 248¶
- cuMemsetD16 = 73¶
- cuMemsetD16Async = 218¶
- cuMemsetD16Async_ptsz = 429¶
- cuMemsetD16_v2 = 250¶
- cuMemsetD16_v2_ptds = 412¶
- cuMemsetD2D16 = 79¶
- cuMemsetD2D16Async = 224¶
- cuMemsetD2D16Async_ptsz = 432¶
- cuMemsetD2D16_v2 = 253¶
- cuMemsetD2D16_v2_ptds = 415¶
- cuMemsetD2D32 = 81¶
- cuMemsetD2D32Async = 226¶
- cuMemsetD2D32Async_ptsz = 433¶
- cuMemsetD2D32_v2 = 254¶
- cuMemsetD2D32_v2_ptds = 416¶
- cuMemsetD2D8 = 77¶
- cuMemsetD2D8Async = 222¶
- cuMemsetD2D8Async_ptsz = 431¶
- cuMemsetD2D8_v2 = 252¶
- cuMemsetD2D8_v2_ptds = 414¶
- cuMemsetD32 = 75¶
- cuMemsetD32Async = 220¶
- cuMemsetD32Async_ptsz = 430¶
- cuMemsetD32_v2 = 251¶
- cuMemsetD32_v2_ptds = 413¶
- cuMemsetD8 = 71¶
- cuMemsetD8Async = 216¶
- cuMemsetD8Async_ptsz = 428¶
- cuMemsetD8_v2 = 249¶
- cuMemsetD8_v2_ptds = 411¶
- cuMipmappedArrayCreate = 347¶
- cuMipmappedArrayDestroy = 349¶
- cuMipmappedArrayGetLevel = 348¶
- cuMipmappedArrayGetMemoryRequirements = 655¶
- cuMipmappedArrayGetSparseProperties = 583¶
- cuModuleEnumerateFunctions = 738¶
- cuModuleGetFunction = 23¶
- cuModuleGetFunctionCount = 737¶
- cuModuleGetGlobal = 24¶
- cuModuleGetGlobal_v2 = 241¶
- cuModuleGetLoadingMode = 673¶
- cuModuleGetSurfRef = 190¶
- cuModuleGetTexRef = 26¶
- cuModuleLoad = 18¶
- cuModuleLoadData = 19¶
- cuModuleLoadDataEx = 20¶
- cuModuleLoadFatBinary = 21¶
- cuModuleUnload = 22¶
- cuMulticastAddDevice = 707¶
- cuMulticastBindAddr = 709¶
- cuMulticastBindMem = 708¶
- cuMulticastCreate = 706¶
- cuMulticastGetGranularity = 711¶
- cuMulticastUnbind = 710¶
- cuNNSetAllocator = 466¶
- cuNVNbufferGetPointer = 464¶
- cuNVNtextureGetArray = 465¶
- cuOccupancyAvailableDynamicSMemPerBlock = 543¶
- cuOccupancyMaxActiveBlocksPerMultiprocessor = 374¶
- cuOccupancyMaxActiveBlocksPerMultiprocessorWithFlags = 451¶
- cuOccupancyMaxActiveClusters = 676¶
- cuOccupancyMaxPotentialBlockSize = 384¶
- cuOccupancyMaxPotentialBlockSizeWithFlags = 452¶
- cuOccupancyMaxPotentialClusterSize = 675¶
- cuParamSetSize = 110¶
- cuParamSetTexRef = 114¶
- cuParamSetf = 112¶
- cuParamSeti = 111¶
- cuParamSetv = 113¶
- cuPointerGetAttribute = 310¶
- cuPointerGetAttributes = 450¶
- cuPointerSetAttribute = 378¶
- cuProfilerInitialize = 311¶
- cuProfilerStart = 308¶
- cuProfilerStop = 309¶
- cuSignalExternalSemaphoresAsync = 490¶
- cuSignalExternalSemaphoresAsync_ptsz = 491¶
- cuStreamAddCallback = 346¶
- cuStreamAddCallback_ptsz = 437¶
- cuStreamAttachMemAsync = 377¶
- cuStreamAttachMemAsync_ptsz = 438¶
- cuStreamBatchMemOp = 462¶
- cuStreamBatchMemOp_ptsz = 463¶
- cuStreamBatchMemOp_v2 = 667¶
- cuStreamBatchMemOp_v2_ptsz = 668¶
- cuStreamBeginCapture = 495¶
- cuStreamBeginCaptureToGraph = 720¶
- cuStreamBeginCaptureToGraph_ptsz = 721¶
- cuStreamBeginCapture_ptsz = 496¶
- cuStreamBeginCapture_v2 = 539¶
- cuStreamBeginCapture_v2_ptsz = 540¶
- cuStreamCopyAttributes = 572¶
- cuStreamCopyAttributes_ptsz = 573¶
- cuStreamCreate = 124¶
- cuStreamCreateWithPriority = 367¶
- cuStreamDestroy = 127¶
- cuStreamDestroy_v2 = 326¶
- cuStreamEndCapture = 497¶
- cuStreamEndCapture_ptsz = 498¶
- cuStreamGetAttribute = 574¶
- cuStreamGetAttribute_ptsz = 575¶
- cuStreamGetCaptureInfo = 536¶
- cuStreamGetCaptureInfo_ptsz = 537¶
- cuStreamGetCaptureInfo_v2 = 629¶
- cuStreamGetCaptureInfo_v2_ptsz = 630¶
- cuStreamGetCaptureInfo_v3 = 729¶
- cuStreamGetCaptureInfo_v3_ptsz = 730¶
- cuStreamGetCtx = 483¶
- cuStreamGetCtx_ptsz = 484¶
- cuStreamGetCtx_v2 = 759¶
- cuStreamGetCtx_v2_ptsz = 760¶
- cuStreamGetDevice = 774¶
- cuStreamGetDevice_ptsz = 775¶
- cuStreamGetFlags = 369¶
- cuStreamGetFlags_ptsz = 435¶
- cuStreamGetGreenCtx = 752¶
- cuStreamGetId = 693¶
- cuStreamGetId_ptsz = 694¶
- cuStreamGetPriority = 368¶
- cuStreamGetPriority_ptsz = 434¶
- cuStreamIsCapturing = 499¶
- cuStreamIsCapturing_ptsz = 500¶
- cuStreamQuery = 125¶
- cuStreamQuery_ptsz = 439¶
- cuStreamSetAttribute = 576¶
- cuStreamSetAttribute_ptsz = 577¶
- cuStreamSetFlags = 559¶
- cuStreamSetFlags_ptsz = 560¶
- cuStreamSynchronize = 126¶
- cuStreamSynchronize_ptsz = 440¶
- cuStreamUpdateCaptureDependencies = 631¶
- cuStreamUpdateCaptureDependencies_ptsz = 632¶
- cuStreamUpdateCaptureDependencies_v2 = 731¶
- cuStreamUpdateCaptureDependencies_v2_ptsz = 732¶
- cuStreamWaitEvent = 295¶
- cuStreamWaitEvent_ptsz = 436¶
- cuStreamWaitValue32 = 458¶
- cuStreamWaitValue32_ptsz = 459¶
- cuStreamWaitValue32_v2 = 659¶
- cuStreamWaitValue32_v2_ptsz = 660¶
- cuStreamWaitValue64 = 473¶
- cuStreamWaitValue64_ptsz = 474¶
- cuStreamWaitValue64_v2 = 661¶
- cuStreamWaitValue64_v2_ptsz = 662¶
- cuStreamWriteValue32 = 460¶
- cuStreamWriteValue32_ptsz = 461¶
- cuStreamWriteValue32_v2 = 663¶
- cuStreamWriteValue32_v2_ptsz = 664¶
- cuStreamWriteValue64 = 475¶
- cuStreamWriteValue64_ptsz = 476¶
- cuStreamWriteValue64_v2 = 665¶
- cuStreamWriteValue64_v2_ptsz = 666¶
- cuSurfObjectCreate = 343¶
- cuSurfObjectDestroy = 344¶
- cuSurfObjectGetResourceDesc = 345¶
- cuSurfRefCreate = 191¶
- cuSurfRefDestroy = 192¶
- cuSurfRefGetArray = 196¶
- cuSurfRefGetFormat = 195¶
- cuSurfRefSetArray = 194¶
- cuSurfRefSetFormat = 193¶
- cuTensorMapEncodeIm2col = 698¶
- cuTensorMapEncodeIm2colWide = 781¶
- cuTensorMapEncodeTiled = 697¶
- cuTensorMapReplaceAddress = 699¶
- cuTexObjectCreate = 339¶
- cuTexObjectDestroy = 340¶
- cuTexObjectGetResourceDesc = 341¶
- cuTexObjectGetResourceViewDesc = 361¶
- cuTexObjectGetTextureDesc = 342¶
- cuTexRefCreate = 92¶
- cuTexRefDestroy = 93¶
- cuTexRefGetAddress = 103¶
- cuTexRefGetAddressMode = 106¶
- cuTexRefGetAddress_v2 = 257¶
- cuTexRefGetArray = 105¶
- cuTexRefGetBorderColor = 456¶
- cuTexRefGetFilterMode = 107¶
- cuTexRefGetFlags = 109¶
- cuTexRefGetFormat = 108¶
- cuTexRefGetMaxAnisotropy = 359¶
- cuTexRefGetMipmapFilterMode = 356¶
- cuTexRefGetMipmapLevelBias = 357¶
- cuTexRefGetMipmapLevelClamp = 358¶
- cuTexRefGetMipmappedArray = 355¶
- cuTexRefSetAddress = 95¶
- cuTexRefSetAddress2D = 97¶
- cuTexRefSetAddress2D_v2 = 256¶
- cuTexRefSetAddress2D_v3 = 327¶
- cuTexRefSetAddressMode = 100¶
- cuTexRefSetAddress_v2 = 255¶
- cuTexRefSetArray = 94¶
- cuTexRefSetBorderColor = 455¶
- cuTexRefSetFilterMode = 101¶
- cuTexRefSetFlags = 102¶
- cuTexRefSetFormat = 99¶
- cuTexRefSetMaxAnisotropy = 354¶
- cuTexRefSetMipmapFilterMode = 351¶
- cuTexRefSetMipmapLevelBias = 352¶
- cuTexRefSetMipmapLevelClamp = 353¶
- cuTexRefSetMipmappedArray = 350¶
- cuThreadExchangeStreamCaptureMode = 541¶
- cuUserObjectCreate = 633¶
- cuUserObjectRelease = 635¶
- cuUserObjectRetain = 634¶
- cuVDPAUCtxCreate = 187¶
- cuVDPAUCtxCreate_v2 = 240¶
- cuVDPAUGetDevice = 186¶
- cuWGLGetDevice = 177¶
- cuWaitExternalSemaphoresAsync = 492¶
- cuWaitExternalSemaphoresAsync_ptsz = 493¶
- class cupti.cupti.runtime_api_trace_cbid(value)¶
Bases:
IntEnum
See CUpti_runtime_api_trace_cbid.
- FORCE_INT = 2147483647¶
- INVALID = 0¶
- SIZE = 487¶
- cuda470_v12060 = 470¶
- cuda471_v12060 = 471¶
- cuda472_v12060 = 472¶
- cuda473_v12060 = 473¶
- cuda474_v12060 = 474¶
- cuda475_v12060 = 475¶
- cuda476_v12060 = 476¶
- cuda477_v12060 = 477¶
- cuda478_v12060 = 478¶
- cuda479_v12060 = 479¶
- cudaArrayGetInfo_v4010 = 181¶
- cudaArrayGetMemoryRequirements_v11060 = 428¶
- cudaArrayGetPlane_v11020 = 381¶
- cudaArrayGetSparseProperties_v11010 = 359¶
- cudaBindSurfaceToArray_v3020 = 61¶
- cudaBindTexture2D_v3020 = 56¶
- cudaBindTextureToArray_v3020 = 57¶
- cudaBindTextureToMipmappedArray_v5000 = 195¶
- cudaBindTexture_v3020 = 55¶
- cudaChooseDevice_v3020 = 5¶
- cudaConfigureCall_v3020 = 8¶
- cudaCreateChannelDesc_v3020 = 7¶
- cudaCreateSurfaceObject_v5000 = 189¶
- cudaCreateTextureObject_v2_v11080 = 434¶
- cudaCreateTextureObject_v5000 = 185¶
- cudaCtxResetPersistingL2Cache_v11000 = 337¶
- cudaD3D10GetDevice_v3020 = 88¶
- cudaD3D10GetDevices_v3020 = 89¶
- cudaD3D10GetDirect3DDevice_v3020 = 149¶
- cudaD3D10MapResources_v3020 = 94¶
- cudaD3D10RegisterResource_v3020 = 92¶
- cudaD3D10ResourceGetMappedArray_v3020 = 98¶
- cudaD3D10ResourceGetMappedPitch_v3020 = 101¶
- cudaD3D10ResourceGetMappedPointer_v3020 = 99¶
- cudaD3D10ResourceGetMappedSize_v3020 = 100¶
- cudaD3D10ResourceGetSurfaceDimensions_v3020 = 97¶
- cudaD3D10ResourceSetMapFlags_v3020 = 96¶
- cudaD3D10SetDirect3DDevice_v3020 = 90¶
- cudaD3D10UnmapResources_v3020 = 95¶
- cudaD3D10UnregisterResource_v3020 = 93¶
- cudaD3D11GetDevice_v3020 = 84¶
- cudaD3D11GetDevices_v3020 = 85¶
- cudaD3D11GetDirect3DDevice_v3020 = 148¶
- cudaD3D11SetDirect3DDevice_v3020 = 86¶
- cudaD3D9Begin_v3020 = 117¶
- cudaD3D9End_v3020 = 118¶
- cudaD3D9GetDevice_v3020 = 102¶
- cudaD3D9GetDevices_v3020 = 103¶
- cudaD3D9GetDirect3DDevice_v3020 = 105¶
- cudaD3D9MapResources_v3020 = 109¶
- cudaD3D9MapVertexBuffer_v3020 = 121¶
- cudaD3D9RegisterResource_v3020 = 107¶
- cudaD3D9RegisterVertexBuffer_v3020 = 119¶
- cudaD3D9ResourceGetMappedArray_v3020 = 113¶
- cudaD3D9ResourceGetMappedPitch_v3020 = 116¶
- cudaD3D9ResourceGetMappedPointer_v3020 = 114¶
- cudaD3D9ResourceGetMappedSize_v3020 = 115¶
- cudaD3D9ResourceGetSurfaceDimensions_v3020 = 112¶
- cudaD3D9ResourceSetMapFlags_v3020 = 111¶
- cudaD3D9SetDirect3DDevice_v3020 = 104¶
- cudaD3D9UnmapResources_v3020 = 110¶
- cudaD3D9UnmapVertexBuffer_v3020 = 122¶
- cudaD3D9UnregisterResource_v3020 = 108¶
- cudaD3D9UnregisterVertexBuffer_v3020 = 120¶
- cudaDestroyExternalMemory_v10000 = 277¶
- cudaDestroyExternalSemaphore_v10000 = 283¶
- cudaDestroySurfaceObject_v5000 = 190¶
- cudaDestroyTextureObject_v5000 = 186¶
- cudaDeviceCanAccessPeer_v4000 = 154¶
- cudaDeviceDisablePeerAccess_v4000 = 156¶
- cudaDeviceEnablePeerAccess_v4000 = 155¶
- cudaDeviceFlushGPUDirectRDMAWrites_v11030 = 405¶
- cudaDeviceGetAttribute_v5000 = 200¶
- cudaDeviceGetByPCIBusId_v4010 = 173¶
- cudaDeviceGetCacheConfig_v3020 = 168¶
- cudaDeviceGetDefaultMemPool_v11020 = 372¶
- cudaDeviceGetGraphMemAttribute_v11040 = 424¶
- cudaDeviceGetLimit_v3020 = 166¶
- cudaDeviceGetMemPool_v11020 = 386¶
- cudaDeviceGetNvSciSyncAttributes_v10020 = 328¶
- cudaDeviceGetP2PAttribute_v8000 = 255¶
- cudaDeviceGetPCIBusId_v4010 = 174¶
- cudaDeviceGetStreamPriorityRange_v5050 = 205¶
- cudaDeviceGetTexture1DLinearMaxWidth_v11010 = 347¶
- cudaDeviceGraphMemTrim_v11040 = 423¶
- cudaDeviceRegisterAsyncNotification_v12040 = 465¶
- cudaDeviceReset_v3020 = 164¶
- cudaDeviceSetCacheConfig_v3020 = 169¶
- cudaDeviceSetGraphMemAttribute_v11040 = 425¶
- cudaDeviceSetLimit_v3020 = 167¶
- cudaDeviceSetMemPool_v11020 = 385¶
- cudaDeviceSynchronize_v3020 = 165¶
- cudaDeviceUnregisterAsyncNotification_v12040 = 466¶
- cudaDriverGetVersion_v3020 = 1¶
- cudaEGLStreamConsumerAcquireFrame_v7000 = 259¶
- cudaEGLStreamConsumerConnectWithFlags_v7000 = 268¶
- cudaEGLStreamConsumerConnect_v7000 = 257¶
- cudaEGLStreamConsumerDisconnect_v7000 = 258¶
- cudaEGLStreamConsumerReleaseFrame_v7000 = 260¶
- cudaEGLStreamProducerConnect_v7000 = 261¶
- cudaEGLStreamProducerDisconnect_v7000 = 262¶
- cudaEGLStreamProducerPresentFrame_v7000 = 263¶
- cudaEGLStreamProducerReturnFrame_v7000 = 264¶
- cudaEventCreateFromEGLSync_v9000 = 271¶
- cudaEventCreateWithFlags_v3020 = 134¶
- cudaEventCreate_v3020 = 133¶
- cudaEventDestroy_v3020 = 136¶
- cudaEventElapsedTime_v2_v12080 = 486¶
- cudaEventElapsedTime_v3020 = 139¶
- cudaEventQuery_v3020 = 138¶
- cudaEventRecordWithFlags_ptsz_v11010 = 371¶
- cudaEventRecordWithFlags_v11010 = 370¶
- cudaEventRecord_ptsz_v7000 = 242¶
- cudaEventRecord_v3020 = 135¶
- cudaEventSynchronize_v3020 = 137¶
- cudaExternalMemoryGetMappedBuffer_v10000 = 275¶
- cudaExternalMemoryGetMappedMipmappedArray_v10000 = 276¶
- cudaFreeArray_v3020 = 24¶
- cudaFreeAsync_ptsz_v11020 = 376¶
- cudaFreeAsync_v11020 = 375¶
- cudaFreeHost_v3020 = 26¶
- cudaFreeMipmappedArray_v5000 = 194¶
- cudaFree_v3020 = 22¶
- cudaFuncGetAttributes_v3020 = 15¶
- cudaFuncGetName_v12030 = 451¶
- cudaFuncGetParamInfo_v12040 = 467¶
- cudaFuncSetAttribute_v9000 = 273¶
- cudaFuncSetCacheConfig_v3020 = 14¶
- cudaGLGetDevices_v4010 = 175¶
- cudaGLMapBufferObjectAsync_v3020 = 69¶
- cudaGLMapBufferObject_v3020 = 65¶
- cudaGLRegisterBufferObject_v3020 = 64¶
- cudaGLSetBufferObjectMapFlags_v3020 = 68¶
- cudaGLSetGLDevice_v3020 = 63¶
- cudaGLUnmapBufferObjectAsync_v3020 = 70¶
- cudaGLUnmapBufferObject_v3020 = 66¶
- cudaGLUnregisterBufferObject_v3020 = 67¶
- cudaGetChannelDesc_v3020 = 6¶
- cudaGetDeviceCount_v3020 = 3¶
- cudaGetDeviceFlags_v7000 = 212¶
- cudaGetDeviceProperties_v2_v12000 = 440¶
- cudaGetDeviceProperties_v3020 = 4¶
- cudaGetDevice_v3020 = 17¶
- cudaGetDriverEntryPointByVersion_ptsz_v12050 = 469¶
- cudaGetDriverEntryPointByVersion_v12050 = 468¶
- cudaGetDriverEntryPoint_ptsz_v11030 = 407¶
- cudaGetDriverEntryPoint_v11030 = 406¶
- cudaGetErrorName_v6050 = 209¶
- cudaGetErrorString_v3020 = 12¶
- cudaGetFuncBySymbol_v11000 = 336¶
- cudaGetKernel_v12000 = 439¶
- cudaGetLastError_v3020 = 10¶
- cudaGetMipmappedArrayLevel_v5000 = 193¶
- cudaGetSurfaceObjectResourceDesc_v5000 = 191¶
- cudaGetSurfaceReference_v3020 = 62¶
- cudaGetSymbolAddress_v3020 = 53¶
- cudaGetSymbolSize_v3020 = 54¶
- cudaGetTextureAlignmentOffset_v3020 = 59¶
- cudaGetTextureObjectResourceDesc_v5000 = 187¶
- cudaGetTextureObjectResourceViewDesc_v5000 = 199¶
- cudaGetTextureObjectTextureDesc_v2_v11080 = 435¶
- cudaGetTextureObjectTextureDesc_v5000 = 188¶
- cudaGetTextureReference_v3020 = 60¶
- cudaGraphAddChildGraphNode_v10000 = 298¶
- cudaGraphAddDependencies_v10000 = 307¶
- cudaGraphAddDependencies_v2_v12030 = 458¶
- cudaGraphAddEmptyNode_v10000 = 300¶
- cudaGraphAddEventRecordNode_v11010 = 362¶
- cudaGraphAddEventWaitNode_v11010 = 365¶
- cudaGraphAddExternalSemaphoresSignalNode_v11020 = 397¶
- cudaGraphAddExternalSemaphoresWaitNode_v11020 = 400¶
- cudaGraphAddHostNode_v10000 = 296¶
- cudaGraphAddKernelNode_v10000 = 289¶
- cudaGraphAddMemAllocNode_v11040 = 419¶
- cudaGraphAddMemFreeNode_v11040 = 421¶
- cudaGraphAddMemcpyNode1D_v11010 = 352¶
- cudaGraphAddMemcpyNodeFromSymbol_v11010 = 351¶
- cudaGraphAddMemcpyNodeToSymbol_v11010 = 350¶
- cudaGraphAddMemcpyNode_v10000 = 290¶
- cudaGraphAddMemsetNode_v10000 = 293¶
- cudaGraphAddNode_v12020 = 445¶
- cudaGraphAddNode_v2_v12030 = 460¶
- cudaGraphChildGraphNodeGetGraph_v10000 = 299¶
- cudaGraphClone_v10000 = 301¶
- cudaGraphConditionalHandleCreate_v12030 = 454¶
- cudaGraphCreate_v10000 = 286¶
- cudaGraphDebugDotPrint_v11030 = 408¶
- cudaGraphDestroyNode_v10000 = 309¶
- cudaGraphDestroy_v10000 = 314¶
- cudaGraphEventRecordNodeGetEvent_v11010 = 363¶
- cudaGraphEventRecordNodeSetEvent_v11010 = 364¶
- cudaGraphEventWaitNodeGetEvent_v11010 = 366¶
- cudaGraphEventWaitNodeSetEvent_v11010 = 367¶
- cudaGraphExecChildGraphNodeSetParams_v11010 = 361¶
- cudaGraphExecDestroy_v10000 = 313¶
- cudaGraphExecEventRecordNodeSetEvent_v11010 = 368¶
- cudaGraphExecEventWaitNodeSetEvent_v11010 = 369¶
- cudaGraphExecExternalSemaphoresSignalNodeSetParams_v11020 = 403¶
- cudaGraphExecExternalSemaphoresWaitNodeSetParams_v11020 = 404¶
- cudaGraphExecGetFlags_v12000 = 438¶
- cudaGraphExecHostNodeSetParams_v10020 = 334¶
- cudaGraphExecKernelNodeSetParams_v10010 = 326¶
- cudaGraphExecMemcpyNodeSetParams1D_v11010 = 358¶
- cudaGraphExecMemcpyNodeSetParamsFromSymbol_v11010 = 357¶
- cudaGraphExecMemcpyNodeSetParamsToSymbol_v11010 = 356¶
- cudaGraphExecMemcpyNodeSetParams_v10020 = 332¶
- cudaGraphExecMemsetNodeSetParams_v10020 = 333¶
- cudaGraphExecNodeSetParams_v12020 = 447¶
- cudaGraphExecUpdate_v10020 = 335¶
- cudaGraphExternalSemaphoresSignalNodeGetParams_v11020 = 398¶
- cudaGraphExternalSemaphoresSignalNodeSetParams_v11020 = 399¶
- cudaGraphExternalSemaphoresWaitNodeGetParams_v11020 = 401¶
- cudaGraphExternalSemaphoresWaitNodeSetParams_v11020 = 402¶
- cudaGraphGetEdges_v10000 = 323¶
- cudaGraphGetEdges_v2_v12030 = 455¶
- cudaGraphGetNodes_v10000 = 322¶
- cudaGraphGetRootNodes_v10000 = 304¶
- cudaGraphHostNodeGetParams_v10000 = 297¶
- cudaGraphHostNodeSetParams_v10000 = 321¶
- cudaGraphInstantiateWithFlags_v11040 = 418¶
- cudaGraphInstantiateWithParams_ptsz_v12000 = 437¶
- cudaGraphInstantiateWithParams_v12000 = 436¶
- cudaGraphInstantiate_v10000 = 310¶
- cudaGraphInstantiate_v12000 = 443¶
- cudaGraphKernelNodeCopyAttributes_v11000 = 338¶
- cudaGraphKernelNodeGetAttribute_v11000 = 339¶
- cudaGraphKernelNodeGetParams_v10000 = 287¶
- cudaGraphKernelNodeSetAttribute_v11000 = 340¶
- cudaGraphKernelNodeSetParams_v10000 = 288¶
- cudaGraphLaunch_ptsz_v10000 = 312¶
- cudaGraphLaunch_v10000 = 311¶
- cudaGraphMemAllocNodeGetParams_v11040 = 420¶
- cudaGraphMemFreeNodeGetParams_v11040 = 422¶
- cudaGraphMemcpyNodeGetParams_v10000 = 291¶
- cudaGraphMemcpyNodeSetParams1D_v11010 = 355¶
- cudaGraphMemcpyNodeSetParamsFromSymbol_v11010 = 354¶
- cudaGraphMemcpyNodeSetParamsToSymbol_v11010 = 353¶
- cudaGraphMemcpyNodeSetParams_v10000 = 292¶
- cudaGraphMemsetNodeGetParams_v10000 = 294¶
- cudaGraphMemsetNodeSetParams_v10000 = 295¶
- cudaGraphNodeFindInClone_v10000 = 302¶
- cudaGraphNodeGetDependencies_v10000 = 305¶
- cudaGraphNodeGetDependencies_v2_v12030 = 456¶
- cudaGraphNodeGetDependentNodes_v10000 = 306¶
- cudaGraphNodeGetDependentNodes_v2_v12030 = 457¶
- cudaGraphNodeGetEnabled_v11060 = 427¶
- cudaGraphNodeGetType_v10000 = 303¶
- cudaGraphNodeSetEnabled_v11060 = 426¶
- cudaGraphNodeSetParams_v12020 = 446¶
- cudaGraphReleaseUserObject_v11030 = 417¶
- cudaGraphRemoveDependencies_v10000 = 308¶
- cudaGraphRemoveDependencies_v2_v12030 = 459¶
- cudaGraphRetainUserObject_v11030 = 416¶
- cudaGraphUpload_ptsz_v10000 = 349¶
- cudaGraphUpload_v10000 = 348¶
- cudaGraphicsD3D10RegisterResource_v3020 = 91¶
- cudaGraphicsD3D11RegisterResource_v3020 = 87¶
- cudaGraphicsD3D9RegisterResource_v3020 = 106¶
- cudaGraphicsEGLRegisterImage_v7000 = 256¶
- cudaGraphicsGLRegisterBuffer_v3020 = 73¶
- cudaGraphicsGLRegisterImage_v3020 = 72¶
- cudaGraphicsMapResources_v3020 = 76¶
- cudaGraphicsResourceGetMappedEglFrame_v7000 = 265¶
- cudaGraphicsResourceGetMappedMipmappedArray_v5000 = 196¶
- cudaGraphicsResourceGetMappedPointer_v3020 = 78¶
- cudaGraphicsResourceSetMapFlags_v3020 = 75¶
- cudaGraphicsSubResourceGetMappedArray_v3020 = 79¶
- cudaGraphicsUnmapResources_v3020 = 77¶
- cudaGraphicsUnregisterResource_v3020 = 74¶
- cudaGraphicsVDPAURegisterOutputSurface_v3020 = 83¶
- cudaGraphicsVDPAURegisterVideoSurface_v3020 = 82¶
- cudaHostAlloc_v3020 = 27¶
- cudaHostGetDevicePointer_v3020 = 28¶
- cudaHostGetFlags_v3020 = 29¶
- cudaHostRegister_v4000 = 152¶
- cudaHostUnregister_v4000 = 153¶
- cudaImportExternalMemory_v10000 = 274¶
- cudaImportExternalSemaphore_v10000 = 278¶
- cudaInitDevice_v12000 = 444¶
- cudaIpcCloseMemHandle_v4010 = 180¶
- cudaIpcGetEventHandle_v4010 = 176¶
- cudaIpcGetMemHandle_v4010 = 178¶
- cudaIpcOpenEventHandle_v4010 = 177¶
- cudaIpcOpenMemHandle_v4010 = 179¶
- cudaLaunchCooperativeKernelMultiDevice_v9000 = 272¶
- cudaLaunchCooperativeKernel_ptsz_v9000 = 270¶
- cudaLaunchCooperativeKernel_v9000 = 269¶
- cudaLaunchHostFunc_ptsz_v10000 = 285¶
- cudaLaunchHostFunc_v10000 = 284¶
- cudaLaunchKernelExC_ptsz_v11060 = 431¶
- cudaLaunchKernelExC_v11060 = 430¶
- cudaLaunchKernel_ptsz_v7000 = 214¶
- cudaLaunchKernel_v7000 = 211¶
- cudaLaunch_ptsz_v7000 = 213¶
- cudaLaunch_v3020 = 13¶
- cudaMalloc3DArray_v3020 = 141¶
- cudaMalloc3D_v3020 = 140¶
- cudaMallocArray_v3020 = 23¶
- cudaMallocAsync_ptsz_v11020 = 374¶
- cudaMallocAsync_v11020 = 373¶
- cudaMallocFromPoolAsync_ptsz_v11020 = 392¶
- cudaMallocFromPoolAsync_v11020 = 391¶
- cudaMallocHost_v3020 = 25¶
- cudaMallocManaged_v6000 = 206¶
- cudaMallocMipmappedArray_v5000 = 192¶
- cudaMallocPitch_v3020 = 21¶
- cudaMalloc_v3020 = 20¶
- cudaMemAdvise_v2_v12020 = 448¶
- cudaMemAdvise_v8000 = 254¶
- cudaMemGetInfo_v3020 = 30¶
- cudaMemPoolCreate_v11020 = 383¶
- cudaMemPoolDestroy_v11020 = 384¶
- cudaMemPoolExportPointer_v11020 = 389¶
- cudaMemPoolGetAccess_v11020 = 382¶
- cudaMemPoolGetAttribute_v11020 = 379¶
- cudaMemPoolImportPointer_v11020 = 390¶
- cudaMemPoolSetAccess_v11020 = 380¶
- cudaMemPoolSetAttribute_v11020 = 378¶
- cudaMemPoolTrimTo_v11020 = 377¶
- cudaMemPrefetchAsync_ptsz_v8000 = 253¶
- cudaMemPrefetchAsync_v2_ptsz_v12020 = 450¶
- cudaMemPrefetchAsync_v2_v12020 = 449¶
- cudaMemPrefetchAsync_v8000 = 252¶
- cudaMemRangeGetAttribute_v8000 = 266¶
- cudaMemRangeGetAttributes_v8000 = 267¶
- cudaMemcpy2DArrayToArray_ptds_v7000 = 222¶
- cudaMemcpy2DArrayToArray_v3020 = 38¶
- cudaMemcpy2DAsync_ptsz_v7000 = 228¶
- cudaMemcpy2DAsync_v3020 = 44¶
- cudaMemcpy2DFromArrayAsync_ptsz_v7000 = 230¶
- cudaMemcpy2DFromArrayAsync_v3020 = 46¶
- cudaMemcpy2DFromArray_ptds_v7000 = 220¶
- cudaMemcpy2DFromArray_v3020 = 36¶
- cudaMemcpy2DToArrayAsync_ptsz_v7000 = 229¶
- cudaMemcpy2DToArrayAsync_v3020 = 45¶
- cudaMemcpy2DToArray_ptds_v7000 = 218¶
- cudaMemcpy2DToArray_v3020 = 34¶
- cudaMemcpy2D_ptds_v7000 = 216¶
- cudaMemcpy2D_v3020 = 32¶
- cudaMemcpy3DAsync_ptsz_v7000 = 246¶
- cudaMemcpy3DAsync_v3020 = 145¶
- cudaMemcpy3DBatchAsync_ptsz_v12080 = 485¶
- cudaMemcpy3DBatchAsync_v12080 = 484¶
- cudaMemcpy3DPeerAsync_ptsz_v7000 = 250¶
- cudaMemcpy3DPeerAsync_v4000 = 163¶
- cudaMemcpy3DPeer_ptds_v7000 = 249¶
- cudaMemcpy3DPeer_v4000 = 162¶
- cudaMemcpy3D_ptds_v7000 = 245¶
- cudaMemcpy3D_v3020 = 144¶
- cudaMemcpyArrayToArray_ptds_v7000 = 221¶
- cudaMemcpyArrayToArray_v3020 = 37¶
- cudaMemcpyAsync_ptsz_v7000 = 225¶
- cudaMemcpyAsync_v3020 = 41¶
- cudaMemcpyBatchAsync_ptsz_v12080 = 483¶
- cudaMemcpyBatchAsync_v12080 = 482¶
- cudaMemcpyFromArrayAsync_ptsz_v7000 = 227¶
- cudaMemcpyFromArrayAsync_v3020 = 43¶
- cudaMemcpyFromArray_ptds_v7000 = 219¶
- cudaMemcpyFromArray_v3020 = 35¶
- cudaMemcpyFromSymbolAsync_ptsz_v7000 = 232¶
- cudaMemcpyFromSymbolAsync_v3020 = 48¶
- cudaMemcpyFromSymbol_ptds_v7000 = 224¶
- cudaMemcpyFromSymbol_v3020 = 40¶
- cudaMemcpyPeerAsync_v4000 = 161¶
- cudaMemcpyPeer_v4000 = 160¶
- cudaMemcpyToArrayAsync_ptsz_v7000 = 226¶
- cudaMemcpyToArrayAsync_v3020 = 42¶
- cudaMemcpyToArray_ptds_v7000 = 217¶
- cudaMemcpyToArray_v3020 = 33¶
- cudaMemcpyToSymbolAsync_ptsz_v7000 = 231¶
- cudaMemcpyToSymbolAsync_v3020 = 47¶
- cudaMemcpyToSymbol_ptds_v7000 = 223¶
- cudaMemcpyToSymbol_v3020 = 39¶
- cudaMemcpy_ptds_v7000 = 215¶
- cudaMemcpy_v3020 = 31¶
- cudaMemset2DAsync_ptsz_v7000 = 236¶
- cudaMemset2DAsync_v3020 = 52¶
- cudaMemset2D_ptds_v7000 = 234¶
- cudaMemset2D_v3020 = 50¶
- cudaMemset3DAsync_ptsz_v7000 = 244¶
- cudaMemset3DAsync_v3020 = 143¶
- cudaMemset3D_ptds_v7000 = 243¶
- cudaMemset3D_v3020 = 142¶
- cudaMemsetAsync_ptsz_v7000 = 235¶
- cudaMemsetAsync_v3020 = 51¶
- cudaMemset_ptds_v7000 = 233¶
- cudaMemset_v3020 = 49¶
- cudaMipmappedArrayGetMemoryRequirements_v11060 = 429¶
- cudaMipmappedArrayGetSparseProperties_v11010 = 360¶
- cudaOccupancyAvailableDynamicSMemPerBlock_v10200 = 329¶
- cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags_v7000 = 251¶
- cudaOccupancyMaxActiveBlocksPerMultiprocessor_v6000 = 207¶
- cudaOccupancyMaxActiveBlocksPerMultiprocessor_v6050 = 210¶
- cudaOccupancyMaxActiveClusters_v11070 = 433¶
- cudaOccupancyMaxPotentialClusterSize_v11070 = 432¶
- cudaPeekAtLastError_v3020 = 11¶
- cudaPeerGetDevicePointer_v4000 = 159¶
- cudaPeerRegister_v4000 = 157¶
- cudaPeerUnregister_v4000 = 158¶
- cudaPointerGetAttributes_v4000 = 151¶
- cudaProfilerInitialize_v4000 = 170¶
- cudaProfilerStart_v4000 = 171¶
- cudaProfilerStop_v4000 = 172¶
- cudaRuntimeGetVersion_v3020 = 2¶
- cudaSetDeviceFlags_v3020 = 19¶
- cudaSetDevice_v3020 = 16¶
- cudaSetDoubleForDevice_v3020 = 124¶
- cudaSetDoubleForHost_v3020 = 125¶
- cudaSetValidDevices_v3020 = 18¶
- cudaSetupArgument_v3020 = 9¶
- cudaSignalExternalSemaphoresAsync_ptsz_v10000 = 280¶
- cudaSignalExternalSemaphoresAsync_v10000 = 279¶
- cudaSignalExternalSemaphoresAsync_v2_ptsz_v11020 = 394¶
- cudaSignalExternalSemaphoresAsync_v2_v11020 = 393¶
- cudaStreamAddCallback_ptsz_v7000 = 248¶
- cudaStreamAddCallback_v5000 = 197¶
- cudaStreamAttachMemAsync_ptsz_v7000 = 241¶
- cudaStreamAttachMemAsync_v6000 = 208¶
- cudaStreamBeginCaptureToGraph_ptsz_v12030 = 453¶
- cudaStreamBeginCaptureToGraph_v12030 = 452¶
- cudaStreamBeginCapture_ptsz_v10000 = 316¶
- cudaStreamBeginCapture_v10000 = 315¶
- cudaStreamCopyAttributes_ptsz_v11000 = 342¶
- cudaStreamCopyAttributes_v11000 = 341¶
- cudaStreamCreateWithFlags_v5000 = 198¶
- cudaStreamCreateWithPriority_v5050 = 202¶
- cudaStreamCreate_v3020 = 129¶
- cudaStreamDestroy_v3020 = 130¶
- cudaStreamDestroy_v5050 = 201¶
- cudaStreamEndCapture_ptsz_v10000 = 320¶
- cudaStreamEndCapture_v10000 = 319¶
- cudaStreamGetAttribute_ptsz_v11000 = 344¶
- cudaStreamGetAttribute_v11000 = 343¶
- cudaStreamGetCaptureInfo_ptsz_v10010 = 325¶
- cudaStreamGetCaptureInfo_v10010 = 324¶
- cudaStreamGetCaptureInfo_v2_ptsz_v11030 = 410¶
- cudaStreamGetCaptureInfo_v2_v11030 = 409¶
- cudaStreamGetCaptureInfo_v3_ptsz_v12030 = 462¶
- cudaStreamGetCaptureInfo_v3_v12030 = 461¶
- cudaStreamGetDevice_ptsz_v12080 = 481¶
- cudaStreamGetDevice_v12080 = 480¶
- cudaStreamGetFlags_ptsz_v7000 = 238¶
- cudaStreamGetFlags_v5050 = 204¶
- cudaStreamGetId_ptsz_v12000 = 442¶
- cudaStreamGetId_v12000 = 441¶
- cudaStreamGetPriority_ptsz_v7000 = 237¶
- cudaStreamGetPriority_v5050 = 203¶
- cudaStreamIsCapturing_ptsz_v10000 = 318¶
- cudaStreamIsCapturing_v10000 = 317¶
- cudaStreamQuery_ptsz_v7000 = 240¶
- cudaStreamQuery_v3020 = 132¶
- cudaStreamSetAttribute_ptsz_v11000 = 346¶
- cudaStreamSetAttribute_v11000 = 345¶
- cudaStreamSetFlags_ptsz_v10200 = 331¶
- cudaStreamSetFlags_v10200 = 330¶
- cudaStreamSynchronize_ptsz_v7000 = 239¶
- cudaStreamSynchronize_v3020 = 131¶
- cudaStreamUpdateCaptureDependencies_ptsz_v11030 = 412¶
- cudaStreamUpdateCaptureDependencies_v11030 = 411¶
- cudaStreamUpdateCaptureDependencies_v2_ptsz_v12030 = 464¶
- cudaStreamUpdateCaptureDependencies_v2_v12030 = 463¶
- cudaStreamWaitEvent_ptsz_v7000 = 247¶
- cudaStreamWaitEvent_v3020 = 147¶
- cudaThreadExchangeStreamCaptureMode_v10010 = 327¶
- cudaThreadExit_v3020 = 123¶
- cudaThreadGetCacheConfig_v3020 = 150¶
- cudaThreadGetLimit_v3020 = 127¶
- cudaThreadSetCacheConfig_v3020 = 146¶
- cudaThreadSetLimit_v3020 = 128¶
- cudaThreadSynchronize_v3020 = 126¶
- cudaUnbindTexture_v3020 = 58¶
- cudaUserObjectCreate_v11030 = 413¶
- cudaUserObjectRelease_v11030 = 415¶
- cudaUserObjectRetain_v11030 = 414¶
- cudaVDPAUGetDevice_v3020 = 80¶
- cudaVDPAUSetVDPAUDevice_v3020 = 81¶
- cudaWGLGetDevice_v3020 = 71¶
- cudaWaitExternalSemaphoresAsync_ptsz_v10000 = 282¶
- cudaWaitExternalSemaphoresAsync_v10000 = 281¶
- cudaWaitExternalSemaphoresAsync_v2_ptsz_v11020 = 396¶
- cudaWaitExternalSemaphoresAsync_v2_v11020 = 395¶
- cupti.cupti.activity_configure_unified_memory_counter(intptr_t config, uint32_t count)¶
Set Unified Memory Counter configuration.
- Args:
config (intptr_t): A pointer to
CUpti_ActivityUnifiedMemoryCounterConfig
structures containing Unified Memory counter configuration. count (uint32_t): Number of Unified Memory counter configuration structures.
See also
cuptiActivityConfigureUnifiedMemoryCounter
- cupti.cupti.activity_disable(int kind)¶
Disable collection of a specific kind of activity record.
- Args:
kind (CUpti_ActivityKind): The kind of activity record to stop collecting.
See also
cuptiActivityDisable
- cupti.cupti.activity_disable_context(intptr_t context, int kind)¶
Disable collection of a specific kind of activity record for a context.
- Args:
context (intptr_t): The context for which activity is to be disabled. kind (CUpti_ActivityKind): The kind of activity record to stop collecting.
See also
cuptiActivityDisableContext
- cupti.cupti.activity_enable(int kind)¶
Enable collection of a specific kind of activity record.
- Args:
kind (CUpti_ActivityKind): The kind of activity record to collect.
See also
cuptiActivityEnable
- cupti.cupti.activity_enable_allocation_source(uint8_t enable)¶
Enables tracking the source library for memory allocation requests.
- Args:
enable (uint8_t): is a boolean, denoting whether the source library of the memory allocation request needs to be tracked.
See also
cuptiActivityEnableAllocationSource
- cupti.cupti.activity_enable_and_dump(int kind)¶
Enable collection of a specific kind of activity record. For certain activity kinds it dumps existing records.
- Args:
kind (CUpti_ActivityKind): The kind of activity record to collect.
See also
cuptiActivityEnableAndDump
- cupti.cupti.activity_enable_context(intptr_t context, int kind)¶
Enable collection of a specific kind of activity record for a context.
- Args:
context (intptr_t): The context for which activity is to be enabled. kind (CUpti_ActivityKind): The kind of activity record to collect.
See also
cuptiActivityEnableContext
- cupti.cupti.activity_enable_device_graph(uint8_t enable)¶
Controls the collection of records for device launched graphs.
- Args:
enable (uint8_t): is a boolean, denoting whether these records should be collected.
See also
cuptiActivityEnableDeviceGraph
- cupti.cupti.activity_enable_driver_api(uint32_t cbid, uint8_t enable)¶
Controls the collection of activity records for specific CUDA Driver APIs.
- Args:
cbid (uint32_t): callback id of the CUDA Driver API. This can be found in the header
cupti_driver_cbid.h
. enable (uint8_t): is a boolean, denoting whether to enable or disable the collection.
See also
cuptiActivityEnableDriverApi
- cupti.cupti.activity_enable_hw_trace(uint8_t enable)¶
Enables the collection of CUDA kernel timestamps through HW events.
- Args:
enable (uint8_t): is a boolean, denoting whether to enable or disable the collection through HW events.
See also
cuptiActivityEnableHWTrace
- cupti.cupti.activity_enable_latency_timestamps(uint8_t enable)¶
Controls the collection of queued and submitted timestamps for kernels.
- Args:
enable (uint8_t): is a boolean, denoting whether these timestamps should be collected.
See also
cuptiActivityEnableLatencyTimestamps
- cupti.cupti.activity_enable_launch_attributes(uint8_t enable)¶
Controls the collection of launch attributes for kernels.
- Args:
enable (uint8_t): is a boolean denoting whether these launch attributes should be collected.
See also
cuptiActivityEnableLaunchAttributes
- cupti.cupti.activity_enable_runtime_api(uint32_t cbid, uint8_t enable)¶
Controls the collection of activity records for specific CUDA Runtime APIs.
- Args:
cbid (uint32_t): callback id of the CUDA Runtime API. This can be found in the header
cupti_runtime_cbid.h
. enable (uint8_t): is a boolean, denoting whether to enable or disable the collection.
See also
cuptiActivityEnableRuntimeApi
- cupti.cupti.activity_flush_all(uint32_t flag)¶
Request to deliver activity records via the buffer completion callback.
- Args:
flag (uint32_t): The flag can be set to indicate a forced flush. See CUpti_ActivityFlag.
See also
cuptiActivityFlushAll
- cupti.cupti.activity_flush_period(uint32_t time)¶
Sets the flush period for the worker thread.
- Args:
time (uint32_t): flush period in milliseconds (ms).
See also
cuptiActivityFlushPeriod
- cupti.cupti.activity_get_attribute(int attr, intptr_t value_size, intptr_t value)¶
Read an activity API attribute.
- Args:
attr (CUpti_ActivityAttribute): The attribute to read. value_size (intptr_t): Size of buffer pointed by the value, and returns the number of bytes written to
value
. value (intptr_t): Returns the value of the attribute.
See also
cuptiActivityGetAttribute
- cupti.cupti.activity_get_num_dropped_records(intptr_t context, uint32_t stream_id, intptr_t dropped)¶
Get the number of activity records that were dropped of insufficient buffer space.
- Args:
context (intptr_t): The context, or NULL to get dropped count from global queue. stream_id (uint32_t): The stream ID. dropped (intptr_t): The number of records that were dropped since the last call to this function.
See also
cuptiActivityGetNumDroppedRecords
- cupti.cupti.activity_pop_external_correlation_id(int kind) uint64_t ¶
Pop an external correlation id for the calling thread.
- Args:
kind (CUpti_ExternalCorrelationKind): The kind of external API activities should be correlated with.
- Returns:
uint64_t: If the function returns successful, contains the last external correlation id for this
kind
, can be NULL.
See also
cuptiActivityPopExternalCorrelationId
- cupti.cupti.activity_push_external_correlation_id(int kind, uint64_t id)¶
Push an external correlation id for the calling thread.
- Args:
kind (CUpti_ExternalCorrelationKind): The kind of external API activities should be correlated with. id (uint64_t): External correlation id.
See also
cuptiActivityPushExternalCorrelationId
- cupti.cupti.activity_register_callbacks(func_buffer_requested, func_buffer_completed)¶
Registers callback functions with CUPTI for activity buffer handling.
- Args:
func_buffer_requested (function): callback which is invoked when an empty buffer is requested by CUPTI. func_buffer_completed (function): callback which is invoked when a buffer containing activity records is available from CUPTI.
See also
cuptiActivityRegisterCallbacks
- cupti.cupti.activity_register_timestamp_callback(func_timestamp)¶
Registers callback function with CUPTI for providing timestamp.
- Args:
func_timestamp (function): callback which is invoked when a timestamp is needed by CUPTI.
See also
cuptiActivityRegisterTimestampCallback
- cupti.cupti.activity_set_attribute(int attr, intptr_t value_size, intptr_t value)¶
Write an activity API attribute.
- Args:
attr (CUpti_ActivityAttribute): The attribute to write. value_size (intptr_t): The size, in bytes, of the value. value (intptr_t): The attribute value to write.
See also
cuptiActivitySetAttribute
- cupti.cupti.compute_capability_supported(int major, int minor) int ¶
Check support for a compute capability.
- Args:
major (int): The major revision number of the compute capability. minor (int): The minor revision number of the compute capability.
- Returns:
int: Pointer to an integer to return the support status.
See also
cuptiComputeCapabilitySupported
- cupti.cupti.device_supported(int dev) int ¶
Check support for a compute device.
- Args:
dev (int): The device handle returned by CUDA Driver API cuDeviceGet.
- Returns:
int: Pointer to an integer to return the support status.
See also
cuptiDeviceSupported
- cupti.cupti.device_virtualization_mode(int dev) int ¶
Query the virtualization mode of the device.
- Args:
dev (int): The device handle returned by CUDA Driver API cuDeviceGet.
- Returns:
int: Pointer to an CUpti_DeviceVirtualizationMode to return the virtualization mode.
See also
cuptiDeviceVirtualizationMode
- cupti.cupti.enable_all_domains(uint32_t enable, intptr_t subscriber)¶
Enable or disable all callbacks in all domains.
- Args:
enable (uint32_t): New enable state for all callbacks in all domain. Zero disables all callbacks, non-zero enables all callbacks. subscriber (intptr_t): Handle to callback subscription.
See also
cuptiEnableAllDomains
- cupti.cupti.enable_callback(uint32_t enable, intptr_t subscriber, int domain, uint32_t cbid)¶
Enable or disabled callbacks for a specific domain and callback ID.
- Args:
enable (uint32_t): New enable state for the callback. Zero disables the callback, non-zero enables the callback. subscriber (intptr_t): Handle to callback subscription. domain (CUpti_CallbackDomain): The domain of the callback. cbid (uint32_t): The ID of the callback.
See also
cuptiEnableCallback
- cupti.cupti.enable_domain(uint32_t enable, intptr_t subscriber, int domain)¶
Enable or disabled all callbacks for a specific domain.
- Args:
enable (uint32_t): New enable state for all callbacks in the domain. Zero disables all callbacks, non-zero enables all callbacks. subscriber (intptr_t): Handle to callback subscription. domain (CUpti_CallbackDomain): The domain of the callback.
See also
cuptiEnableDomain
- cupti.cupti.finalize()¶
Detach CUPTI from the running process.
See also
cuptiFinalize
- cupti.cupti.get_auto_boost_state(intptr_t context, intptr_t state)¶
Get auto boost state.
- Args:
context (intptr_t): A valid CUcontext. state (intptr_t): A pointer to
CUpti_ActivityAutoBoostState
structure which contains the current state and the id of the process that has requested the current state.
See also
cuptiGetAutoBoostState
- cupti.cupti.get_callback_name(int domain, uint32_t cbid)¶
Get the name of a callback for a specific domain and callback ID.
- Args:
domain (CUpti_CallbackDomain): The domain of the callback. cbid (uint32_t): The ID of the callback.
- Returns:
name (str): Returns name of the callback for the specified domain and callback ID
See also
cuptiGetCallbackName
- cupti.cupti.get_callback_state(intptr_t subscriber, int domain, uint32_t cbid) uint32_t ¶
Get the current enabled/disabled state of a callback for a specific domain and function ID.
- Args:
subscriber (intptr_t): Handle to the initialize subscriber. domain (CUpti_CallbackDomain): The domain of the callback. cbid (uint32_t): The ID of the callback.
- Returns:
uint32_t: Returns non-zero if callback enabled, zero if not enabled.
See also
cuptiGetCallbackState
- cupti.cupti.get_context_id(intptr_t context) uint32_t ¶
Get the ID of a context.
- Args:
context (intptr_t): The context.
- Returns:
uint32_t: Returns a process-unique ID for the context.
See also
cuptiGetContextId
- cupti.cupti.get_device_id(intptr_t context) uint32_t ¶
Get the ID of a device.
- Args:
context (intptr_t): The context, or NULL to indicate the current context.
- Returns:
uint32_t: Returns the ID of the device that is current for the calling thread.
See also
cuptiGetDeviceId
- cupti.cupti.get_graph_exec_id(intptr_t graph_exec) uint32_t ¶
Get the unique ID of executable graph.
- Args:
graph_exec (intptr_t): The executable graph.
- Returns:
uint32_t: Returns the unique ID of the executable graph.
See also
cuptiGetGraphExecId
- cupti.cupti.get_graph_id(intptr_t graph) uint32_t ¶
Get the unique ID of graph.
- Args:
graph (intptr_t): The graph.
- Returns:
uint32_t: Returns the unique ID of the graph.
See also
cuptiGetGraphId
- cupti.cupti.get_graph_node_id(intptr_t node) uint64_t ¶
Get the unique ID of a graph node.
- Args:
node (intptr_t): The graph node.
- Returns:
uint64_t: Returns the unique ID of the node.
See also
cuptiGetGraphNodeId
- cupti.cupti.get_last_error()¶
Returns the last error from a cupti call or callback.
See also
cuptiGetLastError
- cupti.cupti.get_stream_id_ex(intptr_t context, intptr_t stream, uint8_t per_thread_stream) uint32_t ¶
Get the ID of a stream.
- Args:
context (intptr_t): If non-NULL then the stream is checked to ensure that it belongs to this context. Typically this parameter should be null. stream (intptr_t): The stream. per_thread_stream (uint8_t): Flag to indicate if program is compiled for per-thread streams.
- Returns:
uint32_t: Returns a context-unique ID for the stream.
See also
cuptiGetStreamIdEx
- cupti.cupti.get_thread_id_type() int ¶
Get the thread-id type.
- Returns:
int: .
See also
cuptiGetThreadIdType
- cupti.cupti.get_timestamp() uint64_t ¶
Get the CUPTI timestamp.
- Returns:
uint64_t: Returns the CUPTI timestamp.
See also
cuptiGetTimestamp
- cupti.cupti.set_thread_id_type(int type)¶
Set the thread-id type.
- Args:
type (CUpti_ActivityThreadIdType): .
See also
cuptiSetThreadIdType
- cupti.cupti.subscribe(callback, userdata) intptr_t ¶
Initialize a callback subscriber with a callback function and user data.
- Args:
callback (CUpti_CallbackFunc): The callback function. userdata (intptr_t): A pointer to user data. This data will be passed to the callback function via the
userdata
parameter.- Returns:
intptr_t: Returns handle to initialize subscriber.
See also
cuptiSubscribe
- cupti.cupti.supported_domains()¶
Get the available callback domains.
- Returns:
list[cupti.cupti.CallbackDomain]: List of all available callback domains
See also
cuptiSupportedDomains
- cupti.cupti.unsubscribe(intptr_t subscriber)¶
Unregister a callback subscriber.
- Args:
subscriber (intptr_t): Handle to the initialize subscriber.
See also
cuptiUnsubscribe