cuTENSOR
2.1.0
  • Release Notes
    • cuTENSOR v2.1.0
    • cuTENSOR v2.0.2
    • cuTENSOR v2.0.0
    • cuTENSOR v1.7.0
    • cuTENSOR v1.6.2
    • cuTENSOR v1.6.1
    • cuTENSOR v1.6.0
    • cuTENSOR v1.5.0
    • cuTENSOR v1.4.0
    • cuTENSOR v1.3.3
    • cuTENSOR v1.3.2
    • cuTENSOR v1.3.1
    • cuTENSOR v1.3.0
    • cuTENSOR v1.2.2
    • cuTENSOR v1.2.1
    • cuTENSOR v1.2.0
    • cuTENSOR v1.1.0
    • cuTENSOR v1.0.1
    • cuTENSOR v1.0.0
  • User Guide
    • Nomenclature
    • Einstein Notation
    • Performance Guidelines
    • Software-managed Plan Cache
    • Accuracy Guarantees
    • Scalar Types
    • Supported Unary Operators
    • Supported GPUs
    • CUDA Graph Support
    • Logging
    • Environment Variables
  • Getting Started
    • Installation and Compilation
    • Headers and Data Types
    • Define Tensor Sizes
    • Initialize Tensor Data
    • Create Tensor Descriptors
    • Create Contraction Descriptor
    • Determine Algorithm and Workspace
    • Plan and reduce workspace
    • Execute
  • Transition to cuTENSOR 2.x
    • Overview
    • Differences at a glance
    • Example 1: Migrating a contraction from 1.x to 2.x
    • Example 2: Migrating a reduction operation from 1.x to 2.x
    • Example 3: Migrating a permutation/elementwise operation from 1.x to 2.x
  • Just In Time (JIT) Compilation
    • Introductory Example
    • Reading and writing the kernel cache to disk
  • Plan Cache
    • Incremental Autotuning
    • Introductory Example
    • Advanced Example
  • Multi-GPU support - cuTENSORMg
    • Performance Guidelines
    • Accuracy Guarantees
    • Scalar Types
    • CUDA Graph Support
    • cuTENSORMg Logging
  • API Reference
    • cuTENSOR Data Types
      • cutensorDataType_t
      • cutensorComputeDescriptor_t
      • cutensorHandle_t
      • cutensorTensorDescriptor_t
      • cutensorOperationDescriptor_t
      • cutensorOperationDescriptorAttribute_t
      • cutensorPlanPreference_t
      • cutensorPlanPreferenceAttribute_t
      • cutensorPlan_t
      • cutensorPlanAttribute_t
      • cutensorAutotuneMode_t
      • cutensorJitMode_t
      • cutensorCacheMode_t
      • cutensorAlgo_t
      • cutensorWorksizePreference_t
      • cutensorOperator_t
      • cutensorStatus_t
      • cudaDataType_t
      • cutensorLoggerCallback_t
    • cuTENSOR Functions
      • Helper Functions
        • cutensorCreate()
        • cutensorDestroy()
        • cutensorCreateTensorDescriptor()
        • cutensorDestroyTensorDescriptor()
        • cutensorGetErrorString()
        • cutensorGetVersion()
        • cutensorGetCudartVersion()
      • Element-wise Operations
        • cutensorCreateElementwiseTrinary()
        • cutensorElementwiseTrinaryExecute()
        • cutensorCreateElementwiseBinary()
        • cutensorElementwiseBinaryExecute()
        • cutensorCreatePermutation()
        • cutensorPermute()
      • Contraction Operations
        • cutensorCreateContraction()
        • cutensorContract()
      • Reduction Operations
        • cutensorCreateReduction()
        • cutensorReduce()
      • Generic Operation Functions
        • cutensorDestroyOperationDescriptor()
        • cutensorOperationDescriptorGetAttribute()
        • cutensorOperationDescriptorSetAttribute()
        • cutensorCreatePlanPreference()
        • cutensorDestroyPlanPreference()
        • cutensorPlanPreferenceSetAttribute()
        • cutensorEstimateWorkspaceSize()
        • cutensorCreatePlan()
        • cutensorDestroyPlan()
        • cutensorPlanGetAttribute()
        • cutensorPlanPreferenceSetAttribute()
      • Cache-related Operations
        • cutensorHandleResizePlanCache()
        • cutensorHandleReadPlanCacheFromFile()
        • cutensorHandleWritePlanCacheToFile()
        • cutensorReadKernelCacheFromFile()
        • cutensorWriteKernelCacheToFile()
      • Logger Functions
        • cutensorLoggerSetCallback()
        • cutensorLoggerSetFile()
        • cutensorLoggerOpenFile()
        • cutensorLoggerSetLevel()
        • cutensorLoggerSetMask()
        • cutensorLoggerForceDisable()
  • API Reference - cuTENSORMg
    • General
      • cutensorMgHostDevice_t
      • cutensorMgHandle_t
      • cutensorMgTensorDescriptor_t
      • cutensorMgCreate()
      • cutensorMgDestroy()
      • cutensorMgCreateTensorDescriptor()
      • cutensorMgDestroyTensorDescriptor()
    • Copy Operations
      • cutensorMgCopyDescriptor_t
      • cutensorMgCopyPlan_t
      • cutensorMgCreateCopyDescriptor()
      • cutensorMgDestroyCopyDescriptor()
      • cutensorMgCopyGetWorkspace()
      • cutensorMgCreateCopyPlan()
      • cutensorMgDestroyCopyPlan()
      • cutensorMgCopy()
    • Contraction Operations
      • cutensorMgContractionDescriptor_t
      • cutensorMgContractionFind_t
      • cutensorMgContractionPlan_t
      • cutensorMgAlgo_t
      • cutensorMgCreateContractionDescriptor()
      • cutensorMgDestroyContractionDescriptor()
      • cutensorMgCreateContractionFind()
      • cutensorMgDestroyContractionFind()
      • cutensorMgContractionGetWorkspace()
      • cutensorMgCreateContractionPlan()
      • cutensorMgDestroyContractionPlan()
      • cutensorMgContraction()
  • Software License Agreement
  • Third Party License Agreements
    • HPTT
cuTENSOR
  • Search


Privacy Policy | Manage My Privacy | Do Not Sell or Share My Data | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2019-2024, NVIDIA Corporation and affiliates.

NVIDIA cuTensor v: 2.1.0