Basic functions for library management, in particular library version and device property query functions. More...

Functions
const NppLibraryVersion *	nppGetLibVersion (void)
	Get the NPP library version. More...

NppGpuComputeCapability	nppGetGpuComputeCapability (void)
	What CUDA compute model is supported by the active CUDA device? More...

int	nppGetGpuNumSMs (void)
	Get the number of Streaming Multiprocessors (SM) on the active CUDA device. More...

int	nppGetMaxThreadsPerBlock (void)
	Get the maximum number of threads per block on the active CUDA device. More...

int	nppGetMaxThreadsPerSM (void)
	Get the maximum number of threads per SM for the active GPU. More...

int	nppGetGpuDeviceProperties (int pMaxThreadsPerSM, int pMaxThreadsPerBlock, int *pNumberOfSMs)
	Get the maximum number of threads per SM, maximum threads per block, and number of SMs for the active GPU. More...

const char *	nppGetGpuName (void)
	Get the name of the active CUDA device. More...

cudaStream_t	nppGetStream (void)
	Get the NPP CUDA stream. More...

unsigned int	nppGetStreamNumSMs (void)
	Get the number of SMs on the device associated with the current NPP CUDA stream. More...

unsigned int	nppGetStreamMaxThreadsPerSM (void)
	Get the maximum number of threads per SM on the device associated with the current NPP CUDA stream. More...

void	nppSetStream (cudaStream_t hStream)
	Set the NPP CUDA stream. More...

Detailed Description

Basic functions for library management, in particular library version and device property query functions.

Function Documentation

NppGpuComputeCapability nppGetGpuComputeCapability ( void )

What CUDA compute model is supported by the active CUDA device?

Before trying to call any NPP functions, the user should make a call this function to ensure that the current machine has a CUDA capable device.

Returns: An enum value representing if a CUDA capable device was found and what level of compute capabilities it supports.

int nppGetGpuDeviceProperties	(	int *	pMaxThreadsPerSM,
		int *	pMaxThreadsPerBlock,
		int *	pNumberOfSMs
	)

Get the maximum number of threads per SM, maximum threads per block, and number of SMs for the active GPU.

Returns: cudaSuccess for success, -1 for failure

const char* nppGetGpuName ( void )

Get the name of the active CUDA device.

Returns: Name string of the active graphics-card/compute device in a system.

int nppGetGpuNumSMs ( void )

Get the number of Streaming Multiprocessors (SM) on the active CUDA device.

Returns: Number of SMs of the default CUDA device.

const NppLibraryVersion* nppGetLibVersion ( void )

Get the NPP library version.

Returns: A struct containing separate values for major and minor revision and build number.

int nppGetMaxThreadsPerBlock ( void )

Get the maximum number of threads per block on the active CUDA device.

Returns: Maximum number of threads per block on the active CUDA device.

int nppGetMaxThreadsPerSM ( void )

Get the maximum number of threads per SM for the active GPU.

Returns: Maximum number of threads per SM for the active GPU

cudaStream_t nppGetStream ( void )

Get the NPP CUDA stream.

NPP enables concurrent device tasks via a global stream state varible. The NPP stream by default is set to stream 0, i.e. non-concurrent mode. A user can set the NPP stream to any valid CUDA stream. All CUDA commands issued by NPP (e.g. kernels launched by the NPP library) are then issed to that NPP stream.

unsigned int nppGetStreamMaxThreadsPerSM ( void )

Get the maximum number of threads per SM on the device associated with the current NPP CUDA stream.

NPP enables concurrent device tasks via a global stream state varible. The NPP stream by default is set to stream 0, i.e. non-concurrent mode. A user can set the NPP stream to any valid CUDA stream. All CUDA commands issued by NPP (e.g. kernels launched by the NPP library) are then issed to that NPP stream. This call avoids a cudaGetDeviceProperties() call.

unsigned int nppGetStreamNumSMs ( void )

Get the number of SMs on the device associated with the current NPP CUDA stream.

NPP enables concurrent device tasks via a global stream state varible. The NPP stream by default is set to stream 0, i.e. non-concurrent mode. A user can set the NPP stream to any valid CUDA stream. All CUDA commands issued by NPP (e.g. kernels launched by the NPP library) are then issed to that NPP stream. This call avoids a cudaGetDeviceProperties() call.

void nppSetStream ( cudaStream_t hStream )

Set the NPP CUDA stream.

See Also: nppGetStream()

Functions

Detailed Description

Function Documentation