NVIDIA nvJPEG2000#

The nvJPEG2000 library accelerates the decoding and encoding of JPEG2000 images on NVIDIA GPUs. The library is built on the CUDA platform and is supported on Pascal+ GPU architectures.

Note

Throughout this document, the terms “CPU” and “Host” are used synonymously. Similarly, the terms “GPU” and “Device” are synonymous.

Decoder#

The library utilizes both CPU and GPU for decoding. Tier 2 decode stage (first stage of decode; please refer to the JPEG2000 specification for details) is run on the CPU. All other stages of the decoding process are offloaded to the GPU.

The nvJPEG2000 decoder supports the following:

JPEG2000 Options:

Up to 16 bits per component
No of components : 4
Reversible(5-3) and irreversible(9-7) wavelet transforms
Multiple tiles per image
Up to 32 layers per image
Chroma sub-sampling types - 444, 422, and 420
All progression orders
Image and tile start coordinates should be 0
Tile/Resolution based decoding
Partial decoding of tiles and images
jp2 file format and jpeg2000 codestream
High throughput jpeg2000(All code-blocks have to be HT code-blocks with one HT set and no refinement)

Features:

Decode multiple images in pipeline mode. Refer to nvJPEG2000-Decoder-Pipelined sample for details.
Color conversion to RGB.
Both planar and interleaved outputs are supported.

Encoder#

The library utilizes both GPU and CPU to create JPEG2000 bitstreams. The encode APIs require that the input image is on device memory, and the compressed bitstream will be written to host memory.

The nvJPEG2000 encoder supports the following:

JPEG2000 options

Up to 16 bits per component
No of components : 4
Reversible(5-3) and irreversible(9-7) wavelet transforms
Multiple tiles per image
Single layer per image
All progression orders
Chroma sub-sampling types - 444, 422, and 420
jp2 file format and jpeg2000 codestream
Lossy encode using one of the following options
- Quantization step
- Q-Factor (JPEG like quality)
- PSNR

Features

Both planar and interleaved inputs are supported.

Prerequisites#

CUDA Toolkit version 12.0 and above
CUDA Driver version r525 and above

Platforms Supported#

Linux versions:

Architecture	Distribution Information
Architecture	Name	Version	GCC	GLIBC
x86_64	RHEL / Rocky Linux	10	14.2.1	2.39
		9	11.5.0	2.34
		8	8.5.0	2.28
	Ubuntu	24.04	13.2.0	2.39
	Ubuntu	22.04	12.3.0	2.34
	OpenSUSE Leap	15	7.5.0	2.38
	SUSE SLES	15	7.5.0	2.38
	Debian	12	12.2.0	2.36
	Fedora	42	15.1.1	2.41
arm64-sbsa	RHEL	10	14.2.1	2.39
		9	11.3.1	2.34
		8	8.5.0	2.28
	Ubuntu	24.04	13.3.0	2.39
	Ubuntu	22.04	11.4.0	2.35
	SUSE SLES	15	7.5.0	2.38
	Debian	12	12.2.0	2.36

Windows Versions
- Windows 10/11.
Tegra
- arm64-sbsa builds are supported on Thor starting CUDA 13.0.
- aarch64-jetson binaries based on CUDA 12 are available for Orin.

Thread Safety#

Not all nvJPEG2000 types are thread safe. The following should be instantiated separately for each thread: nvjpeg2kDecodeState_t, nvjpeg2kStream_t, and nvjpeg2kEncodeState_t.