NVIDIA nvJPEG2000#
The nvJPEG2000 library accelerates the decoding and encoding of JPEG2000 images on NVIDIA GPUs. The library is built on the CUDA platform and is supported on Pascal+ GPU architectures.
Note
Throughout this document, the terms “CPU” and “Host” are used synonymously. Similarly, the terms “GPU” and “Device” are synonymous.
Decoder#
The library utilizes both CPU and GPU for decoding. Tier 2 decode stage (first stage of decode; please refer to the JPEG2000 specification for details) is run on the CPU. All other stages of the decoding process are offloaded to the GPU.
The nvJPEG2000 decoder supports the following:
JPEG2000 Options:
Up to 16 bits per component
No of components : 4
Reversible(5-3) and irreversible(9-7) wavelet transforms
Multiple tiles per image
Up to 32 layers per image
Chroma sub-sampling types - 444, 422, and 420
All progression orders
Image and tile start coordinates should be 0
Tile/Resolution based decoding
Partial decoding of tiles and images
jp2 file format and jpeg2000 codestream
High throughput jpeg2000(All code-blocks have to be HT code-blocks with one HT set and no refinement)
Features:
Decode multiple images in pipeline mode. Refer to nvJPEG2000-Decoder-Pipelined sample for details.
Color conversion to RGB.
Both planar and interleaved outputs are supported.
Encoder#
The library utilizes both GPU and CPU to create JPEG2000 bitstreams. The encode APIs require that the input image is on device memory, and the compressed bitstream will be written to host memory.
The nvJPEG2000 encoder supports the following:
JPEG2000 options
Up to 16 bits per component
No of components : 4
Reversible(5-3) and irreversible(9-7) wavelet transforms
Multiple tiles per image
Single layer per image
All progression orders
Chroma sub-sampling types - 444, 422, and 420
jp2 file format and jpeg2000 codestream
- Lossy encode using one of the following options
Quantization step
Q-Factor (JPEG like quality)
PSNR
Features
Both planar and interleaved inputs are supported.
Prerequisites#
CUDA Toolkit version 12.0 and above
CUDA Driver version r525 and above
Platforms Supported#
Linux versions:
Architecture |
Distribution Information |
|||
|---|---|---|---|---|
Name |
Version |
GCC |
GLIBC |
|
x86_64 |
RHEL / Rocky Linux |
10 |
14.2.1 |
2.39 |
9 |
11.5.0 |
2.34 |
||
8 |
8.5.0 |
2.28 |
||
Ubuntu |
24.04 |
13.2.0 |
2.39 |
|
22.04 |
12.3.0 |
2.34 |
||
OpenSUSE Leap |
15 |
7.5.0 |
2.38 |
|
SUSE SLES |
15 |
7.5.0 |
2.38 |
|
Debian |
12 |
12.2.0 |
2.36 |
|
Fedora |
42 |
15.1.1 |
2.41 |
|
arm64-sbsa |
RHEL |
10 |
14.2.1 |
2.39 |
9 |
11.3.1 |
2.34 |
||
8 |
8.5.0 |
2.28 |
||
Ubuntu |
24.04 |
13.3.0 |
2.39 |
|
22.04 |
11.4.0 |
2.35 |
||
SUSE SLES |
15 |
7.5.0 |
2.38 |
|
Debian |
12 |
12.2.0 |
2.36 |
|
Windows Versions
Windows 10/11.
Tegra
arm64-sbsa builds are supported on Thor starting CUDA 13.0.
aarch64-jetson binaries based on CUDA 12 are available for Orin.
Thread Safety#
Not all nvJPEG2000 types are thread safe. The following should be instantiated separately for each thread: nvjpeg2kDecodeState_t, nvjpeg2kStream_t, and nvjpeg2kEncodeState_t.