VPI - Vision Programming Interface

0.1.0 Release

Gaussian Image Filter

Overview

Gaussian image filter is a low-pass discrete Gaussian filter that smooths out the image by doing a Gaussian-weighted averaging of neighbor pixels of a given input pixel. It produces images with less artifacts than Box Image Filter, but could potentially be more costly to compute.

It supports two modes of operation:

  • Kernel support size is automatically calculated based on the filter standard deviation (sigma).
  • Use both user-provided kernel support size and filter standard deviation.
Input Gaussian kernel Output
7x7 support,

\[ \sigma=1.7 \]

Implementation

Gaussian filter is implemented as a convolution operation on the input image where the kernel has the following weights:

\[ w_g[x,y] = \frac{1}{2\pi\sigma^2} \cdot e^{-\frac{x^2+y^2}{2\sigma^2}} \]

When the input kernel support size is 0 for a given dimension (or both), it is calculated from the given standard deviation by assuming that the weights outside \(\pm3\sigma\) window are zero.

In this case, the following formula is used:

\[ w = \max\{3,2 \times \lceil 3\sigma\rceil-1\} \]

Note
We clamp the minimum kernel size to 3 because a kernel with size 1 doesn't have enough samples to properly characterize a Gaussian function.

Usage

  1. Initialization phase
    1. Include the header that defines the Gaussian filter function.
    2. Define the stream on which the algorithm will be executed, the input and output images.
      VPIStream stream = /*...*/;
      VPIImage input = /*...*/;
    3. Create the output image.
      uint32_t w,h;
      vpiImageGetSize(input, &w, &h);
      vpiImageGetType(input, &type);
      VPIImage output;
      vpiImageCreate(w, h, type, 0, &output);
  2. Processing phase
    1. Submit the algorithm to the stream, input, output images, window size and boundary condition.
      vpiSubmitGaussianImageFilter(stream, input, output, 7, 7, 1.7, 1.7, VPI_BOUNDARY_COND_ZERO);
    2. Optionally, wait until the processing is done.
      vpiStreamSync(stream);

Limitations and Constraints

Gaussian filter is currently implemented using separable image convolver if it satisfies \(max(m,n) \geq 7\), and the input characteristics (size and/or type) satisfies its constraints.

If \(max(m,n) < 7\), it'll use image convolver if input characteristics satisfies its constraints, or else the operation will fail.

Note
Currently image convolver has some limitations on its PVA backend that makes some Gaussian filter operations fail. This is the case when the resulting convolution kernel has more than 49 weights on image types VPI_IMAGE_TYPE_Y8 and VPI_IMAGE_TYPE_Y8I.
VPIImageType
VPIImageType
Image formats.
Definition: Types.h:172
GaussianImageFilter.h
vpiStreamSync
VPIStatus vpiStreamSync(VPIStream stream)
Blocks the calling thread until all submitted commands in this stream queue are done (queue is empty)...
VPIImage
struct VPIImageImpl * VPIImage
Definition: Types.h:153
vpiImageGetSize
VPIStatus vpiImageGetSize(VPIImage img, uint32_t *width, uint32_t *height)
Get the image size in pixels.
vpiImageGetType
VPIStatus vpiImageGetType(VPIImage img, VPIImageType *type)
Get the image type.
vpiImageCreate
VPIStatus vpiImageCreate(uint32_t width, uint32_t height, VPIImageType type, uint32_t flags, VPIImage *img)
Create an empty image instance with the specified flags.
VPI_BOUNDARY_COND_ZERO
All pixels outside the image are considered to be zero.
Definition: Types.h:204
VPIStream
struct VPIStreamImpl * VPIStream
Definition: Types.h:147
vpiSubmitGaussianImageFilter
VPIStatus vpiSubmitGaussianImageFilter(VPIStream stream, VPIImage input, VPIImage output, uint32_t kernelSizeX, uint32_t kernelSizeY, float sigmaX, float sigmaY, VPIBoundaryCond boundary)
Runs a 2D Gaussian filter over an image.