Overview

The Pyramidal Lucas-Kanade (LK) Optical Flow algorithm estimates the 2D translation of sparse feature points from a previous frame to the next. Image pyramids are used to improve the performance and robustness of tracking over larger translations.

Inputs are previous image pyramid, the next image pyramid, and the feature points on the previous image.

Outputs are the feature points on the next image and the tracking status of each feature point.

Frame #10

Implementation

Each feature point defines their location in the image with x, y coordinates. These points are then tracked in the next image. The tracking status will inform whether the feature point is being tracked successfully or not. For more information, see [1] and [2].

C API functions

For list of limitations, constraints and backends that implements the algorithm, consult reference documentation of the following functions:

Function	Description
vpiInitOpticalFlowPyrLKParams	Initializes VPIOpticalFlowPyrLKParams with default values.
vpiCreateOpticalFlowPyrLK	Creates payload for vpiSubmitOpticalFlowPyrLK.
vpiSubmitOpticalFlowPyrLK	Runs Pyramidal LK Optical Flow on two frames.

Usage

Language: C/C++ Python

Import VPI module
import vpi
Initialization phase
1. Create the Pyramidal Optical Flow LK object, feeding it the initial frame and the VPI array with the keypoints to track. The CUDA backend will be used to execute the algorithm.
  with vpi.Backend.CUDA:
  
  optflow = vpi.OpticalFlowPyrLK(frame, curFeatures, 4)
Processing phase
1. Fetch a new frame from input video sequence into a VPI image.
  while inVideo.read(input)[0]:
2. Feed this VPI image into the OptFlow object. I'll return the estimated keypoint positions in the passed frame, along with a vector that informs the keypoint state, i.e., whether it's being tracked or not.
  curFeatures, status = optflow(input)

Initialization phase
1. Include the header that defines the needed functions and structures.
  #include <vpi/algo/OpticalFlowPyrLK.h>
  
  OpticalFlowPyrLK.h
  Declares functions that implement the Pyramidal LK Optical Flow algorithm.
2. Create the stream where the algorithm will be submitted for execution.
  VPIStream stream;
  
  vpiStreamCreate(0, &stream);
  
  VPIStream
  struct VPIStreamImpl * VPIStream
  A handle to a stream.
  Definition: Types.h:250
  
  vpiStreamCreate
  VPIStatus vpiStreamCreate(uint64_t flags, VPIStream *stream)
  Create a stream instance.
3. Define the required images, pyramids and arrays
  VPIImage prevImage = /* previous frame */;
  
  VPIPyramid pyrPrevFrame = /* pyramid out of previous frame */;
  
  VPIPyramid pyrCurFrame = /* pyramid for current frame */;
  
  VPIArray arrPrevPts = /* array with previous frame's keypoints, type VPI_ARRAY_TYPE_KEYPOINT_F32 */;
  
  VPIArray arrCurPts = /* array with current frame's keypoints, type VPI_ARRAY_TYPE_KEYPOINT_F32 */;
  
  VPIArray arrStatus = /* array with keypoint tracking status, type VPI_ARRAY_TYPE_U8 */;
  
  VPIArray scores = /* array with keypoint scores, type VPI_ARRAY_TYPE_U8 */;
  
  VPIArray
  struct VPIArrayImpl * VPIArray
  A handle to an array.
  Definition: Types.h:232
  
  VPIImage
  struct VPIImageImpl * VPIImage
  A handle to an image.
  Definition: Types.h:256
  
  VPIPyramid
  struct VPIPyramidImpl * VPIPyramid
  A handle to an image pyramid.
  Definition: Types.h:262
4. Create the payload that will contain all temporary buffers needed for processing. Its parameters are taken from the input pyramid and image used.
  int levels;
  
  vpiPyramidGetNumLevels(pyrPrevFrame, &levels);
  
  float scale;
  
  vpiPyramidGetScale(pyrPrevFrame, &scale);
  
  VPIImageFormat format;
  
  vpiImageGetFormat(prevImage, &format);
  
  int width, height;
  
  vpiImageGetSize(prevImage, &width, &height);
  
  VPIPayload optflow;
  
  vpiCreateOpticalFlowPyrLK(VPI_BACKEND_CUDA, width, height, format, levels, scale, &optflow);
  
  VPIImageFormat
  uint64_t VPIImageFormat
  Pre-defined image formats.
  Definition: ImageFormat.h:94
  
  vpiImageGetFormat
  VPIStatus vpiImageGetFormat(VPIImage img, VPIImageFormat *format)
  Get the image format.
  
  vpiImageGetSize
  VPIStatus vpiImageGetSize(VPIImage img, int32_t *width, int32_t *height)
  Get the image dimensions in pixels.
  
  vpiCreateOpticalFlowPyrLK
  VPIStatus vpiCreateOpticalFlowPyrLK(uint64_t backends, int32_t width, int32_t height, VPIImageFormat fmt, int32_t levels, float scale, VPIPayload *payload)
  Creates payload for vpiSubmitOpticalFlowPyrLK.
  
  VPIPayload
  struct VPIPayloadImpl * VPIPayload
  A handle to an algorithm payload.
  Definition: Types.h:268
  
  vpiPyramidGetNumLevels
  VPIStatus vpiPyramidGetNumLevels(VPIPyramid pyr, int32_t *numLevels)
  Get the image pyramid level count.
  
  vpiPyramidGetScale
  VPIStatus vpiPyramidGetScale(VPIPyramid pyr, float *scale)
  Returns the scale factor of the pyramid levels.
  
  VPI_BACKEND_CUDA
  @ VPI_BACKEND_CUDA
  CUDA backend.
  Definition: Types.h:93
5. Define the configuration parameters that guide the LK tracking process.
  VPIOpticalFlowPyrLKParams lkParams;
  
  vpiInitOpticalFlowPyrLKParams(&lkParams);
  
  vpiInitOpticalFlowPyrLKParams
  VPIStatus vpiInitOpticalFlowPyrLKParams(VPIOpticalFlowPyrLKParams *params)
  Initializes VPIOpticalFlowPyrLKParams with default values.
  
  VPIOpticalFlowPyrLKParams
  Structure that defines the parameters for vpiSubmitOpticalFlowPyrLK.
  Definition: OpticalFlowPyrLK.h:93
Processing phase
1. Start of the processing loop from the second frame. The previous frame is where the algorithm fetches the feature points from, the current frame is where these feature points are estimated on.
  for (int idframe = 1; idframe < frame_count; ++idframe)
  
  {
2. Fetch new frame from the input video.
  curImage = /* "new frame from video sequence */;
3. Generate image pyramid for the current image using the CUDA backend.
  VPI_CHECK_STATUS(
  
  vpiSubmitGaussianPyramidGenerator(stream, VPI_BACKEND_CUDA, curImage, pyrCurFrame, VPI_BORDER_CLAMP));
  
  vpiSubmitGaussianPyramidGenerator
  VPIStatus vpiSubmitGaussianPyramidGenerator(VPIStream stream, uint64_t backend, VPIImage input, VPIPyramid output, VPIBorderExtension border)
  Computes the Gaussian pyramid from the input image.
  
  VPI_BORDER_CLAMP
  @ VPI_BORDER_CLAMP
  Border pixels are repeated indefinitely.
  Definition: Types.h:279
4. Submit the algorithm to be executed by the CUDA backend. It will go through all input feature points, and find the estimated points and tracking status in the next image. The user will decide whether to continue using the tracked feature points or re-generate a new set of feature points. In this example the tracked feature points are reused as input for the next frame.
  vpiSubmitOpticalFlowPyrLK(stream, VPI_BACKEND_CUDA, optflow, pyrPrevFrame, pyrCurFrame, arrPrevPts, arrCurPts, arrStatus, &lkParams);
  
  vpiSubmitOpticalFlowPyrLK
  VPIStatus vpiSubmitOpticalFlowPyrLK(VPIStream stream, uint64_t backend, VPIPayload payload, VPIPyramid prevPyr, VPIPyramid curPyr, VPIArray prevPts, VPIArray curPts, VPIArray trackingStatus, const VPIOpticalFlowPyrLKParams *params)
  Runs Pyramidal LK Optical Flow on two frames.
5. Wait until the processing is done.
  vpiStreamSync(stream);
  
  vpiStreamSync
  VPIStatus vpiStreamSync(VPIStream stream)
  Blocks the calling thread until all submitted commands in this stream queue are done (queue is empty)...
6. Prepare for the next iteration. Current iteration's *cur* buffers will be used as *prev* buffers for the next iteration.
  VPIImage tmpImg = prevImage;
  
  prevImage = curImage;
  
  curImage = tmpImg;
  
  VPIPyramid tmpPyr = pyrPrevFrame;
  
  pyrPrevFrame = pyrCurFrame;
  
  pyrCurFrame = tmpPyr;
  
  VPIArray tmpArray = arrPrevPts;
  
  arrPrevPts = arrCurPts;
  
  arrCurPts = tmpArray;
  
  }
Cleanup phase
1. Free resources held by the stream, the payload, and the input and output arrays.
  vpiStreamDestroy(stream);
  
  vpiPayloadDestroy(optflow);
  
  vpiPyramidDestroy(pyrPrevFrame);
  
  vpiPyramidDestroy(pyrCurFrame);
  
  vpiArrayDestroy(arrPrevPts);
  
  vpiArrayDestroy(arrCurPts);
  
  vpiArrayDestroy(arrStatus);
  
  vpiArrayDestroy
  void vpiArrayDestroy(VPIArray array)
  Destroy an array instance.
  
  vpiPayloadDestroy
  void vpiPayloadDestroy(VPIPayload payload)
  Deallocates the payload object and all associated resources.
  
  vpiPyramidDestroy
  void vpiPyramidDestroy(VPIPyramid pyr)
  Destroy an image pyramid instance as well as all resources it owns.
  
  vpiStreamDestroy
  void vpiStreamDestroy(VPIStream stream)
  Destroy a stream instance and deallocate all HW resources.

For more information, see Pyramidal LK Optical Flow in the "C API Reference" section of VPI - Vision Programming Interface.

Performance

For information on how to use the performance table below, see Algorithm Performance Tables.
Before comparing measurements, consult Comparing Algorithm Elapsed Times.
For further information on how performance was benchmarked, see Performance Benchmark.

References

B. D. Lucas and T. Kanade (1981), "An iterative image registration technique with an application to stereo vision."
Proceedings of Imaging Understanding Workshop, pages 121–130
J. Y. Bouguet, (2000), "Pyramidal implementation of the affine lucas kanade feature tracker description of the algorithm."
Intel Corporation, Microprocessor Research Labs