nvidia.dali.fn#
The following table lists all operations available in nvidia.dali.fn module:
Function |
Device support |
Short description |
|---|---|---|
CPU |
Legacy alias for |
|
CPU, GPU |
Resamples an audio signal. |
|
CPU |
Produces a batch of random integers which can be used as indices for indexing samples in the batch. |
|
CPU, GPU |
Flips bounding boxes horizontally or vertically (mirror). |
|
CPU |
Transforms bounding boxes so that the boxes remain in the same place in the image after the image is pasted on a larger canvas. |
|
CPU |
Transforms bounding boxes so that the boxes remain in the same place in the image after the image is rotated. Boxes that land outside the image with |
|
CPU, GPU |
Encodes the input bounding boxes and labels using a set of default boxes (anchors) passed as an argument. |
|
CPU, GPU |
Adjusts the brightness of the images. |
|
CPU, GPU |
Adjusts the brightness and contrast of the images. |
|
CPU |
Legacy alias for |
|
CPU |
Legacy alias for |
|
CPU, GPU |
Cast a tensor to a different type. |
|
CPU, GPU |
Cast the first tensor to the type of the second tensor. |
|
CPU, GPU |
Joins the input tensors along an existing axis. |
|
CPU, GPU |
Contrast Limited Adaptive Histogram Equalization (CLAHE) operator. Performs local histogram equalization with clipping and bilinear blending of lookup tables (LUTs) between neighboring tiles. This technique enhances local contrast while preventing over-amplification of noise. Attempts to use same algorithm as OpenCV (https://docs.opencv.org/4.x/d5/daf/tutorial_py_histogram_equalization.html). The input image is divided into rectangular tiles, and histogram equalization is applied to each tile independently. To avoid artifacts at tile boundaries, the lookup tables are bilinearly interpolated between neighboring tiles. Supports both grayscale (1-channel) and RGB (3-channel) uint8 images in HWC layout. |
|
CPU |
Legacy alias for |
|
CPU, GPU |
Generates random boolean values following a bernoulli distribution. |
|
CPU, GPU |
Converts between various image color models. |
|
CPU, GPU |
Adjusts hue, saturation, brightness and contrast of the image. |
|
CPU, GPU |
Adjusts the contrast of the images. |
|
CPU, GPU |
Transforms vectors or points by flipping (reflecting) their coordinates with respect to a given center. |
|
CPU, GPU |
Applies a linear transformation to points or vectors. |
|
CPU, GPU |
Creates a copy of the input tensor. |
|
CPU, GPU |
Crops the images with the specified window dimensions and window position (upper left corner). |
|
CPU, GPU |
Performs fused cropping, normalization, format conversion (NHWC to NCHW) if desired, and type casting. |
|
CPU, GPU |
Executes a Python function that operates on DLPack tensors. |
|
CPU, GPU |
Save images in batch to disk in PPM format. |
|
CPU, GPU |
Extracts one or more elements from input sequence. |
|
CPU, GPU |
Erases one or more regions from the input tensors. |
|
CPU, GPU |
Insert new dimension(s) with extent 1 to the data shape. |
|
CPU, GPU |
Allows externally provided data to be passed as an input to the pipeline. |
|
CPU, GPU |
Legacy alias for ResizedCropMirror, with antialiasing disabled by default. |
|
CPU |
Legacy alias for |
|
CPU, GPU |
Flips the images in selected dimensions (horizontal, vertical, and depthwise). |
|
CPU |
Returns new data of given shape and type, filled with a fill value. |
|
CPU |
Returns new data with the same shape and type as the input data, filled with a fill_value. |
|
CPU, GPU |
Applies a Gaussian Blur to the input. |
|
CPU, GPU |
Returns a property of the tensor passed as an input. |
|
CPU, GPU |
Performs the gridmask augmentation (https://arxiv.org/abs/2001.04086). |
|
CPU, GPU |
Adjusts hue, saturation and value (brightness) of the images. |
|
CPU, GPU |
Changes the hue level of the image. |
|
CPU, Mixed |
Legacy alias for |
|
CPU, Mixed |
Legacy alias for |
|
CPU, Mixed |
Legacy alias for |
|
CPU, Mixed |
Legacy alias for |
|
GPU |
Performs a random Jitter augmentation. |
|
CPU, GPU |
Introduces JPEG compression artifacts to RGB images. |
|
CPU, GPU |
Computes the Laplacian of an input. |
|
CPU, GPU |
Maps the input to output by using a lookup table that is specified by keys and values, and a default_value for unspecified keys. |
|
CPU, GPU |
Converts a spectrogram to a mel spectrogram by applying a bank of triangular filters. |
|
CPU, GPU |
Computes Mel Frequency Cepstral Coefficients (MFCC) from a mel spectrogram. |
|
CPU, GPU |
Performs multiple pastes from image batch to each of the outputs. |
|
CPU |
Legacy alias for |
|
CPU |
Legacy alias for |
|
CPU, GPU |
Performs leading and trailing silence detection in an audio buffer. |
|
CPU, GPU |
Generates random numbers following a normal distribution. |
|
CPU, GPU |
Normalizes the input by removing the mean and dividing by the standard deviation. |
|
CPU, GPU |
Invokes a njit compiled Numba function. |
|
CPU, GPU |
Legacy alias for |
|
CPU, GPU |
Produces a one-hot encoding of the input. |
|
CPU |
Returns new data of given shape and type, filled with ones. |
|
CPU |
Returns new data with the same shape and type as the input array, filled with ones. |
|
GPU |
Calculates the optical flow between images in the input. |
|
CPU, GPU |
Pads all samples with the fill_value in the specified axes to match the biggest extent in the batch for those axes or to match the minimum shape specified. |
|
CPU, GPU |
Pastes the input images on a larger canvas, where the canvas size is equal to |
|
CPU |
Obtains the shape of the encoded image. |
|
CPU, GPU |
Marks the input tensor as a sequence. |
|
CPU, GPU |
Returns a batch of tensors constructed by selecting tensors from the input based on indices given in indices argument. |
|
CPU |
Calculates power spectrum of the signal. |
|
CPU, GPU |
Applies preemphasis filter to the input data. |
|
CPU, GPU |
Executes a Python function. |
|
CPU |
Applies a prospective random crop to an image coordinate space while keeping the bounding boxes, and optionally labels, consistent. |
|
CPU |
Produces a cropping window with a randomly selected area and aspect ratio. |
|
CPU, GPU |
Performs a crop with a randomly selected area and aspect ratio and resizes it to the specified size. |
|
CPU, GPU |
Treats content of the input as if it had a different type, shape, and/or layout. |
|
CPU, GPU |
Treats content of the input as if it had a different shape and/or layout. |
|
CPU, GPU |
Resize images. |
|
CPU, GPU |
Performs a fused resize, crop, mirror operation. |
|
CPU |
Produces a fixed shape cropping window, randomly placed so that as much of the provided region of interest (ROI) is contained in it. |
|
CPU, GPU |
Rotates the images by the specified angle. |
|
CPU, GPU |
Changes the saturation level of the image. |
|
CPU |
Legacy alias for |
|
CPU, GPU |
Rearranges frames in a sequence. |
|
CPU, GPU |
Returns the shapes of tensors in the input batch. |
|
CPU, GPU |
Extracts a subtensor, or slice. |
|
CPU, GPU |
Produces a spectrogram from a 1D signal (for example, audio). |
|
CPU, GPU |
Performs a sphere augmentation. |
|
CPU, GPU |
Removes the dimensions given as axes or axis_names. |
|
CPU |
Performs a random crop with bounding boxes where Intersection Over Union (IoU) meets a randomly selected threshold between 0-1. |
|
CPU, GPU |
Joins the input tensors along a new axis. |
|
CPU |
Legacy alias for |
|
CPU, GPU |
Converts a magnitude (real, positive) to the decibel scale. |
|
CPU, GPU |
Executes a function that is operating on Torch tensors. |
|
CPU, GPU |
Transposes the tensors by reordering the dimensions based on the perm parameter. |
|
CPU, GPU |
Generates random numbers following a uniform distribution. |
|
GPU |
Legacy alias for |
|
GPU |
Legacy alias for |
|
CPU, GPU |
Applies an affine transformation to the images. |
|
CPU, GPU |
Performs a water augmentation, which makes the image appear to be underwater. |
|
CPU |
Returns new data of given shape and type, filled with zeros. |
|
CPU |
Returns new data with the same shape and type as the input array, filled with zeros. |