NVIDIA DALI
0.29.0 -2a85811
Version select:
Current release
main (unstable)
Older releases
Documentation home
User Guide
Installation
DALI and NGC
Installing prebuilt DALI packages
Prerequisites
Installation
DALI TensorFlow plugin (nvidia-dali-tf-plugin)
Pre-built packages in Watson Machine Learing Community Edition
Nightly and weekly release channels
Nightly builds
Weekly builds
Compiling DALI from source
Compiling DALI from source (using Docker builder) - recommended
Prerequisites
Building Python wheel and (optionally) Docker image
Compiling DALI from source (bare metal)
Prerequisites
Build DALI
Install Python bindings
Verify the build (optional)
Obtain test data
Set test data path
Run tests
Building DALI using Clang (experimental)
Optional CMake build parameters
Cross-compiling DALI C++ API for aarch64 Linux (Docker)
Setup
Build the aarch64 Linux Build Container
Compile
Cross-compiling DALI C++ API for aarch64 QNX (Docker)
Setup
Build the aarch64 Build Container
Compile
Getting started
Overview
Optimal configuration
Pipeline
Defining the pipeline
Building the pipeline
Running the pipeline
Adding augmentations
Random shuffle
Augmentations
Tensors as arguments and Random Number Generation
GPU acceleration
Copying tensors to GPU
Important notice
Hybrid decoding
Tutorials
General
Data Loading
ExternalSource Operator
Data Loading: LMDB Database
Data loading: MXNet recordIO
Data Loading: TensorFlow TFRecord
COCO Reader
DALI expressions and arithmetic operations
DALI Expressions and Arithmetic Operators
DALI Binary Arithmetic Operators - Type Promotions
Custom Augmentations with Arithmetic Operations
Conditional-Like Execution and Masking
Multiple GPU support
Overview
Run Pipeline on Selected GPU
Sharding
Erase Operator
Normalize Operator
Introduction
Using the
Normalize
Operator
Adjusting Output Dynamic Range
Externally Provided Parameters
Batch Normalization
Tensor Joining
Concatenation
Stacking
Reduction Operators
Custom Operations
Create a custom operator
Operator definition
CPU operator implementation
GPU operator implementation
Building the plugin
Importing the plugin
Python Operators
Defining an operation
Defining a pipeline
Running the pipeline and visualizing the results
Variety of Python Operators
Limitations of Python operators
Processing GPU data with Python Operators
CuPy operations
Defining a pipeline
Running the pipeline and visualizing the results
Advanced: device synchronization in DLTensorPythonFunction
Audio Processing
Audio Decoder in DALI
Step-by-step guide
Verification
Audio spectrogram
Background
Reference implementation
Calculating the spectrogram using DALI
Mel spectrogram
Mel-frequency cepstral coefficients (MFCCs)
Image Processing
Augmentation Gallery
BrightnessContrast Operator Example
Brighness and contrast adjustment
Step-by-step guide
Color Space Conversion
Defining the pipeline
Building and running the pipeline
Visualizing the results
Image Decoder examples (CPU)
Common code
Image Decoder (CPU)
Image Decoder (CPU) with random cropping window size and anchor
Image Decoder with fixed cropping window size and external anchor
Image Decoder (CPU) with external window size and anchor
Image Decoder (Hybrid)
Image Decoder (Hybrid) with random cropping window size and anchor
Image Decoder (Hybrid) with fixed cropping window size and external anchor
Image Decoder (Hybrid) with external window size and anchor
HSV Operator Example
Introduction
Step-by-step guide
Using HSV to implement RandomGrayscale operator
Using functional API
Interpolation methods
Downscaling
Upscaling
Resize operator
Output size parameters
Scaling modes
Region of Interest (RoI) processing
Fused flip
Input and output types
Subpixel scale
WarpAffine
Introduction
Usage example
Example output
3D Transforms
Warp operators
Usage example
Example output
Sequence Processing
Simple Video pipeline reading from multiple files
Goal
Setting up
Running the pipeline
Visualizing the results
Video pipeline reading labelled videos from a directory
Goal
Setting up
Running the pipeline
Visualizing the results
Video pipeline demonstrating applying labels based on timestamps or frame numbers
Goal
Defining the pipeline
Visualizing the results
Simple Sequence pipeline reading from multiple files
Optical Flow example
Using Dali
Use Cases
Example VideoReader usage: Video Super-Resolution
Dataloaders
Data loader performance
Requirements
FlowNet2-SD implementation and pre-trained model
Data
Training
Results on Myanmar validation set
Reference
ImageNet training in PyTorch
Requirements
Training
Usage
Single Shot MultiBox Detector training in PyTorch
Requirements
Usage
ResNet-N with TensorFlow and DALI
Requirements
PaddlePaddle Use-Cases
ResNet training in PaddlePaddle
Single Shot MultiBox Detector training in PaddlePaddle
Temporal Shift Module inference in PaddlePaddle
MXNet with DALI - ResNet 50 example
Overview
ResNet 50 pipeline
Training with MXNet
COCO Reader with Augmentations
Advanced
Serialization
Overview
Serialization
Framework integration
MXNet
MXNet Plugin API reference
MXNet Framework
Gluon example with DALI
ExternalSource operator
Using MXNet DALI plugin: using various readers
PyTorch
PyTorch Plugin API reference
Pytorch Framework
Using DALI in PyTorch
ExternalSource operator
Using PyTorch DALI plugin: using various readers
Using DALI in PyTorch Lightning
TensorFlow
TensorFlow Plugin API reference
Tensorflow Framework
Using Tensorflow DALI plugin: DALI and tf.data
Using Tensorflow DALI plugin: DALI tf.data.Dataset with multiple GPUs
Using Tensorflow DALI plugin with sparse tensors
Using Tensorflow DALI plugin: simple example
Using Tensorflow DALI plugin: using various readers
PaddlePaddle
PaddlePaddle Plugin API reference
PaddlePaddle Framework
Using DALI in PaddlePaddle
ExternalSource operator
Using Paddle DALI plugin: using various readers
Python API
Pipeline
Pipeline class
Data processing graphs
Current pipeline
DataNode
Types
TensorList
TensorListCPU
TensorListGPU
Tensor
TensorCPU
TensorGPU
Data Layouts
Tensor Layout String format
Interpreting Tensor Layout Strings
Constant wrapper
Constant
Enums
DALIDataType
DALIIterpType
DALIImageType
SampleInfo
TensorLayout
PipelineAPIType
Functional API
Quick start
Interoperability with operator objects
Functions
Mathematical expressions
Type promotion rules
Supported arithmetic operations
Mathematical funcions
Supported operations
Reading this guide
Support Table
Operators Documentation
nvidia.dali.ops
nvidia.dali.ops.reductions
nvidia.dali.ops.segmentation
nvidia.dali.ops.transforms
nvidia.dali.plugin.pytorch
Mathematical expressions
Advanced Topics
Thread Affinity
Memory Consumption
Operator Buffer Presizing
Prefetching Queue Depth
Running DALI pipeline
Sharding
C++ API
Release Notes
NVIDIA DALI
»
Tutorials
»
Audio Processing
View page source
Audio Processing
ΒΆ
Audio Decoder in DALI
Step-by-step guide
Verification
Audio spectrogram
Background
Reference implementation
Calculating the spectrogram using DALI
Mel spectrogram
Mel-frequency cepstral coefficients (MFCCs)