NVIDIA Docs Hub NVIDIA Networking BlueField DPUs / SuperNICs & DOCA DOCA Documentation v3.0.0 DOCA Erasure Coding

DOCA Erasure Coding

This guide provides instructions on how to use the DOCA Erasure Coding API .

Introduction

Note

This library is currently supported at alpha version.

The DOCA Erasure Coding ( known also as forward error correction or FEC) library provides an API to encode and decode data using hardware acceleration, supporting both host and NVIDIA® BlueField®-3 (and higher) DPU memory regions.

DOCA Erasure Coding recovers lost data fragments by creating generic redundancy fragments (backup). Each redundancy block that the library creates can help recover any block in the original data should a total loss of fragment occur. This increases data redundancy and reduces data overhead.

The library provides an API for executing erasure coding (EC) operations on DOCA buffers residing in either the DPU or host memory.

This document is intended for software developers wishing to accelerate their application's EC memory operations.

Glossary

The following terms aid in the understanding of the information on this page:

Term	Definition
Data	Original data, original blocks, blocks of original data to be protected/preserved
Coding matrix	Coefficients, the matrix used to generate the redundancy blocks and recovery
Redundancy blocks	Codes; encoded data; the extra blocks that help recover data loss
Encoding	The process of creating the redundancy blocks. Encoded data is referred to as the original blocks or redundancy blocks.
Decoding	The process of recovering the data. Decoded data is referred to as the original blocks alone.

Prerequisites

DOCA Erasure Coding library follows the architecture of a DOCA Core Context, it is recommended read the following sections before:

Changes From Previous Releases

Changes in 2.10.0

N/A

Environment

DOCA Erasure Coding-based applications can run either on the host machine or on the BlueField target (NVIDIA® BlueField®-3 and above).

Erasure Coding can be run with BlueField configured in either NIC or DPU mode (as described in BlueField Modes of Operation).

Architecture

DOCA Erasure Coding is a DOCA Context as defined by DOCA Core. This library leverages the DOCA Core architecture to expose asynchronous tasks/events that are offloaded to hardware.

The following diagram presents a high-level view of the EC transmission flow:

Erasure_coding_Transmission-version-1-modificationdate-1744380062660-api-v2.png

M packets are sent from the source (8 in this case).
Before the source send them, the source encode the data by adding to it T redundancy packets (4 in this case).
The packets are transmitted to the destination in UDP protocol. Some packets are lost and N' packets are received (in this case 4 packets are lost and 8 are received).
The destination decodes the data using all the packets available (both original data in green and redundancy data in red) and gets back the M original data packets.

Flows

Regular EC flow consists of the following elements:

Creating redundancy blocks from data (EC create).
Updating redundancy blocks from updated data (EC update).
Recovering data blocks from redundancy blocks (EC recover).

Screenshot_2022-11-30_120911-version-1-modificationdate-1744380062147-api-v2.png

The following sections examine an M:K (where M is the original data and K is redundancy) EC.

Create Redundancy Blocks

The user must perform the following:

Input M data blocks via doca_buf (filled with data, each block size B).
Output K empty blocks via doca_buf (each block size B).
Use DOCA Erasure Coding to create a coding matrix of M by K via doca_buf.
Use DOCA Erasure Coding Create task to get the K output redundancy blocks.

Note

This step can be repeated in a stream use case, as the DPU would not be the recovery or update point.

Screenshot_2022-11-24_101000-version-1-modificationdate-1744380061867-api-v2.png

Recover Block

The user must perform the following:

Input M-L original blocks via doca_buf (blocks that were not impaired).
Input L≤K (any) redundancy blocks via doca_buf (redundancy blocks originating from create/update tasks).
Input bitmask or array, indicating which blocks to recover.
Output L empty blocks via doca_buf (same size of data block).
Use DOCA Erasure Coding to create a recover coding matrix of M by L via doca_buf (unique per bitmask).
Use DOCA Erasure Coding Recover task to get the L output recovered data blocks.

Screenshot_2022-11-24_101058-version-1-modificationdate-1744380062430-api-v2.png

Objects

Device and Device Representor

The DOCA Erasure Coding library requires a DOCA device to operate. The device is used to access memory and perform the encoding and decoding operations. See DOCA Core Device Discovery.

For same Bluefield card, it does not matter which device is used (PF/VF/SF), as all these devices utilize the same HW component. If there are multiple DPUs, then it is possible to create an EC instance per DPU, providing each instance with a device from a different DPU . To access memory that is not local (from the host to the DPU and vice versa), the DPU side of the application must pick a device with an appropriate representor. See DOCA Core Representor Device Discovery.

The device must stay valid until the EC instance is destroyed.

Memory Buffers

Executing any DOCA EC task requires two DOCA buffers, a source buffer and a destination buffer.

Depending on the allocation pattern of the buffers, refer to the Inventory Types table.

Buffers must not be modified or read during the execution of any task.

Configuration Phase

To start using the library, first, you need to go through a configuration phase as described in DOCA Core Context Configuration Phase.

This section describes how to configure and start the context, to allow execution of tasks and retrieval of events.

Configurations

The context can be configured to match the application use case.

To find if a configuration is supported, or what the min/max value, please refer to Device Support.

Mandatory Configurations

These configurations are mandatory and must be set by the application before attempting to start the context:

At least 1 task/event type needs to be configured. See configuration of Tasks.
A device with appropriate support must be provided on creation.

Device Support

DOCA Erasure Coding needs a device to operate. For picking a device, see DOCA Core Device Discovery.

Erasure Coding can be used in BlueField-3 with some limitations (see architecture). Any device can be used PF/VF/SF.

As device capabilities may change in the future, it is recommended to choose your device using the following methods:

doca_ec_cap_task_galois_mul_is_supported
doca_ec_cap_task_create_is_supported
doca_ec_cap_task_update_is_supported
doca_ec_cap_task_recover_is_supported

Some devices can allow different capabilities as follows:

The maximum buffer list length
The maximum block size

Note

Current BlueField-3 limitations:

Data block count range: 1-128
Redundancy block count: 1-32
Block size: 64B-128MB

Buffer Support

Tasks support buffers with the following features:

Buffer Type	Source Buffer	Destination Buffer
Linked list buffer	Depends on the device; check the `max_buf_list_len` capability	Depends on the device; check the `max_buf_list_len` capability
Local mmap buffer	Yes	Yes
Mmap from PCIe export buffer	Yes	Yes
Mmap from RDMA export buffer	No	No

Execution Phase

This section describes execution on CPU or DPU using the DOCA Core Progress Engine .

Matrix Generate

All tasks require a coding matrix.

Matrix Type

DOCA EC provides 2 matrix types which are elaborated on in the following subsections.

Cauchy

Cauchy encoding matrix is constructed so that

6b2ccda51bf76da88d43fbbd1f7e69ffd9ac5646d843cf7ad57c7823a8313127.svg

Where:

Vandermonde

Vandermonde encoding matrix is constructed so that

6c2033d7976a993ef6dedd5f930bffb94c96823cde058177c188eb5f4b5d7df8.svg

Where:

Warning

Vandermonde matrix does not guarantee that every submatrix is invertible (i.e., the decode task may fail in some settings).

Matrix Functionality

Create

An encoding matrix is necessary for executing the create task, to create redundancy blocks.

The matrices used for updates and recovery are based on an encoding matrix.

The following subsections describe the available options for creating matrices.

Generic

Generic creation, with the doca_ec_matrix_create() function, is used for simple setup using one of matrix types provided by the library.

Input:

Name	Description
Type	One of matrix types provided by the library
Data block count	The number of original data blocks
Redundancy block count	The number of redundancy blocks

Custom

Custom creation, with the doca_ec_matrix_create_from_raw() function, is used if the desired type of matrix is not provided by the library.

Input:

Name	Description	Notes
Data	The data of a coding matrix	The size of the data should be `data_block_count`*`rdnc_block_count`
Data block count	The number of original data blocks	–
Redundancy block count	The number of redundancy blocks	–

Update

This matrix is necessary for executing the update task, to update the redundancy blocks after a change in the data blocks.

The matrix is created using the doca_ec_matrix_create_update() function.

Input:

Name	Description	Notes
Coding matrix	A coding matrix created by `doca_ec_matrix_create()` or `doca_ec_matrix_create_from_raw()`	–
Update indices	An array specifying the indices of the updated data blocks	The indices must be in ascending order The indices should match the order of the data blocks in the matrix creation function
Number of updates	The number of updated blocks. The length of the update indices array.	–

Recover

This matrix is necessary for executing the recover task, to recover original data blocks.

The matrix is created using the doca_ec_matrix_create_recover() function.

Input:

Name	Description	Notes
Coding matrix	A coding matrix created by `doca_ec_matrix_create()` or `doca_ec_matrix_create_from_raw()`	–
Missing indices	An array specifying the indices of the missing data blocks	The indices must be in ascending order The indices should match the order of the data blocks in the matrix creation function
Number of missing	The number of updated blocks. The length of the update indices array.	–

Tasks

Task Batching

DOCA Erasure Coding supports task batching mode, which is a task submit mode of work that allows aggregating multiple DOCA tasks of the same type and handling them as a single unit.

Info

For more information on task batching, refer to DOCA Core Task.

DOCA Erasure Coding supports the flags DOCA_TASK_SUBMIT_FLAG_NONE, DOCA_TASK_SUBMIT_FLAG_FLUSH and DOCA_TASK_SUBMIT_FLAG_OPTIMIZE_REPORTS.

Galois Mul Task

This task executes Galois multiplication between the original blocks and the coding matrix.

Task Configuration

Description	API to Set the Configuration	API to Query Support
Enable the task	`doca_ec_task_galois_mul_set_conf`	`doca_ec_cap_task_galois_mul_is_supported`
Maximum block size	–	`doca_ec_cap_get_max_block_size`
Maximum buffer list length	–	`doca_ec_cap_get_max_buf_list_len`

Task Input

Common input as described in DOCA Core Task.

Name	Description	Notes
coding matrix	A coding matrix as created by `doca_ec_matrix_create()` or `doca_ec_matrix_create_from_raw()`	–
source buffer	Source original data buffer, holding a sequence containing all original blocks (e.g., `block_1`, `block_2`, etc.); the order matters	The data length of `src_buf` should be a multiplication of the block size The data length should also be aligned to 64B and with a minimum size of 64B
destination buffer	A destination buffer for the multiplication outcome blocks. T he sequence containing all multiplication outcome blocks ( `dst_block_1`, `dst_block_2`, etc.) is written to it upon successful completion of the task.	The data is written to the tail segment extending the data segment The minimal available memory in `dst_buf` should be the number of redundancy blocks * the block size, aligned to 64B and, in any case, at least 64B.

Note

If a Galois multiplication task matrix is 10x4 (i.e., 10 original blocks, 4 multiplication outcome blocks), and the block size is 64KB:

src_buf data length should be 10x64KB = 640KB
The available memory for writing in dst_buf should be at least 4x64KB = 256KB

Task Output

Common output as described in DOCA Core Task .

Task Completion Success

After the task completes successfully, the following happens:

The destination buffer holds a sequence containing all multiplication outcome blocks (e.g., dst_block_1, dst_block_2 , etc.)
The destination buffer data segment is extended to include the outcome blocks

Task Completion Failure

If the task fails midway:

The context may enter stopping state if a fatal error occurs
The source and destination doca_buf objects are not modified
The destination buffer contents may be modified

Task Limitations

The operation is not atomic
Once the task has been submitted, the source and destination buffer should not be read from/written to
Source and destination buffers must not overlap
Other limitations are described in DOCA Core Task

Create Task

This task creates redundancy blocks for the given original data blocks using a given coding matrix.

Task Configuration

Description	API to Set the Configuration	API to Query Support
Enable the task	`doca_ec_task_create_set_conf`	`doca_ec_cap_task_create_is_supported`
Maximum block size	–	`doca_ec_cap_get_max_block_size`
Maximum buffer list length	–	`doca_ec_cap_get_max_buf_list_len`

Task Input

Common input as described in DOCA Core Task.

Name	Description	Notes
coding matrix	A coding matrix created by `doca_ec_matrix_create()` or `doca_ec_matrix_create_from_raw()`	–
original data blocks	Source original data buffer, holding a sequence containing all original blocks (`block_1`, `block_2`, etc.); the order matters	The data length of `original_data_blocks` should be a multiplication of the block size The data length should also be aligned to 64B and with a minimum size of 64B
redundancy blocks	A destination buffer for the redundancy blocks. The sequence containing all redundancy blocks (`rdnc_block_1`, `rdnc_block_2`, etc.) is written to it upo n successful completion of the task.	The data will be written to the tail segment extending the data segment The minimal available memory in `rdnc_blocks` should be the number of redundancy blocks * the block size, aligned to 64B and, in any case, at least 64B

Note

If a create task matrix is 10x4 (i.e., 10 original blocks, 4 redundancy blocks), and the block size is 64KB:

original_data_blocks data length should be 10x64KB = 640KB
The available memory for writing in redundancy_blocks should be at least 4x64KB = 256KB

Task Output

Common output as described in DOCA Core Task .

Task Completion Success

After the task completes successfully, the following happens:

The destination buffer holds a sequence containing all redundancy blocks (rdnc_block_1, rdnc_block_2, etc.)
The destination buffer data segment is extended to include the redundancy blocks

Task Completion Failure

If the task fails midway:

The context may enter stopping state if a fatal error occurs
The source and destination doca_buf objects are not modified
The destination buffer contents may be modified

Task Limitations

The operation is not atomic
Once the task is submitted, the source and destination buffers should not be read from/written to
Source and destination buffers must not overlap
Other limitations are described in DOCA Core Task

Update Task

This task executes updates the redundancy blocks for the given original data blocks, using an update coding matrix.

Task Configuration

Description	API to Set the Configuration	API to Query Support
Enable the task	`doca_ec_task_update_set_conf`	`doca_ec_cap_task_update_is_supported`
Maximum block size	–	`doca_ec_cap_get_max_block_size`
Maximum buffer list length	–	`doca_ec_cap_get_max_buf_list_len`

Task Input

Common input as described in DOCA Core Task.

Name	Description	Notes
update matrix	An update coding matrix created by `doca_ec_matrix_create_update()` or `doca_ec_matrix_create_from_raw()`	-
original updated and RDNC blocks	A source buffer with data, holding a sequence containing the original data block and its updated data block, for each block that was updated, followed by the old redundancy blocks (`old_data_block_i`, `updated_data_block_i`, `old_data_block_j`, `updated_data_block_j`, ..., `rdnc_block_1`, `rdnc_block_2`, etc.)	The data length of `original_updated_and_rdnc_blocks` should be a multiplication of the block size The data length should also be aligned to 64B and with a minimum size of 64B
updated RDNC blocks	A destination buffer for the updated redundancy blocks. The sequence containing the updated redundancy blocks ( `rdnc_block_1`, `rdnc_block_2`, etc.) is written to it upo n successful completion of the task	The data is written to the tail segment extending the data segment The minimal available memory in `updated_rdnc_blocks` should be the number of redundancy blocks * the block size, aligned to 64B and, in any case, at least 64B

Note

using an update task matrix, in which 3 data block were updated and there are 4 redundancy blocks, and the block size is 64KB:

original_updated_and_rdnc_blocks data length should be (3+3+4=10)x64KB = 640KB
The available memory for writing in updated_rdnc_blocks should be at least 4x64KB = 256KB

Task Output

Common output as described in DOCA Core Task.

Task Completion Success

After the task completes successfully, the following happens:

The destination buffer holds a sequence containing the updated redundancy blocks (rdnc_block_1, rdnc_block_2, etc.)
The destination buffer data segment is extended to include the updated redundancy blocks

Task Completion Failure

If the task fails midway:

The context may enter stopping state if a fatal error occurs
The source and destination doca_buf objects is not modified
The destination buffer contents may be modified

Task Limitations

The operation is not atomic
Once the task has been submitted, the source and destination buffers should not be read from/written to
Source and destination buffers must not overlap
Other limitations described in DOCA Core Task

Recover Task

This task executes recovers data blocks for, using given available original data blocks and redundancy blocks and a given coding matrix.

Task Configuration

Description	API to Set the Configuration	API to Query Support
Enable the task	`doca_ec_task_recover_set_conf`	`doca_ec_cap_task_recover_is_supported`
Maximum block size	–	`doca_ec_cap_get_max_block_size`
Maximum buffer list length	–	`doca_ec_cap_get_max_buf_list_len`

Task Input

Common input as described in DOCA Core Task.

Name	Description	Notes
recover matrix	A coding matrix create by `doca_ec_matrix_create()` or `doca_ec_matrix_create_from_raw()`	–
available blocks	A source buffer with data, holding a sequence containing available data blocks and redundancy blocks (`data_block_a`, `data_block_b`, `data_block_c`, ..., `rdnc_block_x`, `rdnc_block_y`, etc.)	The total number of blocks given should be equal to the number of original data blocks The data length of `available_blocks` should be a multiplication of the block size The data length should also be aligned to 64B and with a minimum size of 64B
recovered data blocks	A destination buffer for the recovered data blocks. The sequence containing the recovered data blocks (`data_block_i`, `data_block_j`, etc.) is written to it upo n successful completion of the task	The data is written to the tail segment extending the data segment The minimal available memory in `recovered_data_blocks` should be the number of missing data blocks * the block size, aligned to 64B and, in any case, at least 64B.

Note

Using a recover task matrix, based on an original 10x4 coding matrix (i.e., 10 original blocks, 4 redundancy blocks), and a block size of 64KB:

10 available blocks should be given in total (e.g., 7 data blocks and 3 redundancy blocks)
available_blocks data length should be 10x64KB = 640KB
The available memory for writing in recovered_data_blocks should be at least 3x64KB = 192KB

Task Output

Common output as described in DOCA Core Task.

Task Completion Success

After the task is completed successfully t he data is transformed to destination.

Task Completion Failure

If the task fails midway:

The context may enter stopping state if a fatal error occurs
The source and destination doca_buf objects are not modified
The destination buffer contents may be modified

Task Limitations

The operation is not atomic
Once the task is submitted, the source and destination buffers should not be read from/written to
Source and destination must not overlap
The number of blocks that can be recovered is limited to the number of redundancy blocks created
Other limitations are described in DOCA Core Task