ExperimentConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
string |
Name of model if invoking task via |
|||||
|
string |
Key for encrypting model checkpoints |
|||||
|
string |
Path to where all the assets generated from a task are stored |
/results |
||||
|
collection |
Configurable parameters to construct the wandb client for a DepthNet experiment |
FALSE |
||||
|
collection |
Configurable parameters to construct the dataset for a DepthNet experiment |
FALSE |
||||
|
collection |
Configurable parameters to construct the model for a DepthNet experiment |
FALSE |
||||
|
collection |
Configurable parameters to construct the inferencer for a DepthNet experiment |
FALSE |
||||
|
collection |
Configurable parameters to construct the evaluator for a DepthNet experiment |
FALSE |
||||
|
collection |
Configurable parameters to construct the trainer for a DepthNet experiment |
FALSE |
||||
|
collection |
Configurable parameters to construct the onnx export for a DepthNet experiment |
FALSE |
||||
|
collection |
Configurable parameters to construct the TensorRT engine builder for a DepthNet experiment |
FALSE |
WandBConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
bool |
True |
|||||
|
string |
TAO Toolkit |
|||||
|
string |
||||||
|
list |
[‘tao-toolkit’] |
FALSE |
||||
|
bool |
FALSE |
|||||
|
bool |
FALSE |
|||||
|
bool |
FALSE |
|||||
|
string |
TAO Toolkit Training |
|||||
|
string |
DepthNetDatasetConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
categorical |
Dataset name |
StereoDataset |
MonoDataset,StereoDataset |
|||
|
bool |
Whether to normalize depth |
FALSE |
||||
|
float |
Maximum depth in meters in MetricDepthAnythingV2 |
1.0 |
inf |
|||
|
float |
Minimum depth in meters in MetricDepthAnythingV2 |
0.0 |
inf |
|||
|
int |
Maximum allowed disparity for which we compute losses during training |
416 |
1 |
416 |
||
|
float |
Baseline for stereo datasets |
0.193001 |
0.0 |
inf |
||
|
float |
Focal length along x-axis |
1998.842 |
0.0 |
inf |
||
|
collection |
Configurable parameters to construct the train dataset for a DepthNet experiment |
FALSE |
||||
|
collection |
Configurable parameters to construct the val dataset for a DepthNet experiment |
FALSE |
||||
|
collection |
Configurable parameters to construct the test dataset for a DepthNet experiment |
FALSE |
||||
|
collection |
Configurable parameters to construct the infer dataset for a DepthNet experiment |
FALSE |
DepthNetModelConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
categorical |
Network name |
MetricDepthAnythingV2 |
FoundationStereo,MetricDepthAnything,RelativeDepthAnything |
|||
|
collection |
Network defined paths for Monocular DepthNet Backbone |
FALSE |
||||
|
collection |
Network defined paths for Edgenext and Depthanythingv2 |
FALSE |
||||
|
list |
Hidden dimensions |
[128, 128, 128] |
FALSE |
|||
|
int |
Width of the correlation pyramid |
4 |
1 |
TRUE |
||
|
int |
cv group |
8 |
1 |
TRUE |
||
|
int |
Train iteration |
22 |
1 |
TRUE |
||
|
int |
Validation iteration |
22 |
1 |
|||
|
int |
Volume dimension |
32 |
1 |
TRUE |
||
|
int |
reduce memory usage |
0 |
0 |
4 |
||
|
bool |
Whether to use mixed precision training |
FALSE |
||||
|
int |
Number of hidden GRU levels |
3 |
1 |
3 |
||
|
int |
Number of levels in the correlation pyramid |
2 |
1 |
2 |
||
|
int |
Resolution of the disparity field (1/2^K) |
2 |
1 |
2 |
||
|
categorical |
DepthAnythingV2 Encoder options |
vitl |
vits,vitl |
|||
|
int |
Maximum disparity of the model used in the training of a stereo model |
416 |
DepthNetInferenceExpConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
int |
Number of GPUs to run the inference job. |
1 |
1 |
|||
|
list |
List of GPU IDs to run the inference on. The length of this list
must be equal to the number of gpus in |
[0] |
FALSE |
|||
|
int |
Number of nodes to run the inference on. If > 1, then multi-node is enabled. |
1 |
1 |
|||
|
string |
Path to the checkpoint used for inference. |
??? |
||||
|
string |
Path to the TensorRT engine to be used for inference.
This only works with |
|||||
|
string |
Path to where all the assets generated from a task are stored. |
|||||
|
int |
Batch size of the input Tensor. This is important if batch_size > 1 for a large dataset. |
-1 |
-1 |
|||
|
float |
Value of the confidence threshold to be used when filtering out the final list of boxes. |
0.5 |
||||
|
int |
Width of the input image tensor. |
1 |
||||
|
int |
Height of the input image tensor. |
1 |
||||
|
bool |
Whether to save the raw pfm output during inference. |
FALSE |
DepthNetEvalExpConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
int |
Number of GPUs to run the evaluation job. |
1 |
1 |
|||
|
list |
List of GPU IDs to run the evaluation on. The length of this list
must be equal to the number of |
[0] |
FALSE |
|||
|
int |
Number of nodes to run the evaluation on. If > 1, then multi-node is enabled. |
1 |
1 |
|||
|
string |
Path to the checkpoint used for evaluation. |
??? |
||||
|
string |
Path to the TensorRT engine to be used for evaluation.
This only works with |
|||||
|
string |
Path to where all the assets generated from a task are stored. |
|||||
|
int |
Batch size of the input Tensor. This is important if |
-1 |
-1 |
|||
|
int |
Width of the input image tensor. |
736 |
1 |
|||
|
int |
Height of the input image tensor. |
320 |
1 |
DepthNetTrainExpConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
int |
Number of GPUs to run the train job. |
1 |
1 |
|||
|
list |
List of GPU IDs to run the training on. The length of this list must be equal to the number of gpus in train.num_gpus. |
[0] |
FALSE |
|||
|
int |
Number of nodes to run the training on. If > 1, then multi-node is enabled. |
1 |
1 |
|||
|
int |
Seed for the initializer in PyTorch. If < 0, disable fixed seed. |
1234 |
-1 |
inf |
||
|
collection |
FALSE |
|||||
|
int |
Number of epochs to run the training. |
10 |
1 |
inf |
||
|
int |
Interval (in epochs) at which a checkpoint is to be saved; helps resume training. |
1 |
1 |
|||
|
categorical |
Unit of the checkpoint interval. |
epoch |
epoch,step |
|||
|
int |
Interval (in epochs) at which a evaluation will be triggered on the validation dataset. |
1 |
1 |
|||
|
string |
Path to the checkpoint from which to resume training. |
|||||
|
string |
Path to where all the assets generated from a task are stored. |
|||||
|
int |
Number of steps to save the checkpoint. |
|||||
|
string |
Path to a pretrained DepthNet model from which to initialize the current training. |
|||||
|
float |
Amount to clip the gradient by L2 Norm. A value of 0.0 specifies no clipping. |
0.1 |
||||
|
bool |
Whether to visualize the dataloader. |
FALSE |
TRUE |
|||
|
int |
Visualization interval in step. |
10 |
TRUE |
|||
|
bool |
Whether to run the trainer in Dry Run mode. This serves as a good means to validate the specification file and run a sanity check on the trainer without actually initializing and running the trainer. |
FALSE |
||||
|
collection |
Hyperparameters to configure the optimizer. |
FALSE |
||||
|
categorical |
Precision on which to run the training. |
fp32 |
bf16,fp32,fp16 |
|||
|
categorical |
Multi-GPU training strategy. DDP (Distributed Data Parallel) and Fully Sharded DDP are supported. |
ddp |
ddp,fsdp |
|||
|
bool |
Whether train is to recompute in backward pass to save GPU memory (TRUE) or store activations (FALSE). |
TRUE |
||||
|
bool |
Whether to display verbose logs to console. |
FALSE |
||||
|
bool |
Whether to use tiled inference, particularly for transformers which expect fixed size of sequences. |
FALSE |
||||
|
string |
Use tiled inference weight type. |
gaussian |
||||
|
list |
Minimum overlap for tile. |
[16, 16] |
FALSE |
|||
|
int |
Interval steps of logging training results and running validation numbers within one epoch. |
500 |
DepthNetExportExpConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
string |
Path to where all the assets generated from a task are stored. |
|||||
|
int |
Index of the GPU to build the TensorRT engine. |
0 |
||||
|
string |
Path to the checkpoint file to run export. |
??? |
||||
|
string |
Path to the onnx model file. |
??? |
||||
|
bool |
Whether to export CPU compatible model. |
FALSE |
||||
|
ordered_int |
Number of channels in the input Tensor. |
3 |
1 |
1,3 |
||
|
int |
Width of the input image tensor. |
960 |
32 |
|||
|
int |
Height of the input image tensor. |
544 |
32 |
|||
|
int |
Operator set version of the ONNX model used to generate TensorRT engine. |
17 |
1 |
|||
|
int |
Batch size of the input Tensor for the engine.
A value of |
-1 |
-1 |
|||
|
bool |
Whether to enable verbose TensorRT logging. |
FALSE |
||||
|
categorical |
File format to export to. |
onnx |
onnx,xdl |
|||
|
int |
Number of GRU iterations to export the model. |
22 |
1 |
DepthNetGenTrtEngineExpConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
string |
Path to where all the assets generated from a task are stored. |
|||||
|
int |
Index of the GPU to build the TensorRT engine. |
0 |
0 |
|||
|
string |
Path to the ONNX model file. |
??? |
||||
|
string |
Path to the TensorRT engine generated should be stored.
This only works with |
??? |
||||
|
string |
Path to a TensorRT timing cache that speeds up engine generation. This will be created/read/updated. |
|||||
|
int |
Batch size of the input tensor for the engine.
A value of |
-1 |
-1 |
|||
|
bool |
Whether to enable verbose TensorRT logging. |
FALSE |
||||
|
collection |
Hyperparameters to configure the TensorRT Engine builder. |
FALSE |
BaseDepthNetDatasetConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
list |
List of data sources for training:
|
[{‘dataset_name’: ‘’, ‘data_file’: ‘’}] |
FALSE |
|||
|
string |
Path to the image directory for tao-deploy |
|||||
|
string |
Path to the right image directory for tao-deploy |
|||||
|
string |
Path to the depth image directory for tao-deploy |
|||||
|
int |
Natch size for training and validation |
1 |
1 |
inf |
TRUE |
|
|
int |
Number of parallel workers processing data |
8 |
1 |
inf |
TRUE |
|
|
bool |
Whether to enable the dataloader to allocate pagelocked memory for faster data copying between the CPU and GPU |
TRUE |
||||
|
collection |
Configuration parameters for data augmentation |
FALSE |
BaseDepthNetDatasetConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
list |
List of data sources for training:
|
[{‘dataset_name’: ‘’, ‘data_file’: ‘’}] |
FALSE |
|||
|
string |
Path to the image directory for tao-deploy |
|||||
|
string |
Path to the right image directory for tao-deploy |
|||||
|
string |
Path to the depth image directory for tao-deploy |
|||||
|
int |
Batch size for training and validation |
1 |
1 |
inf |
TRUE |
|
|
int |
Number of parallel workers processing data |
8 |
1 |
inf |
TRUE |
|
|
bool |
Whether to enable the dataloader to allocate pagelocked memory for faster data copying between the CPU and GPU |
TRUE |
||||
|
collection |
Configuration parameters for data augmentation |
FALSE |
BaseDepthNetDatasetConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
list |
List of data sources for training:
|
[{‘dataset_name’: ‘’, ‘data_file’: ‘’}] |
FALSE |
|||
|
string |
Path to the image directory for tao-deploy |
|||||
|
string |
Path to the right image directory for tao-deploy |
|||||
|
string |
Path to the depth image directory for tao-deploy |
|||||
|
int |
Batch size for training and validation |
1 |
1 |
inf |
TRUE |
|
|
int |
Number of parallel workers processing data |
8 |
1 |
inf |
TRUE |
|
|
bool |
Whether to enable the dataloader to allocate pagelocked memory for faster data copying between the CPU and GPU |
TRUE |
||||
|
collection |
Configuration parameters for data augmentation |
FALSE |
BaseDepthNetDatasetConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
list |
List of data sources for training:
|
[{‘dataset_name’: ‘’, ‘data_file’: ‘’}] |
FALSE |
|||
|
string |
Path to the image directory for tao-deploy |
|||||
|
string |
Path to the right image directory for tao-deploy |
|||||
|
string |
Path to the depth image directory for tao-deploy |
|||||
|
int |
Batch size for training and validation |
1 |
1 |
inf |
TRUE |
|
|
int |
Number of parallel workers processing data |
8 |
1 |
inf |
TRUE |
|
|
bool |
Whether to enable the dataloader to allocate pagelocked memory for faster data copying between the CPU and GPU |
TRUE |
||||
|
collection |
Configuration parameters for data augmentation |
FALSE |
MonoBackBone Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
string |
Path to load DepthAnythingv2 as an encoder for Monocular DepthNet |
|||||
|
bool |
Whether to use batch normalization in Monocular DepthNet |
FALSE |
||||
|
bool |
Whether to use class token |
FALSE |
StereoBackBone Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
string |
Path to load DepthAnythingv2 as an encoder for Stereo DepthNet (FoundationStereo) |
|||||
|
string |
Path to load edgenext encoder for Stereo DepthNet (FoundationStereo) |
|||||
|
bool |
Whether to use batch normalization in DepthAnythingV2 |
FALSE |
||||
|
bool |
Whether to use class token |
FALSE |
CuDNNConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
bool |
FALSE |
|||||
|
bool |
TRUE |
OptimConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
categorical |
Type of optimizer used to train the network |
AdamW |
AdamW,SGD |
|||
|
categorical |
Metric value to be monitored for the |
val_loss |
val_loss,train_loss |
|||
|
float |
Initial learning rate for training the model, excluding the backbone |
0.0001 |
TRUE |
|||
|
float |
Momentum for the AdamW optimizer |
0.9 |
TRUE |
|||
|
float |
Weight decay coefficient |
0.0001 |
TRUE |
|||
|
categorical |
Learning scheduler:
|
MultiStepLR |
MultiStep,StepLR,CustomMultiStepLRScheduler,LambdaLR,PolynomialLR,OneCycleLR,CosineAnnealingLR |
|||
|
list |
Steps at which the learning rate must be decreased This is applicable only with the MultiStep LR |
[1000] |
FALSE |
|||
|
int |
Number of steps to decrease the learning rate in the StepLR |
1000 |
TRUE |
|||
|
float |
Decreasing factor for the learning rate scheduler |
0.1 |
TRUE |
|||
|
float |
Minimum learning rate value for the learning rate scheduler |
1e-07 |
TRUE |
|||
|
int |
Number of steps to perform linear learning rate” warm-up before engaging a learning rate scheduler |
20 |
0 |
inf |
DepthNetTrtConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
int |
Size in megabytes of the workspace TensorRT has for running its optimization tactics and generating the TensorRT engine |
1024 |
0 |
|||
|
int |
Minimum batch size in the optimization profile for the input tensor of the TensorRT engine |
1 |
1 |
|||
|
int |
Optimum batch size in the optimization profile for the input tensor of the TensorRT engine |
1 |
1 |
|||
|
int |
Maximum batch size in the optimization profile for the input tensor of the TensorRT engine |
1 |
1 |
|||
|
list |
List to specify layer precision |
[] |
FALSE |
|||
|
categorical |
Precision to be set for building the TensorRT engine |
FP32 |
FP32,FP16 |
DepthNetAugmentationConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
list |
Input mean for RGB frames |
[0.485, 0.456, 0.406] |
FALSE |
|||
|
list |
Input standard deviation per pixel for RGB frames |
[0.229, 0.224, 0.225] |
FALSE |
|||
|
list |
Crop size for input RGB images [height, width] |
[518, 518] |
FALSE |
|||
|
float |
Minimum scale in data augmentation |
-0.2 |
0.2 |
1 |
||
|
float |
Maximum scale in data augmentation |
0.4 |
-0.2 |
1 |
||
|
bool |
Whether to perform flip in data augmentation |
FALSE |
||||
|
float |
Probability for y jitter |
1.0 |
0.0 |
1.0 |
TRUE |
|
|
list |
Gamma range in data augmentation |
[1, 1, 1, 1] |
FALSE |
|||
|
float |
Probability for asymmetric color augmentation |
0.2 |
0.0 |
1.0 |
TRUE |
|
|
float |
Color jitter brightness |
0.4 |
0.0 |
1.0 |
||
|
float |
Color jitter contrast |
0.4 |
0.0 |
1.0 |
||
|
list |
Color jitter saturation |
[0.0, 1.4] |
FALSE |
|||
|
list |
Hue range in data augmentation |
[-0.027777777777777776, 0.027777777777777776] |
FALSE |
|||
|
float |
Probability for eraser augmentation |
0.5 |
0.0 |
1.0 |
TRUE |
|
|
float |
Probability for spatial augmentation |
1.0 |
0.0 |
1.0 |
TRUE |
|
|
float |
Probability for stretch augmentation |
0.8 |
0.0 |
1.0 |
TRUE |
|
|
float |
Maximum stretch augmentation |
0.2 |
0.0 |
1.0 |
||
|
float |
Probability for horizontal flip augmentation |
0.5 |
0.0 |
1.0 |
TRUE |
|
|
float |
Probability for vertical flip augmentation |
0.5 |
0.0 |
1.0 |
TRUE |
|
|
float |
Probability for horizontal shift augmentation |
0.5 |
0.0 |
1.0 |
TRUE |
|
|
float |
Probability for minimum crop valid disparity ratio |
0.0 |
0.0 |
1.0 |
TRUE |