ExperimentConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
string |
Name of model if invoking task via |
|||||
|
string |
Key for encrypting model checkpoints |
|||||
|
string |
Path to where all the assets generated from a task are stored |
|||||
|
collection |
False |
|||||
|
collection |
Configurable parameters to construct the model for the NVPanoptix3D experiment |
False |
||||
|
collection |
Configurable parameters to construct the dataset for the NVPanoptix3D experiment |
False |
||||
|
collection |
Configurable parameters to construct the trainer for the NVPanoptix3D experiment |
False |
||||
|
collection |
Configurable parameters to construct the inferencer for the NVPanoptix3D experiment |
False |
||||
|
collection |
Configurable parameters to construct the evaluator for the NVPanoptix3D experiment |
False |
||||
|
collection |
Configurable parameters to construct the exporter for the NVPanoptix3D experiment |
False |
||||
|
collection |
Configurable parameters to construct the TensorRT engine builder for a NVPanoptix3D experiment |
False |
WandBConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
bool |
True |
|||||
|
string |
TAO Toolkit |
|||||
|
string |
||||||
|
string |
||||||
|
list |
[‘tao-toolkit’] |
False |
||||
|
bool |
False |
|||||
|
bool |
False |
|||||
|
bool |
False |
|||||
|
string |
TAO Toolkit Training |
|||||
|
string |
NVPanoptix3DModelConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
collection |
Configuration hyper parameters for the NVPanoptix3D Backbone |
False |
||||
|
collection |
Configuration hyper parameters for the Mask2Former Semantic Segmentation Head |
False |
||||
|
collection |
Configuration hyper parameters for the Mask2Former model |
False |
||||
|
collection |
Configuration hyper parameters for the Frustum3D model |
False |
||||
|
collection |
Configuration hyper parameters for the Projection model |
False |
||||
|
categorical |
Segmentation mode |
panoptic |
panoptic,instance,semantic |
|||
|
float |
The value of the threshold to be used when filtering out the object mask |
0.4 |
||||
|
float |
The value of the threshold to be used when evaluating overlap |
0.5 |
||||
|
int |
Keep topk instances per image for instance segmentation |
100 |
NVPanoptix3DDatasetConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
collection |
Configurable parameters to construct the train dataset |
False |
||||
|
collection |
Configurable parameters to construct the validation dataset |
False |
||||
|
collection |
Configurable parameters to construct the test dataset |
False |
||||
|
int |
The number of parallel workers processing data |
8 |
1 |
|||
|
bool |
Flag to allocate pagelocked memory for faster of data between the CPU and GPU |
True |
||||
|
collection |
Configuration parameters for data augmentation |
False |
||||
|
bool |
Flag to enable contiguous IDs for labels |
False |
||||
|
string |
A path to label map file |
|||||
|
categorical |
Dataset name |
front3d |
front3d,matterport,synthetic_hospital,synthetic_warehouse |
|||
|
int |
Downsample factor (1: Synthetic & Front3D, 2: Matterport3D) |
1 |
||||
|
float |
ISO value to reconstruct mesh from TUDF volume |
1.0 |
||||
|
int |
Ignore label value |
255 |
||||
|
int |
Minimum number of pixels required for an instance to be considered valid |
200 |
||||
|
string |
Image format |
RGB |
||||
|
list |
Input image size to resize |
[320, 240] |
False |
|||
|
list |
Image size to process at 3D stage |
[160, 120] |
False |
|||
|
list |
Input depth size to resize |
[120, 160] |
False |
|||
|
bool |
Enable depth truncation in bounds |
False |
||||
|
float |
Min depth value |
0.4 |
||||
|
float |
Max depth value |
6.0 |
||||
|
string |
Relative frustum mask path |
meta/frustum_mask.npz |
||||
|
list |
Value to create occuppancy volume from TUDF volume |
[8.0, 6.0] |
False |
|||
|
list |
truncation range for TUDF volume |
[0.0, 12.0] |
False |
|||
|
bool |
Enable 3d for training |
False |
||||
|
bool |
Enable multi-plane occupancy |
True |
||||
|
float |
Depth scale |
25.0 |
||||
|
int |
Number of thing classes |
9 |
NVPanoptix3DTrainExpConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
int |
The number of GPUs to run the train job |
1 |
1 |
|||
|
list |
List of GPU IDs to run the training on; length of list must equal |
[0] |
False |
|||
|
int |
Number of nodes to run the training on; if > 1, multi-node is enabled |
1 |
1 |
|||
|
int |
Seed for the initializer in PyTorch; if < 0, fixed seed is disabled |
1234 |
-1 |
inf |
||
|
collection |
False |
|||||
|
int |
Number of epochs to run the training |
10 |
1 |
inf |
||
|
int |
The interval (in epochs) at which a checkpoint is saved |
1 |
1 |
|||
|
categorical |
The unit of the checkpoint interval |
epoch |
epoch,step |
|||
|
int |
The interval (in epochs) at which an evaluation is triggered on the validation set |
1 |
1 |
|||
|
string |
Path to the checkpoint to resume training from |
|||||
|
string |
The folder in which to save the experiment |
|||||
|
string |
Path to 2D stage checkpoint to initialize the 3D stage training |
|||||
|
string |
Path to 3D stage checkpoint to initialize the 3D stage training |
|||||
|
int |
The number of iterations between validation checks |
5 |
||||
|
list |
|
[] |
False |
|||
|
float |
Amount to clip the gradient by L2 Norm |
0.1 |
||||
|
|||||||
|
string |
Gradient clip type |
full |
||||
|
bool |
Whether to run the trainer in Dry Run mode |
False |
||||
|
collection |
Hyper parameters to configure the optimizer |
False |
||||
|
categorical |
Precision to run the training on |
fp32 |
fp16,fp32 |
|||
|
categorical |
|
ddp |
ddp,fsdp |
|||
|
bool |
|
True |
||||
|
bool |
Flag to enable printing of detailed learning rate scaling from the optimizer |
False |
||||
|
int |
Number of iterations per epoch |
NVPanoptix3DInferenceExpConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
int |
The number of GPUs to run the evaluation job |
1 |
1 |
|||
|
list |
List of GPU IDs to run the inference on; length of list must equal |
[0] |
False |
|||
|
int |
Number of nodes to run the inference on; if > 1, multi-node is enabled |
1 |
1 |
|||
|
string |
Path to the checkpoint file used for inference |
|||||
|
string |
Path to the TensorRT engine folder to be used for inference |
|||||
|
string |
Path to where all the assets generated from a task are stored |
|||||
|
int |
The batch size of the input tensor; important if |
-1 |
-1 |
|||
|
categorical |
Mode to run inference |
panoptic |
semantic,instance,panoptic |
|||
|
string |
Path to the images directory |
NVPanoptix3DEvaluateExpConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
int |
The number of GPUs to run the evaluation job |
1 |
1 |
|||
|
list |
List of GPU IDs to run the evaluation on; length of list must equal |
[0] |
False |
|||
|
int |
Number of nodes to run the evaluation on; if > 1, multi-node is enabled |
1 |
1 |
|||
|
string |
Path to the checkpoint file used for evaluation |
|||||
|
string |
Path to the TensorRT engine to be used for evaluation; only works with |
|||||
|
string |
Path to where all the assets generated from a task are stored |
|||||
|
int |
The batch size of the input tensor; important if |
-1 |
-1 |
NVPanoptix3DExportExpConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
string |
Path to where all the assets generated from a task are stored |
|||||
|
int |
The index of the GPU used to build the TensorRT engine |
0 |
||||
|
string |
Path to the checkpoint file to run export |
??? |
||||
|
string |
Path to the ONNX model file |
??? |
||||
|
bool |
Flag to export CPU compatible model |
False |
||||
|
ordered_int |
Number of channels in the input tensor |
3 |
1 |
1,3 |
||
|
int |
Width of the input image tensor |
960 |
32 |
|||
|
int |
Height of the input image tensor |
544 |
32 |
|||
|
int |
|
17 |
1 |
|||
|
int |
|
-1 |
-1 |
|||
|
bool |
Flag to enable verbose TensorRT logging |
False |
||||
|
categorical |
File format to export to |
onnx |
onnx,xdl |
|||
|
string |
Path to the ONNX model 2D file |
|||||
|
string |
Path to the ONNX model 3D file |
|||||
|
int |
The maximum number of voxels in the input tensor for the engine |
700000 |
1 |
NVPanoptix3DGenTRTEngineExpConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
string |
Path to where all the assets generated from a task are stored |
|||||
|
int |
The index of the GPU used to build the TensorRT engine |
0 |
0 |
|||
|
string |
Path to the ONNX model file |
??? |
||||
|
string |
Path to the generated TensorRT engine; only works with |
??? |
||||
|
string |
|
|||||
|
int |
|
-1 |
-1 |
|||
|
bool |
Flag to enable verbose TensorRT logging |
False |
||||
|
collection |
Hyper parameters to configure the NVPanoptix3D TensorRT Engine builder |
False |
Backbone Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
categorical |
Type of backbone to use. Available backbone: vggt |
vggt |
vggt |
|||
|
string |
Path to a pretrained backbone file |
SemanticSegmentationHead Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
int |
Common stride |
4 |
2 |
|||
|
int |
Number of transformer encoder layers |
6 |
1 |
|||
|
int |
Convolutional layer dimension |
256 |
1 |
|||
|
int |
Mask head dimension |
256 |
1 |
|||
|
int |
Depth head dimension |
256 |
1 |
|||
|
int |
Ignore value |
255 |
0 |
255 |
||
|
list |
List of feature names for deformable transformer encoder input |
[‘res3’, ‘res4’, ‘res5’] |
False |
|||
|
int |
Number of classes |
13 |
1 |
|||
|
string |
Norm layer type |
GN |
||||
|
list |
List of input feature names |
[‘res2’, ‘res3’, ‘res4’, ‘res5’] |
False |
MaskFormer Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
float |
The probability to drop out |
0 |
0.0 |
1.0 |
||
|
int |
Number of heads |
8 |
||||
|
int |
The number of queries |
100 |
1 |
inf |
||
|
int |
Dimension of the hidden units |
256 |
||||
|
int |
Dimension of the feedforward network in the transformer |
1024 |
1 |
|||
|
int |
Dimension of the feedforward network |
2048 |
1 |
|||
|
int |
Number of decoder layers in the transformer |
10 |
1 |
|||
|
bool |
Whether to add layer norm in the encoder; 1=add layer norm, 0=do not add |
0 |
||||
|
float |
The relative weight of the classification error in the matching cost |
2 |
0.0 |
inf |
||
|
float |
The relative weight of the focal loss of the binary mask in the matching cost |
5 |
0.0 |
inf |
||
|
float |
The relative weight of the dice loss of the binary mask in the matching cost |
5 |
0.0 |
inf |
||
|
float |
The relative weight of the depth loss in the matching cost |
5 |
0.0 |
inf |
||
|
float |
The relative weight of the mp occ loss in the matching cost |
5 |
0.0 |
inf |
||
|
int |
The number of points to sample |
12544 |
||||
|
float |
Oversampling parameter |
3 |
||||
|
float |
Ratio of points that are sampled via important sampling |
0.75 |
||||
|
bool |
Flag to enable deep supervision |
1 |
||||
|
float |
The relative classification weight applied to the no-object category |
0.1 |
||||
|
int |
Size divisibility |
32 |
Frustum3D Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
float |
The truncation value |
3.0 |
||||
|
float |
The iso recon value |
2.0 |
||||
|
float |
The weight of the panoptic loss |
25.0 |
||||
|
list |
The weights of the completion loss |
[50.0, 25.0, 10.0] |
False |
|||
|
float |
The weight of the surface loss |
5.0 |
||||
|
int |
The number of output channels of the UNet |
16 |
||||
|
int |
The number of features of the UNet |
16 |
||||
|
bool |
Whether to use multi-scale |
False |
||||
|
int |
The number of grid dimensions |
256 |
||||
|
int |
The number of frustum dimensions |
256 |
||||
|
int |
The number of signed channel |
3 |
Projection Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
float |
The size of the voxel |
0.03 |
||||
|
bool |
Whether to use signed channel |
1 |
||||
|
int |
The dimension of the depth feature |
256 |
Dataset Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
string |
Root directory of the dataset |
|||||
|
string |
JSON file for image/mask pair |
|||||
|
int |
Batch size |
1 |
1 |
|||
|
int |
Number of workers in the dataloader |
1 |
0 |
Dataset Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
string |
Root directory of the dataset |
|||||
|
string |
JSON file for image/mask pair |
|||||
|
int |
Batch size |
1 |
1 |
|||
|
int |
Number of workers in the dataloader |
1 |
0 |
Dataset Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
string |
Root directory of the dataset |
|||||
|
string |
JSON file for image/mask pair |
|||||
|
int |
Batch size |
1 |
1 |
|||
|
int |
Number of workers in the dataloader |
1 |
0 |
AugmentationConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
list |
A list of sizes to perform random resize |
[448] |
False |
|||
|
int |
The maximum random crop size for training data |
768 |
32 |
960 |
||
|
list |
The random crop size for training data in [H, W] |
[240, 240] |
False |
|||
|
int |
The minimum resize size for test data |
240 |
32 |
960 |
||
|
int |
The maximum resize size for test |
960 |
32 |
960 |
||
|
bool |
Color augmentation |
False |
||||
|
bool |
Enable cropping for input image |
False |
||||
|
list |
Size to crop input image |
[240, 240] |
False |
|||
|
float |
Maximum ratio of crop area that can be occupied by a single semantic category |
1.0 |
0.0 |
1.0 |
||
|
string |
Flip horizontal/vertical |
|||||
|
float |
Flip probability |
0.5 |
0.0 |
1.0 |
||
|
float |
Size divisibility to pad |
-1 |
||||
|
float |
Weight for generated augmentation, 0.0 will disable generated augmentation |
0.0 |
0.0 |
1.0 |
CuDNNConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
bool |
Whether to enable cuDNN benchmark mode |
False |
||||
|
bool |
Whether to enable cuDNN deterministic mode |
True |
OptimConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
categorical |
Type of optimizer used to train the network |
AdamW |
AdamW |
|||
|
categorical |
The metric value to be monitored for the |
val_loss |
val_loss,train_loss |
|||
|
float |
The initial learning rate for training the model |
0.0002 |
0.0 |
1.0 |
True |
|
|
float |
A multiplier for backbone learning rate |
0.1 |
0.0 |
1.0 |
True |
|
|
float |
The momentum for the AdamW optimizer |
0.9 |
0.0 |
1.0 |
True |
|
|
float |
The weight decay coefficient |
0.05 |
0.0 |
1.0 |
True |
|
|
categorical |
|
MultiStep |
MultiStep,Warmuppoly |
|||
|
list |
Learning rate decay epochs |
[88, 96] |
False |
|||
|
float |
Multiplicative factor of learning rate decay |
0.1 |
||||
|
int |
The maximum number of steps to train the model |
160000 |
||||
|
float |
The warmup factor for the learning rate scheduler |
1.0 |
||||
|
int |
The number of warmup iterations |
0 |
NVPanoptix3DTrtConfig Fields#
Field |
value_type |
description |
default_value |
valid_min |
valid_max |
valid_options |
automl_enabled |
|---|---|---|---|---|---|---|---|
|
int |
|
1024 |
0 |
|||
|
int |
|
1 |
1 |
|||
|
int |
|
1 |
1 |
|||
|
int |
|
1 |
1 |
|||
|
list |
The list to specify layer precision |
[] |
False |
|||
|
categorical |
The precision to be set for building the TensorRT engine |
FP32 |
FP32,FP16 |