Open Images Pre-trained Object Detection
Object detection is a popular computer vision technique that can detect one or multiple objects in a frame. Object detection will recognize the individual objects in an image and places bounding boxes around the object. This model object contains pretrained weights that may be used as a starting point with the following object detection networks in TAO Toolkit to facilitate transfer learning.
YOLOv3
YOLOv4
YOLOv4-tiny
FasterRCNN
SSD
DSSD
RetinaNet
It is trained on a subset of the Google OpenImages dataset.
The following backbones are supported with these detection networks:
resnet10/resnet18/resnet34/resnet50/resnet101
vgg16/vgg19
googlenet
mobilenet_v1/mobilenet_v2
squeezenet
darknet19/darknet53
efficientnet_b0
cspdarknet19/cspdarknet53
cspdarknet_tiny
Some combinations might not be supported. See the matrix below for all supported combinations.
Object Detection |
|||||||
Backbone |
FasterRCNN |
SSD |
YOLOv3 |
RetinaNet |
DSSD |
YOLOv4 |
YOLOv4-tiny |
ResNet10/18/34/50/101 |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
|
VGG 16/19 |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
|
GoogLeNet |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
|
MobileNet V1/V2 |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
|
SqueezeNet |
Yes |
Yes |
Yes |
Yes |
Yes |
||
DarkNet 19/53 |
Yes |
Yes |
Yes |
Yes |
Yes |
Yes |
|
CSPDarkNet 19/53 |
Yes |
||||||
CSPDarkNet-tiny |
Yes |
||||||
Efficientnet B0 |
Yes |
Yes |
Yes |
Yes |
|||
Efficientnet B1 |
Yes |
These are unpruned models with just the feature extractor weights, and may not be used without retraining to deploy in a classification application.
Please make sure to set the
all_projections
field toFalse
in the spec file when training a ResNet101 model.
For more instructions on downloading and using the models defined here, refer to the NGC catalog page.