Optical Character Recognition Network

TAO Toolkit v5.2.0

The OCRNet model recognizes optical characters or text on images.

The training algorithm optimizes the network to minimize the connectionist temporal classification (CTC) loss between a ground truth characters sequence of a text image and a predicted characters sequence. Then characters will be decoded from the sequence output of the model through best path decoding method (greedy decoding).

The primary use case for this model is recognizing against characters or text on images.

The datasheet for the model is captured in its model card hosted at NGC.

Previous Optical Character Detection Network
Next PeopleSegNet
© Copyright 2024, NVIDIA. Last updated on Mar 18, 2024.