NVIDIA Docs Hub NVIDIA TAO NVIDIA TAO Toolkit v5.2.0 Optical Character Recognition Network

Optical Character Recognition Network

The OCRNet model recognizes optical characters or text on images.

Training Algorithm

The training algorithm optimizes the network to minimize the connectionist temporal classification (CTC) loss between a ground truth characters sequence of a text image and a predicted characters sequence. Then characters will be decoded from the sequence output of the model through best path decoding method (greedy decoding).

Intended Use Case

The primary use case for this model is recognizing against characters or text on images.

The datasheet for the model is captured in its model card hosted at NGC.

Previous Optical Character Detection Network

Next PeopleSegNet