Optical Character Recognition Network

NVIDIA TAO Release 4.0.1

The OCRNet model recognizes optical characters or text on images.

The training algorithm optimizes the network to minimize the connectionist temporal classification (CTC) loss between a ground truth characters sequence of a text image and a predicted characters sequence. Then characters will be decoded from the sequence output of the model through best path decoding method (greedy decoding).

The primary use case for this model is recognizing against characters or text on images.

The datasheet for the model is captured in its model card hosted at NGC.

© Copyright 2023, NVIDIA.. Last updated on Jul 27, 2023.