Release Notes for NVIDIA NIM for Image OCR#
This documentation contains the release notes for NVIDIA NIM for Image OCR.
Release 1.2.0#
Summary#
This is a General Access release of the NVIDIA NIM for Image OCR. This release contains the following changes:
Added the
NIM_TRITON_DYNAMIC_BATCHING_MAX_QUEUE_DELAY_MICROSECONDS
andNIM_TRITON_MAX_QUEUE_SIZE
environment variables.You can now use the
NIM_TRITON_OPTIMIZATION_MODE
environment variable to optimize for performance or VRAM.Renamed the
NIM_TRITON_MODEL_BATCH_SIZE
environment variable toNIM_TRITON_MAX_BATCH_SIZE
.Added processed images size (in MB) as usage metric in responses.
Reduced container image sizes.
Removed model profiles for A100 PCIe 40GB & H100 PCIe 80GB configurations.
Known Issues#
The
list-model-profiles
command incorrectly lists compatible model profiles as incompatible. Select the profile that matches your hardware configuration. This bug does not impact automatic profile selection.The
list-model-profiles
command fails to run on hosts that don’t have an NVIDIA GPUs, even whenNIM_CPU_ONLY
is set.
Release 1.1.0-rtx (Beta)#
Summary#
This is a public beta release of the NVIDIA NIM for Image OCR. This release contains the following changes:
Added support for GeForce RTX 4090, NVIDIA RTX 6000 Ada Generation, GeForce RTX 5080, and GeForce RTX 5090 for the PaddleOCR NIM.
Added the
NIM_TRITON_MODEL_BATCH_SIZE
environment variable.
Known Issues#
The
list-model-profiles
command incorrectly lists compatible model profiles as incompatible. Select the profile that matches your hardware configuration. This bug does not impact automatic profile selection.
Release 1.0.0#
Summary#
This is a Early Access release of the NVIDIA NIM for Image OCR. This release contains the following changes:
Updates the
/v1/infer
endpoint request and response JSON schemas. The new output schema provides bounding boxes and confidence scores for each text detection. See the API Reference for more details.
Table Detection Model Supported#
PaddleOCR
Release 0.2.1#
Summary#
This is an Early Access release of the NVIDIA NIM for Image OCR. This release contains the following fixes:
Fixes bug where
NIM_HTTP_TRITON_PORT
was not properly setting the port number.Fixes the
/v1/manifest
API described in the API Reference returning an empty result instead of the manifest file.Returns an emtpy string instead of “nan” when no text is detected in an image
Table Detection Model Supported#
PaddleOCR
Release 0.2.0#
Summary#
This is the second Early Access release of the NVIDIA NIM for Image OCR. This release contains the following changes:
Added validation for the base64 decoding format and improved the error message for when the format is incorrect.
Improved startup performance of the NIM.
Added FP16 optimized TRT engines for A100, H100, A10G, L40S.
Table Detection Model Supported#
PaddleOCR
Known Issues#
The
/v1/manifest
API described in the API Reference returns an empty result instead of the manifest file.
Release 0.1.0#
Summary#
This is the first Early Access release of the NVIDIA NIM for Image OCR.
Table Detection Model Supported#
PaddleOCR
Known Issues#
If the input
image_url
does not match the expected format described in the API Reference, the runtime returns an error message, such as{"error": "Incorrect padding"}
, indicating what format error occurred.