Release Notes#

Release 1.3.1#

Key Features and Enhancements#

  • Added a size-optimized container for Parakeet 0.6b CTC en-US ASR NIM with WSL support.

  • Added a profile with lower GPU memory requirement for Parakeet 0.6b CTC en-US model with a batch size of 1 for edge deployment.

Release 1.3.0#

Key Features and Enhancements#

  • New ASR models

    • Canary 1b Multilingual

    • Canary 0.6b Turbo Multilingual

    • Whisper Large v3

    • Conformer Spanish (es-US)

  • Simplified deployment with auto profile selection based on hardware.

  • Support for deployment in WSL2 environment

Known Issues#

  • Canary models may produce incorrect translation output for certain languages like es-ES and ko-KR

  • Deployment with RMIR model format is supported only on GPUs with compute capability <= 9.0.

  • NMT NIM is not supported on WSL.

  • ASR HTTP API for Whisper only accepts file, model and language parameters in the request. Other parameters are ignored.

  • ASR gRPC API for Whisper only supports Offline Recognize API.

  • ASR gRPC API for Whisper does not support customization parameters (e.g. profanity_filter, enable_word_time_offsets, enable_automatic_punctuation, verbatim_transcripts) in RecognitionConfig message.

  • Diarizer function is not supported in this release.

Release 1.2.0#

Key Features and Enhancements#

  • New ASR models

    • Conformer-CTC Spanish (es-US)

    • Whisper Large v3 with support for Multilingual Transcription and Translation to English

Known Issues#

  • ASR HTTP API for Whisper only accepts file, model and language parameters in the request. Other parameters are ignored.

  • ASR gRPC API for Whisper only supports Offline Recognize API.

  • ASR gRPC API for Whisper does not support customization parameters (e.g. profanity_filter, enable_word_time_offsets, enable_automatic_punctuation, verbatim_transcripts) in RecognitionConfig message.

Release 1.1.0#

  • Added support for a single container to serve all models.

  • Added support for deploying TensorRT optimized models on supported GPUs.

Release 1.0.0#

This is the first general release of NVIDIA NIM for Riva, featuring support for the following models:

  • ASR Models

    • Parakeet 1.1b CTC English (en-US)

  • TTS Models

    • Fastpitch HifiGAN English (en-US)

  • NMT Models

    • Megatron 1b Any-to-Any Text-to-Text Translation (32 languages)