Support Matrix

Riva 2.16.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.16.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU NVIDIA A10 GPU NVIDIA H100 GPU NVIDIA L4 GPU NVIDIA L40 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB NMT Models ~4500 Mb per bi-lingual model ~5500 Mb per megatron 500m model ~8500 Mb per megatron 1b model Total 20500 MB

Embedded#

The following table shows the supported hardware for Riva 2.16.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin
Jetson SDK version	JetPack 6.0
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) NMT Models ~5000 MB TTS Models ~2100 MB All Models combined ~9500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.16.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 22.04 CUDA 12.3.52 cuBLAS 12.3.2.9 cuDNN 8.9.6.50 NCCL 2.19.3 TensorRT 8.6.1.6 Triton Inference Server 2.40.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.16.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 22.04 CUDA 12.2.140 cuDNN 8.9.4.25 TensorRT 8.6.2.3 Triton Inference Server 2.40.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.15.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.15.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU NVIDIA A10 GPU NVIDIA H100 GPU NVIDIA L4 GPU NVIDIA L40 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB NMT Models ~4500 Mb per bi-lingual model ~5500 Mb per megatron 500m model ~8500 Mb per megatron 1b model Total 20500 MB

Embedded#

The following table shows the supported hardware for Riva 2.15.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin
Jetson SDK version	JetPack 6.0
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) NMT Models ~5000 MB TTS Models ~2100 MB All Models combined ~9500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.15.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 22.04 CUDA 12.3.52 cuBLAS 12.3.2.9 cuDNN 8.9.6.50 NCCL 2.19.3 TensorRT 8.6.1.6 Triton Inference Server 2.40.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.15.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 22.04 CUDA 12.2.140 cuDNN 8.9.4.25 TensorRT 8.6.2.3 Triton Inference Server 2.40.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.14.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.14.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU NVIDIA A10 GPU NVIDIA H100 GPU NVIDIA L4 GPU NVIDIA L40 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB NMT Models ~4500 Mb per bi-lingual model ~5500 Mb per megatron 500m model ~8500 Mb per megatron 1b model Total 20500 MB

Embedded#

The following table shows the supported hardware for Riva 2.14.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.1.1 JetPack 5.1
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) NMT Models ~5000 MB TTS Models ~2100 MB All Models combined ~9500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.14.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.8.89 cuBLAS 11.11.3.6 cuDNN 8.6.0.163 NCCL 2.15.5 TensorRT 8.5.0.12 Triton Inference Server 2.27.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.14.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.19 cuDNN 8.6.0.166 TensorRT 8.5.2.2 Triton Inference Server 2.27.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.13.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.13.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU NVIDIA A10 GPU NVIDIA H100 GPU NVIDIA L4 GPU NVIDIA L40 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB NMT Models ~4500 Mb per model Total 20500 MB

Embedded#

The following table shows the supported hardware for Riva 2.13.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.1.1 JetPack 5.1
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) NMT Models ~5000 MB TTS Models ~2100 MB All Models combined ~9500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.13.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.8.89 cuBLAS 11.11.3.6 cuDNN 8.6.0.163 NCCL 2.15.5 TensorRT 8.5.0.12 Triton Inference Server 2.27.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.13.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.19 cuDNN 8.6.0.166 TensorRT 8.5.2.2 Triton Inference Server 2.27.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.12.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.12.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU NVIDIA A10 GPU NVIDIA H100 GPU NVIDIA L4 GPU NVIDIA L40 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB NMT Models ~4500 Mb per model Total 20500 MB

Embedded#

The following table shows the supported hardware for Riva 2.12.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.1.1 JetPack 5.1
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) NMT Models ~5000 MB TTS Models ~2100 MB All Models combined ~9500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.12.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.8.89 cuBLAS 11.11.3.6 cuDNN 8.6.0.163 NCCL 2.15.5 TensorRT 8.5.0.12 Triton Inference Server 2.27.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.12.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.19 cuDNN 8.6.0.166 TensorRT 8.5.2.2 Triton Inference Server 2.27.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.11.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.11.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU NVIDIA A10 GPU NVIDIA H100 GPU NVIDIA L4 GPU NVIDIA L40 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB NMT Models ~4500 Mb per model Total 20500 MB

Embedded#

The following table shows the supported hardware for Riva 2.11.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.1.1 JetPack 5.1
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) NMT Models ~5000 MB TTS Models ~2100 MB All Models combined ~9500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.11.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.8.89 cuBLAS 11.11.3.6 cuDNN 8.6.0.163 NCCL 2.15.5 TensorRT 8.5.0.12 Triton Inference Server 2.27.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.11.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.19 cuDNN 8.6.0.166 TensorRT 8.5.2.2 Triton Inference Server 2.27.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.10.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.10.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU NVIDIA A10 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB NMT Models ~4500 Mb per model Total 20500 MB

Embedded#

The following table shows the supported hardware for Riva 2.10.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.1
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) NMT Models ~5000 MB TTS Models ~2100 MB All Models combined ~9500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.10.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.8.89 cuBLAS 11.11.3.6 cuDNN 8.6.0.163 NCCL 2.15.5 TensorRT 8.5.0.12 Triton Inference Server 2.27.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.10.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.19 cuDNN 8.6.0.163 TensorRT 8.5.2.2 Triton Inference Server 2.27.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.9.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.9.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU NVIDIA A10 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB NMT Models ~4500 Mb per model Total 20500 MB

Embedded#

The following table shows the supported hardware for Riva 2.9.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.0.2
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) TTS Models ~2100 MB All Models combined ~4500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.9.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.8.89 cuBLAS 11.11.3.6 cuDNN 8.6.0.163 NCCL 2.15.5 TensorRT 8.5.0.12 Triton Inference Server 2.27.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.9.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.239 cuBLAS 11.6.6.23 cuDNN 8.4.1.50 TensorRT 8.4.1.5 Triton Inference Server 2.27.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.8.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.8.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU NVIDIA A10 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB Total 16000 MB

Embedded#

The following table shows the supported hardware for Riva 2.8.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.0.2
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) TTS Models ~2100 MB All Models combined ~4500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.8.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.8.89 cuBLAS 11.11.3.6 cuDNN 8.6.0.163 NCCL 2.15.5 TensorRT 8.5.0.12 Triton Inference Server 2.27.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
TAO Toolkit	TAO Toolkit 21.10 or later
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.8.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.239 cuBLAS 11.6.6.23 cuDNN 8.4.1.50 TensorRT 8.4.1.5 Triton Inference Server 2.27.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.7.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.7.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB Total 16000 MB

Embedded#

The following table shows the supported hardware for Riva 2.7.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.0.2
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) TTS Models ~2100 MB All Models combined ~4500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.7.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.6.0 cuBLAS 11.8.1.74 cuDNN 8.3.2 NCCL 2.11.4 TensorRT 8.2.3.1 Triton Inference Server 2.19.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
TAO Toolkit	TAO Toolkit 21.10 or later
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.7.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.14 cuDNN 8.4.1.50 TensorRT 8.4.1.5 Triton Inference Server 2.21.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.6.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.6.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB Total 16000 MB

Embedded#

The following table shows the supported hardware for Riva 2.6.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.0.2
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) TTS Models ~2100 MB All Models combined ~4500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.6.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.6.0 cuBLAS 11.8.1.74 cuDNN 8.3.2 NCCL 2.11.4 TensorRT 8.2.3.1 Triton Inference Server 2.19.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
TAO Toolkit	TAO Toolkit 21.10 or later
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.6.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.14 cuDNN 8.4.1.50 TensorRT 8.4.1.5 Triton Inference Server 2.21.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.5.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.5.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB Total 16000 MB

Embedded#

The following table shows the supported hardware for Riva 2.5.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.0.2
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) TTS Models ~2100 MB All Models combined ~4500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.5.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.6.0 cuBLAS 11.8.1.74 cuDNN 8.3.2 NCCL 2.11.4 TensorRT 8.2.3.1 Triton Inference Server 2.19.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
TAO Toolkit	TAO Toolkit 21.10 or later
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.5.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.14 cuDNN 8.4.1.50 TensorRT 8.4.1.5 Triton Inference Server 2.21.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.4.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.4.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB Total 16000 MB

Embedded#

The following table shows the supported hardware for Riva 2.4.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.0 Developer Preview JetPack 5.0.1 Developer Preview
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) TTS Models ~2100 MB All Models combined ~4500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.4.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.6.0 cuBLAS 11.8.1.74 cuDNN 8.3.2 NCCL 2.11.4 TensorRT 8.2.3.1 Triton Inference Server 2.19.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
TAO Toolkit	TAO Toolkit 21.10 or later
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.4.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.14 cuDNN 8.3.2.49 TensorRT 8.4.0.11 Triton Inference Server 2.20.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.3.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.3.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB Total 16000 MB

Embedded#

The following table shows the supported hardware for Riva 2.3.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.0 Developer Preview JetPack 5.0.1 Developer Preview
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) TTS Models ~2100 MB All Models combined ~4500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.3.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.6.0 cuBLAS 11.8.1.74 cuDNN 8.3.2 NCCL 2.11.4 TensorRT 8.2.3.1 Triton Inference Server 2.19.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
TAO Toolkit	TAO Toolkit 21.10 or later
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.3.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.14 cuDNN 8.3.2.49 TensorRT 8.4.0.11 Triton Inference Server 2.20.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.2.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.2.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB Total 16000 MB

Embedded#

The following table shows the supported hardware for Riva 2.2.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson Orin NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 5.0 Developer Preview JetPack 5.0.1 Developer Preview
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) TTS Models ~2100 MB All Models combined ~4500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.2.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.6.0 cuBLAS 11.8.1.74 cuDNN 8.3.2 NCCL 2.11.4 TensorRT 8.2.3.1 Triton Inference Server 2.19.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
TAO Toolkit	TAO Toolkit 21.10 or later
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.2.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.4.14 cuDNN 8.3.2.49 TensorRT 8.4.0.11 Triton Inference Server 2.20.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.1.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.1.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB Total 16000 MB

Embedded#

The following table shows the supported hardware for Riva 2.1.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 4.6.1
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) TTS Models ~2100 MB All Models combined ~4500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.1.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.6.0 cuBLAS 11.8.1.74 cuDNN 8.3.2 NCCL 2.11.4 TensorRT 8.2.3.1 Triton Inference Server 2.19.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
TAO Toolkit	TAO Toolkit 21.10 or later
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.1.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 18.04 CUDA 10.2.300 cuDNN 8.2.1.32 TensorRT 8.2.1.8 Triton Inference Server 2.19.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 2.0.0#

Server Hardware#

Data Center#

The following table shows the supported hardware for Riva 2.0.0 on data center platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB Total 16000 MB

Embedded#

The following table shows the supported hardware for Riva 2.0.0 on embedded platforms.

	Hardware Compatibility
Operating System	Riva server requires Linux AArch64.
GPU Model	Deployment Platforms: NVIDIA Jetson AGX Xavier NVIDIA Jetson NX Xavier
Jetson SDK version	JetPack 4.6
Mic, Camera, and Headset	Microphone: Linux AArch64 with a USB microphone (for example, a Logitech H390 USB computer headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
RAM requirement	ASR Models ~2300 MB NLP Models ~1800 MB (with punctuation) TTS Models ~2100 MB All Models combined ~4500 MB

Server Software#

Data Center#

The following table shows the supported software for Riva 2.0.0 on data center platforms.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.6.0 cuBLAS 11.8.1.74 cuDNN 8.3.2 NCCL 2.11.4 TensorRT 8.2.3.1 Triton Inference Server 2.19.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
TAO Toolkit	TAO Toolkit 21.10 or later
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Embedded#

The following table shows the supported software for Riva 2.0.0 on embedded platforms.

	Software Compatibility
Container	Container OS Ubuntu 18.04 CUDA 10.2.300 cuDNN 8.2.1.32 TensorRT 8.0.1.6 Triton Inference Server 2.13.0
Docker	Docker >= 19.03 with nvidia-docker installed

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 1.10.0 Beta#

Server Hardware#

The following table shows the supported hardware for Riva 1.10.0 Beta.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB Total 16000 MB

Server Software#

The following table shows the supported software for Riva 1.10.0 Beta.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.6.0 cuBLAS 11.8.1.74 cuDNN 8.3.2 NCCL 2.11.4 TensorRT 8.2.3.1 Triton Inference Server 2.19.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
TAO Toolkit	TAO Toolkit 21.10 or later
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	510+ 418.40+, 440.33+, 450.51+, 460.27+, 470.57+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 1.9.0 Beta#

Server Hardware#

The following table shows the supported hardware for Riva 1.9.0 Beta.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU Note Riva is supported on any NVIDIA Volta or later GPU (NVIDIA Turing and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB Total 16000 MB

Server Software#

The following table shows the supported software for Riva 1.9.0 Beta.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.3.0 cuBLAS 11.5.4.8 cuDNN 8.2.2.26 NCCL 2.10.3 TensorRT 8.0.1.6 Triton Inference Server 2.13.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
TAO Toolkit	TAO Toolkit 21.10 or later
NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	465.19.01+ 418.40+, 440.33+, 450.51+, 460.27+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 1.8.0 Beta#

Server Hardware#

The following table shows the supported hardware for Riva 1.8.0 Beta.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU Note Riva is supported on any Volta or later NVIDIA GPU (Volta, Turing, and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500 MB per BERT model TTS Models ~3500 MB Total 16000 MB

Server Software#

The following table shows the supported software for Riva 1.8.0 Beta.

	Software Compatibility
Container	Container OS Ubuntu 20.04 NVIDIA CUDA 11.3.0 NVIDIA cuBLAS 11.5.4.8 NVIDIA cuDNN 8.2.2.26 NVIDIA NCCL 2.10.3 NVIDIA TensorRT 8.0.1.6 NVIDIA Triton Inference Server 2.13.0
MIG (Multi Instance GPU)	On A100 and A30 platforms, MIG is supported provided enough vRam is available for the selected models. Note For further information on Ampere and MIG, refer to Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, refer to Preparing to use NVIDIA Containers.
NVIDIA TAO Toolkit	TAO Toolkit 21.10 or later
NVIDIA NeMo	NeMo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	465.19.01+ 418.40+, 440.33+, 450.51+, 460.27+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 1.7.0 Beta#

Server Hardware#

The following table shows the supported hardware for Riva 1.7.0 Beta.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU NVIDIA A30 GPU Note Riva is supported on any Volta or later NVIDIA GPU (Volta, Turing, and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500MB per BERT model TTS Models ~3500 MB Total 16000 MB

Server Software#

The following table shows the supported software for Riva 1.7.0 Beta.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.3.0 cuBLAS 11.5.1.101 cuDNN 8.2.0.41 NCCL 2.9.6 TensorRT 8.0.1.6 Triton Inference Server 2.13.0
MiG (Multi Instance GPU)	On A100 and A30 platforms MiG is supported provided enough vRam is available for the selected Models. Note For further information on Ampere and MiG, see Ampere Architecture In-Depth:.
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, see Preparing to use NVIDIA Containers.
TAO	TAO Toolkit 21.10 or later
Nemo	Nemo 1.1.0 or later
Helm	Helm charts 3.x
NVIDIA Driver	465.19.01+ 418.40+, 440.33+, 450.51+, 460.27+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 1.6.0 Beta#

Server Hardware#

The following table shows the supported hardware for Riva 1.6.0 Beta.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU Note Riva is supported on any Volta or later NVIDIA GPU (Volta, Turing, and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500MB per BERT model TTS Models ~3500 MB Total 16000 MB

Server Software#

The following table shows the supported software for Riva 1.6.0 Beta.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.3.0 cuBLAS 11.5.1.101 cuDNN 8.2.0.41 NCCL 2.9.6 TensorRT 7.2.3.4 Triton Inference Server 2.9.0
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, see Preparing to use NVIDIA Containers.
Helm	Helm charts 3.x
NVIDIA Driver	465.19.01+ 418.40+, 440.33+, 450.51+, 460.27+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 1.5.0 Beta#

Server Hardware#

The following table shows the supported hardware for Riva 1.5.0 Beta.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU Note Riva is supported on any Volta or later NVIDIA GPU (Volta, Turing, and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500MB per BERT model TTS Models ~3500 MB Total 16000 MB

Server Software#

The following table shows the supported software for Riva 1.5.0 Beta.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.3.0 cuBLAS 11.5.1.101 cuDNN 8.2.0.41 NCCL 2.9.6 TensorRT 7.2.3.4 Triton Inference Server 2.9.0
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, see Preparing to use NVIDIA Containers.
Helm	Helm charts 3.x
NVIDIA Driver	465.19.01+ 418.40+, 440.33+, 450.51+, 460.27+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Riva 1.4.0 Beta#

Server Hardware#

The following table shows the supported hardware for Riva 1.4.0 Beta.

	Hardware Compatibility
Operating System	Riva server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU Note Riva is supported on any Volta or later NVIDIA GPU (Volta, Turing, and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500MB per BERT model TTS Models ~3500 MB Total 16000 MB

Server Software#

The following table shows the supported software for Riva 1.4.0 Beta.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.3.0 cuBLAS 11.5.1.101 cuDNN 8.2.0.41 NCCL 2.9.6 TensorRT 7.2.3.4 Triton Inference Server 2.9.0
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, see Preparing to use NVIDIA Containers.
Helm	Helm charts 3.x
NVIDIA Driver	465.19.01+ 418.40+, 440.33+, 450.51+, 460.27+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Skills Clients#

Riva Speech AI Skills clients do not require a local GPU and have minimal hardware requirements. Refer to Clients in a New Programming Language for creating client bindings in your programming language. The Python section describes the prebuilt Python bindings included in the Riva Quick Start package, which are also based on gRPC.

Jarvis 1.3.0 Beta

Hardware#

The following table shows the supported hardware for Jarvis 1.3.0 Beta.

	Hardware Compatibility
Operating System	Jarvis server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU Note Jarvis is supported on any Volta or later NVIDIA GPU (Volta, Turing, and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy. 16+ GB VRAM is recommended.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500MB per BERT model TTS Models ~3500 MB Total 16000 MB

Software#

The following table shows the supported software for Jarvis 1.3.0 Beta.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.3.0 cuBLAS 11.5.1.101 cuDNN 8.2.0.41 NCCL 2.9.6 TensorRT 7.2.3.4 Triton Inference Server 2.9.0
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, see Preparing to use NVIDIA Containers.
Helm	Helm charts 3.x
NVIDIA Driver	465.19.01+ 418.40+, 440.33+, 450.51+, 460.27+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Jarvis 1.2.x Beta

Hardware#

The following table shows the supported hardware for Jarvis 1.2.0 Beta.

	Hardware Compatibility
Operating System	Jarvis server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU Note Jarvis is supported on any Volta or later NVIDIA GPU (Volta, Turing, and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500MB per BERT model TTS Models ~3500 MB Total 16000 MB

Software#

The following table shows the supported software for Jarvis 1.2.0 Beta.

	Software Compatibility
Container	Container OS Ubuntu 20.04 CUDA 11.3.0 cuBLAS 11.5.1.101 cuDNN 8.2.0.41 NCCL 2.9.6 TensorRT 7.2.3.4 Triton Inference Server 2.9.0
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, see Preparing to use NVIDIA Containers.
Helm	Helm charts 3.x
NVIDIA Driver	465.19.01+ 418.40+, 440.33+, 450.51+, 460.27+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Jarvis 1.1.0 Beta

Hardware#

The following table shows the supported hardware for Jarvis 1.1.0 Beta.

	Hardware Compatibility
Operating System	Jarvis server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU Note Jarvis is supported on any Volta or later NVIDIA GPU (Volta, Turing, and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500MB per BERT model TTS Models ~3500 MB Total 16000 MB

Software#

The following table shows the supported software for Jarvis 1.1.0 Beta.

	Software Compatibility
CUDA	11.2.1
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, see Preparing to use NVIDIA Containers.
Helm	Helm charts 3.x
NVIDIA Driver	460.32.03+ 418.40+, 440.33+, 450.51+ for Data Center GPUs Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

Jarvis 1.0.x Beta

Hardware#

The following table shows the supported hardware for Jarvis 1.0.0 Beta.

	Hardware Compatibility
Operating System	Jarvis server requires Linux x86_64.
GPU Model	Preferred Deployment Platforms: NVIDIA Volta V100 NVIDIA Turing T4 NVIDIA A100 GPU Note Jarvis is supported on any Volta or later NVIDIA GPU (Volta, Turing, and NVIDIA Ampere GPU architecture) for development purposes. Care must be taken to not exceed the memory available when selecting models to deploy.
Mic, Camera, and Headset	Microphone: Linux x86 with USB microphone (for example, a Logitech H390 USB Computer Headset) Headphones: Logitech H340 Logitech H390 Microsoft LX3000
GPU Memory	ASR Models Streaming Models: ~5600 MB Non-streaming models: ~3100 MB NLP Models ~500MB per BERT model TTS Models ~3500 MB Total 16000 MB

Software#

The following table shows the supported software for Jarvis 1.0.0 Beta.

	Software Compatibility
CUDA	11.1
Docker	Docker > 19.02 with nvidia-docker installed For users other than DGX, Docker >= 19.03 is required. Note For DGX users, see Preparing to use NVIDIA Containers.
Helm	Helm charts 3.x
NVIDIA Driver	440.44+ Note For earlier driver versions, refer to the NVIDIA Driver section in Deep Learning Frameworks Support Matrix.

NVIDIA Riva

Contents

Support Matrix#

Riva 2.16.0#

Server Hardware#

Data Center#

Embedded#

Server Software#

Data Center#

Embedded#

Skills Clients#

Riva 2.15.0#

Server Hardware#

Data Center#

Embedded#

Server Software#

Data Center#

Embedded#

Skills Clients#

Riva 2.14.0#

Server Hardware#

Data Center#

Embedded#

Server Software#

Data Center#

Embedded#

Skills Clients#

Riva 2.13.0#

Server Hardware#

Data Center#

Embedded#

Server Software#

Data Center#

Embedded#

Skills Clients#

Riva 2.12.0#

Server Hardware#

Data Center#

Embedded#

Server Software#

Data Center#

Embedded#

Skills Clients#

Riva 2.11.0#

Server Hardware#

Data Center#

Embedded#

Server Software#

Data Center#

Embedded#

Skills Clients#

Riva 2.10.0#

Server Hardware#

Data Center#

Embedded#

Server Software#

Data Center#

Embedded#

Skills Clients#

Riva 2.9.0#

Server Hardware#

Data Center#

Embedded#

Server Software#

Data Center#

Embedded#

Skills Clients#

Riva 2.8.0#

Server Hardware#

Data Center#

Embedded#

Server Software#

Data Center#

Embedded#

Skills Clients#

Riva 2.7.0#

Server Hardware#

Data Center#

Embedded#