Is this page helpful?

Deploy and Run ASR Models as NIM Microservices#

This section provides detailed instructions for deploying and running ASR models using the NVIDIA Riva Python client and sample scripts from the Repository of the client.

Choose a model based on your language, inference mode, and capability requirements. For selection guidance — including a disambiguation table for overlapping models — refer to Choosing a Model. Determine which CONTAINER_ID and NIM_TAGS_SELECTOR environment variables to use from the following table. For a complete list of models, languages, inference modes, and GPU memory requirements, refer to the ASR support matrix.

Model	Languages	Modes	Capabilities	`CONTAINER_ID`	`NIM_TAGS_SELECTOR`
Parakeet CTC English	English	Streaming + Offline	Transcription	`parakeet-1-1b-ctc-en-us`	Refer to the support matrix
Parakeet CTC Vietnamese	Vietnamese, English	Streaming + Offline	Transcription	`parakeet-ctc-0.6b-vi`	Refer to the support matrix
Parakeet CTC Spanish	Spanish, English	Streaming + Offline	Transcription	`parakeet-ctc-0.6b-es`	Refer to the support matrix
Parakeet CTC Mandarin	Mandarin, English	Streaming + Offline	Transcription	`parakeet-ctc-0.6b-zh-cn`	Refer to the support matrix
Parakeet CTC Taiwanese	Taiwanese, Mandarin, English	Streaming + Offline	Transcription	`parakeet-ctc-0.6b-zh-tw`	Refer to the support matrix
Parakeet TDT English	English	Offline only	Transcription, word-level timestamps	`parakeet-0.6b-tdt`	Refer to the support matrix
Parakeet RNNT Multilingual	25+ languages (auto-detect)	Streaming + Offline	Transcription	`parakeet-1-1b-rnnt-multilingual`	Refer to the support matrix
Conformer CTC Spanish	Spanish	Streaming + Offline	Transcription	`riva-asr`	Refer to the support matrix
Whisper Large v3	100+ languages (auto-detect)	Offline only	Transcription, translation to English	`whisper-large-v3`	Refer to the support matrix
Canary 1b Multilingual	26 languages	Offline only	Transcription, bidirectional translation	`canary-1b`	Refer to the support matrix
Nemotron ASR Streaming	English	Streaming only	Transcription	`nemotron-asr-streaming`	Refer to the support matrix