NVIDIA Speech NIM Microservices#

NVIDIA Speech NIM microservices are GPU-accelerated Docker containers that provide speech AI capabilities as building blocks for your applications. Each NIM microservice packages a Nemotron model, the full NVIDIA inference stack (CUDA, TensorRT, Triton), and a unified API into a single container that you deploy, scale, and interact with through standard gRPC and HTTP interfaces.

About NVIDIA Speech NIM Microservices#

Understand what Speech NIM microservices are, how they work, and what is new in the latest release.

NVIDIA Speech NIM Microservices Overview

Learn about NVIDIA Speech NIM Microservices, including release notes and product overview.

NVIDIA Speech NIM Microservices Overview
How It Works

Learn how the NVIDIA Speech NIM microservices work together to build speech applications.

How NVIDIA Speech NIM Microservices Work
Release Notes

Track the release notes for the NVIDIA Speech NIM microservices.

NVIDIA Speech NIM Microservices Release Notes
Support Matrix

API reference, support matrix, performance benchmarks, and environment variables.

Support Matrix

Get Started#

Set up prerequisites, install Speech NIM microservices, and deploy them with Docker or Helm.

About Getting Started

Prerequisites, installation, configuration, and tutorials to get up and running.

About Getting Started with NVIDIA Speech NIM Microservices
About Configuring Speech NIM Deployment

Deploy NVIDIA Speech NIM microservices using Docker or Helm charts.

About Configuring Speech NIM Deployment

Developer Guides#

Explore each speech NIM microservice capability, including model customization and integration options.

About NVIDIA ASR NIM

Convert speech to text with the NVIDIA ASR NIM microservice supporting multiple models, languages, and inference modes.

About NVIDIA ASR NIM Microservice
About NVIDIA TTS NIM

Generate natural speech from text with multiple voices, languages, and voice cloning using the NVIDIA TTS NIM microservice.

About NVIDIA TTS NIM Microservice
About NVIDIA NMT NIM

Translate text between 36 languages with the NVIDIA NMT NIM microservice, including translation exclusion and custom dictionaries.

About NVIDIA NMT NIM Microservice

References#

Look up API specifications, supported configurations, performance data, and troubleshooting guidance.

API References

gRPC and real-time API references for the ASR, NMT, and TTS NIM microservices.

NVIDIA Nemotron Speech NIM API References
Performance Benchmarks

Latency and throughput benchmarks for ASR, NMT, and TTS NIM microservices across supported GPUs.

Performance References
Troubleshooting Common Issues

Common issues and solutions shared across Speech NIM microservices (ASR, TTS, NMT).

Troubleshooting Common Issues for NVIDIA Speech NIM Microservices