Is this page helpful?

NVIDIA Speech NIM Microservices#

NVIDIA Speech NIM microservices are GPU-accelerated Docker containers that provide speech AI capabilities as building blocks for your applications. Each NIM microservice packages a Nemotron model, the full NVIDIA inference stack (CUDA, TensorRT, Triton), and a unified API into a single container that you deploy, scale, and interact with through standard gRPC and HTTP interfaces.

About NVIDIA Speech NIM Microservices#

Understand what Speech NIM microservices are, how they work, and what is new in the latest release.

NVIDIA Speech NIM Microservices Overview

Learn about NVIDIA Speech NIM Microservices, including release notes and product overview.

Explanation

NVIDIA Speech NIM Microservices Overview

How It Works

Learn how the NVIDIA Speech NIM microservices work together to build speech applications.

Explanation

How NVIDIA Speech NIM Microservices Work

Release Notes

Track the release notes for the NVIDIA Speech NIM microservices.

Reference

NVIDIA Speech NIM Microservices Release Notes

Support Matrix

API reference, support matrix, performance benchmarks, and environment variables.

Reference

Support Matrix

Get Started#

Set up prerequisites, install Speech NIM microservices, and deploy them with Docker or Helm.

About Getting Started

Prerequisites, installation, configuration, and tutorials to get up and running.

Get Started

About Getting Started with NVIDIA Speech NIM Microservices

About Configuring Speech NIM Deployment

Deploy NVIDIA Speech NIM microservices using Docker or Helm charts.

Explanation

About Configuring Speech NIM Deployment

Developer Guides#

Explore each speech NIM microservice capability, including model customization and integration options.

About NVIDIA ASR NIM

Convert speech to text with the NVIDIA ASR NIM microservice supporting multiple models, languages, and inference modes.

Explanation

About NVIDIA ASR NIM Microservice

About NVIDIA TTS NIM

Generate natural speech from text with multiple voices, languages, and voice cloning using the NVIDIA TTS NIM microservice.

Explanation

About NVIDIA TTS NIM Microservice

About NVIDIA NMT NIM

Translate text between 36 languages with the NVIDIA NMT NIM microservice, including translation exclusion and custom dictionaries.

Explanation

About NVIDIA NMT NIM Microservice

References#

Look up API specifications, supported configurations, performance data, and troubleshooting guidance.

API References

gRPC and real-time API references for the ASR, NMT, and TTS NIM microservices.

Reference

NVIDIA Nemotron Speech NIM API References

Performance Benchmarks

Latency and throughput benchmarks for ASR, NMT, and TTS NIM microservices across supported GPUs.

Reference

Performance References

Troubleshooting Common Issues

Common issues and solutions shared across Speech NIM microservices (ASR, TTS, NMT).

Troubleshooting

Troubleshooting Common Issues for NVIDIA Speech NIM Microservices