Toggle navigation sidebar

Toggle in-page Table of Contents

NVIDIA Riva

Getting Started

Overview
Quick Start Guide
NVIDIA AI Enterprise Trial
Release Notes
Support Matrix

Installation

Best Practices
Local (Docker)
Kubernetes
How to Deploy Riva at Scale on AWS with EKS
NVIDIA Fleet Command

Tutorials

Speech Recognition
Speech Recognition - New Language Adaptation
Cloud Deployment
Speech Synthesis
Translation

Architecture

Overview
Clients in a New Programming Language

Speech Recognition

ASR Overview
Basics of Speech Recognition and Customization of Riva ASR
Pipeline Configuration
Performance
ASR Advanced Details

Speech Synthesis

TTS Overview
TTS Inference and Customization
TTS Zero Shot
Speaker Adapter for Custom Voice
Custom Models
Performance
TTS Deploy
Phoneme Support
Data Collection - Script Generation

Natural Language Processing

NLP Overview
Custom Models

Translation

Translation Overview
Custom Models
Performance

SDKs and Sample Apps

Python
Command-line Clients
Sample Apps

Reference

Models
gRPC & Protocol Buffers
Troubleshooting
Upgrading
Acknowledgements
End User License Agreement
Notice

Contents

Contents

Getting Started

Overview
Quick Start Guide
NVIDIA AI Enterprise Trial
Release Notes
Support Matrix

Installation

Best Practices
Local (Docker)
Kubernetes
How to Deploy Riva at Scale on AWS with EKS
NVIDIA Fleet Command
- Application Setup
- Deployment Setup

Tutorials

Speech Recognition
Speech Recognition - New Language Adaptation
Cloud Deployment
Speech Synthesis
Translation

Architecture

Overview
Clients in a New Programming Language

Speech Recognition

ASR Overview
Basics of Speech Recognition and Customization of Riva ASR
Basics of Automatic Speech Recognition
- Speech Recognition by Humans
- Speech Recognition by Machines
Evaluation of ASR Accuracy
Riva ASR
Riva Speech Recognition Pipeline
Pipeline Configuration
Performance
ASR Advanced Details
- Confidence Estimates

Speech Synthesis

TTS Overview
- Try It Out
- Pretrained TTS Models
Features
TTS Inference and Customization
TTS Zero Shot
Speaker Adapter for Custom Voice
Custom Models
Performance
TTS Deploy
Run Inference
- Connect to the Riva server and run inference
Phoneme Support
- English-US
Data Collection - Script Generation

Natural Language Processing

NLP Overview
- Punctuation and captilization
- Checking deployed models
Custom Models
- Pretrained Models

Translation

Translation Overview
Custom Models
- Example
- Supported Models
Performance

SDKs and Sample Apps

Python
- Installation
- Python Example
Command-line Clients
Sample Apps

Reference

Models
gRPC & Protocol Buffers
Troubleshooting
Upgrading
Acknowledgements
- Google APIs
- GoogleTest
- gflags
- Google Logging Library (glog)
- speexdsp
- libFLAC
- gRPC
- Triton Inference Server
- NVlabs cub
- KenLM
- Kaldi
- grpc_health_probe
- OpenFST
- Yamale
- PyTorch
- requests
- PyCUDA
- RapidJSON
- protobuf
- onnx
- librosa
- omegaconf
- utf8proc
- re2
- thrax
- Sparrowhawk
- SentencePiece
- YouTokenToMe
- MS-SNSD
- Silero VAD
End User License Agreement
Notice

next

Overview

By NVIDIA
© Copyright 2024 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
Last updated on Apr 03, 2025.