Toggle navigation sidebar

Toggle in-page Table of Contents

NVIDIA Riva

Getting Started

Overview
Quick Start Guide
Release Notes

Installation

Best Practices
Local (Docker)
Kubernetes
How to Deploy Riva at Scale on AWS with EKS
NVIDIA Fleet Command

Tutorials

Speech Recognition
Speech Recognition - New Language Adaptation
Cloud Deployment
- How to Deploy Riva at Scale on AWS with EKS
Speech Synthesis
Translation

Architecture

Overview
Clients in a New Programming Language

Speech Recognition

ASR Overview
Basics of Speech Recognition and Customization of Riva ASR
Pipeline Configuration
Performance
ASR Advanced Details

Speech Synthesis

TTS Overview
TTS Inference and Customization
Custom Models
Performance
TTS Deploy
Phoneme Support
Data Collection - Script Generation

Natural Language Processing

NLP Overview
Custom Models

Translation

Translation Overview
Custom Models
Performance
Speech-to-Speech Translation (S2S) Overview
Speech-to-Text Translation (S2T) Overview

SDKs and Sample Apps

Python
Command-line Clients
Sample Apps

Reference

Models
gRPC & Protocol Buffers
Troubleshooting
Support Matrix
Archives
Upgrading
Acknowledgements
End User License Agreement
Notice

Models

Models#

Speech Recognition
- Conformer-CTC
- Citrinet
- Jasper
- QuartzNet
- MarbleNet
- TitaNet
Natural Language Processing
- BERT
- DistilBERT
- Megatron
Natural Machine Translation(NMT)
- Transformer based Seq2Seq
Speech Synthesis
- Mel Spectrogram Generators
- Vocoders

previous

AudioCodes VoiceGateway Sample

next

Speech Recognition

By NVIDIA
© Copyright 2022 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
Last updated on Jul 10, 2023.