logo

NVIDIA Riva

Getting Started

  • Overview
  • Quick Start Guide
  • Release Notes

Installation

  • Best Practices
  • Local (Docker)
  • Kubernetes
  • How to Deploy Riva at Scale on AWS with EKS
  • NVIDIA Fleet Command

Tutorials

  • Speech Recognition
    • How do I use Riva ASR APIs with out-of-the-box models?
    • How to Customize Riva ASR Vocabulary and Pronunciation with Lexicon Mapping
    • How to Deploy a Custom Language Model (n-gram) Trained with NeMo on Riva
    • How to Deploy a Custom Acoustic Model (Citrinet) Trained with NeMo on Riva
    • How to Deploy a Custom Acoustic Model (Conformer-CTC) Trained with NeMo on Riva
    • How to Customize a Riva ASR Acoustic Model (Conformer-CTC) with Adapters
    • How to Fine-Tune a Riva ASR Acoustic Model with NVIDIA NeMo
    • How to Improve Recognition of Specific Words
    • How to Improve the Accuracy on Noisy Speech by Fine-Tuning the Acoustic Model (Conformer-CTC) in the Riva ASR Pipeline
    • How to Fine-Tune a Riva ASR Acoustic Model (Conformer-CTC) with TAO Toolkit
    • How to pretrain a Riva ASR Language Modeling (n-gram) with TAO Toolkit
    • How do I boost specific words at runtime with word boosting?
  • Speech Recognition - New Language Adaptation
    • The Making of RIVA German ASR Service
    • The Making of RIVA Hindi ASR Service
    • The Making of the Riva Mandarin ASR Service
  • Cloud Deployment
    • How to Deploy Riva at Scale on AWS with EKS
  • Speech Synthesis
    • How do I use Riva TTS APIs with out-of-the-box models?
    • TTS Deploy
    • Evaluate a TTS Pipeline
    • Text to Speech Finetuning using NeMo
    • Calculate and Plot the Distribution of Phonemes in a TTS Dataset
  • Translation
    • How do I perform Language Translation using Riva NMT APIs with out-of-the-box models?
    • How to fine-tune a Riva NMT Bilingual model with Nvidia NeMo
    • How to deploy a NeMo-finetuned NMT model on Riva Speech Skills server?

Architecture

  • Overview
  • Clients in a New Programming Language

Speech Recognition

  • ASR Overview
  • Basics of Speech Recognition and Customization of Riva ASR
  • Pipeline Configuration
  • Performance
  • ASR Advanced Details

Speech Synthesis

  • TTS Overview
  • TTS Inference and Customization
  • Custom Models
  • Performance
  • TTS Deploy
  • Phoneme Support
  • Data Collection - Script Generation

Natural Language Processing

  • NLP Overview
  • Custom Models

Translation

  • Translation Overview
  • Custom Models
  • Performance
  • Speech-to-Speech Translation (S2S) Overview
  • Speech-to-Text Translation (S2T) Overview

SDKs and Sample Apps

  • Python
  • Command-line Clients
  • Sample Apps
    • Riva Contact
    • Riva Virtual Assistant Example
    • Virtual Assistant (with Rasa)
    • Virtual Assistant (with Google Dialogflow)
    • SpeechSquad
    • AudioCodes VoiceGateway Sample

Reference

  • Models
    • Speech Recognition
    • Natural Language Processing
    • Natural Machine Translation(NMT)
    • Speech Synthesis
  • gRPC & Protocol Buffers
  • Troubleshooting
  • Support Matrix
  • Archives
  • Upgrading
  • Acknowledgements
  • End User License Agreement
  • Notice

Models

Models#

  • Speech Recognition
    • Conformer-CTC
    • Citrinet
    • Jasper
    • QuartzNet
    • MarbleNet
    • TitaNet
  • Natural Language Processing
    • BERT
    • DistilBERT
    • Megatron
  • Natural Machine Translation(NMT)
    • Transformer based Seq2Seq
  • Speech Synthesis
    • Mel Spectrogram Generators
    • Vocoders

previous

AudioCodes VoiceGateway Sample

next

Speech Recognition

By NVIDIA
© Copyright 2022 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
Last updated on Jul 10, 2023.