logo

NVIDIA Riva

Getting Started

  • Overview
  • Quick Start Guide
  • Release Notes

Tutorials

  • Speech Recognition
    • How do I use Riva ASR APIs with out-of-the-box models?
    • How to Improve Recognition of Specific Words
    • How do I boost specific words at runtime with word boosting?
    • How to Customize Riva ASR Vocabulary and Pronunciation with Lexicon Mapping
    • How to pretrain a Riva ASR Language Modeling (n-gram) with TAO Toolkit
    • How to Fine-Tune a Riva ASR Acoustic Model (Citrinet) with TAO Toolkit
    • How to deploy custom Acoustic Model (Citrinet) trained with TAO Toolkit on Riva
  • Speech Recognition - New Language Adaptation
    • The Making of RIVA German ASR Service
    • The Making of RIVA Hindi ASR Service
    • The Making of the Riva Mandarin ASR Service

Architecture

  • Overview
  • Clients in a New Programming Language

Speech Recognition

  • ASR Overview
  • ASR Customization Best Practices
  • Pipeline Configuration
  • Performance
  • ASR Advanced Details

Speech Synthesis

  • TTS Overview
  • TTS Inference and Customization
  • Custom Models
  • TTS Evaluation
  • Performance
  • TAO Deployment
  • Phoneme Support

Natural Language Processing

  • NLP Overview
  • Custom Models

Installation

  • Best Practices
  • Local (Docker)
  • Kubernetes
  • How to Deploy Riva at Scale on AWS with EKS
  • NVIDIA Fleet Command

SDKs and Sample Apps

  • Python
  • Command-line Clients
  • Sample Apps
    • Riva Contact
    • Riva Virtual Assistant Example
    • Virtual Assistant (with Rasa)
    • Virtual Assistant (with Google Dialogflow)
    • SpeechSquad
    • AudioCodes VoiceGateway Sample

Reference

  • Models
    • Speech Recognition
    • Natural Language Processing
    • Speech Synthesis
  • gRPC & Protocol Buffers
  • NGC Artifacts
  • Troubleshooting
  • Support Matrix
  • Archives
  • Upgrading
  • Acknowledgements
  • End User License Agreement
  • Notice

Speech Recognition

Speech Recognition#

  • How do I use Riva ASR APIs with out-of-the-box models?
    • NVIDIA Riva Overview
    • Transcription with Riva ASR APIs
    • Go deeper into Riva capabilities
  • How to Improve Recognition of Specific Words
    • Overview of Riva customization techniques
    • 1. Word boosting
    • 2. Custom vocabulary
    • 3. Custom pronunciation (Lexicon mapping)
    • 4. Retrain language model
    • 5. Fine tune the acoustic model
  • Conclusion
  • How do I boost specific words at runtime with word boosting?
    • NVIDIA Riva Overview
    • Word boosting with Riva ASR APIs
    • Go deeper into Riva capabilities
  • How to Customize Riva ASR Vocabulary and Pronunciation with Lexicon Mapping
    • Overview
    • What can be customized?
    • Extending the vocabulary
    • Customizing pronunciation with lexicon mapping
    • Go deeper into Riva capabilities
  • How to pretrain a Riva ASR Language Modeling (n-gram) with TAO Toolkit
    • NVIDIA Riva Overview
    • TAO Toolkit
    • Language Modeling
    • Let’s Dig in: Riva Language Modeling using TAO
    • TAO Toolkit workflow
  • How to Fine-Tune a Riva ASR Acoustic Model (Citrinet) with TAO Toolkit
    • NVIDIA Riva Overview
    • Train Adapt Optimize (TAO) Toolkit
    • Automatic Speech Recognition (ASR)
    • ASR using TAO
    • What’s Next?
  • How to deploy custom Acoustic Model (Citrinet) trained with TAO Toolkit on Riva
    • NVIDIA Riva Overview
    • Train, Adapt, and Optimize TAO Toolkit
    • Prerequisites
    • Riva ServiceMaker
    • Start the Riva Server
    • Run Inference

previous

Release Notes

next

How do I use Riva ASR APIs with out-of-the-box models?

By NVIDIA
© Copyright 2022 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
Last updated on Dec 13, 2022.