Speech Recognition

Speech Recognition#

How do I use Riva ASR APIs with out-of-the-box models?
- NVIDIA Riva Overview
- Transcription with Riva ASR APIs
- Go deeper into Riva capabilities
How to Customize Riva ASR Vocabulary and Pronunciation with Lexicon Mapping
- Overview
- What can be customized?
- Extending the vocabulary
- Customizing pronunciation with lexicon mapping
- Go deeper into Riva capabilities
How to Deploy a Custom Language Model (n-gram) Trained with NeMo on Riva
- NVIDIA Riva Overview
- NeMo (Neural Modules) and nemo2riva
- Prerequisites
- Riva ServiceMaker
- Start the Riva Server
- Run Inference
How to Deploy a Custom Acoustic Model (Citrinet) Trained with NeMo on Riva
- NVIDIA Riva Overview
- NeMo (Neural Modules) and nemo2riva
- Prerequisites
- Riva ServiceMaker
- Start the Riva Server
- Run Inference
How to Deploy a Custom Acoustic Model (Conformer-CTC) Trained with NeMo on Riva
- NVIDIA Riva Overview
- NeMo (Neural Modules) and nemo2riva
- Prerequisites
- Riva ServiceMaker
- Start the Riva Server
- Run Inference
How to Customize a Riva ASR Acoustic Model (Conformer-CTC) with Adapters
- NVIDIA Riva Overview
- Neural Module (NeMo)
ASR with Adapters
What are Adapters?
Advantages and Limitations of Adapter Training
Preparing the Acoustic Encoder for Adapter Training
Preparing the Model and Dataset for Adaptation
Creating and Training an Adapter
Evaluating the Model
Export the Model to Riva
What’s Next?
How to Fine-Tune a Riva ASR Acoustic Model with NVIDIA NeMo
- NVIDIA Riva Overview
- NeMo (Neural Modules)
- Fine-Tuning an ASR model with NeMo
- More Resources
- What’s Next?
How to Improve Recognition of Specific Words
- Overview of Riva customization techniques
- 1. Word boosting
- 2. Custom vocabulary
- 3. Custom pronunciation (Lexicon mapping)
- 4. Retrain language model
- 5. Fine-tune the acoustic model
Conclusion
How to Improve the Accuracy on Noisy Speech by Fine-Tuning the Acoustic Model (Conformer-CTC) in the Riva ASR Pipeline
- NVIDIA Riva Overview
How to Fine-Tune a Riva ASR Acoustic Model (Conformer-CTC) with TAO Toolkit
- NVIDIA Riva Overview
- Train Adapt Optimize (TAO) Toolkit
- Automatic Speech Recognition (ASR)
- ASR using TAO
- What’s Next?
How to pretrain a Riva ASR Language Modeling (n-gram) with TAO Toolkit
- NVIDIA Riva Overview
- TAO Toolkit
- Language Modeling
- Let’s Dig in: Riva Language Modeling using TAO
- TAO Toolkit workflow
How do I boost specific words at runtime with word boosting?
- NVIDIA Riva Overview
- Word boosting with Riva ASR APIs
- Go deeper into Riva capabilities