NeMo ASR Models
Use NeMo Framework’s automatic speech recognition models for transcription in your audio curation pipelines. This guide covers basic usage and configuration.
Model Selection
NeMo Framework provides pre-trained ASR models through the Hugging Face model hub. For the complete list of available models and their specifications, refer to the NeMo Framework ASR documentation.
Example Model Usage
Basic Usage
Simple ASR Inference
Custom Configuration
Model Caching
Models are automatically downloaded and cached when first loaded:
Resource Configuration
Configure GPU and CPU resources based on your hardware:
Resource requirements vary by model. Test with your specific model to determine optimal settings.