Checkpoints#

There are two main ways to load pretrained checkpoints in NeMo:

Using the restore_from() method to load a local checkpoint file (.nemo), or
Using the from_pretrained() method to download and set up a checkpoint from the cloud.

Note that these instructions are for loading fully trained checkpoints for evaluation or fine-tuning. For resuming an unfinished training experiment, use the Experiment Manager to do so by setting the resume_if_exists flag to True.

Local Checkpoints#

Save Model Checkpoints: NeMo automatically saves final model checkpoints with .nemo suffix. You could also manually save any model checkpoint using model.save_to(<checkpoint_path>.nemo).
Load Model Checkpoints: if you’d like to load a checkpoint saved at <path/to/checkpoint/file.nemo>, use the restore_from() method below, where <MODEL_BASE_CLASS> is the model class of the original checkpoint.

import nemo.collections.audio as nemo_audio
model = nemo_audio.models.<MODEL_BASE_CLASS>.restore_from(restore_path="<path/to/checkpoint/file.nemo>")

Pretrained Checkpoints#

The table below in Audio Models list part of available pre-trained audio processing models including speech processing, restoration and extraction.

Load Model Checkpoints#

The models can be accessed via the from_pretrained() method inside the audio model class. In general, you can load any of these models with code in the following format,

import nemo.collections.audio as nemo_audio
model = nemo_audio.models.<MODEL_BASE_CLASS>.from_pretrained(model_name="<MODEL_NAME>")

where <MODEL_NAME> is the value in Model Name column in the tables in Audio Models. These names are predefined in the each model’s member function self.list_available_models().

Audio Models#

Speech Enhancement Models#

Model Name	Dataset	Sampling Rate	Model Class	Model Card
nvidia/se_den_sb_16k_small	WSJ0+CHiME	16000Hz	nemo.collections.audio.models.SchroedingerBridgeAudioToAudioModel	se_den_sb_16k_small
nvidia/se_der_sb_16k_small	WSJ0+Reverb	16000Hz	nemo.collections.audio.models.SchroedingerBridgeAudioToAudioModel	se_der_sb_16k_small

SSL Models#

Model Name	Dataset	Sampling Rate	Model Class	Model Card
nvidia/sr_ssl_flowmatching_16k_430m	Libri-Light	16000Hz	nemo.collections.audio.models.FlowMatchingAudioToAudioModel	sr_ssl_flowmatching_16k_430m