nemo_automodel.components.optim.scheduler
nemo_automodel.components.optim.scheduler
Learning rate decay and weight decay incr functions.
Module Contents
Classes
Data
API
Anneals learning rate and weight decay.
Parameters:
the optimizer to be used
initial learning rate
maximum learning rate
minimum learning rate
number of warmup steps
number of decay steps
decay style for learning rate
initial weight decay
final weight decay
number of weight decay increment steps
weight decay increment style
whether to use the checkpoint values for the optimizer param scheduler. Defaults to True.
whether to override the optimizer param scheduler values with the class values. Defaults to False.
number of weight decay decay steps. Defaults to None.
decay style for learning rate during weight decay decay steps. Defaults to None.
Return a string representation of the OptimizerParamScheduler.
Auxiliary function for checking the values in the checkpoint and setting them.
Parameters:
class value
checkpoint value
name of the parameter
Learning rate decay functions from: https://openreview.net/pdf?id=BJYwwY9ll pg. 4.
Weight decay incr functions.
Load the state dict.
Parameters:
state dict to be load
Return the state dict.
Set lr for all parameters groups.
Parameters:
number of steps to increment