Algorithms#

NeMo RL supports multiple training algorithms for post-training large language models.

Support Matrix#

On-policy distillation is also supported in the PyTorch DTensor path.