Model Support#

Broad coverage for 🤗Hugging Face models via NeMo AutoModel #

NeMo RL supports 🤗Hugging Face models from the following classes

LLMs (AutoModelForCausalLM)
VLMs (AutoModelForImageTextToText)

for model sizes under 70B at up to 32k sequence length.

Optimal acceleration for top models via NeMo Megatron Bridge #

NeMo Megatron-Bridge provides acceleration recipes for the below models. Users can also leverage the on-line checkpoint conversion (i.e the “bridge”) by directly inputting a 🤗Hugging Face checkpoint.

LLMs:

Qwen: Qwen3.5-9B/35B-A3B/397B-A17B, Qwen3-1.5B/8B/32B, Qwen3-30B-A3B, Qwen3-235B-A22B, Qwen2.5-1.5B/7B/32B
GLM: GLM-4.7-Flash, GLM-5.1
Llama: Llama 3.1/3.3-8B, Llama 3.1/3.3-70B, Llama 3.2-1B
DeepSeek: DeepSeek-V3/R1-671B
Mistral: Mistral-NeMo-12B
Moonlight-16B-A3B
Gemma: Gemma-3-1B/27B
GPT-OSS: GPT-OSS-20B/120B
Nemotron: NVIDIA-Nemotron-3-Super-120B-A12B (available on the super-v3 branch — see the branch-specific guide), Nemotron-3-Nano-30B-A3B, Llama-3.3-Nemotron-Super-49B, Nemotron-Nano-12B-v2

VLMs:

Qwen: Qwen3.5-35B-A3B/397B-A17B, Qwen2.5VL-3B

In addition, please refer to our performance page for benchmarks and full reproducible yaml recipe configs.

Model Support#

Broad coverage for 🤗Hugging Face models via NeMo AutoModel#

Optimal acceleration for top models via NeMo Megatron Bridge#

Broad coverage for 🤗Hugging Face models via NeMo AutoModel #

Optimal acceleration for top models via NeMo Megatron Bridge #