About Fine-Tuning | NVIDIA NeMo Platform

Learn how to fine-tune models by making requests to NVIDIA NeMo Customizer through the API. Fine-tuned models you have created can be deployed using NVIDIA NIMs.

Fine-Tuning Workflow

At a high level, the fine-tuning workflow consists of the following steps:

Create a Model Entity pointing to your base model checkpoint (stored as a FileSet).
Format a compatible dataset.
Create a customization job referencing the Model Entity.
Monitor the job until it completes.
The customization job automatically creates either:

LoRA jobs: An adapter attached to the original Model Entity
Full fine-tuning jobs: A new Model Entity with the customized weights

Deploy the model using the Deployment Management Service.
Move on to Evaluate the output model.

Container Images

Fine-tuning jobs run in container images published to NVIDIA NGC. Use the image tag that matches your NeMo Platform release:

Image	Purpose
`nvcr.io/nvidia/nemo-platform/nmp-customizer-tasks:<image-tag>`	Shared CPU task steps (file I/O, model entity, model spec) for all customization backends
`nvcr.io/nvidia/nemo-platform/nmp-automodel-training:<image-tag>`	Automodel GPU training step
`nvcr.io/nvidia/nemo-platform/nmp-unsloth-training:<image-tag>`	Unsloth GPU training step

These public images can be pulled directly from nvcr.io:

$ docker pull nvcr.io/nvidia/nemo-platform/nmp-customizer-tasks:<image-tag>
$ docker pull nvcr.io/nvidia/nemo-platform/nmp-automodel-training:<image-tag>
$ docker pull nvcr.io/nvidia/nemo-platform/nmp-unsloth-training:<image-tag>

Most users do not need to pull these images manually. NeMo Platform resolves them from the configured platform image registry and tag. Pull or mirror them when you are operating a self-managed deployment, preloading an air-gapped environment, or validating registry access. If your environment requires authenticated registry pulls, authenticate to nvcr.io with an NGC API key before pulling or mirroring the images.