About Fine-Tuning#

Learn how to fine-tune models by making requests to NVIDIA NeMo Customizer through the API. Fine-tuned models you have created can be deployed using NVIDIA NIMs.

Fine-Tuning Workflow#

At a high level, the fine-tuning workflow consists of the following steps:

  1. Create a Model Entity pointing to your base model checkpoint (stored as a FileSet).

  2. Format a compatible dataset.

  3. Create a customization job referencing the Model Entity.

  4. Monitor the job until it completes.

  5. The customization job automatically creates either:

    • LoRA jobs: An adapter attached to the original Model Entity

    • Full fine-tuning jobs: A new Model Entity with the customized weights

  6. Deploy the model using the Deployment Management Service.

  7. Move on to Evaluate the output model.


Model Catalog#

Explore the model families and sizes supported by NVIDIA NeMo Customizer.

Llama Models

View the available Llama models in the model catalog.

Llama Models
Llama Nemotron Models

View the available Llama Nemotron models from NVIDIA, including Nano and Super variants for efficient and advanced instruction tuning.

Llama Nemotron Models
Phi Models

View the available Phi models from Microsoft, designed for strong reasoning capabilities with efficient deployment.

Phi Models
GPT-OSS Models

View the available GPT-OSS models supported for Full SFT customization.

GPT-OSS Models
Embedding Models

View the available embedding models for question-answering and retrieval tasks.

Embedding Models

For hardware compatibility, models requiring 80GB GPU memory work with A100(80GB), H100 and B200 GPUs. Refer to the model catalog pages for specific GPU requirements.

Task Guides#

Perform common fine-tuning tasks.

Manage Customization Jobs

Create, list, view, and cancel customization jobs.

Manage Customization Jobs
Manage Model Entities

Create FileSets and Model Entities to prepare base models for customization.

Manage Model Entities for Customization
Manage Datasets

Upload and manage datasets for training.

Manage Files

Tutorials#

Follow these tutorials to learn how to accomplish common fine-tuning tasks.

Format Training Datasets

Learn how to format datasets for different model types.

Format Training Dataset
Start a LoRA Customization Job

Learn how to start a LoRA customization job using a custom dataset.

LoRA Model Customization Job
Start a Full SFT Customization Job

Learn how to start a SFT customization job using a custom dataset.

Full SFT Customization
Check Customization Job Metrics

Learn how to check job metrics using MLFlow or Weights & Biases.

Checking Your Customization Job Metrics
Optimize Tokens per GPU

Learn how to optimize the token-per-GPU throughput for a LoRA optimization job.

Optimize for Tokens/GPU Throughput

References#

Hyperparameters

View the available hyperparameters and their valid options that you can set when creating a customization job.

Training Configuration
Customizer API

View the OpenAPI specification for Customizer.

NeMo Platform API Reference
Troubleshoot Failed Jobs

View troubleshooting tips for failed jobs.

Troubleshooting NeMo Customizer