Fine-Tuning Tutorials#

Use the tutorials in this section to gain a deeper understanding of how the NVIDIA NeMo Customizer microservice enables fine-tuning tasks.

Tip

Tutorials are organized by complexity and typically build on one another. The tutorials often reference a CUSTOMIZER_BASE_URL whose value will depend on the ingress in your particular cluster. If you are using the minikube demo installation, it will be http://nemo.test. The demo installation’s value for DEPLOYMENT_BASE_URL is http://nemo.test and the value for NIM_PROXY_BASE_URL is http://nim.test. Otherwise, you will need to consult with your own cluster administrator for the ingress values.


Getting Started#

Understanding Configurations and Models

Learn the fundamentals of NeMo Customizer configurations, model types, and how to choose the right approach for your project.

Understanding NeMo Customizer Configurations and Targets

Dataset Preparation#

Format Training Datasets

Learn how to format datasets for different model types.

Format Training Dataset

Customization Jobs#

Fine-Tune a Model with Custom Data Using LoRA

Learn how to perform supervised fine-tuning with LoRA adapters using custom data.

Start a LoRA Model Customization Job
Fine-Tune a Model with Custom Data Processing All Weights

Learn how to perform supervised fine-tuning using custom data by modifying the all training parameters.

Start a Full SFT Customization Job
Align a Model with DPO and Preference Data

Learn how to align a model with DPO (Direct Preference Optimization) to prefer certain kinds of responses over others.

Start a DPO Customization Job
Increase a Smaller Model’s Task Performance to Closer Match a Larger Model

Learn how to perform a Knowledge Distillation (KD) job using a teacher and student model.

Start a Knowledge Distillation (KD) Customization Job
Fine-Tune an Embedding Model With Positive and Negative Samples Using LoRA

Learn how to fine-tune embedding models using LoRA merged training for improved question-answering and retrieval tasks.

Start an Embedding Model Customization Job
Import and Fine-Tune Private Hugging Face Models

Learn how to import a private Hugging Face model and fine-tune it.

Import and Fine-Tune Private HuggingFace Models

Monitoring & Optimization#

Check Customization Job Metrics

Learn how to check job metrics using MLflow or Weights & Biases.

Checking Your Customization Job Metrics
Optimize Tokens per GPU

Learn how to optimize the token-per-GPU throughput for a LoRA optimization job.

Optimize for Tokens/GPU Throughput