NeMo Customizer Microservice Helm Chart#
For deployment guide, see Admin Setup in the NeMo Microservices documentation.
Values#
Key |
Type |
Default |
Description |
---|---|---|---|
apiImage |
object |
This object has the following default values for the NeMo Customizer API only image. |
Customizer API only image configuration. |
apiImage.imagePullPolicy |
string |
|
Image pull policy for the NeMo Customizer API image. |
apiImage.registry |
string |
|
Registry for the NeMo Customizer API image. |
apiImage.repository |
string |
|
Repository for the NeMo Customizer API image. |
customizerConfig |
object |
This object has default values for the following fields. |
Configuration for the NeMo Customizer microservice. |
customizerConfig.entityStoreURL |
string |
|
URL for the NeMo Entity Store microservice. |
customizerConfig.mlflowURL |
string |
|
URL for the MLflow tracking server. |
customizerConfig.models |
object |
This object has default values for the supported models. |
Models configuration. |
customizerConfig.models.”meta/llama-3.1-70b-instruct” |
object |
This object has the following default values for the Llama 3.1 70B Instruct model. |
Llama 3.1 70B Instruct model configuration. |
customizerConfig.models.”meta/llama-3.1-70b-instruct”.enabled |
bool |
|
Whether to enable the model. |
customizerConfig.models.”meta/llama-3.1-70b-instruct”.max_seq_length |
int |
|
Maximum sequence length for input tokens. |
customizerConfig.models.”meta/llama-3.1-70b-instruct”.micro_batch_size |
int |
|
Micro batch size for training. |
customizerConfig.models.”meta/llama-3.1-70b-instruct”.model_path |
string |
|
Path where model files are stored. |
customizerConfig.models.”meta/llama-3.1-70b-instruct”.model_uri |
string |
|
NGC model URI for Llama 3.1 70B Instruct model. |
customizerConfig.models.”meta/llama-3.1-70b-instruct”.num_parameters |
int |
|
Number of model parameters. |
customizerConfig.models.”meta/llama-3.1-70b-instruct”.precision |
string |
|
Model precision format. |
customizerConfig.models.”meta/llama-3.1-70b-instruct”.prompt_template |
string |
|
Template for formatting prompts. |
customizerConfig.models.”meta/llama-3.1-70b-instruct”.training_options |
list |
|
Training options for different fine-tuning methods. |
customizerConfig.models.”meta/llama-3.1-8b-instruct” |
object |
This object has the following default values for the Llama 3.1 8B Instruct model. |
Llama 3.1 8B Instruct model configuration. |
customizerConfig.models.”meta/llama-3.1-8b-instruct”.enabled |
bool |
|
Whether to enable the model. |
customizerConfig.models.”meta/llama-3.1-8b-instruct”.max_seq_length |
int |
|
Maximum sequence length for input tokens. |
customizerConfig.models.”meta/llama-3.1-8b-instruct”.micro_batch_size |
int |
|
Micro batch size for training. |
customizerConfig.models.”meta/llama-3.1-8b-instruct”.model_path |
string |
|
Path where model files are stored. |
customizerConfig.models.”meta/llama-3.1-8b-instruct”.model_uri |
string |
|
NGC model URI for Llama 3.1 8B Instruct model. |
customizerConfig.models.”meta/llama-3.1-8b-instruct”.num_parameters |
int |
|
Number of model parameters. |
customizerConfig.models.”meta/llama-3.1-8b-instruct”.precision |
string |
|
Model precision format. |
customizerConfig.models.”meta/llama-3.1-8b-instruct”.prompt_template |
string |
|
Template for formatting prompts. |
customizerConfig.models.”meta/llama-3.1-8b-instruct”.training_options |
list |
|
Training options for different fine-tuning methods. |
customizerConfig.models.”meta/llama-3.2-1b” |
object |
This object has the following default values for the Llama 3.2 1B model. |
Llama 3.2 1B model configuration. |
customizerConfig.models.”meta/llama-3.2-1b”.enabled |
bool |
|
Whether to enable the model. |
customizerConfig.models.”meta/llama-3.2-1b”.max_seq_length |
int |
|
Maximum sequence length for input tokens. |
customizerConfig.models.”meta/llama-3.2-1b”.micro_batch_size |
int |
|
Micro batch size for training. |
customizerConfig.models.”meta/llama-3.2-1b”.model_path |
string |
|
Path where model files are stored. |
customizerConfig.models.”meta/llama-3.2-1b”.model_uri |
string |
|
NGC model URI for Llama 3.2 1B model. |
customizerConfig.models.”meta/llama-3.2-1b”.num_parameters |
int |
|
Number of model parameters. |
customizerConfig.models.”meta/llama-3.2-1b”.precision |
string |
|
Model precision format. |
customizerConfig.models.”meta/llama-3.2-1b”.prompt_template |
string |
|
Template for formatting prompts. |
customizerConfig.models.”meta/llama-3.2-1b”.training_options |
list |
|
Training options for different fine-tuning methods. |
customizerConfig.models.”meta/llama-3.2-1b-instruct” |
object |
This object has the following default values for the Llama 3.2 1B Instruct model. |
Llama 3.2 1B Instruct model configuration. |
customizerConfig.models.”meta/llama-3.2-1b-instruct”.enabled |
bool |
|
Whether to enable the model. |
customizerConfig.models.”meta/llama-3.2-1b-instruct”.max_seq_length |
int |
|
Maximum sequence length for input tokens. |
customizerConfig.models.”meta/llama-3.2-1b-instruct”.micro_batch_size |
int |
|
Micro batch size for training. |
customizerConfig.models.”meta/llama-3.2-1b-instruct”.model_path |
string |
|
Path where model files are stored. |
customizerConfig.models.”meta/llama-3.2-1b-instruct”.model_uri |
string |
|
NGC model URI for Llama 3.2 1B Instruct model. |
customizerConfig.models.”meta/llama-3.2-1b-instruct”.num_parameters |
int |
|
Number of model parameters. |
customizerConfig.models.”meta/llama-3.2-1b-instruct”.precision |
string |
|
Model precision format. |
customizerConfig.models.”meta/llama-3.2-1b-instruct”.prompt_template |
string |
|
Template for formatting prompts. |
customizerConfig.models.”meta/llama-3.2-1b-instruct”.training_options |
list |
|
Training options for different fine-tuning methods. |
customizerConfig.models.”meta/llama-3.2-3b-instruct” |
object |
This object has the following default values for the Llama 3.2 3B Instruct model. |
Llama 3.2 3B Instruct model configuration. |
customizerConfig.models.”meta/llama-3.2-3b-instruct”.enabled |
bool |
|
Whether to enable the model. |
customizerConfig.models.”meta/llama-3.2-3b-instruct”.max_seq_length |
int |
|
Maximum sequence length for input tokens. |
customizerConfig.models.”meta/llama-3.2-3b-instruct”.micro_batch_size |
int |
|
Micro batch size for training. |
customizerConfig.models.”meta/llama-3.2-3b-instruct”.model_path |
string |
|
Path where model files are stored. |
customizerConfig.models.”meta/llama-3.2-3b-instruct”.model_uri |
string |
|
NGC model URI. |
customizerConfig.models.”meta/llama-3.2-3b-instruct”.num_parameters |
int |
|
Number of model parameters. |
customizerConfig.models.”meta/llama-3.2-3b-instruct”.precision |
string |
|
Model precision format. |
customizerConfig.models.”meta/llama-3.2-3b-instruct”.prompt_template |
string |
|
Template for formatting prompts. |
customizerConfig.models.”meta/llama-3.2-3b-instruct”.training_options |
list |
|
Training options for different fine-tuning methods. |
customizerConfig.models.”meta/llama-3.3-70b-instruct” |
object |
This object has the following default values for the Llama 3.3 70B Instruct model. |
Llama 3.3 70B Instruct model configuration. |
customizerConfig.models.”meta/llama-3.3-70b-instruct”.enabled |
bool |
|
Whether to enable the model. |
customizerConfig.models.”meta/llama-3.3-70b-instruct”.max_seq_length |
int |
|
Maximum sequence length for input tokens. |
customizerConfig.models.”meta/llama-3.3-70b-instruct”.micro_batch_size |
int |
|
Micro batch size for training. |
customizerConfig.models.”meta/llama-3.3-70b-instruct”.model_path |
string |
|
Path where model files are stored. |
customizerConfig.models.”meta/llama-3.3-70b-instruct”.model_uri |
string |
|
NGC model URI for Llama 3.3 70B Instruct model. |
customizerConfig.models.”meta/llama-3.3-70b-instruct”.num_parameters |
int |
|
Number of model parameters. |
customizerConfig.models.”meta/llama-3.3-70b-instruct”.precision |
string |
|
Model precision format. |
customizerConfig.models.”meta/llama-3.3-70b-instruct”.prompt_template |
string |
|
Template for formatting prompts. |
customizerConfig.models.”meta/llama-3.3-70b-instruct”.training_options |
list |
|
Training options for different fine-tuning methods. |
customizerConfig.models.meta/llama3-70b-instruct |
object |
This object has the following default values for the Llama 3 70B Instruct model. |
Llama 3 70B Instruct model configuration. |
customizerConfig.models.meta/llama3-70b-instruct.enabled |
bool |
|
Whether to enable the model. |
customizerConfig.models.meta/llama3-70b-instruct.max_seq_length |
int |
|
Maximum sequence length for input tokens. |
customizerConfig.models.meta/llama3-70b-instruct.micro_batch_size |
int |
|
Micro batch size for training. |
customizerConfig.models.meta/llama3-70b-instruct.model_path |
string |
|
Path where model files are stored. |
customizerConfig.models.meta/llama3-70b-instruct.model_uri |
string |
|
NGC model URI for Llama 3 70B Instruct model. |
customizerConfig.models.meta/llama3-70b-instruct.num_parameters |
int |
|
Number of model parameters. |
customizerConfig.models.meta/llama3-70b-instruct.precision |
string |
|
Model precision format. |
customizerConfig.models.meta/llama3-70b-instruct.prompt_template |
string |
|
Template for formatting prompts. |
customizerConfig.models.meta/llama3-70b-instruct.training_options |
list |
|
Training options for different fine-tuning methods. |
customizerConfig.models.microsoft/phi-4 |
object |
This object has the following default values for the Phi-4. |
Phi-4 model configuration. |
customizerConfig.models.microsoft/phi-4.enabled |
bool |
|
Whether to enable the model. |
customizerConfig.models.microsoft/phi-4.max_seq_length |
int |
|
Maximum sequence length for input tokens. |
customizerConfig.models.microsoft/phi-4.micro_batch_size |
int |
|
Micro batch size for training. |
customizerConfig.models.microsoft/phi-4.model_path |
string |
|
Path where model files are stored. |
customizerConfig.models.microsoft/phi-4.model_uri |
string |
|
NGC model URI for Phi-4 model. |
customizerConfig.models.microsoft/phi-4.num_parameters |
int |
|
Number of model parameters. |
customizerConfig.models.microsoft/phi-4.precision |
string |
|
Model precision format. |
customizerConfig.models.microsoft/phi-4.prompt_template |
string |
|
Template for formatting prompts. |
customizerConfig.models.microsoft/phi-4.training_options |
list |
|
Training options for different fine-tuning methods. |
customizerConfig.nemoDataStoreURL |
string |
|
URL for the NeMo Data Store microservice. |
customizerConfig.openTelemetry |
object |
This object has the following default values for the OpenTelemetry settings. |
OpenTelemetry settings. |
customizerConfig.openTelemetry.enabled |
bool |
|
Whether to enable OpenTelemetry. |
customizerConfig.openTelemetry.exporterOtlpEndpoint |
string |
|
Endpoint to access a custom OTLP collector listening on port 4317. Example: “http://$(HOST_IP):4317”. |
customizerConfig.openTelemetry.logsExporter |
string |
|
Sets the logs exporter type (otlp, console, none). |
customizerConfig.openTelemetry.metricsExporter |
string |
|
Sets the metrics exporter type (otlp, console, none). |
customizerConfig.openTelemetry.tracesExporter |
string |
|
Sets the traces exporter type (otlp, console, none). |
customizerConfig.tolerations |
list |
|
Tolerations on the customization job pods. |
customizerConfig.training |
object |
This object has the following default values for the training configuration. |
Training configuration for customization jobs. |
customizerConfig.training.container_defaults |
object |
|
Default container configuration for training jobs. |
customizerConfig.training.container_defaults.env |
string |
|
Environment variables for the training container. Cannot override env variables reserved by NeMo Customizer. |
customizerConfig.training.pvc.size |
string |
|
Size of the training job PVC. |
customizerConfig.training.pvc.storageClass |
string |
|
Storage class for the training job PVC. |
customizerConfig.training.pvc.volumeAccessMode |
string |
|
Volume access mode for the training job PVC. |
customizerConfig.training.queue |
string |
|
Queue name used by the underlying scheduler of NemoTrainingJob. Maps to “resourceGroup” in NemoTrainingJob. |
customizerConfig.training.workspace_dir |
string |
|
Directory path for training workspace. |
customizerConfig.trainingNetworking |
list |
|
Network configuration for training jobs on Oracle Kubernetes Engine (OKE) on Oracle Cloud Infrastructure (OCI). |
customizerConfig.wandb |
object |
|
Weights and Biases (WandB) Python SDK intialization configuration for logging and monitoring training jobs in WandB. |
customizerConfig.wandb.entity |
string |
|
Reference: https://docs.wandb.ai/ref/python/init/ |
env |
object |
|
A map of environment variables to inject into the NeMo Customizer app container. Example: |
externalDatabase |
object |
This object has the following default values for the external PostgreSQL configuration. |
External PostgreSQL configuration. |
externalDatabase.database |
string |
|
Name of the database to use. |
externalDatabase.existingSecret |
string |
|
Name of an existing secret resource containing the database credentials. |
externalDatabase.existingSecretPasswordKey |
string |
|
Name of an existing secret key containing the database credentials. |
externalDatabase.host |
string |
|
External database host address. |
externalDatabase.password |
string |
|
Password for the non-root username for the NeMo Customizer microservice. |
externalDatabase.port |
int |
|
External database port number. |
externalDatabase.user |
string |
|
Non-root username for the NeMo Customizer microservice. |
fullnameOverride |
string |
|
String to fully override the chart and release name on resulting objects when deployed. |
image |
object |
This object has the following default values for the NeMo Customizer microservice image. |
NeMo Customizer image that supports training and standalone mode. |
image.imagePullPolicy |
string |
|
Image pull policy for the NeMo Customizer image. |
image.registry |
string |
|
Registry for the NeMo Customizer image. |
image.repository |
string |
|
Repository for the NeMo Customizer image. |
imagePullSecrets |
list |
|
Image pull secrets configuration. |
ingress |
object |
This object has the following default values for the Ingress configuration. |
Ingress configuration. |
ingress.annotations |
object |
|
Additional annotations for the Ingress resource. |
ingress.className |
string |
|
Ingress class name. |
ingress.enabled |
bool |
|
Whether to enable the ingress resource. |
ingress.hostname |
string |
|
Hostname for the ingress resource. |
ingress.hosts |
list |
[] |
A list of maps, each containing the keys |
ingress.tls |
list |
|
TLS configuration for the ingress resource. |
logging |
object |
|
Logging configuration. |
logging.logHealthEndpoints |
bool |
|
Enable logging for health endpoints. |
logging.logLevel |
string |
|
Log level for the application. |
modelDownloader |
object |
This object has the following default values for the model downloader. |
Download models to PVC model cache configuration. |
modelDownloader.pollIntervalSeconds |
int |
|
Interval in seconds to poll for model download status. |
modelDownloader.securityContext |
object |
|
Security context for the model downloader. |
modelDownloader.ttlSecondsAfterFinished |
int |
|
Time to live in seconds after the job finishes. |
modelsStorage |
object |
|
Configure the PVC for models mount, where we store the parent/base models. |
modelsStorage.accessModes |
list |
|
Access modes for the persistent volume. |
modelsStorage.enabled |
bool |
|
Enable persistent volume for model storage. |
modelsStorage.size |
string |
|
Size of the persistent volume. |
modelsStorage.storageClassName |
string |
|
Storage class name for the models PVC. Empty string uses the default storage class. |
nameOverride |
string |
|
String to override chart name on resulting objects when deployed. |
nemoDataStoreTools |
object |
This object has the following default values for the NeMo Data Store tools image. |
Tools configuration for downloading and uploading entities to NeMo Data Store. |
nemoDataStoreTools.imagePullSecret |
string |
|
Image pull secret for the NeMo Data Store tools image. |
nemoDataStoreTools.registry |
string |
|
Registry for the NeMo Data Store tools image. |
nemoDataStoreTools.repository |
string |
|
Repository for the NeMo Data Store tools image. |
nemoDataStoreTools.tag |
string |
|
Tag for the NeMo Data Store tools image. |
ngcAPISecret |
string |
|
Secret used for auto hydrating the model cache from NGC for enabled models. |
ngcAPISecretKey |
string |
|
Key in the NGC API secret containing the API key. |
opentelemetry-collector |
object |
This object has the following default values for the Open Telemetry Collector configuration. |
Open Telemetry Collector configuration. |
opentelemetry-collector.config |
object |
|
Base collector configuration for Open Telemetry Collector. |
opentelemetry-collector.enabled |
bool |
|
Switch to enable or disable Open Telemetry Collector. |
opentelemetry-collector.image.repository |
string |
|
Repository for Open Telemetry Collector image. |
opentelemetry-collector.image.tag |
string |
|
Overrides the image tag whose default is the chart appVersion. |
opentelemetry-collector.mode |
string |
|
Deployment mode for Open Telemetry Collector. Valid values are “daemonset”, “deployment”, and “statefulset”. |
postgresql |
object |
This object has the following default values for the PostgreSQL configuration. |
PostgreSQL configuration for the NeMo Customizer microservice. |
postgresql.architecture |
string |
|
PostgreSQL architecture ( |
postgresql.auth.database |
string |
|
Name for a custom database to create. |
postgresql.auth.enablePostgresUser |
bool |
|
Whether to assign a password to the “postgres” admin user. Otherwise, remote access will be blocked for this user. |
postgresql.auth.existingSecret |
string |
|
Name of existing secret to use for PostgreSQL credentials. |
postgresql.auth.password |
string |
|
Password for the custom user to create. |
postgresql.auth.username |
string |
|
Name for a custom user to create. |
postgresql.enabled |
bool |
|
Whether to enable or disable the PostgreSQL helm chart. |
replicaCount |
int |
|
Number of replicas to deploy. |
service |
object |
|
Service configuration. |
service.internalPort |
int |
|
Internal port for the service. |
service.port |
int |
|
External port for the service. |
service.type |
string |
|
Type of Kubernetes service to create. |
serviceAccount |
object |
|
Service account configuration. |
serviceAccount.annotations |
object |
|
Annotations to add to the service account. |
serviceAccount.automountServiceAccountToken |
bool |
|
Automatically mount a ServiceAccount’s API credentials. |
serviceAccount.create |
bool |
|
Specifies whether a service account should be created. |
serviceAccount.name |
string |
|
The name of the service account to use. If not set and create is true, a name is generated. |
useRunAIExecutor |
bool |
|
Enable or disable RunAI executor. |
wandb |
object |
This object has the following default values for the WandB configuration. |
WandB configuration. |
wandb.existingSecret |
string |
|
Name of an existing Kubernetes secret resource for the WandB encryption secret. |
wandb.existingSecretKey |
string |
|
Name of the key in the existing WandB secret containing the secret value. The secret value must be exactly 32 alphanumeric characters: ^[a-zA-Z0-9]{32}$ |
wandb.secretValue |
string |
|
WandB secret value. Must contain exactly 32 alphanumeric characters. Creates a new Kubernetes secret named “wandb-secret” with key-value pair “encryption_key=<wandb.secretValue>”. Ignored if wandb.existingSecret is set. |