nemo_microservices.types.customization_target#
Module Contents#
Classes#
API#
- class nemo_microservices.types.customization_target.CustomizationTarget(/, **data: typing.Any)#
Bases:
nemo_microservices._models.BaseModel- base_model: Optional[str]#
None
Default to being the same as the the configuration entry name, maps to the name in NIM
- created_at: Optional[datetime.datetime]#
None
Timestamp for when the entity was created.
- custom_fields: Optional[Dict[str, object]]#
None
A set of custom fields that the user can define and use for various purposes.
- description: Optional[str]#
None
The description of the entity.
- enabled: Optional[bool]#
None
Enable the model for training jobs
- hf_endpoint: Optional[str]#
None
Configure HuggingFace Hub base URL.
Defaults to NeMo Data Store. Set value as “https://huggingface.co” to download model_uri from HuggingFace.
- id: Optional[str]#
None
The ID of the entity.
With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [
-base58(uuid4())].
- model_path: str#
None
Path to the model checkpoints to use for training.
Absolute path or local path from the models cache
- model_type: Optional[nemo_microservices.types.target_checkpoint_type.TargetCheckpointType]#
None
The detected checkpoint type of the uploaded target.
nemo model checkpoints have these key files: context/nemo_tokenizer/config.json, context/model.yaml, and weights/metadata.json
hf model checkpoints have these key files: config.json, tokenizer.json or tokenizer_config.json, and either model*.safetensors (preferred) or pytorch_model*.bin
hf-lora model checkpoints only contain the LoRA adapter for a HF model, they have these key files: adapter_config.json adapter_model.safetensors
- model_uri: Optional[str]#
None
The URI of the model to download to the model cache at the model_path directory.
To download from NGC, specify ngc://org/optional-team/model-name:version. To download from Nemo Data Store, specify hf://namespace/model-name@checkpoint-name
- name: Optional[str]#
None
The name of the entity.
Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.
- namespace: Optional[str]#
None
The namespace of the entity.
This can be missing for namespace entities or in deployments that don’t use namespaces.
- num_parameters: int#
None
Number of parameters used for training the model
- ownership: Optional[nemo_microservices.types.shared.ownership.Ownership]#
None
Information about ownership of an entity.
If the entity is a namespace, the
access_policieswill typically apply to all entities inside the namespace.
- precision: nemo_microservices.types.shared.model_precision.ModelPrecision#
None
Type of model precision.
Values
"int8"- 8-bit integer precision"bf16"- Brain floating point precision"fp16"- 16-bit floating point precision"fp32"- 32-bit floating point precision"fp8-mixed"- Mixed 8-bit floating point precision available on Hopper and later architectures."bf16-mixed"- Mixed Brain floating point precision
- project: Optional[str]#
None
The URN of the project associated with this entity.
- status: Optional[nemo_microservices.types.target_status.TargetStatus]#
None
Normalized statuses for targets.
CREATED: The target is created, but not yet scheduled.
PENDING: The target is waiting for resource allocation.
DOWNLOADING: The target is downloading.
FAILED: The target failed to execute and terminated.
READY: The target is ready to be used.
CANCELLED: The target download was cancelled.
UNKNOWN: The target status is unknown.
DELETED: The target is deleted.
DELETING: The target is currently being deleted.
DELETE_FAILED: Failed to delete the target.
- tokenizer: Optional[Dict[str, object]]#
None
Overrides for the model tokenizer
- updated_at: Optional[datetime.datetime]#
None
Timestamp for when the entity was last updated.