nemo_curator.utils.gpu_utils
Module Contents
Functions
API
Get number of available CUDA GPUs as a power of 2.
Many models require tensor parallelism to use power-of-2 GPU counts. This returns the largest power of 2 <= available GPU count.
Returns: int
Power of 2 GPU count, minimum 1.
Raises:
RuntimeError: If no CUDA GPUs are detected.
Try to get max model length from HuggingFace AutoConfig.
Parameters:
model
Model identifier (e.g., “microsoft/phi-4”)
cache_dir
Optional cache directory for model config.
Returns: int | None
Max model length if found, None otherwise.