Container Environments#
This reference documents the default environments available in NeMo Curator containers and their configurations.
Main Container Environment#
The primary NeMo Curator container includes a single conda environment with all necessary dependencies.
Curator Environment#
Property |
Value |
---|---|
Environment Name |
|
Python Version |
3.12 |
CUDA Version |
12.5.1 (configurable) |
Operating System |
Ubuntu 22.04 (configurable) |
Base Image |
|
Core Dependencies |
|
Installation |
NeMo Curator installed with all optional dependencies ( |
Environment Path |
Activated by default and added to system PATH: |
Slurm Environment Variables#
When you deploy NeMo Curator on Slurm clusters, the following environment variables configure the runtime environment:
Default Configuration#
Variable |
Default Value |
Description |
---|---|---|
|
|
Device type: |
|
|
Network interface for Dask communication |
|
|
Network protocol: |
|
|
Memory limit per worker ( |
|
|
Delay CUDA context creation for UCX compatibility |
|
|
Enable automatic GPU memory spilling |
|
|
GPU memory pool size for scheduler |
|
|
GPU memory pool size per worker (80–90% of GPU memory) |
|
|
Direct storage-to-GPU I/O policy |
GPU Configuration Recommendations#
For GPU workloads, consider these optimized settings:
export DEVICE="gpu"
export PROTOCOL="ucx" # If your cluster supports it
export INTERFACE="ib0" # If you're using InfiniBand
export RAPIDS_NO_INITIALIZE="0"
export CUDF_SPILL="0"
export RMM_WORKER_POOL_SIZE="80GiB" # Adjust based on your GPU memory
export LIBCUDF_CUFILE_POLICY="ON" # If GPUDirect Storage is available
Automatic Environment Variables#
The Slurm configuration automatically generates these additional environment variables:
Variable |
Value |
Description |
---|---|---|
|
|
Directory for Dask logs |
|
|
Directory for performance profiles |
|
|
Dask scheduler connection file |
|
|
Scheduler log file |
|
|
Job completion marker |
Container Build Arguments#
The main container accepts these build-time arguments for environment customization:
Argument |
Default |
Description |
---|---|---|
|
|
CUDA version |
|
|
Base OS version |
|
|
Python version |
|
- |
Container label |
|
- |
Source repository URL |
|
- |
Git commit to build from |
Environment Usage Examples#
Text Curation#
Uses the default curator
environment with CPU or GPU workers depending on the module.
Image Curation#
Requires GPU-enabled workers in the curator
environment.