Installation - NVIDIA Docs

Using Clara requires the following:

System requirements

Driver requirements

Clara 4.0 is based on the NVIDIA container for Pytorch, release 21.02: https://docs.nvidia.com/deeplearning/frameworks/pytorch-release-notes/rel_21-02.html#rel_21-02.

Details about the contents of the base container and GPU and driver requirements can be found from the link above.

If you are using a DGX system, you can follow this: https://docs.nvidia.com/deeplearning/frameworks/preparing-containers/index.html.

Download the docker container

Download the docker container using these commands:

Copy
Copied!

            
            export dockerImage=nvcr.io/nvidia-clara-train/clara-train-sdk:v4.0

Copy
Copied!

            
            docker pull $dockerImage

Running the container

Once downloaded, run the docker using this command:

Copy
Copied!

            
            docker run -it --rm --shm-size=1G --ulimit memlock=-1 --ulimit stack=67108864 --ipc=host --net=host --mount type=bind,source=/your/dataset/location,target=/workspace/data $dockerImage /bin/bash

The docker, by default, starts in the /opt/nvidia folder. To access local directories from within the docker, they have to be mounted in the docker.

To mount a directory, use the -v <source_dir>:<mount_dir> option. Here is an example:

Copy
Copied!

            
            docker run --shm-size=1G --ulimit memlock=-1 --ulimit stack=67108864 -it --rm -v /home/<username>/clara-experiments:/workspace/clara-experiments $dockerImage /bin/bash

This mounts the /home/<username>/clara-experiments directory in your disk to /workspace/clara-experiments in docker.

Hint

More information for mounting directories can be found in Docker documentation

If you are on a network that uses a proxy server to connect to the Internet, you can provide proxy server details when launching the container.

Copy
Copied!

            
            docker run -it --rm -e HTTPS_PROXY=https_proxy_server_ip:https_proxy_server_port -e HTTP_PROXY=http_proxy_server_ip:http_proxy_server_port --shm-size=1G --ulimit memlock=-1 --ulimit stack=67108864 $dockerImage /bin/bash

For GPU isolation in the docker, use --gpus= with the latest docker release.

Copy
Copied!

            
            docker run -it --rm --gpus=1 --shm-size=1G --ulimit memlock=-1 --ulimit stack=67108864 $dockerImage /bin/bash