Introduction to Using NGC with Alibaba Cloud

These instructions are for using NGC with the International Alibaba Cloud. To view instructions for the Chinese Alibaba Cloud, see http://docs.nvidia.com/ngc.cn/ngc-alibaba-setup-guide-cn/.

NVIDIA makes available on the Alibaba Cloud platform a customized image optimized for the NVIDIA Pascal™ and Volta™ -based Tesla GPUs. Running NGC containers on this virtual machine (VM) instance provides optimum performance for deep learning jobs.

For those familiar with the Alibaba platform, the process of launching the instance is as simple as logging into Alibaba, selecting the NVIDIA GPU Cloud Virtual Machine Image and one of the supported NVIDIA GPU instance types, configuring settings as needed, then launching the VM. After launching the VM, you can SSH into it and start running deep learning jobs using framework containers from the NGC container registry.   

This document provides step-by-step instructions for accomplishing this.

Prerequisites

These instructions assume the following:

  • You have an Alibaba account - https://home-intl.console.aliyun.com/ with permissions to create resources.

  • Browsed the NGC website and identified an available NGC container and tag to run on the VMI.
  • If you plan to use the CLI or Terraform, then the Alibaba CLI must be be installed, with at least the ECS SDK, and you must create SSH keys to use with Alibaba; see setup instructions below.
  • Windows Users: The CLI code snippets are for bash on Linux or Mac OS X. If you are using Windows and want to use the snippets as-is, you can use the Windows Subsystem for Linux and use the bash shell (you will be in Ubuntu Linux).

Additionally, if you plan to access locked NGC containers, you will need to perform the following steps from the NGC website (see NGC Getting Started Guide)
  • Signed up for an NGC account at https://ngc.nvidia.com/signup.
  • Created an NGC API key for access to locked containers within the NGC container registry.