The Project Specification#
Overview of the Project Specification#
A Workbench project is just a git repository with some metadata files that provided information to drive automation and the overall user experience.
The central file for the metadata is the specification file, i.e. /project/.project/spec.yaml
.
You do not need to know anything about the specification file to use Workbench, but you must be very careful if you decide to edit it.
Workbench reads and writes to this file
It is a yaml file, and is thus very particular about formatting
It has a schema that you must follow
It has required fields and optional fields
You can edit the file but you cannot delete it
If you edit the file and things break, they aren’t broken permanently. Just fix the file
Project Specification Concerns#
The specification has four main concerns broken into sections.
You can see and edit the sections in the /project/.project/spec.yaml
file.
meta
: Basic metadata about the project like name and descriptionlayout
: Directory structure and how versioning backends are appliedenvironment
: Information about the container image and environment, including configured applicationsexecution
: Runtime information for the project container
There are some things that aren’t concerns of the specification.
Git history or the git remote being used
The individual files in the project, except potentially the
compose.yaml
file
Example AI Workbench Project Spec File#
The following is an example of an AI Workbench project spec.yaml file.
1specVersion: v2
2specMinorVersion: 1
3meta:
4 name: example-project
5 image: project-example-project
6 description: An example project using PyTorch
7 labels: []
8 createdOn: "2024-01-04T23:32:17Z"
9 defaultBranch: main
10layout:
11- path: code/
12 type: code
13 storage: git
14- path: models/
15 type: models
16 storage: gitlfs
17- path: data/
18 type: data
19 storage: gitlfs
20- path: data/scratch/
21 type: data
22 storage: gitignore
23environment:
24 base:
25 registry: nvcr.io
26 image: nvidia/ai-workbench/pytorch:1.0.2
27 build_timestamp: "20231212000523"
28 name: PyTorch
29 supported_architectures: []
30 cuda_version: "12.2"
31 description: A Pytorch 2.1 environment with CUDA 12.2
32 entrypoint_script: ""
33 labels:
34 - cuda12.2
35 - pytorch2.1
36 apps:
37 - name: jupyterlab
38 type: jupyterlab
39 class: webapp
40 start_command: jupyter lab --allow-root --port 8888 --ip 0.0.0.0 --no-browser
41 --NotebookApp.base_url=\$PROXY_PREFIX --NotebookApp.default_url=/lab --NotebookApp.allow_origin='*'
42 health_check_command: '[ \$(echo url=\$(jupyter lab list | head -n 2 | tail
43 -n 1 | cut -f1 -d'' '' | grep -v ''Currently'' | sed "s@/?@/lab?@g") | curl
44 -o /dev/null -s -w ''%{http_code}'' --config -) == ''200'' ]'
45 timeout_seconds: 90
46 stop_command: jupyter lab stop 8888
47 user_msg: ""
48 icon_url: ""
49 webapp_options:
50 autolaunch: true
51 port: "8888"
52 proxy:
53 trim_prefix: false
54 url_command: jupyter lab list | head -n 2 | tail -n 1 | cut -f1 -d' ' | grep
55 -v 'Currently'
56 - name: tensorboard
57 type: tensorboard
58 class: webapp
59 start_command: tensorboard --logdir \$TENSORBOARD_LOGS_DIRECTORY --path_prefix=\$PROXY_PREFIX
60 --bind_all
61 health_check_command: '[ \$(curl -o /dev/null -s -w ''%{http_code}'' http://localhost:\$TENSORBOARD_PORT\$PROXY_PREFIX/)
62 == ''200'' ]'
63 timeout_seconds: 90
64 stop_command: ""
65 user_msg: ""
66 icon_url: ""
67 webapp_options:
68 autolaunch: true
69 port: "6006"
70 proxy:
71 trim_prefix: false
72 url: http://localhost:6006
73 programming_languages:
74 - python3
75 icon_url: ""
76 image_version: 1.0.3
77 os: linux
78 os_distro: ubuntu
79 os_distro_release: "22.04"
80 schema_version: v2
81 user_info:
82 uid: "1001"
83 gid: "1001"
84 username: "appuser"
85 package_managers:
86 - name: apt
87 binary_path: /usr/bin/apt
88 installed_packages:
89 - curl
90 - git
91 - git-lfs
92 - vim
93 - name: pip
94 binary_path: /usr/local/bin/pip
95 installed_packages:
96 - jupyterlab==4.0.7
97 package_manager_environment:
98 name: ""
99 target: ""
100execution:
101 apps: []
102 resources:
103 gpu:
104 requested: 1
105 sharedMemoryMB: 1024
106 secrets: []
107 mounts:
108 - type: project
109 target: /project/
110 description: project directory
111 options: rw
112 - type: volume
113 target: /data/tensorboard/logs/
114 description: Tensorboard Log Files
115 options: volumeName=tensorboard-logs-volume
AI Workbench Project Spec Definition#
Project Metadata#
The spec file includes project metadata, such as version, name, descrption, and directory structure. The following are the project metadata fields.
Field |
Description |
Example Usage |
---|---|---|
|
The schema version number of the current project spec. |
1specVersion: v2
|
|
The schema minor version number of the current project spec. |
1specMinorVersion: 1
|
|
Metadata for the project to appear correctly in the AI Workbench application. |
— |
|
The name of the project. |
1name: hello-world
|
|
The name of the project container image. This image name is local to the computer and isn’t pushed to a container registry. |
1image: project-hello-world
|
|
The description of the project. |
1description: An example project using PyTorch
|
|
A list of labels for the project. |
— |
|
When the project was created, formatted as an RFC 3339 string. |
1createdOn: "2024-01-04T23:32:17Z"
|
|
The default Git branch for the project. |
1defaultBranch: main
|
|
A list of information about the project directories. AI Workbench uses this information to reconcile configuration, such as by adding a path to the .gitignore or .gitattributes files.
|
1layout:
2 - path: data/scratch/
3 type: data
4 storage: gitignore
|
Project Environment Information#
The spec file includes environment information, such as the container image for the project, and additional packages that are installed in it. You can manually configure this section of the spec, but typically it is populated automatically from the labels of the environment image that you select when you create a project.
Warning
Any manual modifications to the data in the environment.base
section of the spec.yaml file are overridden when the environment version is updated.
In the environment
section of the spec file, all fields are children of the base
field.
1environment:
2 base:
3 ... all fields
The following are the environment.base
fields.
Field |
Description |
Example Usage |
---|---|---|
|
The container registry that has the image. |
1registry: nvcr.io
|
|
The container image on top of which the project container is built. |
1image: nvidia/pytorch:23.12-py3
|
|
The timestamp of the last image build. For the timestamp specify year, month, day, hour, minutes, seconds. |
1build_timestamp: "20231212000523"
|
|
The name of the container. |
1name: PyTorch
|
|
A list of supported architectures that the image is compatible with. |
1supported_architectures:
2
3 - "amd64"
4 - "arm64"
|
|
The version of CUDA installed in the environment, if applicable. This field tells AI Workbench what version of CUDA the host driver must support. If this value is not set correctly, you may experience runtime errors that AI Workbench fails to warn about when starting the container. |
1cuda_version: "12.2"
|
|
A description of the container. |
1description: A Pytorch 2.1 environment with CUDA 12.2
|
|
The path to a script that runs when the project container starts. |
1entrypoint_script: /path/to/script.sh
|
|
A list of labels for the container, such as search term keywords or descriptors. |
1labels:
2 - cuda12.2
3 - pytorch2.1
4 - python3
5 - jupyterlab
|
|
A list of applications installed in the container. For the definition of the fields in |
1apps:
2- name: jupyterlab
3 ... more fields
4- name: tensorboard
5 ... more fields
|
|
A list of programming languages installed in the container. |
1programming_languages:
2 - python3
|
|
A link to the icon or image for the container. |
1icon_url: https://my-website.com/my-image.png
|
|
The version number of the container image, if any. |
1image_version: 1.0.3
|
|
The name of the container operating system. |
1os: linux
|
|
The name of the container operating system distribution. |
1os_distro: ubuntu
|
|
The release version of the container operating system distribution. |
1os_distro_release: "22.04"
|
|
Metadata for the version of the container label schema currently read by AI Workbench. |
1schema_version: v2
|
|
Information about the user that the container processes should run as. |
1user_info:
2 uid: "1001"
3 gid: "1001"
4 username: "appuser"
|
|
A list of package managers installed in the container, and for each one an optional list of installed packages. |
1package_managers:
2- name: conda
3 binary_path: /opt/conda/bin/conda
4 installed_packages:
5 - python=3.9.18
6 - pip
7- name: apt
8 binary_path: /usr/bin/apt
9 installed_packages:
10 - ca-certificates
11 - curl
12- name: pip
13 binary_path: /opt/conda/bin/pip
14 installed_packages: []
|
|
A package manager environment that should be activated before installing packages and starting applications. Useful if your container uses a virtual environment or conda when building. |
1package_manager_environment:
2 name: ""
3 target: ""
|
Project App Information#
The spec file includes information about applications installed in the environment and custom applications, such as commands and parameters for running them. The following are the application information fields.
Field |
Description |
Example Usage |
---|---|---|
|
The name of the application. This name appears in the user interface. |
1name: jupyterlab
|
|
The type of application, used to determine what application-specific automation is run. |
1type: jupyterlab
|
|
The class of application, used to determine what optional configuration options are available. Valid values are |
1class: webapp
|
|
The shell command used to start the application. Must not be a blocking command. |
1start_command: jupyter lab...
|
|
The shell command used to check the health or status of the application. A return of zero means the application is running and healthy. A return of non-zero means that the application is not running or unhealthy. |
1health_check_command: '<code>'
|
|
The number of seconds that AI Workbench waits for the |
1timeout_seconds: 90
|
|
The shell command used to stop the application. |
1stop_command: jupyter lab stop 8888
|
|
An optional message that appears to the user when the application is running. If |
1user_msg: ""
|
|
An optional link to the icon or image used for the application. |
1icon_url: ""
|
|
If
If
|
1webapp_options:
2 autolaunch: true
3 port: "8888"
4 proxy:
5 trim_prefix: false
6 url_command: <your command>
1webapp_options:
2 autolaunch: true
3 port: "6006"
4 proxy:
5 trim_prefix: false
6 url: http://localhost:6006
1process_options:
2 wait_until_finished: true
|
Project Runtime Information#
The spec file includes runtime information, such as environment variables, the number of GPUs to be mounted in the container, and custom application information. The following are the runtime information fields.
Field |
Description |
Example Usage |
---|---|---|
|
Information about how to run the project. |
— |
|
A list of custom applications installed in the project that are not part of the container environment. For the definition of the fields in |
1apps:
2- name: jupyterlab
3 ... more fields
4- name: mychat
5 ... more fields
|
|
Host resources that are requested or required to run the project.
For more information, see GPU Configuration. |
1resources:
2 gpu:
3 requested: 0
4 sharedMemoryMB: 0
|
|
A list of sensitive environment variables to set before the project starts. Specify the name and description of each variable only. The value of each variable is not part of the spec file and is configured at runtime. For more information, see Environment Variables. |
1secrets:
2- variable: secret1
3 description: Secret 1
|
|
A list of external folders and files that are used by the project, and where they should be located in the project container. Required values to configure the mount (like the source directory) are not part of the spec file, and are configured at runtime.
For more information, see AI Workbench Mounts. |
1mounts:
2 - type: project
3 target: /project/
4 description: project directory
5 options: rw
|
Customize Your Container#
If you want to change the behavior of the container environment for a single project, you can manually edit the metadata contained in the spec.yaml file. First you modify your spec.yaml file as described following, and then you rebuild your project environment.
Note
If you want to use one of the pre-built containers and make simple customizations, such as adding packages, see Walkthrough: Customize Your Environment and Development Environments instead.
If you want to create a fully-custom container that you can use for you own projects, or that you can publish and share with other AI Workbench users, see Use Your Own Container instead. This is an advanced scenario.
Use the following list to determine what changes to make to your spec file.
Change Required — Changes that must occur to customize the project container. Without these changes, your project does not build correctly.
Change Recommended — Best-practice changes that you should make to the project container. These are fields that are used by the NVIDIA AI Workbench client software. Your project might still build, and components might still function; however, your experience working on the project inside the AI Workbench UI might be negatively impacted.
Change Optional — Change these fields only if they are relevant to your project.
To customize your container,
navigate to the .project/spec.yaml
file of your project,
and scroll to the environment
section.
Use the information in the following table to edit your spec file.
The full description and example usage for each field is in AI Workbench Project Spec Definition.
Field |
Change? |
Recommended Action |
---|---|---|
|
Required |
Specify the container registry for your container of interest; if you have the Dockerfile, you may need to first build the container and push it to a registry. |
|
Required |
Specify the container image (and tag, if any) for your container of interest; if you have the Dockerfile, you may need to first build the container and push it to a registry. Also include the namespace if needed, but do not include the registry. |
|
Optional |
No manual updates are necessary. AI Workbench updates this field when you build your environment. |
|
Required |
Specify a name for the container. AI Workbench displays the old container information in the UI if left unchanged. |
|
Recommended |
Specify a list of supported architectures for your container, or leave as a blank list if not applicable. |
|
Required |
Specify the version of CUDA installed in this container, or leave blank if not applicable. AI Workbench matches drivers incorrectly if not updated. |
|
Required |
Specify a brief and informative description of the container. AI Workbench displays the old container information in the UI if left unchanged. |
|
Optional |
If your project requires any custom action when the container starts, we provide a location for an entrypoint script you may want to use. Specify the location to the script here. |
|
Recommended |
Specify a list of search term keywords or labels to be attached alongside your container. Consider these as search term keywords or descriptors for your container. |
|
Recommended |
Specify the applications installed in the container. For details, see Project App Information. |
|
Recommended |
Specify the programming languages installed in this container, or leave blank if not applicable. |
|
Optional |
If you would like AI Workbench to display an icon as part of this container, link the URL to the icon image here or leave it blank for defaults. |
|
Recommended |
Specify the version number of the container image, if any. |
|
Recommended |
Specify the name of the container operating system. |
|
Recommended |
Specify the name of the container operating system distribution. |
|
Recommended |
Specify the release version of the container operating system distribution. |
|
Optional |
There is no need to update this field. However, if incorrect the project breaks. The current version is |
|
Recommended |
AI Workbench automatically provisions a user for you when you run the container, but this field overrides that user. |
|
Recommended |
For each package manager in the container, specify the name of the package manager, the complete path to the manager binary, and a comma-separated list of packages installed by that manager. If this remains unchanged, the package manager widget will likely not work, especially if you are working with a venv or conda environment. |
|
Recommended |
Specify the package manager environment that should be activated before installing packages and starting applications. Useful if your container uses a virtual environment or conda when building. |