InfiniBand Cluster Bring-up Procedure

Overview

This document is intended for network operators responsible for bring-up InfiniBand clusters. The purpose of this document is to outline the necessary automation tools, required tests, and essential information needed when installing a new cluster. Additionally, the document provides recommendations and guidance on how to obtain the necessary inputs for these procedures and how to execute the bring-up operations effectively. The document's content is structured logically to facilitate easy reference and understanding.

To complete the bring-up process, follow the checklist here.

Related Documentation

Document Revision History

For the list of changes made to this document, refer to Document Revision History.

© Copyright 2025, NVIDIA. Last updated on Jul 10, 2025.