Prerequisites
This section describes the required tools for executing the InfiniBand cluster maintenance and operational procedures.
UFM - July 2023 SW Version: This entails UFM Enterprise and at least one instance of UFM Telemetry. UFM incorporates an embedded UFM Telemetry instance featuring 120 fundamental debug counters for each port. These counters are collected periodically and are, by default, accessible through an HTTP endpoint. UFM offers multiple mechanisms for pushing (streaming) UFM Telemetry and event streams. Additional information can be found in Retrieving UFM Issues for comprehensive insights.
UFM Installation: Refer to the installation instructions according to the desired UFM software.
UFM
Link to Installation Instructions
UFM Enterprise
UFM Enterprise Appliance
UFM Telemetry
For those opting to use their own server