ClusterMinder Plugin
This plugin is supported on UFM Enterprise Appliance only.
The ClusterMinder plugin collects telemetry data from multiple data sources and aggreats, streams and visualizes the backed. The plugin can cluster/group aggregated Redfish data from multiple machines that allows operational anomaly and misconfigurations detection. The plugin provides Cluster-wide histograms of hardware telemetry which details compute node configuration and inventory, PCIe bus, hardware information (SN and FW version) and health alerts of all relevant devices on each Redfish category.
The plugin can be deployed as a container and supports multiple data sources, including:
Redfish on Host
Redfish on DPU
MLNX Switch Data
DOCA Telemetry Service on DPU (BlueField)
DOCA Telemetry Service on Host
Unmanaged InfiniBand Switches
The plugin can be deployed using the following methods:
On the UFM Appliance
On the UFM Software
To deploy the plugin, follow these steps:
The plugin is included in the default plugin bundle available at NVIDIA's Licensing Portal .
Load the downloaded image onto the UFM server. This can be done either by using the UFM GUI by navigating to the Settings -> Plugins Management tab or by loading the image via the following instructions:
Log in to the UFM server terminal.
Run:
docker load < <path_to_image>
After successfully loading the plugin image, the plugin should become visible within the plugins management table within the UFM GUI. To initiate the plugin’s execution, simply right-click on the respective in the table.
After the successful deployment of the plugin, a new item is shown in the UFM side menu for the ClusterMinder plugin:
Example of Adding Data Source
Example of Adding the Redfish Host
After inputting the "BMC IP", "Protocol","Username" and "Password". Pressing the button tests the connection and allows to hosts if successful.
Example of Removing Data Source
Removing hosts is done through the "Data Sources" section, Right click any available host and click the remove option.