Port Counters
ibdiagnet collects and processes standard InfiniBand port counters and vendor-specific port counters. The following counters are collected by the ibdiagnet:
PortCounters (collected by default)
PortCountersExtended (collected by default)
PortRcvErrorDetails (collected by default)
PortXmitDiscardDetails (collected by default)
LLRCounters (collected by default from supporting devices, ConnectX3/SwicthX only)
PerSL/VL counters (for supporting devices when corresponding option is specified)
PortExtendedSpeedCounters (for supporting devices when corresponding option is specified)
Mellanox Diagnostic Counters (for supporting devices when corresponding option is specified)
The following options are applicable when port counters are collected and processed by ibdiagnet:
Parameter | Description | Example |
--per_slvl_cntrs | Provides a report of all per sl/vl port counters (if supported by devices) | - |
--sc | Provides a report of NVIDIA Diagnostic counters in ibdiagnet2.mlnx_cntrs file and ibdiagnet2.db_csv. | - |
--scr | Resets all the NVIDIA Diagnostic counters (should be used with -sc option). | ibdiagnet --scr --sc |
--extended_speeds <dev-type> | Collects and tests port extended speeds counters.
These counters are reported in ibdiagnet2.db_csv file (PM_INFO section). | - |
--pm_per_lane | Lists all counters per lane (if supported by devices). Should be used on combination with --extended_speeds. | ibdiagnet --extended_speeds all --pm_per_lane |
--pm_get_all | Get all PM counters. activate the following flags: | ibdiagnet --pm_get_all |
-P | -counter <<PM>=<value>> | If any of the provided counter is greater than its provided value, then print it. If 'all' is used, all counters get the same threshold (0 by default). | ibdiagnet -P vl15_dropped=1, port_xmit_discard=1 or ibdiagnet -P vl15_dropped=1 -P port_xmit_discard=1 or ibdiagnet -P all Supported PM Counter names are:
|
Parameter | Description |
--pc | Resets all fabric IB spec compliant port counters (PortCounters and PortCountersExtended), RN, AR and HBF counters. Note: It is recommended to use this option with –reset_phy_info, as both options have cross counters and using only one of them can be confusing on the next iteration of counters or registers collection. |
--pm_clear_all | Clear all PM counters. activate the following flags: |
Parameter | Description | Example |
--pm_pause_time <seconds> | Specifies a delay (in seconds) between counters samples. If set to 0, only single sampling is performed. (default - 1 second) | ibdiagnet --pm_pause_time 60 |