THIS FIELD NOTICE IS PROVIDED ON AN "AS IS" BASIS AND DOES NOT IMPLY ANY KIND OF GUARANTEE OR WARRANTY, INCLUDING THE WARRANTY OF MERCHANTABILITY. YOUR USE OF THE INFORMATION ON THE FIELD NOTICE OR MATERIALS LINKED FROM THE FIELD NOTICE IS AT YOUR OWN RISK. CISCO RESERVES THE RIGHT TO CHANGE OR UPDATE THIS FIELD NOTICE AT ANY TIME.
Revision | Publish Date | Comments |
---|---|---|
1.0 |
19-Oct-10 |
Initial Release |
10.0 |
11-Oct-17 |
Migration to new field notice system |
Affected OS Type | Affected Release | Affected Release Number | Comments |
---|---|---|---|
NON-IOS |
4 |
4.2.(1),4.2.(2) |
Defect ID | Headline |
---|---|
CSCtc17493 | Nexus 7000 May Encounter an Unexpected Supervisor Switchover or Reload Due to High Availability Policy for Diagnostic Service |
Two specific diagnostic tests on the Nexus 7000 can cause an unexpected process error due to a slow resource leak resulting in two scenarios:
1) A supervisor switchover due to the high availability (HA) policy for service diag_port_lb, when dual supervisors are present
2) A system reload due to HA policy for service diag_port_lb, when only a single supervisor is present
This unexpected process error is documented in CSCtc17493.
The NX-OS software supports a Generic Online Diagnostic (GOLD) feature that proactively monitors the health of the switch. For this particular issue, there are two particular diagnostic tests that are affected and would need to be disabled in order to avoid this situation in the 4.2(1) and 4.2(2) releases:
1) The GOLD PortLoopback test. This test is enabled by default as health monitoring. Any ports that fail the loopback test stay in the error-disabled state and are not available for configuration.
2) The StandbyFabricLoopback test. This is a health monitoring test that is enabled by default, designed to verify the integrity of the data path between the Standby supervisor and the Fabric. The diagnostic has the flexibility to define the action on failure: syslog (default action), onboard diagnostic failure logging, supervisor switchover. Multiple actions can be simultaneously triggered. Before Cisco NX-OS Release 4.2(1), such a health monitoring test was supported only for the active supervisor in the chassis.
An unexpected supervisor switchover or system reload may occur on the Nexus 7000 switch.
Solution:
Upgrade to NX-OS release 4.2(2a) or later. NX-OS release 4.2(2a) and later versions can be downloaded from the NX-OS system software page.
Workaround:
If upgrading the NX-OS software is not a viable option, there is a workaround for CSCtc17493.
Disable the periodic packet loopback tests. Please note that disabling these tests does not affect normal functioning of the device.
Disable these tests by entering the following commands:
1) Enter the following command on each of the modules that are present on the device (for line cards only, not applicable to supervisors):
(config)# no diagnostic monitor module < x > test 5,6
2) Enter the following command to disable the bootup diagnostics:
(config)# diagnostic bootup level bypass
3) Enter the following command to save the configuration to startup-config:
(config)# copy running-config startup
Descriptions of these tests (test 5 and test 6) are given below:
switch# show diagnostic description module 1 test 5
A health monitoring test that will test the packet path from the Supervisor card to the physical port in ADMIN DOWN state on Linecards.
switch# show diagnostic description module 1 test 6
A health monitoring test, enabled by default, that does nondisruptive loopback for all LC ports up to the Rewrite Engine.
If you require further assistance, or if you have any further questions regarding this field notice, please contact the Cisco Systems Technical Assistance Center (TAC) by one of the following methods:
Cisco Notification Service—Set up a profile to receive email updates about reliability, safety, network security, and end-of-sale issues for the Cisco products you specify.
Unleash the Power of TAC's Virtual Assistance