Storwize V7000 node canisters may shut down or reboot during normal operation, leading to a loss of host I/O access.
Stowize V7000 node canisters running V126.96.36.199 – V188.8.131.52 code levels may shut down without warning during normal I/O operations.
These shut down events will typically occur on both node canisters in the Storwize V7000 system, with the second node canister shutting down a number of hours after the first. Once the second node canister has shut down, this will cause a loss of host access to disks presented by the Storwize V7000, until at least one of the node canisters has been manually brought back online.
If this issue is encountered on V184.108.40.206 – V220.127.116.11, the recovery action is to reseat each offline node canister in order to bring it back online.
Partial Fix Introduced in V18.104.22.168
A partial fix was introduced in V22.214.171.124, which caused node canisters that experienced this condition to reboot and automatically resume I/O operations, rather than shut down and remain offline. Customers running V126.96.36.199 code are however still exposed to the risk of both node canisters rebooting at the same time, which could lead to a short, temporary outage to host I/O.
This issue has been fully resolved by APAR IC74088 in the V188.8.131.52 release. Please visit the following URL to download the latest V6.1.0.x code: