HP EVA Cache Battery Failure Issue

Posted by on July 26, 2010 in VMware | 5 comments

The current issue I have come across in our HP storage environment is an issue with the storage controller cache battery modules. We had a module fail recently on one of our 8100 series EVAs. There can be up to four modules per controller. In our environment, we are using two modules per controller.

A healthy set of modules looks like this:

 

Now, for the EVA we have problems with, it looks like this:

This problem occurred after we had this particular module fail. We received a replacement from HP and swapped it out. However, after a few days, it was marked as failed again. Again we received a replacement from HP, and swapped it out. A few days later, same result. In contacting HP a third time, I explained what had occurred. In response, I received this notification:

This is just another somewhat oddball error that we deal with on a regular basis. Now, on to the fix! To restart the controller in question, first note as per Command View which controller is in question. In my case, it is Controller A (just follow the bang indicators)

A restart of the controller should be done during your change / maintenance window (all those years of ITIL ingrained in me!). To do so, you have a few choices.

The first is via Command View:

On the controller’s page, hit shutdown, then restart and the controller (A/B).

The second is via the SSSU utility (installs as part of the Command View install):

Restart controller A, but not its peer controller:

RESTART “HardwareRack 1Enclosure 7Controller A” NOALL_PEERS

Note that when restarting the controller, if it is the master controller the vdisks will transfer to the other controller without any downtime. In my experience with the EVAs, they are a touchy lot. I prefer using the SSSU utility for a halfway decent command line interface. Pretty powerful too. I’ll be writing up a blog posting discussing good uses for SSSU in the future.

5 Comments

  1. love the hp battery,it can use for a long time

  2. Interesting. We have the same problem, but with an EVA8000. We have two but only one has been having these problems. I think we’ve had 6 or 7 faulty batteries during the past 1 1/2 years. Maybe its time to try a reboot..

    • Are you running the 6.220 xcs on both of your 8000’s? I know for us, when we have a battery fail, if we replace it without restarting the controller, that new battery will be marked as Failed in a few days tops. Very irritating. Thankfully it’s easy to restart the controller, but we get minor alerts in our environment from doing so related to disk write errors.

    • There is a problem with XCS 6.22 with premature battery failure due to the load cycle of the charger. XCS 6.24 addresses this problem.
      There was also a problem with certain date coded battery bricks which should have been
      resolved by now. There was a customer advisory concerning this.

  3. Good writing, saved

Trackbacks/Pingbacks

  1. CACHE BATTERY MODULE | Quality Products Blog - [...] cache battery module virtualizetips.com [...]

Leave a Reply

%d bloggers like this: