[cisco-voip] Odd Server issues after Outages

Wed Jun 3 21:43:44 EDT 2015

Not affected per the tool, albeit I’d hazard the original raid was. The freshly rebuilt servers were straight to 10.5 so at least we are on ext4 now.

Thanks!

Matthew G. Loraditch – CCNP-Voice, CCNA-R&S, CCDA
Network Engineer
Direct Voice: 443.541.1518

Facebook<https://www.facebook.com/heliontech?ref=hl> | Twitter<https://twitter.com/HelionTech> | LinkedIn<https://www.linkedin.com/company/helion-technologies?trk=top_nav_home> | G+<https://plus.google.com/+Heliontechnologies/posts>

From: Dave Goodwin [mailto:dave.goodwin at december.net]
Sent: Wednesday, June 3, 2015 6:01 PM
To: Matthew Loraditch
Cc: cisco-voip at puck.nether.net
Subject: Re: [cisco-voip] Odd Server issues after Outages

Matthew, do you happen to know when the servers were purchased, and when the RAID card was replaced? Wondering if you are using a version of the 9271CV-8I card that is not new enough to have the fix for the various field notice issues. I have definitely run into this with the C240M3 boxes that use the same RAID cards as the C220M3, and replacement of the card is required to a fixed one.
http://www.cisco.com/c/en/us/support/docs/field-notices/637/fn63732.html
Aside from that, if you are using 10.5.2, did you install from that version new, or upgrade to it? If you do a fresh install, it will setup the filesystem as ext4 (unlike earlier versions that use ext3), which supposedly has shown evidence of being more resilient in being able to recover lost journals, such as what can happen during an unplanned power outage. If you have the hardware issue, ext4 will not fix it, but perhaps may help reduce the likelihood of filesystem corruption to the point of needing rebuild.
-Dave

On Wed, Jun 3, 2015 at 8:50 AM, Matthew Loraditch <MLoraditch at heliontechnologies.com<mailto:MLoraditch at heliontechnologies.com>> wrote:
So we have a customer in just a bad grid area, we have UPSs on their BE6Ks but these still doesn’t last long enough nor do we always know things in time to do a graceful shutdown. I’ve had plenty of other customers have outages and unclean shutdowns and their systems work fine afterwards. This customer continues to have issues where the servers get corrupted to the point of rebuilds, 2 times ago we got TAC to replace the RAID card in the host (C220M3 generation). The host reports no issues, but honestly this just seems odd to me. There are no bugs known to me for 10.5.2 that would do this (It always happens on both CUCM and UCXN) so I suspect hardware but am unsure how to track it down.
Any thoughts?

Matthew G. Loraditch – CCNP-Voice, CCNA-R&S, CCDA
Network Engineer
Direct Voice: 443.541.1518<tel:443.541.1518>
Facebook<https://www.facebook.com/heliontech?ref=hl> | Twitter<https://twitter.com/HelionTech> | LinkedIn<https://www.linkedin.com/company/helion-technologies?trk=top_nav_home> | G+<https://plus.google.com/+Heliontechnologies/posts>

_______________________________________________
cisco-voip mailing list
cisco-voip at puck.nether.net<mailto:cisco-voip at puck.nether.net>
https://puck.nether.net/mailman/listinfo/cisco-voip

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://puck.nether.net/pipermail/cisco-voip/attachments/20150604/338fa2f6/attachment.html>