[cisco-voip] high I/O Wait on one core

Haas, Neal nhaas at co.fresno.ca.us
Fri Apr 5 11:49:51 EDT 2013


Jumping in on the thread, what is your CDR retention set to? Do you redirect to a 3rd party CDR such as ISI? We have only a 30 day retention on our servers I believe. We never use the CDR from the server.

We had a lot of IO when our CDR was set to a few months, The IO was from the deletion of the old CDR at the end of the month. It has been a few years since we changed the settings.


Neal Haas

From: cisco-voip-bounces at puck.nether.net [mailto:cisco-voip-bounces at puck.nether.net] On Behalf Of Erick Wellnitz
Sent: Friday, April 05, 2013 8:21 AM
To: Tom Piscitell (tpiscite)
Cc: cisco-voip
Subject: Re: [cisco-voip] high I/O Wait on one core

caroninit seems to be the biggest offender (by about 50x) in both disk writes and cpu usage.  Am I correct in assuming this has something to do with call detail records?

On Fri, Apr 5, 2013 at 9:02 AM, Tom Piscitell (tpiscite) <tpiscite at cisco.com<mailto:tpiscite at cisco.com>> wrote:
Erick,

You can use the FIOR utility from the CLI to identify which processes are writing to the disk.

admin:utils fior
      utils fior disable
      utils fior enable
      utils fior list
      utils fior start
      utils fior status
      utils fior stop
      utils fior top

Here is a typical use case:

1. Enable the FIOR utility before/during a time of High IO Wait
        admin:utils fior enable
        File I/O Statistics has been enabled.
        admin:utils fior start
        Loading fiostats module: ok
        Enabling fiostats : ok
        File I/O Statistics has been started.

2. Wait a couple minutes. FIOR will poll for data every 5 seconds I believe. Then use utils fior top to see whats hitting the CPU the hardest:

admin:utils fior top ?
Syntax:
utils fior top n sort_by [start=date-time] [stop=date-time]

         n:            number of processes
         sort_by:      read, write, read-rate, write-rate
         date-time:    of the form %H:%M, %H:%M:%S
                                   %a,%H:%M, %a,%H:%M:%S
                                   %Y-%m-%d,%H:%M, %Y-%m-%d %H:%M:%S
Example:
admin:utils fior top 10 write start=2010-04-20 10:00:00 stop=2010-04-20 10:30:00

This of course won't tell you *why* a process is hitting the disk, but it will at least show you who has the most read/writes. To answer the why question you would need to look at traces for the offending process/service.

HTH,
-Tom

On Apr 4, 2013, at 5:43 PM, Erick Wellnitz <ewellnitzvoip at gmail.com<mailto:ewellnitzvoip at gmail.com>> wrote:

> Hello all!
>
> I have a dual 4 core IBM 7835I3 which is my publisher.   One one core of the first CPU the I/O Wait is through the roof.  RTMT shows that writes to the hard drives are at between 600 and 700 MB/s which is exponentially higher than the subscriber on the same model of hardware.
>
> Short of calling TAC is there any way to figure out what is causing the extremely high volume of writes to the drives?  I already stopped most traces and looking at the processes doesn't give any clues.
>
> Thanks again!
>
>
> _______________________________________________
> cisco-voip mailing list
> cisco-voip at puck.nether.net<mailto:cisco-voip at puck.nether.net>
> https://puck.nether.net/mailman/listinfo/cisco-voip

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://puck.nether.net/pipermail/cisco-voip/attachments/20130405/1c851889/attachment.html>


More information about the cisco-voip mailing list