[cisco-voip] DRF Local service stops and starts on it's own - 8.6.2a SU2

Erick B. erickbee at gmail.com
Fri May 3 17:22:19 EDT 2013


DRF MA is fine, it is DRF LA that stops on it's own on both pub and sub.


On Fri, May 3, 2013 at 3:07 PM, Abdul Salam . <salamka at gmail.com> wrote:

> was suspecting a defect , but seems to be fixed in 8.6(2.22900.9)
>
>
>
> *---AS*
>
>
>
> On Sat, May 4, 2013 at 1:31 AM, Abdul Salam . <salamka at gmail.com> wrote:
>
>> Is it a clustering over deployment?
>> Do u have issue with DRF MA service at pub ?
>>
>>
>>
>> *---AS*
>>
>>
>>
>> On Sat, May 4, 2013 at 12:22 AM, Erick B. <erickbee at gmail.com> wrote:
>>
>>> Nothing in the DRF log or traces saying why it stopped really, other
>>> than DRF Local agent might be down which it is.
>>>
>>> 2013-05-02 16:44:04,716 DEBUG [NetServerClient-CM1] -
>>> drfNetServerClient.Reconnect: Sending version id: 8.6.2.22900-9
>>> 2013-05-02 16:44:04,765 DEBUG [NetServerClient-CM1] -
>>> drfNetServerClient.run, i/o exception from host: [CM1], message: null
>>> 2013-05-02 16:44:04,765 INFO [NetMessageDispatch] -
>>> drfMessageReceiver::HandleMessage: Message ID300 has been validated
>>> successfully
>>> 2013-05-02 16:44:04,765 DEBUG [NetServerClient-CM1] -
>>> drfNetServerClient.sleepRandom: sleeping for: 15 seconds
>>> 2013-05-02 16:44:04,765 FATAL [NetMessageDispatch] -
>>> drfLocalAgent.drfLocalWorker: Unable to send 'Local Agent' client
>>> identifier message to Master Agent. This may be due to Master or Local
>>> Agent being down.
>>>
>>>
>>> Here are service monitor logs, it appears as a failure happened and it
>>> tries to restart the service but the restart fails. Then a manual restart
>>> is needed. This is what we are seeing. Most of the time we service go down
>>> for 1-2 minutes then it is back, but sometimes it stays stopped which is
>>> when it fails to restart and further restarts are denied from the logs.
>>>
>>> 16:44:07.560 |servMProc::setCurState() CurState (notRunning) for service
>>> - Cisco DRF Local
>>> 16:44:07.562 |servMDepNode::goInactive(): for Cisco DRF Local
>>> 16:44:07.562 |servMDepNode::notifyStateChange(): for Cisco DRF Local
>>> 16:44:07.562 |
>>> servMDataMgr::dependentStateChange():...
>>> 16:44:07.562 |servMDepNode::checkDependents(): for Cisco DRF Local
>>>
>>> 16:44:07.562 |servMScriptMgdProc::checkHealth(): Either isAlive () or
>>> getAmAction () reported failure. Current state of Service set to
>>> NotRunning. This will trigger a Graceful Restart of the Service Cisco DRF
>>> Local.
>>>
>>> 16:44:07.562 |servMScriptMgdProc::checkState() CurState (notRunning),
>>> DesiredState (running) - Service - Cisco DRF Local
>>> 16:44:07.562 |servMScriptMgdProc::checkState()  Scheduling restarts ret
>>> 16:44:07.562 |servMMgdProc::scheduleRestart : Restart Failed. Stopping
>>> the dependent services
>>> 16:44:07.562 |servMDataMgr::stopService - Stopping service Cisco DRF
>>> Local with dep flag 2 serverDetails.serviceName Cisco Tomcat
>>> 16:44:07.562 |servMDepNode::listActiveDepnt2(): No Dependents found
>>> 16:44:07.562 |servMDepNode::goInactive(): for Cisco DRF Local
>>> 16:44:07.563 |servMDepNode::notifyStateChange(): for Cisco DRF Local
>>> 16:44:07.563 |servMDataMgr::findProc(): Trying to find Cisco DRF Local
>>> in service group Cisco DRF Local
>>> 16:44:07.563 |servMDataMgr::findProc(): iterating... got (Cisco DRF
>>> Local) in (Cisco DRF Local)
>>>  16:44:07.563 |servMMgdProc::gracefulShutdown : inside Cisco DRF Local
>>> 16:44:07.563 |Cisco DRF Local :Beginning graceful shutdown
>>> 16:44:07.563 |servMDepNode::listActiveDepnt1():
>>> 16:44:07.563 |servMScriptMgdProc::checkState() CurState (notRunning),
>>> DesiredState (notRunning) - Service - Cisco DRF Local
>>> 16:44:07.563 |servMScriptMgdProc::checkState()  No change ret for Cisco
>>> DRF Local
>>> 16:44:07.563 |Cisco DRF Local : Restart Denied : Restarted 3 times in
>>> 1200 secs
>>>
>>>
>>>
>>> On Fri, May 3, 2013 at 8:33 AM, Abdul Salam . <salamka at gmail.com> wrote:
>>>
>>>> good to check DRF LA , MA and service mngr logs
>>>>
>>>>
>>>>
>>>> *---AS*
>>>>
>>>>
>>>>
>>>> On Fri, May 3, 2013 at 3:25 AM, Erick B. <erickbee at gmail.com> wrote:
>>>>
>>>>> Hi,
>>>>>
>>>>> Anyone seen the DRF local server stop and start on it's own
>>>>> on 8.6.2.22900-9 (SU2)?
>>>>>
>>>>> The scheduled backup job runs fine and completes each day.
>>>>>
>>>>> Doesn't happen every day, and happens when the back up job is not
>>>>> running.
>>>>> Sometimes the DRF local service doesn't start back up on it's own and
>>>>> needs to be manually restarted.
>>>>>
>>>>> Nothing in the syslog's indicating what the cause or showing DRF
>>>>> stopping or starting. Set DRF traces to debug level and nothing in the DRF
>>>>> debug traces from that time period this happens either. No core dumps.
>>>>>
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> cisco-voip mailing list
>>>>> cisco-voip at puck.nether.net
>>>>> https://puck.nether.net/mailman/listinfo/cisco-voip
>>>>>
>>>>>
>>>>
>>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://puck.nether.net/pipermail/cisco-voip/attachments/20130503/0bd7fc20/attachment.html>


More information about the cisco-voip mailing list