[j-nsp] CFEB crash/reboot on M7i

Juha Suhonen juhas at mmd.net
Mon May 2 04:13:17 EDT 2005


Hi!


We have a M7i whose CFEB mysteriously rebooted last night, after running 
fine for 6 weeks (since the router was installed :-).

Any ideas on what might have caused this? How could we prevent this from 
happening again? I tried to skim through Juniper's website and the list 
archives, but didn't found anything seemingly related.


Here's the output of show log messages | match cfeb

May  1 22:17:58  core-r1.tre1 cfeb System Exception: Vector/Code 0x00700, Signal 4
May  1 22:17:58  core-r1.tre1 cfeb Event occurred at: May  1 22:17:58.276457
May  1 22:17:58  core-r1.tre1 cfeb Juniper Embedded Microkernel Version 7.1R1.3
May  1 22:17:58  core-r1.tre1 cfeb Built by builder on 2005-02-11 04:16:18 UTC
May  1 22:17:58  core-r1.tre1 cfeb Copyright (C) 1998-2005, Juniper Networks, Inc.
May  1 22:17:58  core-r1.tre1 cfeb All rights reserved.
May  1 22:17:58  core-r1.tre1 cfeb Reason string: "Illegal instruction"
May  1 22:17:58  core-r1.tre1 cfeb Context: Thread (Idle)
May  1 22:17:58  core-r1.tre1 cfeb Registers:
May  1 22:17:58  core-r1.tre1 cfeb R00: 0x00000000 R01: 0x007b3ca8 R02: 0x0000339c R03: 0x00000007
May  1 22:17:58  core-r1.tre1 cfeb R04: 0x005c8a24 R05: 0x007b3c98 R06: 0x00000002 R07: 0x007b3ee0
May  1 22:17:58  core-r1.tre1 cfeb R08: 0x007b3d50 R09: 0x006c0000 R10: 0x00000000 R11: 0x00000000
May  1 22:17:58  core-r1.tre1 cfeb R12: 0x20002082 R13: 0x508c0401 R14: 0x00508800 R15: 0xf7893094
May  1 22:17:58  core-r1.tre1 cfeb R16: 0x30804006 R17: 0x54522d40 R18: 0x006c0000 R19: 0x006c0000
May  1 22:17:58  core-r1.tre1 cfeb R20: 0x006c0000 R21: 0x006c0000 R22: 0x00000000 R23: 0xe216509c
May  1 22:17:58  core-r1.tre1 cfeb R24: 0x00000000 R25: 0xe8846262 R26: 0x00000000 R27: 0xe8844eda
May  1 22:17:58  core-r1.tre1 cfeb R28: 0x00000000 R29: 0x00000000 R30: 0x00000000 R31: 0x00001388
May  1 22:17:58  core-r1.tre1 cfeb MSR: 0x0008b030 CTR: 0x00000000 Link:0x00000000 SP:  0x007b3ca8
May  1 22:17:58  core-r1.tre1 cfeb CCR: 0x20002082 XER: 0x00000000 PC:  0x00000000
May  1 22:17:58  core-r1.tre1 cfeb DSISR: 0x00000000 DAR: 0x00000000 K_MSR: 0x00001030
May  1 22:17:58  core-r1.tre1 cfeb Stack Traceback:
May  1 22:17:58  core-r1.tre1 cfeb Frame 01: sp = 0x007b3ca8, pc = 0x00026e30
May  1 22:17:58  core-r1.tre1 cfeb Frame 02: sp = 0x007b3cb0, pc = 0x0002b89c
May  1 22:17:58  core-r1.tre1 cfeb Frame 03: sp = 0x007b3cf8, pc = 0x00026f6c
May  1 22:18:03  core-r1.tre1 craftd[2564]:  Major alarm set, CFEB not online, the box is not forwarding
May  1 22:18:03  core-r1.tre1 chassisd[2562]: CHASSISD_SHUTDOWN_NOTICE: Shutdown reason: CFEB connection lost
May  1 22:18:03  core-r1.tre1 alarmd[2563]: Alarm set: CFEB color=RED, class=CHASSIS, reason=CFEB not online, the box is not forwarding
May  1 22:19:19  core-r1.tre1 chassisd[2562]: CHASSISD_SNMP_TRAP9: SNMP trap generated: FRU power on (jnxFruContentsIndex 6, jnxFruL1Index 1, jnxFruL2Index 0, jnxFruL3Index 0, jnxFruName CFEB, jnxFruType 4, jnxFruSlot 1, jnxFruOfflineReason 2, jnxFruLastPowerOff 358755490, jnxFruLastPowerOn 358755490)
May  1 22:19:19  core-r1.tre1 cfeb CM: ALARM SET: (Major) Slot 0: CFEB not online, the box is not forwarding
May  1 22:19:29  core-r1.tre1 craftd[2564]: Major alarm cleared, CFEB not online, the box is not forwarding
May  1 22:19:29  core-r1.tre1 alarmd[2563]: Alarm cleared: CFEB color=RED, class=CHASSIS, reason=CFEB not online, the box is not forwarding
May  1 22:19:33  core-r1.tre1 cfeb CM: ALARM CLEAR: Slot 0: CFEB not online, the box is not forwarding


Show log messages doesn't seem to show any apparent reasons for this crash 
- the previous entry (in the whole log) is almost 13 hours older:

May  1 09:23:43  core-r1.tre1 cfeb TCP: Bad TCP offset (24, 20) from xxx.yyy.zzz.aaa



 	-- juhas,
 	MMD Networks Oy


More information about the juniper-nsp mailing list