[j-nsp] Odd behaviour testing failover

Gordon Smith gsmith at wxc.co.nz
Wed Sep 7 23:17:01 EDT 2005


Hi Jared,

Yes to everything  :-)

RE's are both RE2.0 
Bios is version 1.2

Cheers,
Gordon


> -----Original Message-----
> From: Jared Gull [mailto:jmgull at yahoo.com] 
> Sent: Thursday, 8 September 2005 3:06 p.m.
> To: Gordon Smith; juniper-nsp at puck.nether.net
> Subject: Re: [j-nsp] Odd behaviour testing failover
> 
> Gordon,
> 
> Are both REs running the same version?  Also, are the REs the 
> same revision (i.e. RE2.0/3.0)?
> 
> Jared
> 
> --- Gordon Smith <gsmith at wxc.co.nz> wrote:
> 
> > Hi all,
> > 
> > I'm seeing odd RE behaviour on an M20 while testing RE failover.
> > >From the log entries, I'm guessing that a snapshot
> > of the current route
> > table entries gets tarballed and passed to the second RE to 
> use while 
> > the control plane re-establishes adjacencies with other devices.
> > 
> > Problem is, the tarball can't be found by the RE, and 
> transit traffic 
> > fails until it re-meshes.
> > 
> > Has anyone come across this before?
> > If so, how do I get this to behave? Graceful failover is enabled on 
> > this box (JUNOS 7.3R1.6)
> > 
> > Also seeing kernel page faults in the logs on the backup RE during 
> > failover. Hard disk failing?
> > 
> > 
> > 
> > Sep  8 14:36:35  jcore2 chassisd[74828]:
> > CHASSISD_SNMP_TRAP7: SNMP trap
> > generated: Fru Online (jnxFruContentsIndex 9, jnxFruL1Index 2, 
> > jnxFruL2Index 0, jnxFruL3Index 0, jnxFruName Routing Engine 1, 
> > jnxFruType 6, jnxFruSlot 2) Sep  8 14:36:35  jcore2 craftd[2599]: 
> > attempt to delete alarm not in list Sep  8 14:36:35  jcore2 
> > craftd[2599]: forwarding display request to
> > chassisd: type = 4, subtype = 44
> > Sep  8 14:36:41  jcore2 rshd[75266]: root at re1 as
> > root: cmd='rcp -T -t
> > /var/db/dcd.snmp_ix+'
> > Sep  8 14:36:42  jcore2 rshd[75298]: root at re1 as
> > root: cmd='mv
> > /var/db/dcd.snmp_ix+ /var/db/dcd.snmp_ix'
> > Sep  8 14:36:59  jcore2 dumpd: Core and context for rpd saved in 
> > /var/tmp/rpd.core-tarball.4.tgz Sep  8 14:37:00  jcore2 dumpd: tar: 
> > rpd.info.4:
> > Cannot stat: No such
> > file or directory tar: Error exit delayed from previous 
> errors Sep  8 
> > 14:37:00  jcore2 dumpd: Unable to create core tarball 
> > /var/tmp/rpd.core-tarball.4.tgz Sep  8 14:37:00  jcore2 dumpd: tar: 
> > rpd.info.4:
> > Cannot stat: No such
> > file or directory tar: Error exit delayed from previous 
> errors Sep  8 
> > 14:37:00  jcore2 dumpd: Unable to create core tarball 
> > /var/tmp/rpd.core-tarball.4.tgz
> > 
> > 
> > 
> > Sep  8 14:36:24  jcore2 /kernel: Trapframe Register
> > Dump:
> > Sep  8 14:36:24  jcore2 /kernel: eax: 00000000  ecx:
> > 085cf000   edx:
> > 085fabb0   ebx: 00000012
> > Sep  8 14:36:24  jcore2 /kernel: esp: bfbff800  ebp:
> > bfbffc28   esi:
> > 085c63a0   edi: 0864c000
> > Sep  8 14:36:24  jcore2 /kernel: eip: 0812f706
> > eflags: 00010206
> > Sep  8 14:36:24  jcore2 /kernel: cs: 001f       ss:
> > 002f        ds:
> > bfbf002f    es: 8868002f
> > Sep  8 14:36:24  jcore2 /kernel: fs: 864002f   
> > trapno: 0000000c
> > err: 00000004
> > Sep  8 14:36:25  jcore2 /kernel: Page table info for PC address
> > 0x812f706: PDE = 0x189a5067, PTE = 2c65425 Sep  8 14:36:25  jcore2 
> > /kernel: Dumping 16 bytes starting at PC address
> > 0x812f706:
> > Sep  8 14:36:25  jcore2 /kernel: 80 b8 10 02 00 00 00 75 0d 
> 80 bd e7 
> > fb ff ff 03 Sep  8 14:36:25  jcore2 /kernel: BAD_PAGE_FAULT: pid
> > 62594 (rpd), uid 0:
> > pc 0x812f706 got a read fault at 0x210, x86 fault flags = 0x4
> > 
> > 
> > _______________________________________________
> > juniper-nsp mailing list juniper-nsp at puck.nether.net 
> > http://puck.nether.net/mailman/listinfo/juniper-nsp
> > 
> 
> 
> 
> 	
> 		
> ______________________________________________________
> Click here to donate to the Hurricane Katrina relief effort.
> http://store.yahoo.com/redcross-donate3/
> 



More information about the juniper-nsp mailing list