[j-nsp] Odd behaviour testing failover

Josef Buchsteiner josefb at juniper.net
Thu Sep 8 04:11:13 EDT 2005



Thursday, September 8, 2005, 4:52:50 AM, you wrote:

GS>    
GS>    
GS> Hi all,
GS>  
GS>  I'm seeing odd RE behaviour on an M20 while testing RE failover.
 >>From the log entries, I'm guessing that a snapshot of the current route
GS>  table entries gets tarballed and passed to the second RE

     tarball creation  stays local which is done by dumpd. however you
     had a rpd core which does invoke dump to create a tarball file
     with relevant informations like adding config and message log to
     the core file..

GS>  to use while
GS>  the control plane re-establishes adjacencies with other devices.
GS>  
GS>  Problem is, the tarball can't be found by the RE, and transit traffic
GS>  fails until it re-meshes.


           dumpd creates the tar file and rpd does write the core.
           please look if you still have a rpd.core file in the
           /var/tmp/ directory generated from rpd.
GS>  
GS>  Has anyone come across this before?
GS>  If so, how do I get this to behave? Graceful failover is enabled on this
GS>  box (JUNOS 7.3R1.6)
GS>  
GS>  Also seeing kernel page faults in the logs on the backup RE during
GS>  failover. Hard disk failing?


     this is a rpd page fault.. again you should have a rpd.core file
     please issue a case with all the data so we can fix this

     thanks
     Josef


GS>  
GS>  
GS>  
GS>  Sep  8 14:36:35  jcore2 chassisd[74828]: CHASSISD_SNMP_TRAP7: SNMP trap
GS>  generated: Fru Online (jnxFruContentsIndex 9, jnxFruL1Index 2,
GS>  jnxFruL2Index 0, jnxFruL3Index 0, jnxFruName Routing Engine 1,
GS>  jnxFruType 6, jnxFruSlot 2)
GS>  Sep  8 14:36:35  jcore2 craftd[2599]: attempt to delete alarm not in
GS>  list
GS>  Sep  8 14:36:35  jcore2 craftd[2599]: forwarding display request to
GS>  chassisd: type = 4, subtype = 44
GS>  Sep  8 14:36:41  jcore2 rshd[75266]: root at re1 as root: cmd='rcp -T -t
GS>  /var/db/dcd.snmp_ix+'
GS>  Sep  8 14:36:42  jcore2 rshd[75298]: root at re1 as root: cmd='mv
GS>  /var/db/dcd.snmp_ix+ /var/db/dcd.snmp_ix'
GS>  Sep  8 14:36:59  jcore2 dumpd: Core and context for rpd saved in
GS>  /var/tmp/rpd.core-tarball.4.tgz
GS>  Sep  8 14:37:00  jcore2 dumpd: tar: rpd.info.4: Cannot stat: No such
GS>  file or directory tar: Error exit delayed from previous errors
GS>  Sep  8 14:37:00  jcore2 dumpd: Unable to create core tarball
GS>  /var/tmp/rpd.core-tarball.4.tgz
GS>  Sep  8 14:37:00  jcore2 dumpd: tar: rpd.info.4: Cannot stat: No such
GS>  file or directory tar: Error exit delayed from previous errors
GS>  Sep  8 14:37:00  jcore2 dumpd: Unable to create core tarball
GS>  /var/tmp/rpd.core-tarball.4.tgz
GS>  
GS>  
GS>  
GS>  Sep  8 14:36:24  jcore2 /kernel: Trapframe Register Dump:
GS>  Sep  8 14:36:24  jcore2 /kernel: eax: 00000000  ecx: 085cf000   edx:
GS>  085fabb0   ebx: 00000012
GS>  Sep  8 14:36:24  jcore2 /kernel: esp: bfbff800  ebp: bfbffc28   esi:
GS>  085c63a0   edi: 0864c000
GS>  Sep  8 14:36:24  jcore2 /kernel: eip: 0812f706  eflags: 00010206
GS>  Sep  8 14:36:24  jcore2 /kernel: cs: 001f       ss: 002f        ds:
GS>  bfbf002f    es: 8868002f
GS>  Sep  8 14:36:24  jcore2 /kernel: fs: 864002f    trapno: 0000000c
GS>  err: 00000004
GS>  Sep  8 14:36:25  jcore2 /kernel: Page table info for PC address
GS>  0x812f706: PDE = 0x189a5067, PTE = 2c65425
GS>  Sep  8 14:36:25  jcore2 /kernel: Dumping 16 bytes starting at PC address
GS>  0x812f706:
GS>  Sep  8 14:36:25  jcore2 /kernel: 80 b8 10 02 00 00 00 75 0d 80 bd e7 fb
GS>  ff ff 03
GS>  Sep  8 14:36:25  jcore2 /kernel: BAD_PAGE_FAULT: pid 62594 (rpd), uid 0:
GS>  pc 0x812f706 got a read fault at 0x210, x86 fault flags = 0x4
GS>  
GS>  
GS>  _______________________________________________
GS>  juniper-nsp mailing list juniper-nsp at puck.nether.net
GS>  http://puck.nether.net/mailman/listinfo/juniper-nsp
GS>   
GS>   
GS>    
 


More information about the juniper-nsp mailing list