[nsp] Stable 6500 hybrid code?

Clinton Work work@scripty.com
Wed, 20 Nov 2002 22:24:59 -0700


I have seen many CEF/MLS issues with hybrid IOS.

Your problem sounds like bug-id CSCdy01444. 

I've had a lot of problems with bug-id CSCdu85211, but CatOS 6.3(6)
should contain the fix for it. The workaround for this bug works
fairly well (clear ip ro *).

I have a couple of boxes running 6.3(8) with IOS 12.1(12c)E2 which has
been fairly stable. We have had only one CEF/MLS issue with an OSPF E2
route were the MLS entry would not get created:
  Tried "clear ip ro <route>"
  Tried "clear ip ro *"
  Tried "ip route <network> <mask> <next-hop>"
  We tried the "no mls ip unicast" and the box stopped forwarding traffic
   and we had to reload it.
  The MSFC2 would follow the route, but any upstream box would see a
   routing loop.
  
My group did a lot of testing for deploying 6.3(10) onto catalysts
for L2 switching with some ATM.

I would probably recommend 12.1.12cE5 with 6.3(10).
  


On Wed, Nov 20, 2002 at 08:47:22PM -0800, Steve Francis wrote:
> What are the current recommendations anyone has for stable 6500 code, 
> for hybrid mode SupII/MSFC2?
> 
> (Fairly vanilla BGP, OSPF, HSRP, with some PBR)
> 
> We have been running 6.3(6) CatOS,  12.1(8b)E9 IOS.
> 
> However, this morning we got inconsistency on the CEF tables in the 
> switch and the router.  At first it looked like a RPF error (switch 
> would inconsistently drop packets only if the source address was routed 
> out  one particular peering.) Yet RPF counters did not increment.
> 
> To avoid that, we reloaded the router, then basically nothing worked, 
> and we had to admin down almost all interfaces to get a working network. 
> (While you could ping an interface of the router via a router on a local 
> subnet, and things like the loopback of the router were being advertised 
> in OSPF, you could not ping the loopback from even an adjacent, shared 
> interface router.)  An ACL with the log keyword made individual IP's 
> work, forcing CPU switching.
> 
> At this point the TAC engineer on the router tried "no mls ip unicast ", 
> which caused the whole switch to crash with TLB Exception. (And even 
> more fun - not respond to the console except with garbled Hex. Needed a 
> power cycle.)
> 
> I cannot find any bugs matching what we experienced, so I cant see what 
> versions fix them.
> 
> Most importantly, anyone have recommendations for stable CatOS and IOS?
> 
> Anyone recognize the above bugs?
> 
> Anyone have any idea how to make a 6500 run again if it crashes, and 
> outputs this:
> TLB Exception (load/instruction fetch) occurred.
> 
> Software ver
> sion =  6.3(6)
>               Process ID #1b, Name = Fib
>                                             EPC: 809EFC54
> {stack trace}
> GDB: TLB Exception (load/instruction fetch)
>                                            GDB: The system has trapped 
> into the debugger.
>          GDB: It will hang until examined with gdb.
>                                                    Please use normal 
> gdb. special gdb will not work on this apollo+ board
> ||||$S10#b4
> 
> Getting remote staff to power cycle remote core switches (which, 
> incidentally, failed in such an interesting way that I could still talk 
> to some nodes attached to it, but it seemed to take out most nodes on 
> its functionally paired switch) was not the quickest way to restore service.
> 
> Thx
> 
> _______________________________________________
> cisco-nsp mailing list  real_name)s@puck.nether.net
> http://puck.nether.net/mailman/listinfo/cisco-nsp
> archive at http://puck.nether.net/pipermail/cisco-nsp/

-- 
=========================================================================
Clinton Work                                        clinton@scripty.com
Calgary, Alberta