[nsp] Stable 6500 hybrid code?
Clinton Work
work@scripty.com
Wed, 20 Nov 2002 22:24:59 -0700
I have seen many CEF/MLS issues with hybrid IOS.
Your problem sounds like bug-id CSCdy01444.
I've had a lot of problems with bug-id CSCdu85211, but CatOS 6.3(6)
should contain the fix for it. The workaround for this bug works
fairly well (clear ip ro *).
I have a couple of boxes running 6.3(8) with IOS 12.1(12c)E2 which has
been fairly stable. We have had only one CEF/MLS issue with an OSPF E2
route were the MLS entry would not get created:
Tried "clear ip ro <route>"
Tried "clear ip ro *"
Tried "ip route <network> <mask> <next-hop>"
We tried the "no mls ip unicast" and the box stopped forwarding traffic
and we had to reload it.
The MSFC2 would follow the route, but any upstream box would see a
routing loop.
My group did a lot of testing for deploying 6.3(10) onto catalysts
for L2 switching with some ATM.
I would probably recommend 12.1.12cE5 with 6.3(10).
On Wed, Nov 20, 2002 at 08:47:22PM -0800, Steve Francis wrote:
> What are the current recommendations anyone has for stable 6500 code,
> for hybrid mode SupII/MSFC2?
>
> (Fairly vanilla BGP, OSPF, HSRP, with some PBR)
>
> We have been running 6.3(6) CatOS, 12.1(8b)E9 IOS.
>
> However, this morning we got inconsistency on the CEF tables in the
> switch and the router. At first it looked like a RPF error (switch
> would inconsistently drop packets only if the source address was routed
> out one particular peering.) Yet RPF counters did not increment.
>
> To avoid that, we reloaded the router, then basically nothing worked,
> and we had to admin down almost all interfaces to get a working network.
> (While you could ping an interface of the router via a router on a local
> subnet, and things like the loopback of the router were being advertised
> in OSPF, you could not ping the loopback from even an adjacent, shared
> interface router.) An ACL with the log keyword made individual IP's
> work, forcing CPU switching.
>
> At this point the TAC engineer on the router tried "no mls ip unicast ",
> which caused the whole switch to crash with TLB Exception. (And even
> more fun - not respond to the console except with garbled Hex. Needed a
> power cycle.)
>
> I cannot find any bugs matching what we experienced, so I cant see what
> versions fix them.
>
> Most importantly, anyone have recommendations for stable CatOS and IOS?
>
> Anyone recognize the above bugs?
>
> Anyone have any idea how to make a 6500 run again if it crashes, and
> outputs this:
> TLB Exception (load/instruction fetch) occurred.
>
> Software ver
> sion = 6.3(6)
> Process ID #1b, Name = Fib
> EPC: 809EFC54
> {stack trace}
> GDB: TLB Exception (load/instruction fetch)
> GDB: The system has trapped
> into the debugger.
> GDB: It will hang until examined with gdb.
> Please use normal
> gdb. special gdb will not work on this apollo+ board
> ||||$S10#b4
>
> Getting remote staff to power cycle remote core switches (which,
> incidentally, failed in such an interesting way that I could still talk
> to some nodes attached to it, but it seemed to take out most nodes on
> its functionally paired switch) was not the quickest way to restore service.
>
> Thx
>
> _______________________________________________
> cisco-nsp mailing list real_name)s@puck.nether.net
> http://puck.nether.net/mailman/listinfo/cisco-nsp
> archive at http://puck.nether.net/pipermail/cisco-nsp/
--
=========================================================================
Clinton Work clinton@scripty.com
Calgary, Alberta