[c-nsp] ASR920 randomly loosing layer-2 on a port
Shawn L
shawn at rmrf.us
Tue Jul 12 06:43:01 EDT 2022
We upgraded the 920 to 16.12.06 this morning. No change. Still not
learning MAC addresses on port te0/0/4. So, back to the drawing board.
On Mon, Jul 11, 2022 at 1:31 PM Gert Doering <gert at greenie.muc.de> wrote:
> Hi,
>
> On Mon, Jul 11, 2022 at 04:47:57PM +0000, Brian Turnbow wrote:
> > > On Mon, Jul 11, 2022 at 03:59:02PM +0000, Brian Turnbow wrote:
> > > > Yep, sounds like the infamous uptime over 2 years "feature" from 3.16
> > > (something)..
> > > > Reboot and upgrade was the only way we fixed it....
> > >
> > > Uh. Could you elaborate on what that "feature" is, exactly?
> >
> > It was the bug where after after two years of uptime
> > If an interface went down it would stick as up and not pass traffic
> > You could not provision new interfaces.
> > Counters also stopped working. (we used this to find affected units)
>
> Now THAT is interesting. I'm a bit further distanced from day-to-day
> operations these days (otherwise I might have noticed), but indeed,
> counters didn't work anymore either. "No traffic on this box!" which
> I know to be not true (our daily TSM backups go through there...) - and
> after reboot, "Traffic!".
>
> Very interesting. "Interface itself" counters are all "0", but service
> instance counters (gi0/0/2 si 90) still show traffic. So that's actually
> something our alarming could trigger on "si has > 1 Mbit, interface itself
> has 0"...
>
> [..]
> > Sounds like it may be different.
> > Did the counters work?
> > Maybe they decided to add it into 16.06 , you never know what a BU may
> decide is a must have feature....
>
> Obviously, 16.06 has much improved performance, so 2-year-bugs are now
> hit after 0.5 years already!
>
> OTOH... seems it wasn't actually 27 weeks uptime, but quite a bit more,
> which was just distorted by SNMP uptime wrapping (and our prometheus
> instance not properly distinguishing this for old data, it only recently
> learned to query that other OID).
>
> So, definitely more than 2 years, and traffic counters stopped some 5
> months
> ago... and we did not try to actually bring up anything new since then.
>
> Yeah, thanks a lot for this information. This will be very helpful to
> avoid needless frustration by our on-site people ("it does not link! can
> you please try a different cable? did you get the patch right?").
>
> gert
>
> --
> "If was one thing all people took for granted, was conviction that if you
> feed honest figures into a computer, honest figures come out. Never
> doubted
> it myself till I met a computer with a sense of humor."
> Robert A. Heinlein, The Moon is a Harsh
> Mistress
>
> Gert Doering - Munich, Germany
> gert at greenie.muc.de
>
More information about the cisco-nsp
mailing list