[c-nsp] ASR9k: RIB/FIB convergence
Thomas Schmid
schmid at dfn.de
Tue Aug 21 10:35:29 EDT 2018
Hi,
to give you an update: TAC finally could reproduce the issue in the lab. RIB/FIB sync is thwarted when there's a VSM module installed in the chassis (which we have in all 9k chassis).
Let's see if they can fix it with a SMU ...
Cheers,
Thomas
Am 02.08.2018 um 11:13 schrieb Thomas Schmid:
> Hi all,
>
> sort of a heads up ...
>
> I'd be interested to hear if, and under which circumstances others are seeing this behavior,
> since the root cause is still unknown.
>
> In the beginning there were some anecdotical complaints
> by customers that they experienced persistent reachability problems to some destinations
> when we did a scheduled maintenance in our network somewhere else. Further
> investigations pointed to routing inconsistencies during large RIB changes.
>
> To give you some numbers: we found out that in our environment processing 70k BGP changes
> takes 2-3 min to write the updates to FIB, 700k routes takes 20-30 min!!
>
> During that period, RIB and FIB are not consistent with all the nasty consequences:
> blackholing, routing loops etc.
>
> Convergence time seems to be somehow related to the number of eBGP sessions on the
> box. On routers with less than 200 sessions, convergence time looks ok, from 300+
> sessions on, things get bad.
>
> This affects both XR 5.3.3, 6.2.3 and Typhoon, Tomahawk linecards.
>
> TAC/BU are currently working on this, but they have a hard time to find out what's
> going wrong here. Processing the updates on the RP takes less than 1s,
> but writing the updates to the LC takes forever ...
>
> Thanks,
>
> Thomas
>
>
>
>
More information about the cisco-nsp
mailing list