[c-nsp] ASR9001 BGP scaling and memory shortage

Vladimir Troitskiy ruthenate at gmail.com
Mon May 25 14:00:13 EDT 2020


Hello everyone,

Other list members have a significantly lower memory usage for a BGP
process and a shmwin on ASR9001 routers with more sessions/routes in GRT.

Saku Ytti has suggested me some useful notes which I would like to mention
as a summary for this thread:
- one could use 'hw-module profile scale l3xl' in admin mode to increase an
RLIMIT for a BGP process, even on Typhoon-based platforms (not only on
Trident-based ones as I thought);
- a shmwin shortage is probably caused by per-prefix label mode, the per-ce
mode will be much more scalable. We use the per-prefix mode because of BGP
PIC limitations, but maybe it's time to reconsider the feature-set used.

вт, 19 мая 2020 г. в 20:09, Vladimir Troitskiy <ruthenate at gmail.com>:

> Hello everyone,
>
> ASR9001 has some memory usage limits:
> - 1658M for a BGP process on a RSP
> - 1536M for a shared memory window on a LC
> Those limits seems to be unconfigurable.
>
> Has anybody experienced any issues with these limits on high-loaded
> ASR9001 boxes?
> We have a surprisingly high memory usage while the typical router setup is
> pretty lightweight - 4-5 full feeds (couple of upstreams and RRs). The only
> probably uncommon thing is we use "Internet in a VRF" approach.
>
> #show processes memory detail location 0/RSP0/CPU0
>> Tue May 19 19:39:12.592 Ural
>> JID    Text       Data       Stack      Dynamic    Dyn-Limit  Shm-Tot
>>  Phy-Tot    Process
>> ------ ---------- ---------- ---------- ---------- ---------- ----------
>> ---------- -------
>> 1054           1M         5M       516K      1485M      1658M        76M
>>      1491M  bgp
>>
>
> #show memory summary location 0/0/CPU0
>>
> node:      node0_0_CPU0
>> ------------------------------------------------------------------
>> Physical Memory: 8192M total
>>  Application Memory : 7988M (3811M available)
>>  Image: 75M (bootram: 75M)
>>  Reserved: 128M, IOMem: 0, flashfsys: 0
>>  Total shared window: 1327M
>>
>
> We have already had FIB inconsistency issues due to SHMWIN exhaustion
> despite the fact the total prefix amount was far from the platform limit
> (4M):
>
>> fib_mgr[184]: %OS-SHMWIN-3-ALLOC_ARENA_FAILED : SHMWIN: Failed to
>> allocate new arena from the server : 'SHMWIN_SVR' detected the 'fatal'
>> condition 'VM is exhausted or totally fragmented'
>> fib_mgr[184]: %ROUTING-FIB-3-ASSERT_RL : FIB internal inconsistency
>> detected
>> fib_mgr[184]: %ROUTING-FIB-3-PD_FAIL : FIB platform error:
>> fib_leaf_insert 5204 Cannot insert leaf
>>
>
> What are practical limits for BGP scaling on ASR9001 boxes? Could anyone
> share a memory usage stats?
> --
> Best regards,
> Vladimir Troitsky
>

-- 
Best regards,
Vladimir Troitsky


More information about the cisco-nsp mailing list