[c-nsp] SUP2 on 6509 Crashing with BGP

Alex Rubenstein alex at nac.net
Mon Feb 6 14:54:18 EST 2006


Hello,

We run quite a few S2/MSFC2 with full BGP and some features, and don't 
nearly have this problem.

One, in particular, has two internal route-reflecting views, and full 
transit neighbour, about 20 customer peers, and 15 or so peering session. 
See below cut/pastes for info.

As stability, have another 6509 in a similar situation:

IOS (tm) c6sup2_rp Software (c6sup2_rp-JSV-M), Version 12.1(20)E3, EARLY DEPLOYMENT RELEASE SOFTWARE (fc1)
xxxxxxxx uptime is 1 year, 34 weeks, 13 hours, 16 minutes

Here are the cut and pastes:

#sho ip bgp sum
BGP router identifier 209.123.xx.x, local AS number 8001
BGP table version is 44596948, main routing table version 44596948
178197 network entries and 471482 paths using 34436658 bytes of memory
99448 BGP path attribute entries using 5967420 bytes of memory
25 BGP rrinfo entries using 616 bytes of memory
55529 BGP AS-PATH entries using 1445336 bytes of memory
1570 BGP community entries using 111684 bytes of memory
182286 BGP route-map cache entries using 2916576 bytes of memory
0 BGP filter-list cache entries using 0 bytes of memory
BGP activity 1602996/32607422 prefixes, 19744438/19272956 paths, scan interval 60 secs


#sho mem sum
                 Head    Total(b)     Used(b)     Free(b)   Lowest(b)  Largest(b)
Processor   423AD760   432351392   189513076   242838316   201718076   151654464
       I/O    8000000    67108864     8906416    58202448    56914568    57943196

Cisco Internetwork Operating System Software
IOS (tm) c6sup2_rp Software (c6sup2_rp-JSV-M), Version 12.1(26)E3, RELEASE SOFTWARE (fc1)
Technical Support: http://www.cisco.com/techsupport
Copyright (c) 1986-2005 by cisco Systems, Inc.
Compiled Mon 15-Aug-05 04:27 by ccai
Image text-base: 0x40008F90, data-base: 0x41B4A000

ROM: System Bootstrap, Version 12.2(17r)S1, RELEASE SOFTWARE (fc1)
BOOTLDR: c6sup2_rp Software (c6sup2_rp-JSV-M), Version 12.1(26)E3, RELEASE 
SOFTWARE (fc1)

##### uptime is 19 weeks, 6 days, 13 hours, 49 minutes
Time since esd1.nwr switched to active is 19 weeks, 6 days, 13 hours, 49 minutes
System returned to ROM by power-on (SP by power-on)
System restarted at 02:02:55 EDT Tue Sep 20 2005
System image file is "sup-bootflash:c6sup22-jsv-mz.121-26.E3.bin"

cisco WS-C6506 (R7000) processor (revision 2.0) with 458752K/65536K bytes of memory.
Processor board ID TBA04130986
R7000 CPU at 300Mhz, Implementation 39, Rev 3.3, 256KB L2, 1024KB L3 Cache
Last reset from power-on
Bridging software.
X.25 software, Version 3.0.0.
SuperLAT software (copyright 1990 by Meridian Technology Corp).
TN3270 Emulation software.
8 Virtual Ethernet/IEEE 802.3  interface(s)
72 FastEthernet/IEEE 802.3 interface(s)
26 Gigabit Ethernet/IEEE 802.3 interface(s)
381K bytes of non-volatile configuration memory.

32768K bytes of Flash internal SIMM (Sector size 512K).
Configuration register is 0x2102


If s2/m2 wasn't abandoned by cisco, and had CoPP, they would still have a 
lot of life left to them. Sad, almost.




On Mon, 6 Feb 2006, Richard J. Sears wrote:

> Hi Stephen -
>
> Thanks for the info - I think the customer is going to bump to SUB720's.
>
> :-)
>
> On Mon, 6 Feb 2006 15:36:25 +0000 (GMT)
> "Stephen J. Wilcox" <steve at telecomplete.co.uk> wrote:
>
>> Hi Richard,
>>  it'll be related to the IOS you are using and the features you have enabled.
>>
>> You should be seeing significant memory being allocated for BGP, Routing, CEF
>>
>> For comparison, I have a couple 7206s with 256Mb RAM doing some simple tasks..
>> taking a reflected full feed (so hopefully minimal duplication of routing data)
>> they're down to about 15Mb free - time is running out!
>>
>> Steve
>>
>> On Mon, 6 Feb 2006, Richard J. Sears wrote:
>>
>>> Thanks Jon -
>>>
>>> I was running SUP2s for a long time in three of our 6509s doing full BGP
>>> with 256MB of ram, but that was a couple of years ago and I know the
>>> tables have grown a considerable amount since then.
>>>
>>> One interesting issue is the fact that the tables are taking up a lot
>>> more memory on this router than on my SUP720 routers for the same number
>>> of entries. And the reboot issue is a definite indicator (in my mind
>>> anyway) of some other type of problem.
>>>
>>> I guess we will try the microsoft method of troubleshooting - add more
>>> memory and try again :-)
>>>
>>> Thanks for your input !!
>>>


>>> On Sun, 5 Feb 2006 23:01:37 -0500 (EST)
>>> Jon Lewis <jlewis at lewis.org> wrote:
>>>
>>>> On Sun, 5 Feb 2006, Richard J. Sears wrote:
>>>>
>>>>> Hey Everyone -
>>>>>
>>>>> I am working on a 6509 with a SUP2 in IOS mode. I went to bring up BGP
>>>>> with two peers today and all went well until I tried to bring up the
>>>>> second peer.
>>>>
>>>> What IOS version?  If it's recent, like 12.2SX*, you don't have enough
>>>> RAM.  Regardless, that seems to be what the maloc fails are saying.  On a
>>>> 6509 Sup2 with not quite 2 full views, and 512mb RAM, I have 212390720
>>>> free.  If I had 256MB less RAM, I'd be in trouble.
>>>>
>>>>> -Traceback= 402382D8 4023A738 40234F84 40235784 407A225C 4077A524 4078C7F4 4078CCF4 4077FD18 40784E7C 4022EC9C 4022EC88
>>>>>
>>>>> Then both peers go down and the router reboots itself.
>>>>                                           ^^^^^^^^^^^^^^
>>>>
>>>> You might want to open a TAC case on that.  My experience with other
>>>> ciscos, BGP, full tables, and insufficient RAM, is that the session resets
>>>> and then tries again.  It shouldn't be crashing/rebooting just because it
>>>> ran out of memory.
>>>>
>>>> ----------------------------------------------------------------------
>>>>   Jon Lewis                   |  I route
>>>>   Senior Network Engineer     |  therefore you are
>>>>   Atlantic Net                |
>>>> _________ http://www.lewis.org/~jlewis/pgp for PGP public key_________
>>>
>>>
>>> ******************************************
>>> Richard J. Sears
>>> CCNP/CCDP/F5SE
>>> Vice President & CTO
>>> American Internet Services
>>> ----------------------------------------------------
>>> rsears at americanis.net
>>> http://www.americanis.net
>>> ----------------------------------------------------
>>> 858.576.4272 - Phone
>>> 858.427.2401 - Fax
>>> INOC-DBA - 6130
>>> ----------------------------------------------------
>>>
>>> I fly because it releases my mind
>>> from the tyranny of petty things . .
>>>
>>>
>>> "Work like you don't need the money, love like you've
>>> never been hurt and dance like you do when nobody's
>>> watching."
>>>
>>> _______________________________________________
>>> cisco-nsp mailing list  cisco-nsp at puck.nether.net
>>> https://puck.nether.net/mailman/listinfo/cisco-nsp
>>> archive at http://puck.nether.net/pipermail/cisco-nsp/
>>>
>
>
> ******************************************
> Richard J. Sears
> CCNP/CCDP/F5SE
> Vice President & CTO
> American Internet Services
> ----------------------------------------------------
> rsears at americanis.net
> http://www.americanis.net
> ----------------------------------------------------
> 858.576.4272 - Phone
> 858.427.2401 - Fax
> INOC-DBA - 6130
> ----------------------------------------------------
>
> I fly because it releases my mind
> from the tyranny of petty things . .
>
>
> "Work like you don't need the money, love like you've
> never been hurt and dance like you do when nobody's
> watching."
>
> _______________________________________________
> cisco-nsp mailing list  cisco-nsp at puck.nether.net
> https://puck.nether.net/mailman/listinfo/cisco-nsp
> archive at http://puck.nether.net/pipermail/cisco-nsp/
>

-- 
Alex Rubenstein, AR97, K2AHR, alex at nac.net, latency, Al Reuben
Net Access Corporation, 800-NET-ME-36, http://www.nac.net




More information about the cisco-nsp mailing list