[nsp] sudden OSPF failures

Jack.W.Parks at alltel.com Jack.W.Parks at alltel.com
Wed Sep 24 10:21:07 EDT 2003


Are all your 5300 interfaces in OSPF like:

Router ospf 1
  network 192.168.0.0 0.0.255.255 area x

Or are you redistributing connected?  We have run into a feature
limitation with Cisco's implementation of OSPF having a finite amount of
ospf interfaces per box.

>From our TAC case:
found a internal found bug ( CSCin17242
<http://wwwin-metrics.cisco.com/cgi-bin/ddtsdisp.cgi?id=CSCin17242>)
that has the following similarities: 
- very similar error message eg: 
00:39:14: %SYS-2-GETBUF: Bad getbuffer, bytes= 18028
-Process= "OSPF Router", ipl= 0, pid= 148 
- decoded tacebacks are almost the same with identical processes 
The addition of new pvc means extra load on the router to handle ospf
traffic (LSA's), which requires more buffer resources from the huge
buffer. 
The Bad getbuffer error message we received on your router (as well as
from the bug referred above) indicated lack of buffer resources, which
causes ospf 
packets to be dropped (this happens even during the time when the router
is trying to create new buffers). 
The workaround would be to increase the huger buffer size to allocate
more resources. 

Jack
-----Original Message-----
From: cisco-nsp-bounces at puck.nether.net
[mailto:cisco-nsp-bounces at puck.nether.net] On Behalf Of jlewis at lewis.org
Sent: Tuesday, September 23, 2003 3:11 PM
To: cisco-nsp at puck.nether.net
Subject: [nsp] sudden OSPF failures


On our network of roughly 40 routers, 50 access-servers (all cisco),
OSPF 
for one of our smaller remote POPs began breaking down last night.  This

POP has existed for years in its own OSPF area connected via frame to 
a router in area 0.

The POP consists of a 2501 and 4 AS5248's.  The 5200's are in area 1000,
the 2500 has its serial0.1 in area 0 and connects back to one of our
core routers (a 7206 running 122-14.S3) and has e0 in area 1000.

The 2501 was running 11.0(22), but I upgraded it to 12.0(27) last night
while troubleshooting this.  The 5248's run 11.3(11)aAA.  Each AS5248
has a dial pool of a /27 and /28 (just enough IPs for the channelized
T1's on each box).  Last night, for unknown reasons, the dial pool
routes for AS5248-2 started flapping, even when watched from the POP's
2501.  sh ip ro blah (for either fo AS5248-2's dial pools) on the 2501
would alternatively show the /27|/28 routes, or the /17 they're part of
coming from our core.  When the /27|/28 routes were active, they might
last a few seconds to a minute or so...then disappear for similar time.
If I shut down the T1 controllers on AS5248-2, routing instability
ceased and the /27|/28 routes propogated properly.  When unshut, as soon
as dialup users started hitting AS5248-2, the instability returned.  I
didn't see any signifigant ethernet errors, but as we were using a very
old 1912 switch, we swapped it out for a 2924xl.  That made no
difference.

I thought this could perhaps be due to the size of our OSPF routes
(~1300 
routes), so I converted the remote POP into an NSSA area, reducing the 
number of routes seen in that POP to around 150.  That seemed to work
for 
a while (30-60m), but then random routes from the POP ceased to
propogate 
beyond the core router that POP's 2501 connects to.

At the moment, I seem to have things working again, with the remote POP 
ethernet in its own NSSA area, the 2501's s0.1 in area 0, and all the 
dialup pool subnets static routed to each 5248 from the POP's 2501.

Anyone seen stuff like this before?  We've got several other small POPs 
consisting of similar setups (2501's and AS5248's) not having this 
problem.  Most of our network is in area 0...only a few POPs are split
off 
in their own areas.

----------------------------------------------------------------------
 Jon Lewis *jlewis at lewis.org*|  I route
 Senior Network Engineer     |  therefore you are
 Atlantic Net                |  
_________ http://www.lewis.org/~jlewis/pgp for PGP public key_________

_______________________________________________
cisco-nsp mailing list  cisco-nsp at puck.nether.net
https://puck.nether.net/mailman/listinfo/cisco-nsp
archive at http://puck.nether.net/pipermail/cisco-nsp/



More information about the cisco-nsp mailing list