[c-nsp] MPLS "Tag Control" process - what does this do?
Reuben Farrelly
reuben-cisco-nsp at reub.net
Fri Aug 3 08:17:58 EDT 2007
Hi Rodney,
Thanks for taking a look and researching this one for me.
On 3/08/2007 9:42 PM, Rodney Dunn wrote:
> I'm pretty sure Reuben that it's controlling the allocation of
> labels to prefixes. I'd need to dig a bit more on it.
>
> How many routes do you have in the global routing table?
>
> 'sh ip route summ'.
370flinders-r200.32-pe01#show ip route summary
IP routing table name is Default-IP-Routing-Table(0)
Route Source Networks Subnets Overhead Memory (bytes)
connected 0 58 4228 8816
static 1 79 5808 14200
eigrp 9176 0 0 0 0
bgp 9176 42568 8972 6430852 7838160
External: 51540 Internal: 0 Local: 0
ospf 9176 24 573 38336 94824
Intra-area: 8 Inter-area: 5 External-1: 583 External-2: 1
NSSA External-1: 0 NSSA External-2: 0
internal 92 107824
Total 42685 9682 6479224 8063824
370flinders-r200.32-pe01#
There's a full BGP internet feed coming in but it's filtered to the 42000 routes
as seen above (I don't know why, it's an ISP network we have recently taken
over management of so there are still some unanswered questions). The 7200 has
1G of DRAM.
>> Aug 3 04:42:37.479: %TIB-3-REMOTETAG: 150.82.0.0/255.255.0.0, peer
>> 61.84.96.254:0; tag 53643; Failure with LDP Label Release; prefix not in TIB
>> Aug 3 04:43:08.171: %TIB-3-REMOTETAG: 192.232.71.0/255.255.255.0, peer
>> 61.84.96.254:0; tag 54898; Failure with LDP Label Release; prefix not in TIB
>
> Are you running TDP or LDP?
LDP. There are two routers attached to the far end peer which are running TDP
still but these are not directly connected to this router I sent the logs from.
I am intending to change these over to LDP soon so that the whole network is
just using LDP throughout and not a mixture of the two.
> What is the peer and code?
It is also a 7200 NPE-G1 with 12.3(19). There are three core routers - all the
same - linked via ATM in a triangle/meshed MPLS topology. The other two seemed
to be OK.
> I looked around a bit to try and understand that messge.
> Seems it has to do with getting a label for a prefix we don't have.
One of the support guys said something about the router dropping it's CEF table.
Unfortunately I didn't check that at the time, my concern was more on why so
much CPU was being burnt up on that one process.
> Problem Description:
> ====================
> Problem: LDP does not withdraw label before announcing a new label for
> the same FEC.
>
> Solution:
> To fix this problem a new config command is introduced:
> [no] mpls ldp neighbor A.B.C.D implicit-withdraw-label
>
> default Behavior:
> It will follow the LDP standard i.e. LDP will withdraw previously
> advertised label before advertising a new label for a FEC. When
> "mpls ldp neighbor A.B.C.D implicit-withdraw-label" is configured
> LDP will not withdraw the previous label before advertising a new
> label.
>
> default behavior is changed. Now when there is a need to change
> label for a FEC:
> 1. LDP will send a Label withdraw and then after receiving Label
> Release, it will advertise the new binding with a Label Mapping. If
> after sending Label Withdraw, no Label Release is received from a peer(s)
> and sufficient time (Currently set to 5 minutes) has passed, then LDP
> will assume that peer(s) is not capable of sending a label release and it
> will send Label Mapping to the peer.
>
> 2. LDP maintains a list of previous labels for which a Label Release is
> awaited from any peer.
> A new Label Mapping for a FEC is not announced to a peer if a Label
> release for the same FEC is pending from the peer.
>
> that was changes that went in under:
>
> CSCdv74248
> Externally found enhancement defect: Resolved (R)
> LDP session drop after receiving a new label for the same FEC
>
> that you would have in 12.3(19). But what about the peering router?
See above.
> Rodney
I've also just noticed that the 3550 behind this router and another one
connected to that via an ethernet link to that switch also is pretty sick, and
both nearly ran out of memory today. Both have logged a lot of messages about
"Aug 3 11:51:39: %FIB-2-FIBDOWN: CEF has been disabled due to a low memory
condition. It can be re-enabled by configuring "ip cef [distributed]"
'show ip cef' on these switches shows that CEF is now not running. My thinking
is that this is a rather big problem in itself (!) and will try get these
switches reloaded later tonight.
The reason I am bringing this up is that I'm considering if the problem may not
have been directly caused by this 7200, but may be caused by some other external
factor. The only thing which strikes me as a possibility is that someone or
something flooded/redistributed an entire BGP feed into OSPF. Does that sound
like a possibility?
Still doesn't answer quite why the 7200 was chewing so much cpu though
<scratches head>.
Reuben
More information about the cisco-nsp
mailing list