[VoiceOps] Adtran TA900 timing problems
My List Account
mylists at battleop.com
Tue Nov 5 15:44:49 EST 2013
The first time this happened a year or so ago the CLEC CO techs insisted
that I must change to "t1 1 clock source line" even though we had always
used "t1 1 clock source internal" in the past. I've changed half back to
internal and half to line to see what happens a few months from now.
Richey
From: VoiceOps [mailto:voiceops-bounces at voiceops.org] On Behalf Of Paul
Timmins
Sent: Monday, November 04, 2013 5:18 PM
To: myaklin at g4.net
Cc: voiceops at voiceops.org
Subject: Re: [VoiceOps] Adtran TA900 timing problems
There's a few items to this:
The 7204/7206VXR don't have a BITS reference input or a bus to transport
synchronization around the unit. If you're using the PA-MC-T3 or PA-MC-2T3
or the various EC variants, the T3 controller should be set for line
clocking (provided you are coming off a DCS or SONET node that has valid
clock). Only time I'd use clock source internal on the PA-MC-T3 card's T3
ports is if I'm feeding a DS3 mux directly from it, without any DCS or SONET
transport - literally like a breakout box.
The embedded DS1s are completely unlocked from this frequency. You have the
option of internal timing on the embedded DS1s (the default) or line timing
(controller t3 1/0, t1 1 clock source line, etc). Timing is only synchronous
between DS1s on the same DS3 (or maybe same card, I'm not totally clear, and
usually assume it's per-port for safety's sake).
So if you have a TA900, it should be set to line timing, and if it has
multiple DS1s, they must all land on the same DS3 on the 7200VXR. Cisco
routers don't care because they don't have a common sync source internally,
but Adtrans require all connected T1s to run in sync with the chassis, and
the chassis can have a primary and secondary timing source, which can be any
of the DS1s or internal (tertiary is automatically internal if the primary
and secondary are both down).
If the 7200VXR had BITS timing internally, and each port had a chassis sync
source, you could pull T1s from not just this router, but any isochronous
router. But it doesn't, and the DS3 line timing is completely independent of
how the DS1s get clocked. You get the internal free running quartz source,
or the line itself. Forget clocking off an embedded DS1 - each DS1 can run
out of sync with each other just fine.
(This is why you'll see that a PRI coming off the DSX1 port of the unit
won't run in sync with a TDM T1 coming from a real switch. The real switch
uses BITS clocking, and the poor Adtran is off in lala land, with whatever
the crystal oscillator on board the PA-MC-T3 card feels like sending. It's
perfectly stable clocking, mind you, but it exists in a world of its own,
dependent on nothing else)
So a valid config would be like:
controller T3 1/0
clock source line
t1 1 channel-group 0 timeslots 1-24
t1 1 clock source internal (default)
And then on the adtran:
timing-source primary t1 0/1
(these commands are off the top of my head, so if they're a bit off in
syntax, give me partial credit for a good memory, rather than bad markings
for a poor transcription)
And that is, as they say, the other half of the story.
-Paul
On Mon, 11/04/2013 04:54 PM, Matt Yaklin <myaklin at g4.net> wrote:
On Mon, 4 Nov 2013, Jay Hennigan wrote:
> On 11/4/13 9:41 AM, Matt Yaklin wrote:
>>
>> I forgot to mention that the cisco T1s on the DS3 use line
>> timing in my setup. So line and line on both sides. The T1
>> and the Adtran.
>
> Don't do that then. :-)
>
> Every T1 span must have exactly one source of clocking.
>
Yea, I saw that as I was typing out my email but did not
change it. I just did now. Why it was set that way? I am unsure.
Probably because a mistake was made and then duplicated. Then
we probably followed the good old if it aint broken dont fix it.
I am sure I will be punished with a few support calls over the
next day for touching it. ;-)
But... see below...
> My recommendation is to configure the T3 controller on the 7206VXR to
> internal clocking on all of its T1s, and clock the Adtrans from T1 0/1
> (line).
>
> If both sides are set to internal, you'll get clock slips as the
> oscillators drift slightly with respect to each other.
>
> If both sides are set to line, then there's no reference clock. The
> line will probably sync up initially as the free-running T1 will be
> close enough to sync. Over time the frequency will drift up or down
> until one side can no longer sync. In some cases they'll recover on
> their own, and in others you need to shut/no shut or otherwise force a
> resync.
>
> So, don't do that.
>
But things get even more complicated then that if you wish to be
a clocking purist. Here we are talking about timing over a DS3. To
some that is ? humurous ?.
This is a post from the Taqua mailing list from Dave Long. Pretty
interesting if you are talking about timing and DS3s! I obviously had
a mistake in my config but even when corrected it is still, technically,
wrong?
I would love to know what people's opinions are of the post below.
I bet Paul has read it a few times now.
----------------------------------------------------------------------------
--
david.long at taqua.com
Springtime for Switching
Ahhh, its springtime! In the switch supplier business, that means two
things:
1) More release upgrades (since the holiday season is well behind us, we
get larger numbers of upgrade requests)
2) GR-303/PRI issues/outages due to network clocking problems.
Number 2 is somewhat related to number 1, but not always... But, that's
what I'm here to talk about...
When we deployed some of our first switches, we saw network clocking
reference problems due to not getting a valid clock reference to the
switch. We changed our installation process to check for this early in
the deployment process. The T7000 has a lot of logic to constantly
monitor and qualify multiple clock sources, and to switch to secondary and
tertiary references, issue logs/events/alarms as a result. So, we can
spot if we don't have a valid timing reference coming to the switch. We
will typically recommend a BITS clock if we see issues.
We now have 6 years of experience with the Broadband Interface Card (BIC).
And with the BIC came a different deployment model. Not only were the
DS3s deployed to the carrier side of the switch, they were also deployed
on the access side as well. The Digital Loop Carriers (DLCs) are
typically in remote areas, take DS1s muxed up with other DS1s with M13s to
a DS3, many times this transported via fiber/SONET transport to the
central office, where the DS3 is demuxed to DS1 and recombined with other
DS1s, remuxed to DS3s and connected to the T7000 BIC card. The DLCs will
then take their network clocking reference off of the DS1 that leads
(ultimately) to the T7000.
So, what's the problem? It's that a DS3 is not a valid clock transport.
That includes DS1's embedded within DS3's. Now, this is typically the
time where the confusion sets in... This is not a "Taqua rule" or a
"T7000 limitation." This is clearly laid out by Telcordia in the network
clocking standards. But, the Telcordia rule is not just some bureaucratic
decision, it's basically a statement of engineering and physics. The
problem is that a DS3 is not a synchronous transport, it is asynchronous.
When DS1s are packed into a DS3, transported and unpacked, the integrity
of the DS1 is not the same on the output as it is on the input. It is
close, but from a timing perspective, the process adds jitter and wander.
Throw in there Fiber/SONET transport, and bigger problems happen (pointer
adjustment/justification in particular). If I said "why not transport a
clocking reference as part of RTP in a VoIP network?" People would laugh.
But, DS3s have some of the same characteristics (asynchronous transport).
(2011 revision... IEEE 1588 is a protocol that does work over IP networks
to transport a clock synchronous... My point is still valid in that even
1588 does many things to adjust to jitter/wander and was designed to work
in an asynchronous network, where TDM clocking was not).
Keep in mind that clock reference transport requires much more precision
than voice traffic transport. Jitter and wander in the voice path may be
barely noticeable, but if two "ends" of the span are using this for
clocking, the results can be unpredictable. All bets are off when it
comes to High-Level Data Link Control (HDLC) channels. Being "a little
bit off" may mean that HDLC links do not want to sync up at all. For
GR303, these links are the TMC and the EOC. All DLCs need a network
reference for clocking and that can't be an embedded DS1 from a DS3
(unless your M13 mux injects a clock signal on the DS1, but these are
rare). For PRI, these links are D-channels.
You may be saying "but Dave, I've been clocking from embedded DS1s for
years and I've never had a problem?" That is the reason I'm writing this
note. A phenomenon called plesiosynchronous timing occurs in many cases.
This means that the clocking in a network can "coincidently" or "almost"
be synchronized. Your network can work PERFECTLY, for years, right until
it does not work at all. Then, it seems like nothing you do can get the
two ends talking to each other again. Eventually, after power cycling
muxes, DLCs, switches, etc., finally the ends will talk to each other.
The blame will typically lay with whatever the last box was that was power
cycled, kicked, or yelled at rudely.
The other item that conspires in this problem is that DLCs typically have
little if any clocking qualification logic and circuitry. So, even if
they would detect slips or sags in the timing, many are hard pressed to
report it.
Back at Tekelec, when I had responsibility for R&D on the T9000 and T7000,
we had an 8 hour T9000 outage of this nature. We had a team of a dozen
engineers looking at the software, another dozen TAC engineers looking at
all the provisioning. There wasn't anything that seemed to help. The
RDTs were power cycled, the switch was rebooted. Finally some of the
muxes were rebooted, at the same time someone typed the "ls" (directory
listing) command and everything synced up and worked like nothing
happened. It took weeks to convince this customer that the "ls" command
was not the reason for the fix. That customer did not believe that the
network clocking issue was to blame. Some months later, after we formed
the T7000 business unit and I no longer had the T9000, I walked by TAC one
day when they were fighting the same problem with the same customer. I
noticed an almost frantic typing of "ls" by the customer to no avail...
How can this "plesiosynchronous" thing work at all? Well, if you have one
of those "atomic clock" alarm clocks or wristwatches, you have a good
example of how network synchronization works. Your clock or watch has its
own quartz oscillator to keep time. Every so often it looks at the atomic
clock radio broadcast and resynchronizes. If the atomic clock signal gets
a little "loopy" between updates, no one knows. Think of it as the
network gets "into a rhythm" when it is plesiosynchronized. Everything is
fine until something upsets that rhythm.
"So, what upsets this "rhythm" in my network," you ask? A number of
things...
* Anytime any piece of gear is power cycled or rebooted (which
typically happens anytime one of these elements is upgraded). If an
upgrade was being performed, the thought is "something in the new software
load broke my network."
* Another cause is anytime significant change happens to the amount of
traffic over the DS3s. In particular, putting more DS1s into the DS3 can
cause disruption. This changes the packing in the DS3 and will change the
rhythm.
* And, the most hideous of them all is temperature fluctuations in a
collocation. Remember the quartz clock I was talking about? Most/all
DLCs use quartz oscillators of their own. The problem is quartz
oscillators are very susceptible to changing their frequency based on
temperature. And what happens in the spring? Many locations go through a
heating at night, cooling during the day HVAC cycle, where we see broad
temperature fluctuations in the office. There is one particular story
where timing slips occurred every time the CO tech used the door to go
outside for a smoke break.
So, while I've talked mostly about DLCs, the same issues can be found with
any signaling that uses an HDLC channel. In particular, we've seen this
with PRI and SS7. But, mostly the problem is with GR303.
So hopefully, if you've read this far, I've convinced you that this is a
problem. So, if you have this situation in your network, what can you do
about it?
* If you are using fiber equipment, many have the special capabilities
for a separate clocking feed and transport.
* Another is to add BITS timing at each of your locations. There are
BITS vendors that have a variety of "wireless" references that work very
well. Anything from GPS based, to taking a CDMA clock signal over the
air. The only downside is that they can be expensive (a few thousand
dollars) and if you don't have roof access for a GPS and don't have a CDMA
signal, they won't work.
* "Rent" a valid clocking source from your collocation provider.
Hopefully, there is a reasonable charge for this.
* We have heard of an inexpensive network regeneration device that
takes a DS1 signal coming out of a mux and is able to "reformulate" the
network reference before feeding into the DLC. Maybe someone on the
mailing list has heard of this or uses one and can report on it.
We have worked with a few "timing consultants" that can also analyze your
network. If you think have a need for this service, Larry Cooley
(larry.cooley at taqua.com<mailto:larry.cooley at taqua.com
<mailto:larry.cooley%20at%20taqua.com> >) can refer
you to a couple of different shops.
"So Dave, I've read this far and I'm still awake! Do you have any
references that I can read to help me with this insomnia thing?" Why yes,
I do:
* Seriously, the one I like the best is "Engineering Networks for
Synchronization, CCS7, and ISDN" by P.K. Bhatnagar, IEEE Press, 1997 (Our
friend Whit Reeve is the Series Editor). I refer to this one a lot.
* If you have access to Bellcore (Telcordia) Standards:
* "Clocks for the Synchronized Network: Common Generic Criteria."
BELLCORE TA-NWT-001244, issue 2, Nov. 1992.
* "Digital network synchronization plan." BELLCORE Technical
Advisory TA-NWT-000436, issue 2, June 1993.
* American National Standard for Telecommunications. Synchronization
Interface Standard. ANSI T1.101-1994.
Hopefully, this information has been useful. If you have any questions,
send me email directly or reply to this message.
Thanks,
Dave
------------------------------------------------------------------------
> --
> Jay Hennigan - CCIE #7880 - Network Engineering - jay at impulse.net
> Impulse Internet Service - http://www.impulse.net/
> Your local telephone and internet company - 805 884-6323 - WB6RDV
> _______________________________________________
> VoiceOps mailing list
> VoiceOps at voiceops.org
> https://puck.nether.net/mailman/listinfo/voiceops
>
_______________________________________________
VoiceOps mailing list
VoiceOps at voiceops.org
https://puck.nether.net/mailman/listinfo/voiceops
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://puck.nether.net/pipermail/voiceops/attachments/20131105/c0f21033/attachment-0001.html>
More information about the VoiceOps
mailing list