[f-nsp] NI-MLX-10Gx4 shown as "Invalid Module" maybe due to very old code on lp while chassis is on 5.3
Gunther Stammwitz
gstammw at gmx.net
Thu Jul 25 16:24:34 EDT 2013
Wilbur,
thanks for your effort, Im sure we will solve this mystery
J
I have now plugged the module in another MLX that runs 5.2 and the symptoms
are the same: Whatever I do I cannot see anything on the rconsole.
There has never been a module in slot 4 before and the config is empty for
this slot:
#show running-config | inc module
module 1 ni-mlx-20-port-1g-100fx
As you said I tried it from SSH as well as from the serial console of the
MP:
#rconsole 4
Remote connection to LP slot 4 established
Press CTRL-X or type 'exit' to disconnect it
(and nothing happens)
When I try a #lp boot system interactive 4 nothing happens on rconsole,
only the log says Module 4 is reset by mgmt (reason: CLI command)
Powering off and on again doesnt help also lp sync 4 it fails:
LP SYNC for Slot4: monitor image sync is timed out.
LP SYNC for Slot4: primary image sync is timed out.
LP SYNC for Slot4: No LP secondary image in MP's flash.
The rconsole shows Remote connection terminated and when I reconnect once
again it is still stuck.
#show mod
S4: Invalid Module CARD_STATE_REBOOT 0 000c.dbf5.9d90
I really hope that you have some sort of magic that will fix this situation?
Come on ;-)
Well and if not this will be a lengthy downgrading-procedure, wont it?
When looking in kp.brocade.com the first firmware is 0300 from 2006. We have
03500 from 2008 or 03800 from 2009.
Does brocade have a downgrade-matrix or do I need to read through all of
these release-notes?? Maybe theres some sort of thumb rule?
Thank you once again.
Gunther
Von:
Wilbur Smith [mailto:wsmith at brocade.com]
Gesendet:
Donnerstag, 25. Juli 2013 19:07
An:
Gunther Stammwitz; foundry-nsp at puck.nether.net
Betreff:
RE: [f-nsp] NI-MLX-10Gx4 shown as "Invalid Module" maybe due to
very old code on lp while chassis is on 5.3
Gunther,
Sorry, I was pulled into a project, so Im just getting a chance to reply.
So far, it looks like you have done everything correctly. I read through
some of the other comments, so Im also going to recommend you remove the
card type you specified manually through CLI; lets make sure that when you
run a show muldule the entry for this module is completely blank. You may
need to unseat the module to be able to remove the entry.
Next, ran you rcon to that module and tell me exactly what you? If the
module is stuck in a reboot loop, you should see the entire boot process
through the console, including the point where the local app code on the LP
detects a problem and triggers the reboot. If you cant see any output after
performing the rcon to the LP, then we have a larger issue. If this doesnt
seem to work over SSH or Telnet, try a direct connection to the serial
console port on the active MP.
If you are getting output from rcon, then you can hit b right after the LP
reboot to break into the module. From here we can delete the Primary and
Secondary flash manually, or force a boot to interactive mode from within
the LP.
Let me know what output you are seeing and Ill try to help you work through
it.
Wilbur
From:
Gunther Stammwitz [
mailto:gstammw at gmx.net
]
Sent:
Wednesday, July 24, 2013 3:22 PM
To:
Wilbur Smith;
foundry-nsp at puck.nether.net
Subject:
AW: [f-nsp] NI-MLX-10Gx4 shown as "Invalid Module" maybe due to
very old code on lp while chassis is on 5.3
Hello Wilbur,
thank you very much for your reply.
First of all I have now removed the manual module-type-setting for slot 4,
unplugged the lp from slot 4 and reinserted it.
Now the MLX says: S4: Invalid Module
CARD_STATE_REBOOT
0
000c.dbe1.2f90
This is where I tried lp boot system interactive 4 which is being
confirmed in syslog with cr1.fra1.mainlab.net Module 4 is reset by mgmt
(reason: CLI command).
Unfortunately the lp stays in CARD_STATE_REBOOT. The same applies then
powering off and on again. Interactive-boot mode didnt help
L
My software - including mbridge - is up to date:
#show ver
Boot
: Version 5.3.0T165 Copyright (c) 1996-2009 Brocade Communications Systems,
Inc.
Monitor
: Version 5.3.0T165 Copyright (c) 1996-2009 Brocade Communications Systems,
Inc.
IronWare : Version 5.3.0eT163 Copyright (c) 1996-2009 Brocade Communications
Systems, Inc.
Board ID : 00 MBRIDGE Revision : 37
Do you have any other idea how I can get the module out of the reboot-loop
and access is?
Your help is being appreciated thanks in advance.
Gunther
Von:
Wilbur Smith [
mailto:wsmith at brocade.com
]
Gesendet:
Mittwoch, 24. Juli 2013 22:30
An:
Gunther Stammwitz;
foundry-nsp at puck.nether.net
Betreff:
RE: [f-nsp] NI-MLX-10Gx4 shown as "Invalid Module" maybe due to
very old code on lp whlile chassis is on 5.3
Gunther,
Usually when this happens to me, I can still force the LP to boot into
interactive mode with the command I mentioned
and then manually powering on and off the LP ( power-off lp 4 , power-on lp
4). Since the MPs access to the LP uses a separate
out-of-band link, the MP should still be able to upgrade the LP if it is in
interactive mode.
The MP uses a separate FPGA image to allow it to talk to the chassis
backplane; we call this the MBRIDGE image. I would make sure that the
MBRIDGE image is the recommended release for your version of code and update
if needed. The required version is listed in the release notes for the
specific release of code (just search for mbridge). In rare cases, a patch
release may need a newer FPGA or MBRIDGE image to fix a problem, so make
sure you check the release notes for the exact release (5.3.0 vs. 5.3.0C).
The error you are seeing when you tried to push the fpga code to the LP is
triggered because that LP is stuck in reboot. The MP cant communicate with
that LP to see if it needs the newer version; we need to stop the reboot
loop before we can update the LP. I connected to one of my MLX routers in a
lab and double-checked the command you will need. Make sure you are using
this command to reboot a module in slot 4:
SSH at EA_CORE-1#lp boot system interactive 4
Let me know if this helps with your issue.
Wilbur
From:
foundry-nsp [
mailto:foundry-nsp-bounces at puck.nether.net
]
On Behalf Of
Gunther Stammwitz
Sent:
Tuesday, July 23, 2013 11:46 AM
To:
foundry-nsp at puck.nether.net
Subject:
[f-nsp] NI-MLX-10Gx4 shown as "Invalid Module" maybe due to very
old code on lp whlile chassis is on 5.3
Hello colleagues,
I have more or less a similar problem. I have pushed a NI-MLX-10gX4 into
a chassis running 5300e and it didnt work: The module in slot 4 is not
even being recognized: it is an invalid module.
Most probably this is due to very old code on the lp while the chassis runs
a newer code.
#show mod
Module
Status
Ports
Starting MAC
M1 (left): NI-MLX-MR Management Module
Active
M2 (right):
F1: NI-X-SF
Switch Fabric Module
Active
F2: NI-X-SF
Switch Fabric Module
Active
F3:
S4: Invalid Module
CARD_STATE_REBOOT
0
xxx
(S4: Configured as NI-MLX-10Gx4 4-port 10GbE Module)
I can neither access it with rconsole 4 nor does boot lp boot system
interactive 4 help.
Power-off and power-on didnt help.
Wilbur wrote that MP and LP cannot communicate and that the FPGA needs to be
upgraded. I tried to do so, but copy tftp lp 1.2.3.4 lpfpga05300e.bin
fpga-all 4 didnt work.
Copying FPGA images to the applicable slot(s), this may take several
minutes...
à
No FPGA image to be copied.
L
Using copy tftp lp with individual-fpga-images doesnt work either, the
system always claims that the fpga doesnt match.
Any idea how to get a LP with very old code working in a chassis running
5.3-code?
You help is being appreciated.
Kind regards
Gunther
Von:
foundry-nsp [
mailto:foundry-nsp-bounces at puck.nether.net
]
Im Auftrag von
Wilbur Smith
Gesendet:
Freitag, 5. Juli 2013 00:53
An:
Jeroen Wunnink | Atrato IP Networks;
foundry-nsp at puck.nether.net
Betreff:
Re: [f-nsp] NI-MLX-10Gx8-D falls to CARD_STATE_INTERACTIVE mode
Hello Folks,
Using auto lp-syc is a good improvement over the previous methods of
upgrading a line card, but there are times it may not do the trick. Because
the management modules also use an FPGA (the mbridge) to communicate with
the backplane, you can run into a situation where the older modules FPGA
image cannot talk to the management module. The LP and management modules
have their own separate out-of-band network that bypasses the backplane, so
auto-lp will attempt to stop the reboot cycle and use this connection to
push the correct code to a misbehaving LP. Unfortunately, this may not work
in every situation, so you may need to manually upgrade the code.
If you have a linecard constantly rebooting, you should use boot lp 3
interactive from Enable mode to force the card to start in interactive mode
(this will stop the reboot loop). You can then push the correct software and
fpga image to the card. I tend to use the older copy tftp image method
because it will automatically copy the correct IronWare LP code and you can
target a specific LP with it vs. the whole chassis. I also use copy tftp lp
. Fpga-all lp 3 to automatically sync the correct FPGA code to a specific
module.
Alternately, you can use the new copy tftp system method to automatically
up grade all code and fpga images on an LP. The command should only push to
a module that needs to be upgraded, but I would make sure you have an outage
window if you do this on a live system.
There is some good guides on doing this at the Brocade Communities site and
the software upgrade guide for each version of IronWare has details notes on
how to use the methods I mentioned and how to fix common problems when
things go wrong. Check my.brocade.com for more info.
Wilbur
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://puck.nether.net/pipermail/foundry-nsp/attachments/20130725/3daf81c3/attachment.html>
More information about the foundry-nsp
mailing list