[c-nsp] HEADS UP: vlan_mgr crashing in NX-OS 5.2(3)

Bernhard Schmidt berni at birkenwald.de
Tue Dec 13 05:09:53 EST 2011


Hey,

just a quick heads up, maybe someone is hitting that, too. Since
upgrading our test Nexus 7000 from 5.2(1) to 5.2(3) this morning we have
a failover due to a crashing vlan_mgr process every hour. It turns out
"sh vlan" (which is executed by RANCID every hour) reliably kills the
box.

2011 Dec 13 10:50:23 csr1-test %SYSMGR-3-HEARTBEAT_FAILURE: Service
"vlan_mgr" sent SIGABRT for not setting heartbeat for last 6 periods.
Last hea
rtbeat 138.11 secs ago.
2011 Dec 13 10:50:23 csr1-test %SYSMGR-2-SERVICE_CRASHED: Service
"vlan_mgr" (PID 4747) hasn't caught signal 6 (core will be saved).
2011 Dec 13 10:52:43 csr1-test %SYSMGR-3-HEARTBEAT_FAILURE: Service
"vlan_mgr" sent SIGABRT for not setting heartbeat for last 6 periods.
Last hea
rtbeat 139.80 secs ago.
2011 Dec 13 10:52:43 csr1-test %SYSMGR-2-SERVICE_CRASHED: Service
"vlan_mgr" (PID 10181) hasn't caught signal 6 (core will be saved).
2011 Dec 13 10:55:04 csr1-test %SYSMGR-3-HEARTBEAT_FAILURE: Service
"vlan_mgr" sent SIGABRT for not setting heartbeat for last 6 periods.
Last hea
rtbeat 139.84 secs ago.
2011 Dec 13 10:55:04 csr1-test %SYSMGR-2-SERVICE_CRASHED: Service
"vlan_mgr" (PID 10348) hasn't caught signal 6 (core will be saved).
2011 Dec 13 10:57:25 csr1-test %SYSMGR-3-HEARTBEAT_FAILURE: Service
"vlan_mgr" sent SIGABRT for not setting heartbeat for last 6 periods.
Last hea
rtbeat 140.22 secs ago.
2011 Dec 13 10:57:25 csr1-test %SYSMGR-2-SERVICE_CRASHED: Service
"vlan_mgr" (PID 10513) hasn't caught signal 6 (core will be saved).
2011 Dec 13 10:57:25 csr1-test
%SYSMGR-2-SYSMGR_AUTOCOLLECT_TECH_SUPPORT_LOG: This supervisor will
%temporarily remain online in order to collect s
how tech-support. This behavior is configurable via 'system [no]
auto-collect tech-support'.
2011 Dec 13 10:57:26 csr1-test Dec 13 10:57:25 %KERN-2-SYSTEM_MSG:
Switchover started by redundancy driver - kernel
2011 Dec 13 10:57:26 csr1-test %SYSMGR-2-HASWITCHOVER_PRE_START: This
supervisor is becoming active (pre-start phase).
2011 Dec 13 10:57:26 csr1-test %SYSMGR-2-HASWITCHOVER_START: Supervisor
2 is becoming active.

TAC case is open.

Bernhard



More information about the cisco-nsp mailing list