[j-nsp] Strange ARP issue on M7i
Markus
universe at truemetal.org
Tue Aug 14 09:12:47 EDT 2012
Hi all,
last night I encountered something weird (in my opinion). Not sure if
Juniper related but maybe someone here has seen something like this?
I was experiencing a strange effect that several websites hosted on a
Linux KVM VM didn't load properly. They would load but 90% of the time
hang in some strange way, the browser displaying "Waiting for
www.sitename.com..." after all the page has loaded, or even before
anything of the page was displayed. A minute later it would work
sometimes, but only for a short period of time. After eliminating all
MySQL, Apache, KVM etc. as the source of the problem I logged into the
M7i in front of that host and saw:
admin at ffm01.rt> show arp no-resolve |grep 195.100.100.7
00:25:90:38:66:c6 195.100.100.7 ge-0/0/0.0 none
00:25:90:38:66:c6 195.100.101.34 ge-0/0/0.0 none
With 195.100.100.7 being the KVM host. So I thought: why is 101.34 up?
It's an IP that wasn't in use for years. And in the Juniper config a
whole /24 was still getting routed to it. I thought, OK, the KVM host
got hax0red or something and the intruder assigned 101.34, but couldnt
find anything. 101.34 wasn't reachable from any machine in the same LAN
and the MAC could not be seen either. No traffic to/from it on the
Switch monitoring port either. All I saw was traffic (port scans I
think) to the /24 which ended up on the KVM host (195.100.100.7). That
was an indicator that the KVM host was really also saying "I have
195.100.101.34". Or the Juniper insisted that the IP is at that MAC. I
suspect the latter. I shutdown the KVM host physically and cleared the
ARP cache on the Juniper, 195.100.100.7 was gone, but 195.100.101.34 was
still there with the identical MAC, as before.
I then removed the static route entry for the /24 which was pointing to
195.100.101.34 and only then the arp entry for 195.100.101.34 disappeared!
Isn't that weird? Where did that arp entry come from and why was it
saved on the Juniper for so long, and only got removed after I removed
the static routing of that /24?
I'm running JUNOS 8.0R2.8. :)
This didn't eliminate the problem with the websites reachability, I
think it is something local with my dialup connection as I see a lot of
TCP retransmission errors when accessing all sites on any of the VMs
hosted on that KVM host. Through an alternative dialup provider
everything is fine. Other sites on other boxes in the same LAN work just
fine though via the first provider. The problem comes and goes now.
Really puzzled!
Anyway, can't stop thinking about the ARP thing so I thought I would ask
here! Thank you very much!
Regards
Markus
More information about the juniper-nsp
mailing list