Brief red alarms
Hobbit may not be the issue at all here. You could test this by writing a small script to do a constant ping of host $foo and record the results to a file. You might see similar patterns outside of hobbit. Looking at the current summary page it appears that these hosts are more problematic. Yet, looking at a report (short version attached) for Nov1 to Nov 17 there are clearly quite a few more. 2 10.1.0.32 2 10.1.0.40 2 10.1.0.73 2 10.1.0.92 2 10.3.23.242 4 163.153.65.139 4 atlas.cairodurham.org From the report (which is for conn test only) we see these hosts have 100% uptime. Until a short while ago 10.1.0.73 was at 100% avail. 10.1.0.71 100 10.1.0.72 100 10.1.0.90 100 What makes these three hosts different? Regards, Tim From: Jaime Kikpole [user-c575ba5bb612@xymon.invalid] Sent: Thursday, November 18, 2010 8:59 AM To: xymon at xymon.com Subject: [xymon] Brief red alarms We recently had a major change in our network's design. The entire topology had the change, including IP addresses of servers and the "routes" through switches between things. Ever since then, Xymon is reporting very brief (10-20 seconds) outages of one server or another every 30-60 minutes. I tried this suggestion: http://xymon.sourceforge.net/docs/known-issues.html#netfail No luck. A few minutes later, the server Xymon is running on allegedly failed the "conn" test for about 1 second. Any other ideas? If you want to see the symptoms, look at my Xymon instance at http://cns.cairodurham.org/hobbit/bb2.html. This will show the brief outages that I'm talking about. Thanks in advance, Jaime Kikpole -- Network Administrator Cairo-Durham Central School District http://cns.cairodurham.org
Attachments (1)