Xymon Mailing List Archive search

Bug in ping tests?

list Japheth Cleaver
Wed, 18 Jul 2012 12:44:08 -0700 (PDT)
Message-Id: <user-c711b022d6ae@xymon.invalid>

I haven't seen this in the last year or more on this server.  I had
sporadic issues on another service, but by simply moving hardware (from
dedicated Atom to a ESXi platform) it was resolved.

The page said it was red for 2-6 minutes.  I knew the test happens every
5,
so I would have expected a retest to clear it (hosts were ping responsive
from the shell).

What log are you referring to?
hobbitnet (or bbnet, I forget the process name in 4.2.3)'s output log.
Also hobbitlaunch.log from around the time, just to see if something
abnormally quit.

-jc

On Wed, Jul 18, 2012 at 3:23 PM, <user-87556346d4af@xymon.invalid> wrote:
I have a front page with about a dozen hosts and then sub pages.
Every
CONN test on the front page failed.  Each and every host on the
subpages
(well over a dozen) was just fine.  After 6 minutes I restarted the
hobbitd
processes.  They all came right back.

I am running 4.2.3.  Using fping to check
- hobbitserver.cfg:FPING="/usr/sbin/fping"

Has anyone seen this?

Hmm. It's possible that hobbitnet (?) died or was hung up... Or that the
pages weren't representative of the same run (eg, bbgen could have died
during its generation).

Questions: Do you recall the page timestamps being the same? If you
clicked through to the tests when it was happening, did the (dynamic)
test
page match the (static) color in the grid? Has the problem started
recently, is it repeating, and was there anything interesting in the
logs
at the time?

-jc