Xymon Mailing List Archive search

Test goes purple randomly

list Sanu Mathew
Sat, 25 Oct 2008 12:30:35 +0530
Message-Id: <user-099b5cc2cd35@xymon.invalid>

Folks,

My hobbit server has suddenly started showing conn and ssh entries to be
purple, everytime, i restart the hobbit service on my hobbit server, all
looks green and appropriate for a few mins, and then they are again back to
purple. I have read the below email, but can someone tell me, where to
change the LIFETIME, or does making a change to the LIFETIME fix my problem?

Any help at the earliest would be greatly appreciated...

Thanks,
Sanu

On Sat, Oct 25, 2008 at 7:35 AM, Richard Finegold <user-6a016aac278a@xymon.invalid>wrote:
If one uses a default ratio of 5:30 (5 minute poll, 30 minute expire)
then LIFETIME should be more than "a bit longer than" the life span of
the report. Assuming the philosophy of no less than 5 missed intervals
leading to purple is consistent. Hmm...

Ah, the bb manpage says "sligtly more than" (sic) "is a good idea".
This is contradicted by the LIFETIME default of 30. Two ways come to
mind to resolve this contradiction:
 * Change LIFETIME to 6 or 7 (hobbitd.c, handle_status, validity).
 * Change the manpage's text, from "sligtly more than" to "a multiple
of" (or something similar).

On Thu, Oct 23, 2008 at 6:47 PM, Samuel Cai
<user-ba507acc1d03@xymon.invalid> wrote:
From: Ralph Mitchell [mailto:user-00a5e44c48c0@xymon.invalid]
Sent: Tuesday, September 16, 2008 12:44 PM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] Tricky bug in Purple status determination
[snip]
What you should do is alter the test script to use the "status+LIFETIME"
format, where LIFETIME is the life span of the report, as described in
the bb man page, and make the lifetime a bit longer than the the test
interval.