Xymon Mailing List Archive search

A funny thing happened on the way to the RC...............

3 messages in this thread

list Kent Brodie · Thu, 13 Jul 2006 16:48:08 -0500 ·
So, I updated my server to the RC version today.   Saved all of my
configuration files, and (based on info from this list!), decided to add
fping to my system this time.    Fping installed, no problem.

 
Fired up BB.    Then, everyone's pager went NUTS.    100 pages, times 4
pagers.    WHOOOOPS.    The entire world had uh.....   "lost
connectivity."

 
What had really happened of course is, that the hobbit userid couldn't
run fping, since I had forgotten to suid the executable.    And yes, I
did eventually find it in a log file, but...       But, this had the
very nasty side effect of turning each and every connectivity icon RED.
And then the alerts kicked in.....  And four Duracell batteries died.

 
I don't know how easy it is to implement, but I have a suggestion worth
looking into?

 
If fping "doesn't work" (as opposed to, "cannot ping a host"), the
result status should be CLEAR, not RED.       "test not functional" or
"test disabled" or something.

 
Just a thought!!!!

 
:-)

 
Kent C. Brodie - user-da7f7d5174c0@xymon.invalid

Department of Physiology

Medical College of Wisconsin

(XXX) XXX-XXXX
list Henrik Størner · Fri, 14 Jul 2006 07:36:51 +0200 ·
quoted from Kent Brodie
On Thu, Jul 13, 2006 at 04:48:08PM -0500, Brodie, Kent wrote:
Fired up BB.    Then, everyone's pager went NUTS.    100 pages, times 4
pagers.    WHOOOOPS.    The entire world had uh.....   "lost
connectivity."

What had really happened of course is, that the hobbit userid couldn't
run fping, since I had forgotten to suid the executable.    And yes, I
did eventually find it in a log file, but...       But, this had the
very nasty side effect of turning each and every connectivity icon RED.
And then the alerts kicked in.....  And four Duracell batteries died.
Well, at least Hobbit managed to get your attention that there was some
configuration problem :-)

Seriously - you're right, it shouldn't do that. Such a problem should
end up with a red status in the "bbtest" column and nothing else. I'll
make sure it does that.

Sorry about the trouble!


Regards,
Henrik
list Kent Brodie · Fri, 14 Jul 2006 10:34:48 -0500 ·
Henrik- no trouble at all.   That's why it's a release CANDIDATE-- and
why we're willing to put up with issues like this for the sake of making
a REALLY great product.   There's no way you would find all these little
issues without people like us willing to play around and put it through
its paces.

And yeah, it got my attention.  And my boss's.  And my sysadmin.  And
the pager company who stopped accepting emails after the first 20 or so.

That's what we do.   :-)
quoted from Kent Brodie

Kent C. Brodie - user-da7f7d5174c0@xymon.invalid
Department of Physiology
Medical College of Wisconsin
(XXX) XXX-XXXX
-----Original Message-----
From: Henrik Stoerner [mailto:user-ce4a2c883f75@xymon.invalid] 
Sent: Friday, July 14, 2006 12:37 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] A funny thing happened on the way to the

RC...............
quoted from Henrik Størner

On Thu, Jul 13, 2006 at 04:48:08PM -0500, Brodie, Kent wrote:
Fired up BB.    Then, everyone's pager went NUTS.    100 pages, times
4
pagers.    WHOOOOPS.    The entire world had uh.....   "lost
connectivity."

What had really happened of course is, that the hobbit userid couldn't
run fping, since I had forgotten to suid the executable.    And yes, I
did eventually find it in a log file, but...       But, this had the
very nasty side effect of turning each and every connectivity icon
RED.
And then the alerts kicked in.....  And four Duracell batteries died.
Well, at least Hobbit managed to get your attention that there was some
configuration problem :-)

Seriously - you're right, it shouldn't do that. Such a problem should
end up with a red status in the "bbtest" column and nothing else. I'll
make sure it does that.

Sorry about the trouble!


Regards,
Henrik