Xymon Mailing List Archive search

nightly reboots

list Ralph Mitchell
Tue, 13 Jan 2009 12:54:31 -0600
Message-Id: <user-dd430001348f@xymon.invalid>

How much work is the server doing??  The company that just laid me off has
an old single-cpu, 733MHz DL380 running RedHat 7.2.  It runs a lot of bash
scripts out of cron to fetch and check web pages, with the results being
reported back to the same machine.  Last time I was able to see it, there
were over 400 bb-hosts entries and over 2500 reports.  It has a fairly
constant load average of around 5 or 6, spiking to maybe 10 or 11 whenever
the planets align and a lot of stuff happens simultaneously.

As soon as they can figure out how to replace it, Hobbit'll be shutdown, as
it's not one of the officially blessed monitoring systems.  However, even
the folks in their Integration Labs admit they have nothing that can do
quite what I've done with Hobbit, so I imagine they'll end up telling their
customers the monitoring is being downgraded.  I'd love to be a fly on the
wall for *those* conversations...  :)

Ralph Mitchell


On Tue, Jan 13, 2009 at 11:25 AM, Gavin Leonard <user-d65663809eb4@xymon.invalid>wrote:
Ok.. so how do you delay the red alerts? I am wondering if I am just over
loading this system... I may need to build another bb server so I can split
up the work load a bit.. thanks in advance!!

-Gavin

-----Original Message-----
From: Josh Luthman [mailto:user-4c45a83f15cb@xymon.invalid]
Sent: Tuesday, January 13, 2009 9:49 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] nightly reboots

I have had the problem where the conn test goes bad for everything
(not every host, just groups based on bb-hosts) since I installed it
at the office.  No idea why :(

What I do is delay the red sms alerts by a few minutes as it is red
for only a few seconds, sometimes a minute.

On 1/13/09, Gavin Leonard <user-d65663809eb4@xymon.invalid> wrote:
All,
                I am having an issue where my hobbit server thinks that
every server it monitors has been rebooted, so I get blasted with sms
messages when this happens. And none of the servers have actually
rebooted
nor has there been any network outages.. ideas?thoughts?


Gavin Leonard

[cid:image001.gif at 01C97562.5DF2D550]

Director, Systems-Network Engineering

T

 XXX-XXX-XXXX

F

 XXX-XXX-XXXX

E

 user-d65663809eb4@xymon.invalid<mailto:user-d65663809eb4@xymon.invalid>


Research | Marketing | Sales Generation

www.progrexion.com<http://www.progrexion.com/>;


This email and its contents are confidential. If you are not the intended
recipient, delete this email and do not use or disclose the information
within this email or its attachments. Thank you.

--
Josh Luthman
Office: XXX-XXX-XXXX
Direct: XXX-XXX-XXXX
XXXX Wayne St
Suite XXXX
Troy, OH XXXXX

Those who don't understand UNIX are condemned to reinvent it, poorly.
--- Henry Spencer