Xymon Mailing List Archive search

Monitoring a simple cluster

list Dan Smith
Thu, 21 Jun 2012 13:17:04 -0400
Message-Id: <83B8ACE778CB4095A0C3E9894BDDC3FA@E012981PC>

Red Hat 5.8 server, 5.5 clients.  Xymon 4.3.7 clients and server. 

 
I am trying to put together a quick cluster monitor using depend and combos,
but I think I'm stuck.  Has anyone else done this?

 
I have two servers: smtp01 and smtp02 that are in an active/passive cluster.

The smtpd process only runs on the server that is active, but there are
other processes that need to be monitored on both hosts (e.g. clurgmgrd).

 
I would like to go red and have an alert if one of the required processes
goes down on either server, but I only want an alert if the active node of
the cluster has a problem with the smtpd process.

 
My initial idea was to have procs go red for clurgmgrd and yellow on smtpd,
use NOPROPYELLOW, and then use a combo so I could see what host was active.

 
hosts.cfg:

1.2.3.4    smtp01 # NOPROPYELLOW:procs

1.2.3.5    smtp02 # NOPROPYELLOW:procs

1.2.3.6    smtpHA # smtp

 
analysis.cfg:

HOST=smtp01

            PROC   clurgmgrd

            PROC   smtpd

HOST=smtp02

            PROC   clurgmgrd

            PROC   smtpd

 
combo.cfg:

smtpHA.procs = (smtp01.conn && smtp01.procs) || (smtp02.conn &&
smtp02.procs)

 
Unfortunately a yellow status is equal to a 1 for the combo, so even if both
sides of the cluster were down, the combo would still show as up
(green&&yellow is 1&&1).

 
Then I thought I could cover the scenario by making a "depends" test, but
depends seems to be focused on disabling tests if another status is
red.which doesn't work either since I don't want the page to go red.

 
This isn't a huge issue because the smtp poll on smtpHA will trigger red if
both nodes are down, but it seems like there should be an easy way to do
this.

 
Am I making it more complicated than it needs to be, or am I better writing
a custom monitor?

 
Thanks!

 
-dan