Xymon Mailing List Archive search

alerts still not alerting

list Daniel J McDonald
Sat, 19 Mar 2005 10:33:09 -0600
Message-Id: <user-c08f9fb7ce54@xymon.invalid>

I'm still flummoxed by hobbit-alerts.  I'm certain I broke something,
because I am not getting any alerts from the box.

The only logs in /var/log/hobbit/page.log are 
2005-03-11 07:49:30 Tried to down BOARDBUSY: Invalid argument
2005-03-14 17:24:21 Tried to down BOARDBUSY: Invalid argument

I see a couple of those in the hobbitlaunch.log file as well, I also see
the following error:
2005-03-19 10:14:21 Task bbdisplay started with PID 7417
2005-03-19 10:14:21 Task bbretest started with PID 7418
2005-03-19 10:14:29 Our child has failed and will not talk to us
2005-03-19 10:14:36 Our child has failed and will not talk to us

Not knowning which child makes it difficult to figure out what is going
on.  bbpage is aparently running - the logfile says process 5892 is
bbpage, and there is a process 5892 still running.

I fixed the "unmatched" syntax error I had before.

Here is a sample host that is not paging.  The info page lists:
Alerting: Service Recipient 1st Delay Stop after Repeat Time of Day
Colors 
conn user-290ce4e24e19@xymon.invalid (R) 30m  - 5d  - red 
telnet user-290ce4e24e19@xymon.invalid (R) 30m  - 5d  - red 

Both telnet and conn have been down on this host for over two hours.

The salient rule is:
HOST=%.
        MAIL=user-290ce4e24e19@xymon.invalid REPEAT=140h DURATION>30m
RECOVERED COLOR="red" UNMATCHED

I imagine I'm doing something terribly silly, but I'm just not clear
what it might be.