I'm still flummoxed by hobbit-alerts. I'm certain I broke something,
because I am not getting any alerts from the box.
The only logs in /var/log/hobbit/page.log are
2005-03-11 07:49:30 Tried to down BOARDBUSY: Invalid argument
2005-03-14 17:24:21 Tried to down BOARDBUSY: Invalid argument
I see a couple of those in the hobbitlaunch.log file as well, I also see
the following error:
2005-03-19 10:14:21 Task bbdisplay started with PID 7417
2005-03-19 10:14:21 Task bbretest started with PID 7418
2005-03-19 10:14:29 Our child has failed and will not talk to us
2005-03-19 10:14:36 Our child has failed and will not talk to us
Not knowning which child makes it difficult to figure out what is going
on. bbpage is aparently running - the logfile says process 5892 is
bbpage, and there is a process 5892 still running.
I fixed the "unmatched" syntax error I had before.
Here is a sample host that is not paging. The info page lists:
Alerting: Service Recipient 1st Delay Stop after Repeat Time of Day
Colors
conn user-290ce4e24e19@xymon.invalid (R) 30m - 5d - red
telnet user-290ce4e24e19@xymon.invalid (R) 30m - 5d - red
Both telnet and conn have been down on this host for over two hours.
The salient rule is:
HOST=%.
MAIL=user-290ce4e24e19@xymon.invalid REPEAT=140h DURATION>30m
RECOVERED COLOR="red" UNMATCHED
I imagine I'm doing something terribly silly, but I'm just not clear
what it might be.