Hi,
We use the RECOVERED keyword for all recipients defined in hobbit-alerts.cfg.
We noticed a problem for hosts where alerting for a given service is excluded during a certain time. When a problem occurs on the service -out of the exclusion time-, the yellow/red alarms get sent. When the problem is resolved though, there is no recovered confirmation message/SMS. This issue is not related to the amount of time the service was down.
Example configuration and logs:
----hobbit-alerts.cfg----
...
...
# Do not send anything for given service(s) during period of time
HOST=test3 SERVICE=http TIME=*:0305:0315
...
...
# Rules by administrator
HOST=test3
MAIL user-5a72e5dcda3f@xymon.invalid REPEAT=24h RECOVERED
SCRIPT /usr/local/sendsms 0123456789 COLOR=red FORMAT=SMS REPEAT=24h RECOVERED
...
...
-----notification.log-----
Mon May 22 10:23:54 2006 test3.http (13.22.8.8) test.example at com 1148286234 600
Mon May 22 10:24:34 2006 test3.http (13.22.8.8) 0123456789 1148286234 600
...
...
------histfile for test3----------
Last 50 log entries (Full HTML log)
Date Status Duration
Mon May 22 10:24:15 2006 green 0:40:50
Mon May 22 10:23:54 2006 red 0:00:21
Is this a bug or a is something wrong with the exclusion specification?
Thanks
Dominique
UNIL - University of Lausanne