Xymon alert DURATION fail
list Chris Pretorius
Good day
I have a test that determines I a remote web service is responding. The
webservice is managed by someone outside of my environment.
The test fails a couple of times during the day, manual tests confirm that
the xymon test is ok. The webservice will fail for maybe 30 to 60 minutes
then will restore.
If the Xymon test runs while the service is unavailable, I will receive a
alert and by the time I can investigate all is ok.
I added the DURATION>30 or DURATION>1h to the alerts.cnf to only send me
alerts if the service has been down for a period of time, but I still
receive the alert as soon as Xymon detects the problem.
Is there another way to delay alerts or do I have some error in my configs?
My alerts.cfg
==============
PAGE=%prod SERVICE=institution,wsdl
MAIL user-54f379dcb70f@xymon.invalid DURATION>1h REPEAT 30m COLOR=red
FORMAT=SMS
SCRIPT /usr/lib/xymon/server/ext/smsplus.sh
user-7d9bfaba6b0c@xymon.invalid DURATION>1h REPEAT 30m COLOR=red
Set debug option in tasks.cfg
CMD xymond_channel --channel=page --log=$XYMONSERVERLOGS/alert.log
xymond_alert --checkpoint-file=$XYMONTMP/alert.chk
--checkpoint-interval=600 –debug
Alert.log
=======
10687 2013-03-07 09:29:30 xymond_alert: Got message 237 @@page#237/
domain.co.za|1362641370.822859|196.14.77.34|domain.co.za
|wsdl|196.14.77.34|1362643170|red|green|1362641370|prod|555815|linux|linux||
10687 2013-03-07 09:29:30 startpos 2171, fillpos 2171, endpos -1
10687 2013-03-07 09:29:30 Got page message from domain.co.za:wsdl
10687 2013-03-07 09:29:30 Alert status changed from 0 to 1
10687 2013-03-07 09:29:30 Found a first matching rule
10687 2013-03-07 09:29:30 No more secondary matching rule
10687 2013-03-07 09:29:30 No files modified, skipping reload of
/usr/lib/xymon/server/etc/alerts.cfg
10687 2013-03-07 09:29:30 No files modified, skipping reload of
/usr/lib/xymon/server/etc/holidays.cfg
10687 2013-03-07 09:29:30 Found a first matching rule
10687 2013-03-07 09:29:30 No more secondary matching rule
10687 2013-03-07 09:29:30 Found a first matching rule
10687 2013-03-07 09:29:30 1 alerts to go
12977 2013-03-07 09:29:30 send_alert domain.co.za:wsdl state 0
12977 2013-03-07 09:29:30 Found a first matching rule
12977 2013-03-07 09:29:30 repeat domain.co.za|wsdl|mail|
user-84c190bb76be@xymon.invalid at 0
12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to
user-84c190bb76be@xymon.invalid
10687 2013-03-07 09:29:30 Found a first matching rule
10687 2013-03-07 09:29:30 No more secondary matching rule
10687 2013-03-07 09:29:30 Want msg 238, startpos 2171, fillpos 2171, endpos
-1, usedbytes=0, bufleft=264068
12977 2013-03-07 09:29:30 repeat domain.co.za|wsdl|mail|REPEAT at 0
12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to REPEAT
12977 2013-03-07 09:29:30 repeat domain.co.za|wsdl|mail|30m at 0
12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to 30m
12977 2013-03-07 09:29:30 repeat domain.co.za|wsdl|mail|
user-b3d2309a9720@xymon.invalid at 0
12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to
user-b3d2309a9720@xymon.invalid
12977 2013-03-07 09:29:30 repeat domain.co.za|wsdl|mail|REPEAT at 0
12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to REPEAT
12977 2013-03-07 09:29:30 repeat domain.co.za|wsdl|mail|30m at 0
12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to 30m
12977 2013-03-07 09:29:30 repeat domain.co.za|wsdl|script|
user-b3d2309a9720@xymon.invalid at 0
12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to
user-b3d2309a9720@xymon.invalid
12977 2013-03-07 09:29:30 repeat domain.co.za|wsdl|script|REPEAT at 0
12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to REPEAT
12977 2013-03-07 09:29:30 repeat domain.co.za|wsdl|script|30m at 0
12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to 30m
12977 2013-03-07 09:29:30 No more secondary matching rule
Kind Regards,
Chris Pretorius
Systems Administrator
[image: logo small]
*Your market gateway*
* *
Southdowns Office Park, Block B 2nd Floor SF02, Cnr John Vorster Drive &
Karee Street, Highveld, Centurion
Tel: XXX XXX XXXX [image: dot red] Fax: XXX XXX XXXX [image: dot red]
Email: user-9c2042f5f91b@xymon.invalid [image: dot
red] Web:
www.signio.co.za
[image: 3]
This email and any files transmitted with it are confidential and intended
solely for the use of the individual or entity to whom they are addressed.
If you have received this email in error please notify us via email :
user-353b0f2c6a10@xymon.invalid .
Please note that any views or opinions presented in this email are solely
those of the author and do not necessarily represent those of the company.
Finally, the recipient should check this email and any attachments for the
presence of viruses. The company accepts no liability for any damage caused
by any virus transmitted by this email.
Signio (PTY) LTD, Reg Nr: 2003/001508/07
list David Baldwin
Chris,
▸
Good day I have a test that determines I a remote web service is responding. The webservice is managed by someone outside of my environment. The test fails a couple of times during the day, manual tests confirm that the xymon test is ok. The webservice will fail for maybe 30 to 60 minutes then will restore. If the Xymon test runs while the service is unavailable, I will receive a alert and by the time I can investigate all is ok. I added the DURATION>30 or DURATION>1h to the alerts.cnf to only send me alerts if the service has been down for a period of time, but I still receive the alert as soon as Xymon detects the problem. Is there another way to delay alerts or do I have some error in my configs?
My alerts.cfg uses IGNORE
My alerts.cfg
==============
PAGE=%prod SERVICE=institution,wsdl
MAIL user-54f379dcb70f@xymon.invalid
<mailto:user-54f379dcb70f@xymon.invalid> DURATION>1h REPEAT 30m
COLOR=red FORMAT=SMS
SCRIPT /usr/lib/xymon/server/ext/smsplus.sh
user-7d9bfaba6b0c@xymon.invalid <mailto:user-7d9bfaba6b0c@xymon.invalid>
DURATION>1h REPEAT 30m COLOR=red
Try:
PAGE=%prod SERVICE=institution,wsdl
IGNORE DURATION<1h
MAIL user-54f379dcb70f@xymon.invalid REPEAT=30m COLOR=red FORMAT=SMS
SCRIPT /usr/lib/xymon/server/ext/smsplus.sh
user-7d9bfaba6b0c@xymon.invalid REPEAT=30m COLOR=red
David.▸
Set debug option in tasks.cfg CMD xymond_channel --channel=page --log=$XYMONSERVERLOGS/alert.log xymond_alert --checkpoint-file=$XYMONTMP/alert.chk --checkpoint-interval=600 –debug Alert.log ======= 10687 2013-03-07 09:29:30 xymond_alert: Got message 237
@@page#237/domain.co.za <http://domain.co.za>|1362641370.822859|196.14.77.34|domain.co.za <http://domain.co.za>|wsdl|196.14.77.34|1362643170|red|green|1362641370|prod|555815|linux|linux||
▸
10687 2013-03-07 09:29:30 startpos 2171, fillpos 2171, endpos -1
10687 2013-03-07 09:29:30 Got page message from domain.co.za:wsdl
10687 2013-03-07 09:29:30 Alert status changed from 0 to 1
10687 2013-03-07 09:29:30 Found a first matching rule
10687 2013-03-07 09:29:30 No more secondary matching rule
10687 2013-03-07 09:29:30 No files modified, skipping reload of
/usr/lib/xymon/server/etc/alerts.cfg
10687 2013-03-07 09:29:30 No files modified, skipping reload of
/usr/lib/xymon/server/etc/holidays.cfg
10687 2013-03-07 09:29:30 Found a first matching rule
10687 2013-03-07 09:29:30 No more secondary matching rule
10687 2013-03-07 09:29:30 Found a first matching rule
10687 2013-03-07 09:29:30 1 alerts to go
12977 2013-03-07 09:29:30 send_alert domain.co.za:wsdl state 0
12977 2013-03-07 09:29:30 Found a first matching rule
12977 2013-03-07 09:29:30 repeat domain.co.za <http://domain.co.za>|wsdl|mail|user-84c190bb76be@xymon.invalid <mailto:user-84c190bb76be@xymon.invalid> at 0 12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to user-84c190bb76be@xymon.invalid <mailto:user-84c190bb76be@xymon.invalid>
▸
10687 2013-03-07 09:29:30 Found a first matching rule
10687 2013-03-07 09:29:30 No more secondary matching rule
10687 2013-03-07 09:29:30 Want msg 238, startpos 2171, fillpos 2171,
endpos -1, usedbytes=0, bufleft=264068
12977 2013-03-07 09:29:30 repeat domain.co.za <http://domain.co.za>|wsdl|mail|REPEAT at 0 12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to REPEAT 12977 2013-03-07 09:29:30 repeat domain.co.za <http://domain.co.za>|wsdl|mail|30m at 0 12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to 30m 12977 2013-03-07 09:29:30 repeat domain.co.za <http://domain.co.za>|wsdl|mail|user-b3d2309a9720@xymon.invalid <mailto:user-b3d2309a9720@xymon.invalid> at 0 12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to user-b3d2309a9720@xymon.invalid <mailto:user-b3d2309a9720@xymon.invalid> 12977 2013-03-07 09:29:30 repeat domain.co.za <http://domain.co.za>|wsdl|mail|REPEAT at 0 12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to REPEAT 12977 2013-03-07 09:29:30 repeat domain.co.za <http://domain.co.za>|wsdl|mail|30m at 0 12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to 30m 12977 2013-03-07 09:29:30 repeat domain.co.za <http://domain.co.za>|wsdl|script|user-b3d2309a9720@xymon.invalid <mailto:user-b3d2309a9720@xymon.invalid> at 0 12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to user-b3d2309a9720@xymon.invalid <mailto:user-b3d2309a9720@xymon.invalid> 12977 2013-03-07 09:29:30 repeat domain.co.za <http://domain.co.za>|wsdl|script|REPEAT at 0 12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to REPEAT 12977 2013-03-07 09:29:30 repeat domain.co.za <http://domain.co.za>|wsdl|script|30m at 0
▸
12977 2013-03-07 09:29:30 Alert for domain.co.za:wsdl to 30m
12977 2013-03-07 09:29:30 No more secondary matching rule
Kind Regards,
Chris Pretorius
Systems Administrator
logo small
*/Your market gateway/*
*/ /*
Southdowns Office Park, Block B 2^nd Floor SF02, Cnr John Vorster
Drive & Karee Street, Highveld, Centurion
Tel:XXX XXX XXXX dot red Fax: XXX XXX XXXX dot red Email: user-9c2042f5f91b@xymon.invalid <mailto:user-9c2042f5f91b@xymon.invalid> dot red Web: www.signio.co.za <http://www.signio.co.za/>;
▸
3
This email and any files transmitted with it are confidential and
intended solely for the use of the individual or entity to whom they
are addressed. If you have received this email in error please notify
us via email : user-353b0f2c6a10@xymon.invalid <mailto:user-353b0f2c6a10@xymon.invalid> .
▸
Please note that any views or opinions presented in this email are
solely those of the author and do not necessarily represent those of
the company.
Finally, the recipient should check this email and any attachments for
the presence of viruses. The company accepts no liability for any
damage caused by any virus transmitted by this email.
Signio (PTY) LTD, Reg Nr: 2003/001508/07
--
David Baldwin - Senior Systems Administrator (Datacentres + Networks) Information and Communication Technology Services Australian Sports Commission http://ausport.gov.au Tel 02 62147830 Fax 02 62141830 PO Box 176 Belconnen ACT 2616 user-cbbf693f2c89@xymon.invalid Leverrier Street Bruce ACT 2617 Keep up to date with what's happening in Australian sport visit http://www.ausport.gov.au This message is intended for the addressee named and may contain confidential and privileged information. If you are not the intended recipient please note that any form of distribution, copying or use of this communication or the information in it is strictly prohibited and may be unlawful. If you receive this message in error, please delete it and notify the sender.
list Jeremy Laidman
▸
On 7 March 2013 19:56, Chris Pretorius <user-9c2042f5f91b@xymon.invalid> wrote:
Is there another way to delay alerts or do I have some error in my configs?
Perhaps the config is broken.
PAGE=%prod SERVICE=institution,wsdl
MAIL user-54f379dcb70f@xymon.invalid DURATION>1h REPEAT 30m
COLOR=red FORMAT=SMS
Shouldn't "REPEAT 30m" have an equals sign, like "REPEAT=30m"? J
list Chris Pretorius
Hi Jeremy Thanx for pointing out my typo, Did not receive alerts for this test today, even though the web services I monitor was unavailable for brief periods of time. Just want to mention that with the “=” omitted, I still received my alerts every X minutes.
▸
Kind Regards,
Chris Pretorius
Systems Administrator
[image: logo small]
*Your market gateway*
* *
Southdowns Office Park, Block B 2nd Floor SF02, Cnr John Vorster Drive &
Karee Street, Highveld, Centurion
Tel: XXX XXX XXXX [image: dot red] Fax: XXX XXX XXXX [image: dot red]
Email: user-9c2042f5f91b@xymon.invalid [image: dot
red] Web:
www.signio.co.za
[image: 3]
This email and any files transmitted with it are confidential and intended
solely for the use of the individual or entity to whom they are addressed.
If you have received this email in error please notify us via email :
user-353b0f2c6a10@xymon.invalid .
Please note that any views or opinions presented in this email are solely
those of the author and do not necessarily represent those of the company.
Finally, the recipient should check this email and any attachments for the
presence of viruses. The company accepts no liability for any damage caused
by any virus transmitted by this email.
Signio (PTY) LTD, Reg Nr: 2003/001508/07
*From:* Jeremy Laidman [mailto:user-71895fb2e44c@xymon.invalid]
*Sent:* 08 March 2013 01:55 AM
*To:* Chris Pretorius
*Cc:* xymon at xymon.com
*Subject:* Re: [Xymon] Xymon alert DURATION fail
▸
On 7 March 2013 19:56, Chris Pretorius <user-9c2042f5f91b@xymon.invalid> wrote:
Is there another way to delay alerts or do I have some error in my configs?
Perhaps the config is broken.
PAGE=%prod SERVICE=institution,wsdl
MAIL user-54f379dcb70f@xymon.invalid DURATION>1h REPEAT 30m COLOR=red
FORMAT=SMS
Shouldn't "REPEAT 30m" have an equals sign, like "REPEAT=30m"?
J