Xymon Mailing List Archive search

Xymon alert DURATION fail

4 messages in this thread

list Chris Pretorius · Thu, 7 Mar 2013 10:56:37 +0200 ·
Good day


I have a test that determines I a remote web service is responding. The
webservice is managed by someone outside of my environment.


The test fails a couple of times during the day, manual tests confirm that
the xymon test is ok. The webservice will fail for maybe 30 to 60 minutes
then will restore.

If the Xymon test runs while the service is unavailable, I will receive a
alert and by the time I can investigate all is ok.


I added the DURATION>30 or DURATION>1h to the alerts.cnf to only send me
alerts if the service has been down for a period of time, but I still
receive the alert as soon as Xymon detects the problem.


Is there another way to delay alerts or do I have some error in my configs?


My alerts.cfg

==============

PAGE=%prod SERVICE=institution,wsdl

        MAIL user-54f379dcb70f@xymon.invalid DURATION>1h REPEAT 30m COLOR=red
FORMAT=SMS

        SCRIPT /usr/lib/xymon/server/ext/smsplus.sh
user-7d9bfaba6b0c@xymon.invalid DURATION>1h REPEAT 30m COLOR=red


Set debug option in tasks.cfg

CMD xymond_channel --channel=page  --log=$XYMONSERVERLOGS/alert.log
xymond_alert --checkpoint-file=$XYMONTMP/alert.chk
--checkpoint-interval=600 –debug


Alert.log

=======

10687 2013-03-07 09:29:30 xymond_alert: Got message 237 @@page#237/
domain.co.za|1362641370.822859|196.14.77.34|domain.co.za
|wsdl|196.14.77.34|1362643170|red|green|1362641370|prod|555815|linux|linux||

10687 2013-03-07 09:29:30 startpos 2171, fillpos 2171, endpos -1

10687 2013-03-07 09:29:30 Got page message from domain.co.za:wsdl

10687 2013-03-07 09:29:30 Alert status changed from 0 to 1

10687 2013-03-07 09:29:30 Found a first matching rule

10687 2013-03-07 09:29:30 No more secondary matching rule

10687 2013-03-07 09:29:30 No files modified, skipping reload of
/usr/lib/xymon/server/etc/alerts.cfg

10687 2013-03-07 09:29:30 No files modified, skipping reload of
/usr/lib/xymon/server/etc/holidays.cfg

10687 2013-03-07 09:29:30 Found a first matching rule

10687 2013-03-07 09:29:30 No more secondary matching rule

10687 2013-03-07 09:29:30 Found a first matching rule

10687 2013-03-07 09:29:30 1 alerts to go

12977 2013-03-07 09:29:30 send_alert domain.co.za:wsdl state 0

12977 2013-03-07 09:29:30 Found a first matching rule

12977 2013-03-07 09:29:30   repeat domain.co.za|wsdl|mail|
user-84c190bb76be@xymon.invalid at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to
user-84c190bb76be@xymon.invalid

10687 2013-03-07 09:29:30 Found a first matching rule

10687 2013-03-07 09:29:30 No more secondary matching rule

10687 2013-03-07 09:29:30 Want msg 238, startpos 2171, fillpos 2171, endpos
-1, usedbytes=0, bufleft=264068

12977 2013-03-07 09:29:30   repeat domain.co.za|wsdl|mail|REPEAT at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to REPEAT

12977 2013-03-07 09:29:30   repeat domain.co.za|wsdl|mail|30m at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to 30m

12977 2013-03-07 09:29:30   repeat domain.co.za|wsdl|mail|
user-b3d2309a9720@xymon.invalid at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to
user-b3d2309a9720@xymon.invalid

12977 2013-03-07 09:29:30   repeat domain.co.za|wsdl|mail|REPEAT at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to REPEAT

12977 2013-03-07 09:29:30   repeat domain.co.za|wsdl|mail|30m at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to 30m

12977 2013-03-07 09:29:30   repeat domain.co.za|wsdl|script|
user-b3d2309a9720@xymon.invalid at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to
user-b3d2309a9720@xymon.invalid

12977 2013-03-07 09:29:30   repeat domain.co.za|wsdl|script|REPEAT at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to REPEAT

12977 2013-03-07 09:29:30   repeat domain.co.za|wsdl|script|30m at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to 30m

12977 2013-03-07 09:29:30 No more secondary matching rule


Kind Regards,


Chris Pretorius

Systems Administrator


[image: logo small]

*Your market gateway*

* *

  Southdowns Office Park, Block B 2nd Floor SF02, Cnr John Vorster Drive &
Karee Street, Highveld, Centurion


  Tel: XXX XXX XXXX  [image: dot red]  Fax: XXX XXX XXXX  [image: dot red]


                  Email: user-9c2042f5f91b@xymon.invalid   [image: dot
red]    Web:
www.signio.co.za


[image: 3]


This email and any files transmitted with it are confidential and intended
solely for the use of the individual or entity to whom they are addressed.
If you have received this email in error please notify us via email :
user-353b0f2c6a10@xymon.invalid .

Please note that any views or opinions presented in this email are solely
those of the author and do not necessarily represent those of the company.

Finally, the recipient should check this email and any attachments for the
presence of viruses. The company accepts no liability for any damage caused
by any virus transmitted by this email.


Signio (PTY) LTD, Reg Nr: 2003/001508/07
list David Baldwin · Fri, 8 Mar 2013 09:43:18 +1100 ·
Chris,
quoted from Chris Pretorius
Good day

 
I have a test that determines I a remote web service is responding.
The webservice is managed by someone outside of my environment.

 
The test fails a couple of times during the day, manual tests confirm
that the xymon test is ok. The webservice will fail for maybe 30 to 60
minutes then will restore.

If the Xymon test runs while the service is unavailable, I will
receive a alert and by the time I can investigate all is ok.

 
I added the DURATION>30 or DURATION>1h to the alerts.cnf to only send
me alerts if the service has been down for a period of time, but I
still receive the alert as soon as Xymon detects the problem.

 
Is there another way to delay alerts or do I have some error in my
configs?

 
My alerts.cfg uses IGNORE
My alerts.cfg

==============

PAGE=%prod SERVICE=institution,wsdl

        MAIL user-54f379dcb70f@xymon.invalid
<mailto:user-54f379dcb70f@xymon.invalid> DURATION>1h REPEAT 30m
COLOR=red FORMAT=SMS

        SCRIPT /usr/lib/xymon/server/ext/smsplus.sh
user-7d9bfaba6b0c@xymon.invalid <mailto:user-7d9bfaba6b0c@xymon.invalid>
DURATION>1h REPEAT 30m COLOR=red
Try:

PAGE=%prod SERVICE=institution,wsdl
        IGNORE DURATION<1h
        MAIL user-54f379dcb70f@xymon.invalid REPEAT=30m COLOR=red FORMAT=SMS
        SCRIPT /usr/lib/xymon/server/ext/smsplus.sh
user-7d9bfaba6b0c@xymon.invalid REPEAT=30m COLOR=red

David.
quoted from Chris Pretorius
 
Set debug option in tasks.cfg

CMD xymond_channel --channel=page  --log=$XYMONSERVERLOGS/alert.log
xymond_alert --checkpoint-file=$XYMONTMP/alert.chk
--checkpoint-interval=600 –debug

 
Alert.log

=======

10687 2013-03-07 09:29:30 xymond_alert: Got message 237

@@page#237/domain.co.za
<http://domain.co.za>|1362641370.822859|196.14.77.34|domain.co.za
<http://domain.co.za>|wsdl|196.14.77.34|1362643170|red|green|1362641370|prod|555815|linux|linux||
quoted from Chris Pretorius

10687 2013-03-07 09:29:30 startpos 2171, fillpos 2171, endpos -1

10687 2013-03-07 09:29:30 Got page message from domain.co.za:wsdl

10687 2013-03-07 09:29:30 Alert status changed from 0 to 1

10687 2013-03-07 09:29:30 Found a first matching rule

10687 2013-03-07 09:29:30 No more secondary matching rule

10687 2013-03-07 09:29:30 No files modified, skipping reload of
/usr/lib/xymon/server/etc/alerts.cfg

10687 2013-03-07 09:29:30 No files modified, skipping reload of
/usr/lib/xymon/server/etc/holidays.cfg

10687 2013-03-07 09:29:30 Found a first matching rule

10687 2013-03-07 09:29:30 No more secondary matching rule

10687 2013-03-07 09:29:30 Found a first matching rule

10687 2013-03-07 09:29:30 1 alerts to go

12977 2013-03-07 09:29:30 send_alert domain.co.za:wsdl state 0

12977 2013-03-07 09:29:30 Found a first matching rule

12977 2013-03-07 09:29:30   repeat domain.co.za
<http://domain.co.za>|wsdl|mail|user-84c190bb76be@xymon.invalid
<mailto:user-84c190bb76be@xymon.invalid> at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to
user-84c190bb76be@xymon.invalid <mailto:user-84c190bb76be@xymon.invalid>
quoted from Chris Pretorius

10687 2013-03-07 09:29:30 Found a first matching rule

10687 2013-03-07 09:29:30 No more secondary matching rule

10687 2013-03-07 09:29:30 Want msg 238, startpos 2171, fillpos 2171,
endpos -1, usedbytes=0, bufleft=264068

12977 2013-03-07 09:29:30   repeat domain.co.za
<http://domain.co.za>|wsdl|mail|REPEAT at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to REPEAT

12977 2013-03-07 09:29:30   repeat domain.co.za
<http://domain.co.za>|wsdl|mail|30m at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to 30m

12977 2013-03-07 09:29:30   repeat domain.co.za
<http://domain.co.za>|wsdl|mail|user-b3d2309a9720@xymon.invalid
<mailto:user-b3d2309a9720@xymon.invalid> at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to
user-b3d2309a9720@xymon.invalid <mailto:user-b3d2309a9720@xymon.invalid>

12977 2013-03-07 09:29:30   repeat domain.co.za
<http://domain.co.za>|wsdl|mail|REPEAT at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to REPEAT

12977 2013-03-07 09:29:30   repeat domain.co.za
<http://domain.co.za>|wsdl|mail|30m at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to 30m

12977 2013-03-07 09:29:30   repeat domain.co.za
<http://domain.co.za>|wsdl|script|user-b3d2309a9720@xymon.invalid
<mailto:user-b3d2309a9720@xymon.invalid> at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to
user-b3d2309a9720@xymon.invalid <mailto:user-b3d2309a9720@xymon.invalid>

12977 2013-03-07 09:29:30   repeat domain.co.za
<http://domain.co.za>|wsdl|script|REPEAT at 0

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to REPEAT

12977 2013-03-07 09:29:30   repeat domain.co.za
<http://domain.co.za>|wsdl|script|30m at 0
quoted from Chris Pretorius

12977 2013-03-07 09:29:30   Alert for domain.co.za:wsdl to 30m

12977 2013-03-07 09:29:30 No more secondary matching rule

 
Kind Regards,

 
Chris Pretorius

Systems Administrator

 
logo small

*/Your market gateway/*

*/ /*

  Southdowns Office Park, Block B 2^nd Floor SF02, Cnr John Vorster
Drive & Karee Street, Highveld, Centurion

 

  Tel:XXX XXX XXXX  dot red  Fax: XXX XXX XXXX  dot red

 
                  Email: user-9c2042f5f91b@xymon.invalid
<mailto:user-9c2042f5f91b@xymon.invalid>   dot red    Web:
www.signio.co.za <http://www.signio.co.za/>;
quoted from Chris Pretorius

 
3

 
This email and any files transmitted with it are confidential and
intended solely for the use of the individual or entity to whom they
are addressed. If you have received this email in error please notify

us via email : user-353b0f2c6a10@xymon.invalid <mailto:user-353b0f2c6a10@xymon.invalid> .
quoted from Chris Pretorius

Please note that any views or opinions presented in this email are
solely those of the author and do not necessarily represent those of
the company.

Finally, the recipient should check this email and any attachments for
the presence of viruses. The company accepts no liability for any
damage caused by any virus transmitted by this email.

 
Signio (PTY) LTD, Reg Nr: 2003/001508/07

 
-- 

David Baldwin - Senior Systems Administrator (Datacentres + Networks)
Information and Communication Technology Services
Australian Sports Commission          http://ausport.gov.au
Tel 02 62147830 Fax 02 62141830       PO Box 176 Belconnen ACT 2616
user-cbbf693f2c89@xymon.invalid          Leverrier Street Bruce ACT 2617


Keep up to date with what's happening in Australian sport visit http://www.ausport.gov.au

This message is intended for the addressee named and may contain confidential and privileged information. If you are not the intended recipient please note that any form of distribution, copying or use of this communication or the information in it is strictly prohibited and may be unlawful. If you receive this message in error, please delete it and notify the sender.
list Jeremy Laidman · Fri, 8 Mar 2013 10:55:26 +1100 ·
quoted from Chris Pretorius
On 7 March 2013 19:56, Chris Pretorius <user-9c2042f5f91b@xymon.invalid> wrote:
Is there another way to delay alerts or do I have some error in my configs?
Perhaps the config is broken.

PAGE=%prod SERVICE=institution,wsdl

        MAIL user-54f379dcb70f@xymon.invalid DURATION>1h REPEAT 30m
COLOR=red FORMAT=SMS

Shouldn't "REPEAT 30m" have an equals sign, like "REPEAT=30m"?

J
list Chris Pretorius · Fri, 8 Mar 2013 16:30:32 +0200 ·
Hi Jeremy


Thanx for pointing out my typo, Did not receive alerts for this test today,
even though the web services I monitor was unavailable for brief periods of
time.


Just want to mention that with the “=” omitted, I still received my alerts
every X minutes.
quoted from David Baldwin


Kind Regards,


Chris Pretorius

Systems Administrator


[image: logo small]

*Your market gateway*

* *

  Southdowns Office Park, Block B 2nd Floor SF02, Cnr John Vorster Drive &
Karee Street, Highveld, Centurion


  Tel: XXX XXX XXXX  [image: dot red]  Fax: XXX XXX XXXX  [image: dot red]


                  Email: user-9c2042f5f91b@xymon.invalid   [image: dot
red]    Web:
www.signio.co.za


[image: 3]


This email and any files transmitted with it are confidential and intended
solely for the use of the individual or entity to whom they are addressed.
If you have received this email in error please notify us via email :
user-353b0f2c6a10@xymon.invalid .

Please note that any views or opinions presented in this email are solely
those of the author and do not necessarily represent those of the company.

Finally, the recipient should check this email and any attachments for the
presence of viruses. The company accepts no liability for any damage caused
by any virus transmitted by this email.


Signio (PTY) LTD, Reg Nr: 2003/001508/07


*From:* Jeremy Laidman [mailto:user-71895fb2e44c@xymon.invalid]
*Sent:* 08 March 2013 01:55 AM
*To:* Chris Pretorius
*Cc:* xymon at xymon.com
*Subject:* Re: [Xymon] Xymon alert DURATION fail
quoted from Chris Pretorius


On 7 March 2013 19:56, Chris Pretorius <user-9c2042f5f91b@xymon.invalid> wrote:

Is there another way to delay alerts or do I have some error in my configs?


Perhaps the config is broken.


PAGE=%prod SERVICE=institution,wsdl

        MAIL user-54f379dcb70f@xymon.invalid DURATION>1h REPEAT 30m COLOR=red
FORMAT=SMS


Shouldn't "REPEAT 30m" have an equals sign, like "REPEAT=30m"?


J