Xymon Mailing List Archive search

xymond_alert --test

3 messages in this thread

list Daniel Beruben · Wed, 27 Jun 2012 16:55:19 -0400 ·
Hello list,

I have the new version Xymon Version 4.3.7,
I have a strange error … i'm not able to send the mail alert …

I found no information about this error ! ( code 5 )

#./xymond_alert --test w10 disk
2012-06-27 16:34:58 Whoops ! Failed to send message (Connection failed)
2012-06-27 16:34:58 ->  Could not connect to Xymon daemon at 10.10.3.153:1984 (Connection refused)
2012-06-27 16:34:58 ->  Recipient '10.10.3.153', timeout 15
2012-06-27 16:34:58 ->  1st line: 'config hosts.cfg'
2012-06-27 16:34:58 Cannot load hosts.cfg from xymond, code 5
2012-06-27 16:34:58 Failed to load from xymond, reverting to file-load
00012367 2012-06-27 16:34:58 send_alert w10:disk state Paging
00012367 2012-06-27 16:34:58 Matching host:service:dgroup:page 'w10:disk:NONE:Unix/UNIX_ADSDATA' against rule line 119
00012367 2012-06-27 16:34:58 Failed 'HOST=dc0 SERVICE=cpu,disk' (hostname not in include list)
00012367 2012-06-27 16:34:58 Matching host:service:dgroup:page 'w10:disk:NONE:Unix/UNIX_ADSDATA' against rule line 122
00012367 2012-06-27 16:34:58 *** Match with 'HOST=w10 SERVICE=cpu,disk' ***
00012367 2012-06-27 16:34:58 Matching host:service:dgroup:page 'w10.adsdata.ca:disk:NONE:Unix/UNIX_ADSDATA' against rule line 122
00012367 2012-06-27 16:34:58 *** Match with 'HOST=w10 SERVICE=cpu,disk' ***
00012367 2012-06-27 16:34:58 Mail alert with command '/var/mail/root "Xymon [12345] w10:disk CRITICAL (RED)" user-349c8b888118@xymon.invalid'


If a tried manually it's working !
echo ' test' | mail -s "Xymon [12345] w10:cpu CRITICAL (RED)" user-349c8b888118@xymon.invalid


There is my hosts.cfg

10.10.3.81   monitoring      # bbd http://monitoring/

# / Hypervision Page

title Production
include /usr/local/xymon/server/etc/os/unix/unix
list Jeremy Laidman · Thu, 28 Jun 2012 18:51:11 +1000 ·
The error code for this message comes from the fopen()/open() system calls,
and is defined in errno.h as EIO.  The man page for open() shows that this
indicates a problem accessing a STREAMS file, which was probably the socket
to the Xymon server.

But the real problem appears to be that you can't connect to the Xymon
server as indicated in the earlier error saying "Connection refused".  This
means that the Xymon server at 10.10.3.153 is not listening for connections
on port 1984.  All errors after this are most likely a result of this
failure to connect.

Try doing "telnet 10.10.3.153 1984" and see if you get a connection.  If
not, then make sure xymond is running on the Xymon server.

J

On Thu, Jun 28, 2012 at 6:55 AM, Daniel Beruben
quoted from Daniel Beruben
<user-5b6e30f25c5b@xymon.invalid>wrote:
Hello list,

I have the new version Xymon Version 4.3.7,
I have a strange error … i'm not able to send the mail alert …
• *
I found no information about this error ! ( code 5 )

#./xymond_alert --test w10 disk
2012-06-27 16:34:58 Whoops ! Failed to send message (Connection failed)

2012-06-27 16:34:58 ->  Could not connect to Xymon daemon at 10.10.3.153:1984(Connection refused)
quoted from Daniel Beruben
2012-06-27 16:34:58 ->  Recipient '10.10.3.153', timeout 15
2012-06-27 16:34:58 ->  1st line: 'config hosts.cfg'
2012-06-27 16:34:58 *Cannot load hosts.cfg from xymond, code 5*
2012-06-27 16:34:58 Failed to load from xymond, reverting to file-load
00012367 2012-06-27 16:34:58 send_alert w10:disk state Paging
00012367 2012-06-27 16:34:58 Matching host:service:dgroup:page
'w10:disk:NONE:Unix/UNIX_ADSDATA' against rule line 119
00012367 2012-06-27 16:34:58 Failed 'HOST=dc0 SERVICE=cpu,disk' (hostname
not in include list)
00012367 2012-06-27 16:34:58 Matching host:service:dgroup:page
'w10:disk:NONE:Unix/UNIX_ADSDATA' against rule line 122
00012367 2012-06-27 16:34:58 *** Match with 'HOST=w10 SERVICE=cpu,disk' ***
00012367 2012-06-27 16:34:58 Matching host:service:dgroup:page
'w10.adsdata.ca:disk:NONE:Unix/UNIX_ADSDATA' against rule line 122
00012367 2012-06-27 16:34:58 *** Match with 'HOST=w10 SERVICE=cpu,disk' ***
00012367 2012-06-27 16:34:58 Mail alert with command '/var/mail/root
"Xymon [12345] w10:disk CRITICAL (RED)" user-349c8b888118@xymon.invalid'


If a tried manually it's working !
echo ' test' | mail -s "Xymon [12345] w10:cpu CRITICAL (RED)"
user-349c8b888118@xymon.invalid


There is my hosts.cfg

10.10.3.81   monitoring      # bbd http://monitoring/

# / Hypervision Page

title Production
include /usr/local/xymon/server/etc/os/unix/unix

list Daniel Beruben · Thu, 28 Jun 2012 08:07:57 -0400 ·
Arf the more wired the ip was change ... Look at my hosts files ..
I don t know where this error
quoted from Jeremy Laidman


Le 2012-06-28 à 04:51, "Jeremy Laidman" <user-71895fb2e44c@xymon.invalid<mailto:user-71895fb2e44c@xymon.invalid>> a écrit :

The error code for this message comes from the fopen()/open() system calls, and is defined in errno.h as EIO.  The man page for open() shows that this indicates a problem accessing a STREAMS file, which was probably the socket to the Xymon server.

But the real problem appears to be that you can't connect to the Xymon server as indicated in the earlier error saying "Connection refused".  This means that the Xymon server at 10.10.3.153 is not listening for connections on port 1984.  All errors after this are most likely a result of this failure to connect.

Try doing "telnet 10.10.3.153 1984" and see if you get a connection.  If not, then make sure xymond is running on the Xymon server.

J

On Thu, Jun 28, 2012 at 6:55 AM, Daniel Beruben <user-5b6e30f25c5b@xymon.invalid<mailto:user-5b6e30f25c5b@xymon.invalid>> wrote:
Hello list,

I have the new version Xymon Version 4.3.7,
I have a strange error … i'm not able to send the mail alert …

I found no information about this error ! ( code 5 )

#./xymond_alert --test w10 disk
2012-06-27 16:34:58 Whoops ! Failed to send message (Connection failed)

2012-06-27 16:34:58 ->  Could not connect to Xymon daemon at 10.10.3.153:1984<http://daemon at 10.10.3.153:1984> (Connection refused)
quoted from Jeremy Laidman
2012-06-27 16:34:58 ->  Recipient '10.10.3.153', timeout 15
2012-06-27 16:34:58 ->  1st line: 'config hosts.cfg'
2012-06-27 16:34:58 Cannot load hosts.cfg from xymond, code 5
2012-06-27 16:34:58 Failed to load from xymond, reverting to file-load
00012367 2012-06-27 16:34:58 send_alert w10:disk state Paging
00012367 2012-06-27 16:34:58 Matching host:service:dgroup:page 'w10:disk:NONE:Unix/UNIX_ADSDATA' against rule line 119
00012367 2012-06-27 16:34:58 Failed 'HOST=dc0 SERVICE=cpu,disk' (hostname not in include list)
00012367 2012-06-27 16:34:58 Matching host:service:dgroup:page 'w10:disk:NONE:Unix/UNIX_ADSDATA' against rule line 122
00012367 2012-06-27 16:34:58 *** Match with 'HOST=w10 SERVICE=cpu,disk' ***
00012367 2012-06-27 16:34:58 Matching host:service:dgroup:page 'w10.adsdata.ca:disk:NONE:Unix/UNIX_ADSDATA' against rule line 122
00012367 2012-06-27 16:34:58 *** Match with 'HOST=w10 SERVICE=cpu,disk' ***

00012367 2012-06-27 16:34:58 Mail alert with command '/var/mail/root "Xymon [12345] w10:disk CRITICAL (RED)" user-349c8b888118@xymon.invalid<mailto:user-349c8b888118@xymon.invalid>'


If a tried manually it's working !
echo ' test' | mail -s "Xymon [12345] w10:cpu CRITICAL (RED)" user-349c8b888118@xymon.invalid<mailto:user-349c8b888118@xymon.invalid>
quoted from Jeremy Laidman


There is my hosts.cfg

10.10.3.81   monitoring      # bbd http://monitoring/

# / Hypervision Page

title Production
include /usr/local/xymon/server/etc/os/unix/unix