Xymon Mailing List Archive search

PLEASE HELP: PAGE not sending alerts

list Henrik Størner
Fri, 23 Jun 2006 23:01:53 +0200
Message-Id: <user-6617bc2ac3c2@xymon.invalid>

On Fri, Jun 23, 2006 at 10:25:18AM -0700, Williams, Jeffrey wrote:
Hobbit 4.1.2p1 on Fedora4
I have tried everything and can't get this to work.
The PAGE=/ works but non of the PAGE tags for anything below that do.

Bb-hosts

title Newport Corporation
page            usirv                   USIRV
title USIRV
subpage         usirvpivotal            Pivotal Servers 
title USIRV - Pivotal Servers
10.25.1.35      usirv-piv01             # sql rdp DESCR:server:"Pivotal main"
10.25.1.14      usirv-piv02             # rdp DESCR:server:"Pivotal Irvine satellite"
10.25.1.9       usirv-piv03             # sql rdp DESCR:server:"Pivotal HTTP sync, web store"
10.25.1.59      usirv-pivdev01          # sql rdp DESCR:server:"Pivotal development"
All of your hosts here are on the usirv/usirvpivotal page.
Hobbit-alert.cfg

PAGE=/
        MAIL $jw REPEAT=24h RECOVERED NOTICE
        MAIL $dh RECOVERED COLOR=red
PAGE=usirv
        MAIL $td RECOVERED
PAGE=usirv/usirvpivotal
        MAIL $st RECOVERED
So all of these hosts should match the last rule only, i.e. alerts
should go to whatever is in the "$st" macro.
[root at usirv-bb01 hobbit]# /usr/lib/hobbit/server/bin/bbcmd hobbitd_alert --test usirv-piv01 conn
Using default environment file /usr/lib/hobbit/server/etc/hobbitserver.cfg
send_alert usirv-piv01:conn state Paging
Matching host:service:page 'usirv-piv01:conn:' against rule line 144
Now, this debug output doesn't match the bb-hosts snippet you included
here. The "host:service:page" appears as "usirv-piv01:conn:" - i.e. the
usirv-piv01 hosts is found to be located on the top-level page, not the
"usirv/usirvpivotal" page.
*** Match with 'PAGE=/' ***
Which fits with this match.
Matching host:service:page 'usirv-piv01:conn:' against rule line 145
*** Match with 'MAIL $jw REPEAT=24h RECOVERED NOTICE' ***
Mail alert with command 'mail -s "Hobbit [12345] usirv-piv01:conn CRITICAL (RED)" user-17399b280feb@xymon.invalid'
Matching host:service:page 'usirv-piv01:conn:' against rule line 146
*** Match with 'MAIL $dh RECOVERED COLOR=red' ***
Mail alert with command 'mail -s "Hobbit [12345] usirv-piv01:conn CRITICAL (RED)" user-88ca097bfa4a@xymon.invalid'
So you get alerts to the $jw and $dh recipients.
Matching host:service:page 'usirv-piv01:conn:' against rule line 147
Failed 'PAGE=usirv' (pagename not in include list)
Matching host:service:page 'usirv-piv01:conn:' against rule line 149
Failed 'PAGE=usirv/usirvpivotal' (pagename not in include list)
And not with the pagename you expect.


If I cut-and-paste your bb-hosts file and hobbit-alerts.cfg into my own
configuration, I get the correct result:

$ grep -v "^#" /etc/hobbit/bb-hosts
title Newport Corporation
page            usirv                   USIRV
title USIRV
subpage         usirvpivotal            Pivotal Servers
title USIRV - Pivotal Servers
10.25.1.35      usirv-piv01             # sql rdp DESCR:server:"Pivotal main"
10.25.1.14      usirv-piv02             # rdp DESCR:server:"Pivotal Irvine satellite"
10.25.1.9       usirv-piv03             # sql rdp DESCR:server:"Pivotal HTTP sync, web store"
10.25.1.59      usirv-pivdev01          # sql rdp DESCR:server:"Pivotal development"

$ grep -v "^#" /etc/hobbit/hobbit-alerts.cfg
$jw=user-a0f2385d0910@xymon.invalid
$dh=user-a7477abeff1c@xymon.invalid
$dt=user-17ef2e5a2fba@xymon.invalid
$st=user-9545a618124f@xymon.invalid
PAGE=/
	MAIL $jw REPEAT=24h RECOVERED NOTICE
	MAIL $dh RECOVERED COLOR=red
PAGE=usirv
	MAIL $td RECOVERED
PAGE=usirv/usirvpivotal
	MAIL $st RECOVERED

$ ./common/bbcmd ./hobbitd/hobbitd_alert --test usirv-piv01 conn
2006-06-23 22:59:06 Using default environment file /usr/lib/hobbit/server/etc/hobbitserver.cfg
00017896 2006-06-23 22:59:06 send_alert usirv-piv01:conn state Paging
00017896 2006-06-23 22:59:06 Matching host:service:page 'usirv-piv01:conn:usirv/usirvpivotal' against rule line 121
00017896 2006-06-23 22:59:06 Failed 'PAGE=/' (pagename not in include list)
00017896 2006-06-23 22:59:06 Matching host:service:page 'usirv-piv01:conn:usirv/usirvpivotal' against rule line 124
00017896 2006-06-23 22:59:06 Failed 'PAGE=usirv' (pagename not in include list)
00017896 2006-06-23 22:59:06 Matching host:service:page 'usirv-piv01:conn:usirv/usirvpivotal' against rule line 126
00017896 2006-06-23 22:59:06 *** Match with 'PAGE=usirv/usirvpivotal' ***
00017896 2006-06-23 22:59:06 Matching host:service:page 'usirv-piv01:conn:usirv/usirvpivotal' against rule line 127
00017896 2006-06-23 22:59:06 *** Match with 'MAIL $st RECOVERED' ***
00017896 2006-06-23 22:59:06 Mail alert with command 'mail -s "Hobbit [12345] usirv-piv01:conn CRITICAL (RED)" user-9545a618124f@xymon.invalid'


So my immediate reaction is that you must have these hosts listed twice
in your bb-hosts file - somewhere else than what you have shown here.
If that is not the case, pack your bb-hosts and hobbit-alerts.cfg files
in a tar-file and send them to me for inspection.


Regards,
Henrik