Help with GROUP and alerting
list Greg Shea
Hi all,
I'm having some trouble either understanding how to use groups or my
syntax is wrong, or
maybe even a combo of both. Like most other organizations, the
sysadmins don't want to be
alerted for disk usage of database and application groups. Some of our
larger applications
can have DBAs responsible for their disks, 2 different apps-dev groups
responsible for their
disks, and finally the sysadmins. I tried a 2 disk scenario and a
couple of different ways
to see if I could notify on separate disks and I'm not getting the
desired results.
Is anyone else doing this??
Thanks
-Grs-
Gregory R Shea
EMC Corporation
Hobbit-clients.cfg
HOST=raditz
LOAD 0.0 0.1
DISK / 52 60 GROUP=rootdisk
DISK /data 35 40 GROUP=datadisk
PROC sm_serviced
Hobbit-alerts.cfg
HOST=raditz
## MAIL user-762ee872a5a4@xymon.invalid RECOVERED SERVICE=disk GROUP=rootdisk
MAIL user-762ee872a5a4@xymon.invalid RECOVERED GROUP=rootdisk
SCRIPT /apps/hobbit/server/etc/disk-alert.sh gshea RECOVERED
SERVICE=disk EXGROUP=rootdisk
SCRIPT /apps/hobbit/server/etc/script2 gshea RECOVERED
SERVICE=msgs
[hobbit at hobbitmon etc]$ ../bin/bbcmd ../bin/hobbitd_alert --debug --test
raditz rootdisk
2007-07-23 10:51:03 Using default environment file
/apps/hobbit/server/etc/hobbitserver.cfg
2007-07-23 10:51:03 Opening file /apps/hobbit/server/etc/bb-hosts
2007-07-23 10:51:03 Opening file
/apps/hobbit/server/etc/hobbit-alerts.cfg
2007-07-23 10:51:03 Compiling regex .*
2007-07-23 10:51:03 send_alert raditz:rootdisk state 0
00027629 2007-07-23 10:51:03 send_alert raditz:rootdisk state Paging
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 160
00027629 2007-07-23 10:51:03 Failed 'HOST=almoraprd01 SERVICE=oratnsP'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 164
00027629 2007-07-23 10:51:03 Failed 'HOST=almoradr01
SERVICE=oratns1T,oratns2T,oratnsD' (hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 168
00027629 2007-07-23 10:51:03 Failed 'HOST=almorarpt01 SERVICE=oratnsD'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 180
00027629 2007-07-23 10:51:03 Failed 'HOST=rangers,redwings
SERVICE=disk,memory,msgs,network,procs' (hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 184
00027629 2007-07-23 10:51:03 Failed 'HOST=bruins,blackhawks
SERVICE=disk,memory,msgs,network,procs' (hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 187
00027629 2007-07-23 10:51:03 Failed 'HOST=vhmaster SERVICE=disk'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 190
00027629 2007-07-23 10:51:03 Failed 'HOST=emslabicsam01 SERVICE=disk'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 194
00027629 2007-07-23 10:51:03 Failed 'HOST=emcov1 SERVICE=procs'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 198
00027629 2007-07-23 10:51:03 *** Match with 'HOST=raditz' ***
2007-07-23 10:51:03 Found a first matching rule
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 199
00027629 2007-07-23 10:51:03 Failed 'MAIL user-762ee872a5a4@xymon.invalid RECOVERED
GROUP=rootdisk' (group not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 202
00027629 2007-07-23 10:51:03 Failed 'SCRIPT
/apps/hobbit/server/etc/disk-alert.sh gshea RECOVERED SERVICE=disk
EXGROUP=rootdisk' (service not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 203
00027629 2007-07-23 10:51:03 Failed 'SCRIPT
/apps/hobbit/server/etc/script2 gshea RECOVERED SERVICE=msgs' (service
not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 205
00027629 2007-07-23 10:51:03 Failed 'HOST=doctorj SERVICE=disk'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 209
00027629 2007-07-23 10:51:03 Failed 'HOST=emslabicip01 SERVICE=msgs'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 213
00027629 2007-07-23 10:51:03 Failed 'HOST=vapoll2 SERVICE=logs'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 216
00027629 2007-07-23 10:51:03 Failed 'HOST=got01-automon01' (hostname not
in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 219
00027629 2007-07-23 10:51:03 Failed
'HOST=CORPUSWEB190,CORPUSWEB191,CORPUSWEB196,CORPUSWEB198,CORPUSWEB199
SERVICE=procs' (hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 222
00027629 2007-07-23 10:51:03 Failed 'HOST=%.*
SERVICE=ntp,emc,nfs,temp,network' (service not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 224
00027629 2007-07-23 10:51:03 *** Match with
'EXSERVICE=cpu,files,netstat,ports,vmstat' ***
2007-07-23 10:51:03 Found a secondary matching rule
[hobbit at hobbitmon etc]$
list T.J. Yang
▸
From: user-762ee872a5a4@xymon.invalid Reply-To: user-ae9b8668bcde@xymon.invalid To: <user-ae9b8668bcde@xymon.invalid> CC: <user-762ee872a5a4@xymon.invalid> Subject: [hobbit] Help with GROUP and alerting Date: Mon, 23 Jul 2007 11:22:59 -0400 Hi all, I'm having some trouble either understanding how to use groups or my syntax is wrong, or maybe even a combo of both. Like most other organizations, the sysadmins don't want to be alerted for disk usage of database and application groups. Some of our larger applications can have DBAs responsible for their disks, 2 different apps-dev groups responsible for their disks, and finally the sysadmins. I tried a 2 disk scenario and a couple of different ways to see if I could notify on separate disks and I'm not getting the desired results. Is anyone else doing this??
Yes, I can confirm GROUP(process and disk partition) is working. I will post my test case that works on wiki FAQ. Need edit mine for public view. tj
▸
Thanks
-Grs-
Gregory R Shea
EMC Corporation
Hobbit-clients.cfg
HOST=raditz
LOAD 0.0 0.1
DISK / 52 60 GROUP=rootdisk
DISK /data 35 40 GROUP=datadisk
PROC sm_serviced
Hobbit-alerts.cfg
HOST=raditz
## MAIL user-762ee872a5a4@xymon.invalid RECOVERED SERVICE=disk GROUP=rootdisk
MAIL user-762ee872a5a4@xymon.invalid RECOVERED GROUP=rootdisk
SCRIPT /apps/hobbit/server/etc/disk-alert.sh gshea RECOVERED
SERVICE=disk EXGROUP=rootdisk
SCRIPT /apps/hobbit/server/etc/script2 gshea RECOVERED
SERVICE=msgs
[hobbit at hobbitmon etc]$ ../bin/bbcmd ../bin/hobbitd_alert --debug --test
raditz rootdisk
2007-07-23 10:51:03 Using default environment file
/apps/hobbit/server/etc/hobbitserver.cfg
2007-07-23 10:51:03 Opening file /apps/hobbit/server/etc/bb-hosts
2007-07-23 10:51:03 Opening file
/apps/hobbit/server/etc/hobbit-alerts.cfg
2007-07-23 10:51:03 Compiling regex .*
2007-07-23 10:51:03 send_alert raditz:rootdisk state 0
00027629 2007-07-23 10:51:03 send_alert raditz:rootdisk state Paging
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 160
00027629 2007-07-23 10:51:03 Failed 'HOST=almoraprd01 SERVICE=oratnsP'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 164
00027629 2007-07-23 10:51:03 Failed 'HOST=almoradr01
SERVICE=oratns1T,oratns2T,oratnsD' (hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 168
00027629 2007-07-23 10:51:03 Failed 'HOST=almorarpt01 SERVICE=oratnsD'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 180
00027629 2007-07-23 10:51:03 Failed 'HOST=rangers,redwings
SERVICE=disk,memory,msgs,network,procs' (hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 184
00027629 2007-07-23 10:51:03 Failed 'HOST=bruins,blackhawks
SERVICE=disk,memory,msgs,network,procs' (hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 187
00027629 2007-07-23 10:51:03 Failed 'HOST=vhmaster SERVICE=disk'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 190
00027629 2007-07-23 10:51:03 Failed 'HOST=emslabicsam01 SERVICE=disk'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 194
00027629 2007-07-23 10:51:03 Failed 'HOST=emcov1 SERVICE=procs'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 198
00027629 2007-07-23 10:51:03 *** Match with 'HOST=raditz' ***
2007-07-23 10:51:03 Found a first matching rule
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 199
00027629 2007-07-23 10:51:03 Failed 'MAIL user-762ee872a5a4@xymon.invalid RECOVERED
GROUP=rootdisk' (group not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 202
00027629 2007-07-23 10:51:03 Failed 'SCRIPT
/apps/hobbit/server/etc/disk-alert.sh gshea RECOVERED SERVICE=disk
EXGROUP=rootdisk' (service not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 203
00027629 2007-07-23 10:51:03 Failed 'SCRIPT
/apps/hobbit/server/etc/script2 gshea RECOVERED SERVICE=msgs' (service
not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 205
00027629 2007-07-23 10:51:03 Failed 'HOST=doctorj SERVICE=disk'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 209
00027629 2007-07-23 10:51:03 Failed 'HOST=emslabicip01 SERVICE=msgs'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 213
00027629 2007-07-23 10:51:03 Failed 'HOST=vapoll2 SERVICE=logs'
(hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 216
00027629 2007-07-23 10:51:03 Failed 'HOST=got01-automon01' (hostname not
in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 219
00027629 2007-07-23 10:51:03 Failed
'HOST=CORPUSWEB190,CORPUSWEB191,CORPUSWEB196,CORPUSWEB198,CORPUSWEB199
SERVICE=procs' (hostname not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 222
00027629 2007-07-23 10:51:03 Failed 'HOST=%.*
SERVICE=ntp,emc,nfs,temp,network' (service not in include list)
00027629 2007-07-23 10:51:03 Matching host:service:page
'raditz:rootdisk:hobbit-uclients' against rule line 224
00027629 2007-07-23 10:51:03 *** Match with
'EXSERVICE=cpu,files,netstat,ports,vmstat' ***
2007-07-23 10:51:03 Found a secondary matching rule
[hobbit at hobbitmon etc]$
http://liveearth.msn.com
list T.J. Yang
This FAQ is my first draft. edit this FAQ where you see fit. http://en.wikibooks.org/wiki/System_Monitoring_with_Hobbit/Other_Docs/FAQ#Q._How_do_I_configure_.22GROUP.22_alerts_.3F T.J. Yang
▸
From: "T.J. Yang" <user-8e841282cda5@xymon.invalid> Reply-To: user-ae9b8668bcde@xymon.invalid To: user-ae9b8668bcde@xymon.invalid CC: user-762ee872a5a4@xymon.invalid Subject: RE: [hobbit] Help with GROUP and alerting Date: Mon, 23 Jul 2007 10:31:25 -0500From: user-762ee872a5a4@xymon.invalid Reply-To: user-ae9b8668bcde@xymon.invalid To: <user-ae9b8668bcde@xymon.invalid> CC: <user-762ee872a5a4@xymon.invalid> Subject: [hobbit] Help with GROUP and alerting Date: Mon, 23 Jul 2007 11:22:59 -0400 Hi all, I'm having some trouble either understanding how to use groups or my syntax is wrong, or maybe even a combo of both. Like most other organizations, the sysadmins don't want to be alerted for disk usage of database and application groups. Some of our larger applications can have DBAs responsible for their disks, 2 different apps-dev groups responsible for their disks, and finally the sysadmins. I tried a 2 disk scenario and a couple of different ways to see if I could notify on separate disks and I'm not getting the desired results. Is anyone else doing this??Yes, I can confirm GROUP(process and disk partition) is working. I will post my test case that works on wiki FAQ. Need edit mine for public view. tjThanks -Grs- Gregory R Shea EMC Corporation Hobbit-clients.cfg HOST=raditz LOAD 0.0 0.1 DISK / 52 60 GROUP=rootdisk DISK /data 35 40 GROUP=datadisk PROC sm_serviced Hobbit-alerts.cfg HOST=raditz ## MAIL user-762ee872a5a4@xymon.invalid RECOVERED SERVICE=disk GROUP=rootdisk MAIL user-762ee872a5a4@xymon.invalid RECOVERED GROUP=rootdisk SCRIPT /apps/hobbit/server/etc/disk-alert.sh gshea RECOVERED SERVICE=disk EXGROUP=rootdisk SCRIPT /apps/hobbit/server/etc/script2 gshea RECOVERED SERVICE=msgs [hobbit at hobbitmon etc]$ ../bin/bbcmd ../bin/hobbitd_alert --debug --test raditz rootdisk 2007-07-23 10:51:03 Using default environment file /apps/hobbit/server/etc/hobbitserver.cfg 2007-07-23 10:51:03 Opening file /apps/hobbit/server/etc/bb-hosts 2007-07-23 10:51:03 Opening file /apps/hobbit/server/etc/hobbit-alerts.cfg 2007-07-23 10:51:03 Compiling regex .* 2007-07-23 10:51:03 send_alert raditz:rootdisk state 0 00027629 2007-07-23 10:51:03 send_alert raditz:rootdisk state Paging 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 160 00027629 2007-07-23 10:51:03 Failed 'HOST=almoraprd01 SERVICE=oratnsP' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 164 00027629 2007-07-23 10:51:03 Failed 'HOST=almoradr01 SERVICE=oratns1T,oratns2T,oratnsD' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 168 00027629 2007-07-23 10:51:03 Failed 'HOST=almorarpt01 SERVICE=oratnsD' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 180 00027629 2007-07-23 10:51:03 Failed 'HOST=rangers,redwings SERVICE=disk,memory,msgs,network,procs' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 184 00027629 2007-07-23 10:51:03 Failed 'HOST=bruins,blackhawks SERVICE=disk,memory,msgs,network,procs' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 187 00027629 2007-07-23 10:51:03 Failed 'HOST=vhmaster SERVICE=disk' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 190 00027629 2007-07-23 10:51:03 Failed 'HOST=emslabicsam01 SERVICE=disk' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 194 00027629 2007-07-23 10:51:03 Failed 'HOST=emcov1 SERVICE=procs' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 198 00027629 2007-07-23 10:51:03 *** Match with 'HOST=raditz' *** 2007-07-23 10:51:03 Found a first matching rule 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 199 00027629 2007-07-23 10:51:03 Failed 'MAIL user-762ee872a5a4@xymon.invalid RECOVERED GROUP=rootdisk' (group not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 202 00027629 2007-07-23 10:51:03 Failed 'SCRIPT /apps/hobbit/server/etc/disk-alert.sh gshea RECOVERED SERVICE=disk EXGROUP=rootdisk' (service not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 203 00027629 2007-07-23 10:51:03 Failed 'SCRIPT /apps/hobbit/server/etc/script2 gshea RECOVERED SERVICE=msgs' (service not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 205 00027629 2007-07-23 10:51:03 Failed 'HOST=doctorj SERVICE=disk' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 209 00027629 2007-07-23 10:51:03 Failed 'HOST=emslabicip01 SERVICE=msgs' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 213 00027629 2007-07-23 10:51:03 Failed 'HOST=vapoll2 SERVICE=logs' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 216 00027629 2007-07-23 10:51:03 Failed 'HOST=got01-automon01' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 219 00027629 2007-07-23 10:51:03 Failed 'HOST=CORPUSWEB190,CORPUSWEB191,CORPUSWEB196,CORPUSWEB198,CORPUSWEB199 SERVICE=procs' (hostname not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 222 00027629 2007-07-23 10:51:03 Failed 'HOST=%.* SERVICE=ntp,emc,nfs,temp,network' (service not in include list) 00027629 2007-07-23 10:51:03 Matching host:service:page 'raditz:rootdisk:hobbit-uclients' against rule line 224 00027629 2007-07-23 10:51:03 *** Match with 'EXSERVICE=cpu,files,netstat,ports,vmstat' *** 2007-07-23 10:51:03 Found a secondary matching rule [hobbit at hobbitmon etc]$http://liveearth.msn.com
http://imagine-windowslive.com/hotmail/?locale=en-us&ocid=TXT_TAGHM_migration_HM_mini_pcmag_0507