Xymon Mailing List Archive search

alert collections

4 messages in this thread

list Gsimpson · Wed, 15 Jun 2011 20:05:26 +0000 (UTC) ·
Hello, 
I have a collection of pages and subpages that compose a system. There are several systems in my xymon hosts file. Each system looks at a set of machines, VMs and http targets. And each system is listed on an 'Environments' page, which only has links (and icons) to the subpages for the systems (i.e. no URL or http info is connected to that Environments page) 
Periodically, each system is brought down for maintenance. But when that happens, I get swamped with alerts about every component of that system. 
Does anyone know of a way to get one, high-level alert, say from the subpages under my ovmEnvironments page instead of 100+ alerts from my monitored elements ? 
I have not seen any discussions like this in the archives, but I could have missed them. 

thanks Greg 


Here is a modified snippet of my hosts.cfg file ---------------------------------------------------------------- 

page ovmEnvironments OVMEnvironments title OVM Environments 

################################################################################################## # Env1 starts here ################################################################################################## subpage b16Env1 B16Env1 title B16 Env1 subparent b16Env1 b16Env1Hypers Hypervisors title B16 Env1 Hypervisors group-compress <font size="+1">Hypervisors</font> 0.0.0.6 appohs-crm-machine06 0.0.0.7 authohs-crm-machine07 ... 
subparent b16Env1 b16Env1Hosts Hosts title B16 Env1 Hosts group-compress <font size="+1">OVM Hosts</font> 0.0.0.8 v_resourceWebgate_machine08 0.0.0.9 v_challengWebgate_machine09 ... 
subparent b16Env1 b16Env1Admin Weblogic Admin/EM Pages title B16 Env1 Weblogic Administration group-compress <font size="+1">All Admin Consoles</font> 0.0.0.91 v_BI_ADMIN # http://user:pswd at machine:port/console 0.0.0.90 v_CRM_ADMIN # http://user:pswd at machine:port/console ... 

and here is a snippet from alerts.cfg which DO NOT fire when any of the subpage indicators turn red: 
PAGE=ovmEnvironments COLOR=red RECOVERED SCRIPT /home/msmith/xymon/xymonServer/server/ext/smsplus.sh user-8b93a8749c44@xymon.invalid REPEAT=24h 
PAGE=ovmEnvironments/b16Env1 COLOR=red RECOVERED SCRIPT /home/msmith/xymon/xymonServer/server/ext/smsplus.sh user-8b93a8749c44@xymon.invalid REPEAT=24h
list Ken Connell · Thu, 16 Jun 2011 11:52:11 +0000 ·
We currently monitor (ping) our AP's and it would really suck to get over 1000 emails if/when something goes down, so I exclude them as follows:

You can exclude pages from you alerts in two ways:

1.) above all other alerts add:
PAGE=<yourpage/yoursubpage> 
Add nothing for action, and that's it. 

2.) On your bottom (default) alert action append the following:
EXPAGE=<yourpage/yoursubpage> 


Ken Connell
Intermediate Network Engineer
Computer & Communication Services
Ryerson University
XXX Victoria St
RM AB50
Toronto, Ont
M5B 2K3
XXX-XXX-XXXX x6709


-----Original Message-----
From: user-cddc978aa440@xymon.invalid
Sender: xymon-bounces at xymon.com
Date: Wed, 15 Jun 2011 20:05:26 
To: <xymon at xymon.com>
Subject: [Xymon] alert collections
list Scot Kreienkamp · Thu, 16 Jun 2011 08:45:24 -0400 ·
I just removed the default alert section.  That way I only get notified
for systems/pages that I specifically call out in the alerts.cfg file.

Scot Kreienkamp


-----Original Message-----
From: xymon-bounces at xymon.com [mailto:xymon-bounces at xymon.com] On Behalf
Of user-7cb0f5662626@xymon.invalid
Sent: Thursday, June 16, 2011 7:52 AM
To: user-cddc978aa440@xymon.invalid; xymon-bounces at xymon.com; Xymon Email List
Subject: Re: [Xymon] alert collectionsif I understand
quoted from Ken Connell

We currently monitor (ping) our AP's and it would really suck to get
over 1000 emails if/when something goes down, so I exclude them as
follows:

You can exclude pages from you alerts in two ways:

1.) above all other alerts add:
PAGE=<yourpage/yoursubpage> 
Add nothing for action, and that's it. 

2.) On your bottom (default) alert action append the following:
EXPAGE=<yourpage/yoursubpage> 


Ken Connell
Intermediate Network Engineer
Computer & Communication Services
Ryerson University
XXX Victoria St
RM AB50
Toronto, Ont
M5B 2K3
XXX-XXX-XXXX x6709


-----Original Message-----
From: user-cddc978aa440@xymon.invalid
Sender: xymon-bounces at xymon.com
Date: Wed, 15 Jun 2011 20:05:26 
To: <xymon at xymon.com>
Subject: [Xymon] alert collections


This message is intended only for the individual or entity to which it is addressed.  It may contain privileged, confidential information which is exempt from disclosure under applicable laws.  If you are not the intended recipient, please note that you are strictly prohibited from disseminating or distributing this information (other than to the intended recipient) or copying this information.  If you have received this communication in error, please notify us immediately by e-mail or by telephone at the above number. Thank you.
list Gsimpson · Thu, 16 Jun 2011 15:39:43 +0000 (UTC) ·
Thanks for the responses guys. I am making things different now, but I'm not sure that it is progress. 
so now, my alerts.cfg only has the entries listed below. I am not certain of how to designate a default alert. 
But now, when I run a test, I don't get ANY email notices sent. So, I believe I am excluding the subpages correctly, which is good for me. 
But I am not generating any alerts when my 'System' page turns red. I think it is because I have nothing but subpages on each of my 'System' pages. 
Do you have any advice on generating alerts from a page like that ? I want to get an alert from my "b16Env1" subpage (see the hosts snippet below). 
thanks! 

Here is my alerts.cfg file ------------------------------------------ PAGE=ovmEnvironments/b16Env1 COLOR=red RECOVERED PAGE=ovmEnvironments/b16Env1/b16Env1Hypers COLOR=red RECOVERED PAGE=ovmEnvironments/b17Env1/b16Env1Hosts COLOR=red RECOVERED PAGE=ovmEnvironments/b16Env1/b16Env1Admin COLOR=red RECOVERED PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1IcApps COLOR=red RECOVERED PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps COLOR=red RECOVERED PAGE=ovmEnvironments COLOR=red RECOVERED 
MAIL user-e45015a569dc@xymon.invalid REPEAT=5 EXPAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1IcApps EXPAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps #------------------- 

and here is the test output from sending one http error from a low-level subpage #------------------- 2011-06-16 08:18:51 Using default environment file /home/gjsimpso/xymon/xymonServer/server/etc/xymonserver.cfg 00026390 2011-06-16 08:18:51 send_alert v_TS_ODSM:http state Paging 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 66 00026390 2011-06-16 08:18:51 Failed 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1FSApps COLOR=red RECOVERED' (pagename not in include list) 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 67 00026390 2011-06-16 08:18:51 Failed 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1CrmApps COLOR=red RECOVERED' (pagename not in include list) 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 68 00026390 2011-06-16 08:18:51 Failed 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1FinsApps COLOR=red RECOVERED' (pagename not in include list) 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 69 00026390 2011-06-16 08:18:51 Failed 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1HcmApps COLOR=red RECOVERED' (pagename not in include list) 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 70 00026390 2011-06-16 08:18:51 Failed 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1ScmApps COLOR=red RECOVERED' (pagename not in include list) 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 71 00026390 2011-06-16 08:18:51 Failed 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1IcApps COLOR=red RECOVERED' (pagename not in include list) 00026390 2011-06-16 08:18:51 Matching host:service:page 'v_TS_ODSM:http:ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps' against rule line 72 00026390 2011-06-16 08:18:51 *** Match with 'PAGE=ovmEnvironments/b16Env1/b16Env1Apps/b16Env1TechApps COLOR=red RECOVERED' *** 

I have not changed my hosts.cfg file (this is a snippet) #------------------- page ovmEnvironments OVMEnvironments title OVM Environments 
quoted from Gsimpson

################################################################################################## # Env1 starts here ################################################################################################## subpage b16Env1 B16Env1 title B16 Env1 subparent b16Env1 b16Env1Hypers Hypervisors title B16 Env1 Hypervisors group-compress <font size="+1">Hypervisors</font> 0.0.0.6 appohs-crm-machine06 0.0.0.7 authohs-crm-machine07 ... 
subparent b16Env1 b16Env1Hosts Hosts title B16 Env1 Hosts group-compress <font size="+1">OVM Hosts</font> 0.0.0.8 v_resourceWebgate_machine08 0.0.0.9 v_challengWebgate_machine09 ... 
subparent b16Env1 b16Env1Admin Weblogic Admin/EM Pages title B16 Env1 Weblogic Administration group-compress <font size="+1">All Admin Consoles</font> 0.0.0.91 v_BI_ADMIN # http://user:pswd at machine:port/console 0.0.0.90 v_CRM_ADMIN # http://user:pswd at machine:port/console ... 

################################################################################################## # Env2 starts here ################################################################################################## subpage b16Env2 B16Env2 title B16 Env2 ... 


-- 
Greg 

----- Original Message ----- From: "Scot Kreienkamp" <user-462cf0b6d846@xymon.invalid> To: user-7cb0f5662626@xymon.invalid, user-cddc978aa440@xymon.invalid, xymon-bounces at xymon.com, "Xymon Email List" <xymon at xymon.com> Sent: Thursday, June 16, 2011 6:45:24 AM Subject: RE: [Xymon] alert collectionsif I understand 
quoted from Scot Kreienkamp
I just removed the default alert section. That way I only get notified for systems/pages that I specifically call out in the alerts.cfg file. 
Scot Kreienkamp 

-----Original Message----- From: xymon-bounces at xymon.com [mailto:xymon-bounces at xymon.com] On Behalf Of user-7cb0f5662626@xymon.invalid Sent: Thursday, June 16, 2011 7:52 AM To: user-cddc978aa440@xymon.invalid; xymon-bounces at xymon.com; Xymon Email List Subject: Re: [Xymon] alert collectionsif I understand 
We currently monitor (ping) our AP's and it would really suck to get over 1000 emails if/when something goes down, so I exclude them as follows: 
You can exclude pages from you alerts in two ways: 
1.) above all other alerts add: PAGE=<yourpage/yoursubpage> Add nothing for action, and that's it. 
2.) On your bottom (default) alert action append the following: EXPAGE=<yourpage/yoursubpage> 


Ken Connell Intermediate Network Engineer Computer & Communication Services Ryerson University XXX Victoria St RM AB50 Toronto, Ont M5B 2K3 XXX-XXX-XXXX x6709