Xymon Mailing List Archive search

Monitoring multiple Linux Processes

11 messages in this thread

list Steve Mbs Newton · Tue, 1 Feb 2011 10:04:38 -0500 ·
Hello,

                I'm trying to monitor multiple processes on Linux servers. I would like to use Page/subpage like described in the
Hobbit-clients.cfg file, but it will only show one processes on the page for each server. Here is a what I have in the file.

PAGE=infrastructure/noc
       PROC batchman
       PROC jobman
       PROC netman

Infrastructure is the page
Noc is the subpage.

XYMON only finds the first PROC.

Steve Newton
PHH Corporation
Desk XXX-XXX-XXXX
Cell XXX-XXX-XXXX
list Henrik Størner · Tue, 1 Feb 2011 21:04:55 +0000 (UTC) ·
quoted from Steve Mbs Newton
In <user-f2c3b6b76d3f@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
               I'm trying to monitor multiple processes on Linux servers.
I would like to use Page/subpage like described in the
Hobbit-clients.cfg file, but it will only show one processes on the page for
each server. Here is a what I have in the file.
PAGE=infrastructure/noc
      PROC batchman
      PROC jobman
      PROC netman
Infrastructure is the page
Noc is the subpage.
XYMON only finds the first PROC.

I'm not sure what you mean by that:

* Does it only check for the first of those processes, so you have
  only one red/green "dot" on the detailed "procs" status display?
  There should be one red/green dot for each of the processes
  listed, e.g. like that on the demo site - see
  http://www.xymon.com/xymon-cgi/svcstatus.sh?HOST=voodoo.hswn.dk&SERVICE=procs

* Or does it check all of the processes, but only on the first
  host on the page ?


Regards,
Henrik
list Josh Luthman · Tue, 1 Feb 2011 16:08:32 -0500 ·
What he is asking for is if there is one proc column for each proc.  There
is not.

The proc column for the host includes EVERY ps.  If any one ps goes down the
color changes for that one column (proc).  You would need to make a script
to monitor a ps and create another column.

Josh Luthman
Office: XXX-XXX-XXXX
Direct: XXX-XXX-XXXX
XXXX Wayne St
Suite XXXX
Troy, OH XXXXX
quoted from Henrik Størner


On Tue, Feb 1, 2011 at 4:04 PM, Henrik Størner <user-ce4a2c883f75@xymon.invalid> wrote:
In <user-f2c3b6b76d3f@xymon.invalid>
"Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
               I'm trying to monitor multiple processes on Linux servers.
I would like to use Page/subpage like described in the
Hobbit-clients.cfg file, but it will only show one processes on the page
for
each server. Here is a what I have in the file.
PAGE=infrastructure/noc
      PROC batchman
      PROC jobman
      PROC netman
Infrastructure is the page
Noc is the subpage.
XYMON only finds the first PROC.

I'm not sure what you mean by that:

* Does it only check for the first of those processes, so you have
 only one red/green "dot" on the detailed "procs" status display?
 There should be one red/green dot for each of the processes
 listed, e.g. like that on the demo site - see

http://www.xymon.com/xymon-cgi/svcstatus.sh?HOST=voodoo.hswn.dk&SERVICE=procs

* Or does it check all of the processes, but only on the first
 host on the page ?


Regards,
Henrik

list Steve Mbs Newton · Wed, 2 Feb 2011 08:28:37 -0500 ·
This is what I see when I list the host servers:

HOST=Hostname1,Hostname2,Hostname3,Hostname4,Hostname5,Hostname6
       PROC netman
       PROC batchman
       PROC jobman

Results:

green netman (found 1, req. 1 or more)
green batchman (found 1, req. 1 or more)
green jobman (found 1, req. 1 or more)

This is what I see when I use PAGE/SUBPAGE

PAGE=infrastructure/noc
       PROC batchman
       PROC jobman
       PROC netman

Results:

green batchman (found 1, req. 1 or more)
quoted from Josh Luthman

From: Josh Luthman [mailto:user-4c45a83f15cb@xymon.invalid]
Sent: Tuesday, February 01, 2011 4:09 PM
To: xymon at xymon.com
Subject: Re: [xymon] Monitoring multiple Linux Processes

What he is asking for is if there is one proc column for each proc.  There is not.

The proc column for the host includes EVERY ps.  If any one ps goes down the color changes for that one column (proc).  You would need to make a script to monitor a ps and create another column.

Josh Luthman
Office: XXX-XXX-XXXX
Direct: XXX-XXX-XXXX
XXXX Wayne St
Suite XXXX
Troy, OH XXXXX

On Tue, Feb 1, 2011 at 4:04 PM, Henrik Størner <user-ce4a2c883f75@xymon.invalid<mailto:user-ce4a2c883f75@xymon.invalid>> wrote:
In <user-f2c3b6b76d3f@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
               I'm trying to monitor multiple processes on Linux servers.
I would like to use Page/subpage like described in the
Hobbit-clients.cfg file, but it will only show one processes on the page for
each server. Here is a what I have in the file.
PAGE=infrastructure/noc
      PROC batchman
      PROC jobman
      PROC netman
Infrastructure is the page
Noc is the subpage.
XYMON only finds the first PROC.
I'm not sure what you mean by that:

* Does it only check for the first of those processes, so you have
 only one red/green "dot" on the detailed "procs" status display?
 There should be one red/green dot for each of the processes
 listed, e.g. like that on the demo site - see
 http://www.xymon.com/xymon-cgi/svcstatus.sh?HOST=voodoo.hswn.dk&SERVICE=procs

* Or does it check all of the processes, but only on the first
 host on the page ?


Regards,
Henrik
list Henrik Størner · Wed, 2 Feb 2011 14:14:15 +0000 (UTC) ·
quoted from Steve Mbs Newton
In <user-945d5422fd1a@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
This is what I see when I list the host servers:
HOST=Hostname1,Hostname2,Hostname3,Hostname4,Hostname5,Hostname6
      PROC netman
      PROC batchman
      PROC jobman
Results:
green netman (found 1, req. 1 or more)
green batchman (found 1, req. 1 or more)
green jobman (found 1, req. 1 or more)

OK, this is fine and what you should expect.
quoted from Steve Mbs Newton

This is what I see when I use PAGE/SUBPAGE
PAGE=infrastructure/noc
      PROC batchman
      PROC jobman
      PROC netman
Results:
green batchman (found 1, req. 1 or more)

And this is obviously not correct.


I've tried reproducing your setup on my 4.3.0-RC1 installation,
but I cannot make it fail this way. So I need some more information,
and I'd like you to do some tests with the configuration that is
NOT working (the one where you have PAGE=infrastructure/noc).

1) What Xymon version are you running ?

2) Please run this command and send me the result:
   xymon 127.0.0.1 "xymondboard host=Hostname1 test=info fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
   (If you are running a Xymon version prior to 4.3.0-beta3, the
   command will be "bb" instead of "xymon", and the first word
   inside the double quotes must be "hobbitdboard" instead of
   "xymondboard").

3) If you look at the "info" status for one of these hosts, you'll
   see a "Page/Subpage" entry which is a link to a webpage. What
   does the URL in this link look like - specifically, does it
   include "infrastructure/noc" or is it something else ?

4) Try running the command
      xymoncmd xymond_client --test
   (that's "bbcmd hobbitd_client --test" in older versions).
   At the Hostname prompt, enter one of the failing hosts.
   Hit <enter> at the Hosttype prompt, enter "proc" as
   the Test, and just hit <enter> at the "ps command string"
   prompt. This should result in the three processes being listed
   as "red" - does it ?


Regards,
Henrik
list Steve Mbs Newton · Wed, 2 Feb 2011 12:54:34 -0500 ·
1) What Xymon version are you running ?
	4.3.0.3.beta2
quoted from Henrik Størner

2) Please run this command and send me the result:
   xymon 127.0.0.1 "xymondboard host=Hostname1 test=info fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
   (If you are running a Xymon version prior to 4.3.0-beta3, the
   command will be "bb" instead of "xymon", and the first word
   inside the double quotes must be "hobbitdboard" instead of
   "xymondboard").

	[root at vmlb1mon1p bin]# ./bb 127.0.0.1 "hobbitdboard host=lb1etl1p test=info field=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
lb1etl1p.mortgagesvcs.com|info|green||0|0|0|0|0||0|
quoted from Henrik Størner

3) If you look at the "info" status for one of these hosts, you'll
   see a "Page/Subpage" entry which is a link to a webpage. What
   does the URL in this link look like - specifically, does it
   include "infrastructure/noc" or is it something else ?
	

	Infrastructure Services/Enterprise Operations
quoted from Henrik Størner


4) Try running the command
      xymoncmd xymond_client --test
   (that's "bbcmd hobbitd_client --test" in older versions).
   At the Hostname prompt, enter one of the failing hosts.
   Hit <enter> at the Hosttype prompt, enter "proc" as
   the Test, and just hit <enter> at the "ps command string"
   prompt. This should result in the three processes being listed
   as "red" - does it ?

	[root at vmlb1mon1p bin]# ./bbcmd hobbitd_client --test
2011-02-02 12:50:54 Using default environment file /home/xymon/server/etc/hobbitserver.cfg
Hostname (.=end, ?=dump, !=reload) []: lb1etl1p.mortgagesvcs.com Hosttype []:
Test (cpu, mem, disk, proc, log, port): proc To read 'ps' data from a file, enter '@FILENAME' at the prompt ps command string:
Process batchman color red: Count=0, min=1, max=-1 Hostname (.=end, ?=dump, !=reload) [lb1etl1p.mortgagesvcs.com]:	

Thanks

Steve Newton
quoted from Henrik Størner

-----Original Message-----
From: Henrik Størner [mailto:user-ce4a2c883f75@xymon.invalid] 
Sent: Wednesday, February 02, 2011 9:14 AM
To: xymon at xymon.com
Subject: Re: [xymon] Monitoring multiple Linux Processes

In <user-945d5422fd1a@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
This is what I see when I list the host servers:
HOST=Hostname1,Hostname2,Hostname3,Hostname4,Hostname5,Hostname6
      PROC netman
      PROC batchman
      PROC jobman
Results:
green netman (found 1, req. 1 or more)
green batchman (found 1, req. 1 or more)
green jobman (found 1, req. 1 or more)

OK, this is fine and what you should expect.

This is what I see when I use PAGE/SUBPAGE
PAGE=infrastructure/noc
      PROC batchman
      PROC jobman
      PROC netman
Results:
green batchman (found 1, req. 1 or more)

And this is obviously not correct.


I've tried reproducing your setup on my 4.3.0-RC1 installation,
but I cannot make it fail this way. So I need some more information,
and I'd like you to do some tests with the configuration that is
NOT working (the one where you have PAGE=infrastructure/noc).

1) What Xymon version are you running ?

2) Please run this command and send me the result:
   xymon 127.0.0.1 "xymondboard host=Hostname1 test=info fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
   (If you are running a Xymon version prior to 4.3.0-beta3, the
   command will be "bb" instead of "xymon", and the first word
   inside the double quotes must be "hobbitdboard" instead of
   "xymondboard").

3) If you look at the "info" status for one of these hosts, you'll
   see a "Page/Subpage" entry which is a link to a webpage. What
   does the URL in this link look like - specifically, does it
   include "infrastructure/noc" or is it something else ?

4) Try running the command
      xymoncmd xymond_client --test
   (that's "bbcmd hobbitd_client --test" in older versions).
   At the Hostname prompt, enter one of the failing hosts.
   Hit <enter> at the Hosttype prompt, enter "proc" as
   the Test, and just hit <enter> at the "ps command string"
   prompt. This should result in the three processes being listed
   as "red" - does it ?


Regards,
Henrik
list Buchan Milne · Wed, 2 Feb 2011 23:01:34 +0200 ·
quoted from Henrik Størner
On Wednesday, 2 February 2011 16:14:15 Henrik "Størner" wrote:
2) Please run this command and send me the result:
   xymon 127.0.0.1 "xymondboard host=Hostname1 test=info
fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS
These fields aren't documented anywhere! A project I was working on a while 
back would have progressed much further if these were documented (I didn't get 
around to hacking source ...).

Regards,
Buchan
list Henrik Størner · Thu, 3 Feb 2011 22:09:17 +0000 (UTC) ·
quoted from Buchan Milne
In <user-32d1ec8a2490@xymon.invalid> Buchan Milne <user-9b139aff4dec@xymon.invalid> writes:
On Wednesday, 2 February 2011 16:14:15 Henrik "St=F8rner" wrote:
2) Please run this command and send me the result:

   xymon 127.0.0.1 "xymondboard host=3DHostname1 test=3Dinfo
fields=3Dhostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS
quoted from Buchan Milne
These fields aren't documented anywhere! A project I was working on a 
while back would have progressed much further if these were documented 
(I didn't get around to hacking source ...).
OK, how about this? Will add it to the 4.3.0 docs.

http://www.xymon.com/~henrik/xymon-xmh.5.html


Regards,
Henrik
list Buchan Milne · Fri, 4 Feb 2011 10:31:51 +0200 ·
quoted from Henrik Størner
On Friday, 4 February 2011 00:09:17 Henrik "Størner" wrote:
In <user-32d1ec8a2490@xymon.invalid> Buchan Milne 
<user-9b139aff4dec@xymon.invalid> writes:
On Wednesday, 2 February 2011 16:14:15 Henrik "St=F8rner" wrote:
2) Please run this command and send me the result:
   xymon 127.0.0.1 "xymondboard host=3DHostname1 test=3Dinfo

fields=3Dhostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS
These fields aren't documented anywhere! A project I was working on a
while back would have progressed much further if these were documented
(I didn't get around to hacking source ...).
OK, how about this? Will add it to the 4.3.0 docs.

http://www.xymon.com/~henrik/xymon-xmh.5.html
Some very useful info there, thanks.

Regards,
Buchan
list Steve Mbs Newton · Fri, 4 Feb 2011 09:54:52 -0500 ·
Hello,

	I was wondering if anyone has any luck fixing this? Thanks
quoted from Steve Mbs Newton

-----Original Message-----
From: Newton, Steve (MBS) 
Sent: Wednesday, February 02, 2011 12:55 PM
To: 'xymon at xymon.com'
Subject: RE: [xymon] Monitoring multiple Linux Processes

1) What Xymon version are you running ?
	4.3.0.3.beta2

2) Please run this command and send me the result:
   xymon 127.0.0.1 "xymondboard host=Hostname1 test=info fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
   (If you are running a Xymon version prior to 4.3.0-beta3, the
   command will be "bb" instead of "xymon", and the first word
   inside the double quotes must be "hobbitdboard" instead of
   "xymondboard").

	[root at vmlb1mon1p bin]# ./bb 127.0.0.1 "hobbitdboard host=lb1etl1p test=info field=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
lb1etl1p.mortgagesvcs.com|info|green||0|0|0|0|0||0|

3) If you look at the "info" status for one of these hosts, you'll
   see a "Page/Subpage" entry which is a link to a webpage. What
   does the URL in this link look like - specifically, does it
   include "infrastructure/noc" or is it something else ?
	
	Infrastructure Services/Enterprise Operations


4) Try running the command
      xymoncmd xymond_client --test
   (that's "bbcmd hobbitd_client --test" in older versions).
   At the Hostname prompt, enter one of the failing hosts.
   Hit <enter> at the Hosttype prompt, enter "proc" as
   the Test, and just hit <enter> at the "ps command string"
   prompt. This should result in the three processes being listed
   as "red" - does it ?

	[root at vmlb1mon1p bin]# ./bbcmd hobbitd_client --test
2011-02-02 12:50:54 Using default environment file /home/xymon/server/etc/hobbitserver.cfg
Hostname (.=end, ?=dump, !=reload) []: lb1etl1p.mortgagesvcs.com Hosttype []:
Test (cpu, mem, disk, proc, log, port): proc To read 'ps' data from a file, enter '@FILENAME' at the prompt ps command string:
Process batchman color red: Count=0, min=1, max=-1 Hostname (.=end, ?=dump, !=reload) [lb1etl1p.mortgagesvcs.com]:	

Thanks

Steve Newton

-----Original Message-----
From: Henrik Størner [mailto:user-ce4a2c883f75@xymon.invalid] 
Sent: Wednesday, February 02, 2011 9:14 AM
To: xymon at xymon.com
Subject: Re: [xymon] Monitoring multiple Linux Processes

In <user-945d5422fd1a@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
This is what I see when I list the host servers:
HOST=Hostname1,Hostname2,Hostname3,Hostname4,Hostname5,Hostname6
      PROC netman
      PROC batchman
      PROC jobman
Results:
green netman (found 1, req. 1 or more)
green batchman (found 1, req. 1 or more)
green jobman (found 1, req. 1 or more)

OK, this is fine and what you should expect.

This is what I see when I use PAGE/SUBPAGE
PAGE=infrastructure/noc
      PROC batchman
      PROC jobman
      PROC netman
Results:
green batchman (found 1, req. 1 or more)

And this is obviously not correct.


I've tried reproducing your setup on my 4.3.0-RC1 installation,
but I cannot make it fail this way. So I need some more information,
and I'd like you to do some tests with the configuration that is
NOT working (the one where you have PAGE=infrastructure/noc).

1) What Xymon version are you running ?

2) Please run this command and send me the result:
   xymon 127.0.0.1 "xymondboard host=Hostname1 test=info fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
   (If you are running a Xymon version prior to 4.3.0-beta3, the
   command will be "bb" instead of "xymon", and the first word
   inside the double quotes must be "hobbitdboard" instead of
   "xymondboard").

3) If you look at the "info" status for one of these hosts, you'll
   see a "Page/Subpage" entry which is a link to a webpage. What
   does the URL in this link look like - specifically, does it
   include "infrastructure/noc" or is it something else ?

4) Try running the command
      xymoncmd xymond_client --test
   (that's "bbcmd hobbitd_client --test" in older versions).
   At the Hostname prompt, enter one of the failing hosts.
   Hit <enter> at the Hosttype prompt, enter "proc" as
   the Test, and just hit <enter> at the "ps command string"
   prompt. This should result in the three processes being listed
   as "red" - does it ?


Regards,
Henrik
list Henrik Størner · Sat, 5 Feb 2011 10:50:41 +0000 (UTC) ·
In <user-454528627302@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
1) What Xymon version are you running ?
4.3.0.3.beta2
It would be interesting to know if this problem also
happens with 4.3.0 RC1, but I realize this upgrade
might require some planning if it is a production
system.
quoted from Steve Mbs Newton
2) Please run this command and send me the result:
  xymon 127.0.0.1 "xymondboard host=Hostname1 test=info 
fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
[root at vmlb1mon1p bin]# ./bb 127.0.0.1 "hobbitdboard host=lb1etl1p 
test=info field=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
lb1etl1p.mortgagesvcs.com|info|green||0|0|0|0|0||0|
Not quite what I was looking for - you've written "field" instead
of "fields" (plural) in the command. What I was after was to see
if these hosts are present on more than one page in your bb-hosts
file.
quoted from Steve Mbs Newton
4) Try running the command
     xymoncmd xymond_client --test
[root at vmlb1mon1p bin]# ./bbcmd hobbitd_client --test
Process batchman color red: Count=0, min=1, max=-1 
Hostname (.=end,= > ?=dump, !=reload)
OK, so at least this is consistent with the "procs" status you see.


The only reason I can think of is if the hosts are present on
more than one page in the bb-hosts file - then they might 
have a different "PAGE" value than you think. There was a 
change done so that PAGE will now match any of the pages
a host is listed on, and if memory serves me right then
this was done between beta-2 and beta-3. So that could be
the problem you're having.


Regards,
Henrik