Monitoring multiple Linux Processes
list Steve Mbs Newton
Hello,
I'm trying to monitor multiple processes on Linux servers. I would like to use Page/subpage like described in the
Hobbit-clients.cfg file, but it will only show one processes on the page for each server. Here is a what I have in the file.
PAGE=infrastructure/noc
PROC batchman
PROC jobman
PROC netman
Infrastructure is the page
Noc is the subpage.
XYMON only finds the first PROC.
Steve Newton
PHH Corporation
Desk XXX-XXX-XXXX
Cell XXX-XXX-XXXX
list Henrik Størner
▸
In <user-f2c3b6b76d3f@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
I'm trying to monitor multiple processes on Linux servers. I would like to use Page/subpage like described in the Hobbit-clients.cfg file, but it will only show one processes on the page for each server. Here is a what I have in the file.
PAGE=infrastructure/noc
PROC batchman
PROC jobman
PROC netmanInfrastructure is the page Noc is the subpage.
XYMON only finds the first PROC.
I'm not sure what you mean by that: * Does it only check for the first of those processes, so you have only one red/green "dot" on the detailed "procs" status display? There should be one red/green dot for each of the processes listed, e.g. like that on the demo site - see http://www.xymon.com/xymon-cgi/svcstatus.sh?HOST=voodoo.hswn.dk&SERVICE=procs * Or does it check all of the processes, but only on the first host on the page ? Regards, Henrik
list Josh Luthman
What he is asking for is if there is one proc column for each proc. There is not. The proc column for the host includes EVERY ps. If any one ps goes down the color changes for that one column (proc). You would need to make a script to monitor a ps and create another column. Josh Luthman Office: XXX-XXX-XXXX Direct: XXX-XXX-XXXX XXXX Wayne St Suite XXXX Troy, OH XXXXX
▸
On Tue, Feb 1, 2011 at 4:04 PM, Henrik Størner <user-ce4a2c883f75@xymon.invalid> wrote:
In <user-f2c3b6b76d3f@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:I'm trying to monitor multiple processes on Linux servers. I would like to use Page/subpage like described in the Hobbit-clients.cfg file, but it will only show one processes on the page for each server. Here is a what I have in the file.PAGE=infrastructure/noc PROC batchman PROC jobman PROC netmanInfrastructure is the page Noc is the subpage.XYMON only finds the first PROC.I'm not sure what you mean by that: * Does it only check for the first of those processes, so you have only one red/green "dot" on the detailed "procs" status display? There should be one red/green dot for each of the processes listed, e.g. like that on the demo site - see http://www.xymon.com/xymon-cgi/svcstatus.sh?HOST=voodoo.hswn.dk&SERVICE=procs * Or does it check all of the processes, but only on the first host on the page ? Regards, Henrik
list Steve Mbs Newton
This is what I see when I list the host servers:
HOST=Hostname1,Hostname2,Hostname3,Hostname4,Hostname5,Hostname6
PROC netman
PROC batchman
PROC jobman
Results:
green netman (found 1, req. 1 or more)
green batchman (found 1, req. 1 or more)
green jobman (found 1, req. 1 or more)
This is what I see when I use PAGE/SUBPAGE
PAGE=infrastructure/noc
PROC batchman
PROC jobman
PROC netman
Results:
green batchman (found 1, req. 1 or more)
▸
From: Josh Luthman [mailto:user-4c45a83f15cb@xymon.invalid]
Sent: Tuesday, February 01, 2011 4:09 PM
To: xymon at xymon.com
Subject: Re: [xymon] Monitoring multiple Linux Processes
What he is asking for is if there is one proc column for each proc. There is not.
The proc column for the host includes EVERY ps. If any one ps goes down the color changes for that one column (proc). You would need to make a script to monitor a ps and create another column.
Josh Luthman
Office: XXX-XXX-XXXX
Direct: XXX-XXX-XXXX
XXXX Wayne St
Suite XXXX
Troy, OH XXXXX
On Tue, Feb 1, 2011 at 4:04 PM, Henrik Størner <user-ce4a2c883f75@xymon.invalid<mailto:user-ce4a2c883f75@xymon.invalid>> wrote:
In <user-f2c3b6b76d3f@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
I'm trying to monitor multiple processes on Linux servers. I would like to use Page/subpage like described in the Hobbit-clients.cfg file, but it will only show one processes on the page for each server. Here is a what I have in the file.
PAGE=infrastructure/noc
PROC batchman
PROC jobman
PROC netmanInfrastructure is the page Noc is the subpage.
XYMON only finds the first PROC.
I'm not sure what you mean by that: * Does it only check for the first of those processes, so you have only one red/green "dot" on the detailed "procs" status display? There should be one red/green dot for each of the processes listed, e.g. like that on the demo site - see http://www.xymon.com/xymon-cgi/svcstatus.sh?HOST=voodoo.hswn.dk&SERVICE=procs * Or does it check all of the processes, but only on the first host on the page ? Regards, Henrik
list Henrik Størner
▸
In <user-945d5422fd1a@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
This is what I see when I list the host servers:
HOST=Hostname1,Hostname2,Hostname3,Hostname4,Hostname5,Hostname6
PROC netman
PROC batchman
PROC jobmanResults:
green netman (found 1, req. 1 or more) green batchman (found 1, req. 1 or more) green jobman (found 1, req. 1 or more)
OK, this is fine and what you should expect.
▸
This is what I see when I use PAGE/SUBPAGE
PAGE=infrastructure/noc
PROC batchman
PROC jobman
PROC netmanResults:
green batchman (found 1, req. 1 or more)
And this is obviously not correct.
I've tried reproducing your setup on my 4.3.0-RC1 installation,
but I cannot make it fail this way. So I need some more information,
and I'd like you to do some tests with the configuration that is
NOT working (the one where you have PAGE=infrastructure/noc).
1) What Xymon version are you running ?
2) Please run this command and send me the result:
xymon 127.0.0.1 "xymondboard host=Hostname1 test=info fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
(If you are running a Xymon version prior to 4.3.0-beta3, the
command will be "bb" instead of "xymon", and the first word
inside the double quotes must be "hobbitdboard" instead of
"xymondboard").
3) If you look at the "info" status for one of these hosts, you'll
see a "Page/Subpage" entry which is a link to a webpage. What
does the URL in this link look like - specifically, does it
include "infrastructure/noc" or is it something else ?
4) Try running the command
xymoncmd xymond_client --test
(that's "bbcmd hobbitd_client --test" in older versions).
At the Hostname prompt, enter one of the failing hosts.
Hit <enter> at the Hosttype prompt, enter "proc" as
the Test, and just hit <enter> at the "ps command string"
prompt. This should result in the three processes being listed
as "red" - does it ?
Regards,
Henrik
list Steve Mbs Newton
1) What Xymon version are you running ? 4.3.0.3.beta2
▸
2) Please run this command and send me the result:
xymon 127.0.0.1 "xymondboard host=Hostname1 test=info fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
(If you are running a Xymon version prior to 4.3.0-beta3, the
command will be "bb" instead of "xymon", and the first word
inside the double quotes must be "hobbitdboard" instead of
"xymondboard").
[root at vmlb1mon1p bin]# ./bb 127.0.0.1 "hobbitdboard host=lb1etl1p test=info field=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
lb1etl1p.mortgagesvcs.com|info|green||0|0|0|0|0||0|
▸
3) If you look at the "info" status for one of these hosts, you'll
see a "Page/Subpage" entry which is a link to a webpage. What
does the URL in this link look like - specifically, does it
include "infrastructure/noc" or is it something else ?
Infrastructure Services/Enterprise Operations
▸
4) Try running the command
xymoncmd xymond_client --test
(that's "bbcmd hobbitd_client --test" in older versions).
At the Hostname prompt, enter one of the failing hosts.
Hit <enter> at the Hosttype prompt, enter "proc" as
the Test, and just hit <enter> at the "ps command string"
prompt. This should result in the three processes being listed
as "red" - does it ?
[root at vmlb1mon1p bin]# ./bbcmd hobbitd_client --test
2011-02-02 12:50:54 Using default environment file /home/xymon/server/etc/hobbitserver.cfg
Hostname (.=end, ?=dump, !=reload) []: lb1etl1p.mortgagesvcs.com Hosttype []:
Test (cpu, mem, disk, proc, log, port): proc To read 'ps' data from a file, enter '@FILENAME' at the prompt ps command string:
Process batchman color red: Count=0, min=1, max=-1 Hostname (.=end, ?=dump, !=reload) [lb1etl1p.mortgagesvcs.com]:
Thanks
Steve Newton
▸
-----Original Message-----
From: Henrik Størner [mailto:user-ce4a2c883f75@xymon.invalid]
Sent: Wednesday, February 02, 2011 9:14 AM
To: xymon at xymon.com
Subject: Re: [xymon] Monitoring multiple Linux Processes
In <user-945d5422fd1a@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
This is what I see when I list the host servers:
HOST=Hostname1,Hostname2,Hostname3,Hostname4,Hostname5,Hostname6
PROC netman
PROC batchman
PROC jobmanResults:
green netman (found 1, req. 1 or more) green batchman (found 1, req. 1 or more) green jobman (found 1, req. 1 or more)
OK, this is fine and what you should expect.
This is what I see when I use PAGE/SUBPAGE
PAGE=infrastructure/noc
PROC batchman
PROC jobman
PROC netmanResults:
green batchman (found 1, req. 1 or more)
And this is obviously not correct.
I've tried reproducing your setup on my 4.3.0-RC1 installation,
but I cannot make it fail this way. So I need some more information,
and I'd like you to do some tests with the configuration that is
NOT working (the one where you have PAGE=infrastructure/noc).
1) What Xymon version are you running ?
2) Please run this command and send me the result:
xymon 127.0.0.1 "xymondboard host=Hostname1 test=info fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
(If you are running a Xymon version prior to 4.3.0-beta3, the
command will be "bb" instead of "xymon", and the first word
inside the double quotes must be "hobbitdboard" instead of
"xymondboard").
3) If you look at the "info" status for one of these hosts, you'll
see a "Page/Subpage" entry which is a link to a webpage. What
does the URL in this link look like - specifically, does it
include "infrastructure/noc" or is it something else ?
4) Try running the command
xymoncmd xymond_client --test
(that's "bbcmd hobbitd_client --test" in older versions).
At the Hostname prompt, enter one of the failing hosts.
Hit <enter> at the Hosttype prompt, enter "proc" as
the Test, and just hit <enter> at the "ps command string"
prompt. This should result in the three processes being listed
as "red" - does it ?
Regards,
Henrik
list Buchan Milne
▸
On Wednesday, 2 February 2011 16:14:15 Henrik "Størner" wrote:
2) Please run this command and send me the result: xymon 127.0.0.1 "xymondboard host=Hostname1 test=info fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS
These fields aren't documented anywhere! A project I was working on a while back would have progressed much further if these were documented (I didn't get around to hacking source ...). Regards, Buchan
list Henrik Størner
▸
In <user-32d1ec8a2490@xymon.invalid> Buchan Milne <user-9b139aff4dec@xymon.invalid> writes:
On Wednesday, 2 February 2011 16:14:15 Henrik "St=F8rner" wrote:2) Please run this command and send me the result:
xymon 127.0.0.1 "xymondboard host=3DHostname1 test=3Dinfo fields=3Dhostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS
▸
These fields aren't documented anywhere! A project I was working on a while back would have progressed much further if these were documented (I didn't get around to hacking source ...).
OK, how about this? Will add it to the 4.3.0 docs. http://www.xymon.com/~henrik/xymon-xmh.5.html Regards, Henrik
list Buchan Milne
▸
On Friday, 4 February 2011 00:09:17 Henrik "Størner" wrote:
In <user-32d1ec8a2490@xymon.invalid> Buchan Milne
<user-9b139aff4dec@xymon.invalid> writes:
On Wednesday, 2 February 2011 16:14:15 Henrik "St=F8rner" wrote:2) Please run this command and send me the result: xymon 127.0.0.1 "xymondboard host=3DHostname1 test=3Dinfo fields=3Dhostname,BBH_PAGEPATH,BBH_ALLPAGEPATHSThese fields aren't documented anywhere! A project I was working on a while back would have progressed much further if these were documented (I didn't get around to hacking source ...).OK, how about this? Will add it to the 4.3.0 docs. http://www.xymon.com/~henrik/xymon-xmh.5.html
Some very useful info there, thanks. Regards, Buchan
list Steve Mbs Newton
Hello, I was wondering if anyone has any luck fixing this? Thanks
▸
-----Original Message-----
From: Newton, Steve (MBS)
Sent: Wednesday, February 02, 2011 12:55 PM
To: 'xymon at xymon.com'
Subject: RE: [xymon] Monitoring multiple Linux Processes
1) What Xymon version are you running ?
4.3.0.3.beta2
2) Please run this command and send me the result:
xymon 127.0.0.1 "xymondboard host=Hostname1 test=info fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
(If you are running a Xymon version prior to 4.3.0-beta3, the
command will be "bb" instead of "xymon", and the first word
inside the double quotes must be "hobbitdboard" instead of
"xymondboard").
[root at vmlb1mon1p bin]# ./bb 127.0.0.1 "hobbitdboard host=lb1etl1p test=info field=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
lb1etl1p.mortgagesvcs.com|info|green||0|0|0|0|0||0|
3) If you look at the "info" status for one of these hosts, you'll
see a "Page/Subpage" entry which is a link to a webpage. What
does the URL in this link look like - specifically, does it
include "infrastructure/noc" or is it something else ?
Infrastructure Services/Enterprise Operations
4) Try running the command
xymoncmd xymond_client --test
(that's "bbcmd hobbitd_client --test" in older versions).
At the Hostname prompt, enter one of the failing hosts.
Hit <enter> at the Hosttype prompt, enter "proc" as
the Test, and just hit <enter> at the "ps command string"
prompt. This should result in the three processes being listed
as "red" - does it ?
[root at vmlb1mon1p bin]# ./bbcmd hobbitd_client --test
2011-02-02 12:50:54 Using default environment file /home/xymon/server/etc/hobbitserver.cfg
Hostname (.=end, ?=dump, !=reload) []: lb1etl1p.mortgagesvcs.com Hosttype []:
Test (cpu, mem, disk, proc, log, port): proc To read 'ps' data from a file, enter '@FILENAME' at the prompt ps command string:
Process batchman color red: Count=0, min=1, max=-1 Hostname (.=end, ?=dump, !=reload) [lb1etl1p.mortgagesvcs.com]:
Thanks
Steve Newton
-----Original Message-----
From: Henrik Størner [mailto:user-ce4a2c883f75@xymon.invalid]
Sent: Wednesday, February 02, 2011 9:14 AM
To: xymon at xymon.com
Subject: Re: [xymon] Monitoring multiple Linux Processes
In <user-945d5422fd1a@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
This is what I see when I list the host servers:
HOST=Hostname1,Hostname2,Hostname3,Hostname4,Hostname5,Hostname6
PROC netman
PROC batchman
PROC jobmanResults:
green netman (found 1, req. 1 or more) green batchman (found 1, req. 1 or more) green jobman (found 1, req. 1 or more)
OK, this is fine and what you should expect.
This is what I see when I use PAGE/SUBPAGE
PAGE=infrastructure/noc
PROC batchman
PROC jobman
PROC netmanResults:
green batchman (found 1, req. 1 or more)
And this is obviously not correct.
I've tried reproducing your setup on my 4.3.0-RC1 installation,
but I cannot make it fail this way. So I need some more information,
and I'd like you to do some tests with the configuration that is
NOT working (the one where you have PAGE=infrastructure/noc).
1) What Xymon version are you running ?
2) Please run this command and send me the result:
xymon 127.0.0.1 "xymondboard host=Hostname1 test=info fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
(If you are running a Xymon version prior to 4.3.0-beta3, the
command will be "bb" instead of "xymon", and the first word
inside the double quotes must be "hobbitdboard" instead of
"xymondboard").
3) If you look at the "info" status for one of these hosts, you'll
see a "Page/Subpage" entry which is a link to a webpage. What
does the URL in this link look like - specifically, does it
include "infrastructure/noc" or is it something else ?
4) Try running the command
xymoncmd xymond_client --test
(that's "bbcmd hobbitd_client --test" in older versions).
At the Hostname prompt, enter one of the failing hosts.
Hit <enter> at the Hosttype prompt, enter "proc" as
the Test, and just hit <enter> at the "ps command string"
prompt. This should result in the three processes being listed
as "red" - does it ?
Regards,
Henrik
list Henrik Størner
In <user-454528627302@xymon.invalid> "Newton, Steve (MBS)" <user-0f7bfdde23c1@xymon.invalid> writes:
1) What Xymon version are you running ? 4.3.0.3.beta2
It would be interesting to know if this problem also happens with 4.3.0 RC1, but I realize this upgrade might require some planning if it is a production system.
▸
2) Please run this command and send me the result: xymon 127.0.0.1 "xymondboard host=Hostname1 test=info fields=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS"
[root at vmlb1mon1p bin]# ./bb 127.0.0.1 "hobbitdboard host=lb1etl1p test=info field=hostname,BBH_PAGEPATH,BBH_ALLPAGEPATHS" lb1etl1p.mortgagesvcs.com|info|green||0|0|0|0|0||0|
Not quite what I was looking for - you've written "field" instead of "fields" (plural) in the command. What I was after was to see if these hosts are present on more than one page in your bb-hosts file.
▸
4) Try running the command
xymoncmd xymond_client --test
[root at vmlb1mon1p bin]# ./bbcmd hobbitd_client --test
Process batchman color red: Count=0, min=1, max=-1
Hostname (.=end,= > ?=dump, !=reload)OK, so at least this is consistent with the "procs" status you see. The only reason I can think of is if the hosts are present on more than one page in the bb-hosts file - then they might have a different "PAGE" value than you think. There was a change done so that PAGE will now match any of the pages a host is listed on, and if memory serves me right then this was done between beta-2 and beta-3. So that could be the problem you're having. Regards, Henrik