Xymon Mailing List Archive search

Availabilitreports and maintenance mode

8 messages in this thread

list T.J. Yang · Fri, 18 Aug 2006 04:40:05 -0500 ·
When doing Availability Report,
 is it possible to deduct the downtime caused by system maintenance ?


Regards

tj
list Henrik Størner · Fri, 18 Aug 2006 12:31:33 +0200 ·
quoted from T.J. Yang
On Fri, Aug 18, 2006 at 04:40:05AM -0500, T.J. Yang wrote:
When doing Availability Report,
is it possible to deduct the downtime caused by system maintenance ?
If you do it in advance, yes. Ie. define DOWNTIME settings for
the hosts covering your regular outages (e.g. the scheduled reboot every
Saturday morning), or disable the host/service when doing maintenance.

Periods with a "blue" (disabled) status are not included in the
availability calculations.


Regards,
Henrik
list Thomas Seglard · Fri, 18 Aug 2006 14:38:53 +0200 ·
Hello,

I'm currently migrating to hobbit 4.2. All is going ok but I notice something strange under solaris (6->10)! Under the cpu column, I didn't get any output, only load graph. I don't understand because it's working fine with previous version of the client (4.1.2p1). I'm looking at the hobbitclient-sunos.sh and find these lines :

# $TOP must be set, the install utility should do that for us if it exists.
if test "$TOP" != ""
then
    if test -x "$TOP"
    then
        echo "[top]"
        $TOP -b 20
    fi
fi

So, I'm opening the etc/hobbitclient.cfg and have this variable :

TOP="/usr/bin/prstat -can 20 1 1"

I'm launching this command from a console and it works perfectly as you can see :

root at psa129:/ # /usr/bin/prstat -can 20 1 1
   PID USERNAME  SIZE   RSS STATE  PRI NICE      TIME  CPU PROCESS/NLWP   20555 root     4432K 4088K cpu0    39    0   0:00:00 0.4% prstat/1
 16570 oraadm   7712K 2192K sleep   59    0   0:00:00 0.1% sshd/1
[snip]
   454 root     1808K  528K sleep   59    0   0:00:00 0.0% ttymon/1
 NPROC USERNAME  SIZE   RSS MEMORY      TIME  CPU     101 root      563M  227M    47%   4:09:40 0.4%
    77 oraadm    507M  189M    39%   0:05:18 0.2%
     1 smmsp    4352K  840K   0.2%   0:00:31 0.0%
    10 hobbit     11M 9336K   1.9%   0:00:23 0.0%
     1 daemon   2696K 1592K   0.3%   0:00:23 0.0%
Total: 221 processes, 363 lwps, load averages: 0.08, 0.14, 0.16

Under my cpu test on the hobbit display I just have this line : "System clock is 0 seconds off"
Is it the new behaviour for this test or something is wrong on my configuration ? What does "System clock is 0 seconds off" mean ?

Once again, thanks for all your great work !

Best regards,

ThomaS


Ce message (et toutes ses pieces jointes eventuelles) est confidentiel et etabli a l'intention exclusive de ses destinataires.
Toute utilisation de ce message non conforme a sa destination, toute diffusion ou toute publication, totale ou partielle, est
interdite, sauf autorisation expresse.
L'internet ne permettant pas d'assurer l'integrite de ce message, CNP Assurances et ses filiales declinent toute responsabilite
au titre de ce message, s'il a ete altere, deforme ou falsifie.

*****

This message and any attachments (the "message") are confidential and intended solely for the addressees.
Any unauthorised use or dissemination is prohibited.
E-mails are susceptible to alteration.
Neither CNP Assurances nor any of its subsidiaries or affiliates shall be liable for the message if altered, changed or falsified.
list Colin Spargo · Fri, 18 Aug 2006 14:03:36 +0100 ·
I noticed  this problem too.  I changed the entry for TOP in hobbitclient-sunos.sh to:
quoted from Thomas Seglard

# $TOP must be set, the install utility should do that for us if it exists.
if test "$TOP" != ""
then

    if  [ `expr "$TOP" : "/usr/bin/prstat"` -ne 0  ]
    then
        echo "[top]"
        $TOP
    elif test -x "$TOP"
    then
        echo "[top]"
        $TOP -b 20
    fi
fi


user-bb3e9041f07f@xymon.invalid 18/08/2006 13:38
Please respond to
user-ae9b8668bcde@xymon.invalid


To
user-ae9b8668bcde@xymon.invalid
cc

Subject
[hobbit] Question on cpu test under solaris 6->10
quoted from Thomas Seglard


Hello,

I'm currently migrating to hobbit 4.2. All is going ok but I notice something strange under solaris (6->10)! Under the cpu column, I didn't get any output, only load graph. I don't understand because it's working fine with previous version of the client (4.1.2p1). I'm looking at the hobbitclient-sunos.sh and find these lines :

# $TOP must be set, the install utility should do that for us if it exists.
if test "$TOP" != ""
then
    if test -x "$TOP"
    then
        echo "[top]"
        $TOP -b 20
    fi
fi

So, I'm opening the etc/hobbitclient.cfg and have this variable :

TOP="/usr/bin/prstat -can 20 1 1"

I'm launching this command from a console and it works perfectly as you can see :

root at psa129:/ # /usr/bin/prstat -can 20 1 1
   PID USERNAME  SIZE   RSS STATE  PRI NICE      TIME  CPU PROCESS/NLWP  20555 root     4432K 4088K cpu0    39    0   0:00:00 0.4% prstat/1
 16570 oraadm   7712K 2192K sleep   59    0   0:00:00 0.1% sshd/1
[snip]
   454 root     1808K  528K sleep   59    0   0:00:00 0.0% ttymon/1
 NPROC USERNAME  SIZE   RSS MEMORY      TIME  CPU    101 root      563M  227M    47%   4:09:40 0.4%
    77 oraadm    507M  189M    39%   0:05:18 0.2%
     1 smmsp    4352K  840K   0.2%   0:00:31 0.0%
    10 hobbit     11M 9336K   1.9%   0:00:23 0.0%
     1 daemon   2696K 1592K   0.3%   0:00:23 0.0%
Total: 221 processes, 363 lwps, load averages: 0.08, 0.14, 0.16

Under my cpu test on the hobbit display I just have this line : "System clock is 0 seconds off"
Is it the new behaviour for this test or something is wrong on my configuration ? What does "System clock is 0 seconds off" mean ?

Once again, thanks for all your great work !

Best regards,

ThomaS


Ce message (et toutes ses pieces jointes eventuelles) est confidentiel et etabli a l'intention exclusive de ses destinataires.
Toute utilisation de ce message non conforme a sa destination, toute diffusion ou toute publication, totale ou partielle, est
interdite, sauf autorisation expresse.
L'internet ne permettant pas d'assurer l'integrite de ce message, CNP Assurances et ses filiales declinent toute responsabilite
au titre de ce message, s'il a ete altere, deforme ou falsifie.

*****

This message and any attachments (the "message") are confidential and intended solely for the addressees.
Any unauthorised use or dissemination is prohibited.
E-mails are susceptible to alteration.
Neither CNP Assurances nor any of its subsidiaries or affiliates shall be liable for the message if altered, changed or falsified.
list Rdeal · Fri, 18 Aug 2006 09:10:56 -0400 ·
The problem seems to be that $TOP is set to "/usr/bin/prstat -can 20 1 1"
And this causes the if statement to fail { if test -x "$TOP"}
I made a change to hobbitclient.cfg and hobbitserver.cfg so that:
TOP="/usr/bin/prstat"
TOPARGS="-can 20 1 1"

And then changed hobbitclient-sunos.sh to:
if test "$TOP" != ""
then
    if test -x "$TOP"
    then
        echo "[top]"
#        $TOP -b 40
        $TOP $TOPARGS
    fi
fi

And this works well except the hobbit server does not get a cpu load graph.
It only gets a graph if there is not [top] line in the data set
quoted from Colin Spargo
From: <user-bb3e9041f07f@xymon.invalid>
Reply-To: <user-ae9b8668bcde@xymon.invalid>
Date: Fri, 18 Aug 2006 14:38:53 +0200
To: <user-ae9b8668bcde@xymon.invalid>
Subject: [hobbit] Question on cpu test under solaris 6->10

Hello,

I'm currently migrating to hobbit 4.2. All is going ok but I notice
something strange under solaris (6->10)! Under the cpu column, I didn't
get any output, only load graph. I don't understand because it's working
fine with previous version of the client (4.1.2p1). I'm looking at the
hobbitclient-sunos.sh and find these lines :

# $TOP must be set, the install utility should do that for us if it
exists.
if test "$TOP" != ""
then
    if test -x "$TOP"
    then
        echo "[top]"
        $TOP -b 20
    fi
fi

So, I'm opening the etc/hobbitclient.cfg and have this variable :

TOP="/usr/bin/prstat -can 20 1 1"

I'm launching this command from a console and it works perfectly as you
can see :

root at psa129:/ # /usr/bin/prstat -can 20 1 1
   PID USERNAME  SIZE   RSS STATE  PRI NICE      TIME  CPU PROCESS/NLWP
 20555 root     4432K 4088K cpu0    39    0   0:00:00 0.4% prstat/1
 16570 oraadm   7712K 2192K sleep   59    0   0:00:00 0.1% sshd/1
[snip]
   454 root     1808K  528K sleep   59    0   0:00:00 0.0% ttymon/1
 NPROC USERNAME  SIZE   RSS MEMORY      TIME  CPU
   101 root      563M  227M    47%   4:09:40 0.4%
    77 oraadm    507M  189M    39%   0:05:18 0.2%
     1 smmsp    4352K  840K   0.2%   0:00:31 0.0%
    10 hobbit     11M 9336K   1.9%   0:00:23 0.0%
     1 daemon   2696K 1592K   0.3%   0:00:23 0.0%
Total: 221 processes, 363 lwps, load averages: 0.08, 0.14, 0.16

Under my cpu test on the hobbit display I just have this line : "System
clock is 0 seconds off"
Is it the new behaviour for this test or something is wrong on my
configuration ? What does "System clock is 0 seconds off" mean ?

Once again, thanks for all your great work !

Best regards,

ThomaS


Ce message (et toutes ses pieces jointes eventuelles) est confidentiel et
etabli a l'intention exclusive de ses destinataires.
Toute utilisation de ce message non conforme a sa destination, toute diffusion
ou toute publication, totale ou partielle, est
interdite, sauf autorisation expresse.
L'internet ne permettant pas d'assurer l'integrite de ce message, CNP
Assurances et ses filiales declinent toute responsabilite
au titre de ce message, s'il a ete altere, deforme ou falsifie.

*****

This message and any attachments (the "message") are confidential and intended
solely for the addressees.
Any unauthorised use or dissemination is prohibited.
E-mails are susceptible to alteration.
Neither CNP Assurances nor any of its subsidiaries or affiliates shall be
liable for the message if altered, changed or falsified.

list Henrik Størner · Sat, 19 Aug 2006 11:06:23 +0200 ·
quoted from Rdeal
On Fri, Aug 18, 2006 at 02:38:53PM +0200, user-bb3e9041f07f@xymon.invalid wrote:
I'm currently migrating to hobbit 4.2. All is going ok but I notice something strange under solaris (6->10)! Under the cpu column, I didn't get any output, only load graph.
It's a bug in the hobbitclient-sunos script.

There's a patch available on http://www.hswn.dk/hobbitsw/patches/ now.

What does "System clock is 0 seconds off" mean ?
It means the client's system clock is in sync with the clock on the
server. It's a rough measure of how well your hosts are synchronized
against a common time-source. You can use the CLOCK setting in hobbit-clients.cfg to warn you if a host clock drifts too far from
the norm.


Regards,
Henrik
list Thomas Seglard · Mon, 21 Aug 2006 11:44:06 +0200 ·
Thanks for this patch ! And thanks for your explanation about CLOCK. Which utility are you using to measure the drift ?
Best regards,

Thomas


user-ce4a2c883f75@xymon.invalid (Henrik Stoerner) a écrit sur 19/08/2006 11:06:23 :
quoted from Rdeal
On Fri, Aug 18, 2006 at 02:38:53PM +0200, user-bb3e9041f07f@xymon.invalid 
wrote:
I'm currently migrating to hobbit 4.2. All is going ok but I notice > something strange under solaris (6->10)! Under the cpu column, I 
didn't 
get any output, only load graph.
It's a bug in the hobbitclient-sunos script.

There's a patch available on http://www.hswn.dk/hobbitsw/patches/ now.

What does "System clock is 0 seconds off" mean ?
It means the client's system clock is in sync with the clock on the
server. It's a rough measure of how well your hosts are synchronized
against a common time-source. You can use the CLOCK setting in hobbit-clients.cfg to warn you if a host clock drifts too far from
the norm.


Regards,
Henrik

Ce message (et toutes ses pieces jointes eventuelles) est confidentiel et etabli a l'intention exclusive de ses destinataires.
Toute utilisation de ce message non conforme a sa destination, toute diffusion ou toute publication, totale ou partielle, est
interdite, sauf autorisation expresse.
L'internet ne permettant pas d'assurer l'integrite de ce message, CNP Assurances et ses filiales declinent toute responsabilite
au titre de ce message, s'il a ete altere, deforme ou falsifie.

*****

This message and any attachments (the "message") are confidential and intended solely for the addressees.
Any unauthorised use or dissemination is prohibited.
E-mails are susceptible to alteration.
Neither CNP Assurances nor any of its subsidiaries or affiliates shall be liable for the message if altered, changed or falsified.
list Charles Goyard · Mon, 21 Aug 2006 11:55:21 +0200 ·
quoted from Thomas Seglard
user-bb3e9041f07f@xymon.invalid wrote :
And thanks for your explanation about CLOCK. Which utility are you
using to measure the drift ?
A substract and a comparison. It's public domain :).

It just compares the hobbit server time against the client time, and
gives a error if the difference is too big. It lets one drop deadcat's
bb-ntp.sh for most hosts.


-- 
Charles Goyard - user-98f9625a7a59@xymon.invalid - (+33) 1 45 38 01 31
(lunch together today ?)