Xymon Mailing List Archive search

Sun Grid Engine monitoring

list Martin Flemming
Wed, 12 Nov 2008 00:53:57 +0100 (CET)
Message-Id: <user-24ed3b49c53b@xymon.invalid>

Nice, the update work's ... unfortunatley,
i've to change the command

qstat -s r

to

qhost -j -h

because i've got no output for qstat ... don't know why ..

.. for the mrtg- and graph-question i will be answer/ask again,
if i've got time test it ... :-(

 	martin

On Thu, 16 Oct 2008, Martin Flemming wrote:
Nice, that will be easier and quicker to test it, Thanks !

cheers,
	 martin

On Wed, 15 Oct 2008, rdeal wrote:
 I just put an upgrade to the test on deadcat yesterday to version 0.3e...
 Richard

 From: Martin Flemming <user-f286aaa49a76@xymon.invalid>
 Reply-To: <user-ae9b8668bcde@xymon.invalid>
 Date: Wed, 15 Oct 2008 22:58:46 +0200 (CEST)
 To: <user-ae9b8668bcde@xymon.invalid>
 Subject: Re: [hobbit] Sun Grid Engine monitoring
 Thanks, i will test it and give feedback .. :-)
 .. unfortunatley, i'm short in time for testing very short :-(
 cheers,
 martin
 On Fri, 10 Oct 2008, rdeal wrote:
 We have several queues, primary are default, fast, medium.
 The config file is pretty simple:
 Unscaled[_]: y
 Target[sge_1]: `/local/packages/IT/MRTG/bin/qgraph.sh both`
 MaxBytes[sge_1]: 625
 AbsMax[sge_1]: 100000000
 ShortLegend[sge_1]:  Jobs
 YLegend[sge_1]: jobs
 Options[sge_1]: absolute, gauge, noinfo, nopercent, growright, > >  integer,
 unknaszero
 Title[sge_1]: Current Jobs on JCVI GRID
 Legend1[sge_1]: Running
 Legend2[sge_1]: Queued
 Legend3[sge_1]: Peak running
 Legend4[sge_1]: Peak queued
 LegendI[sge_1]: running
 LegendO[sge_1]: queued
 Colours[sge_1]: GREEN#00eb0c,BLUE#0000ff,GRAY#AAAAAA,VIOLET#ff00ff
 PageTop[sge_1]: <H1>Current Jobs on JCVI GRID</H1>
 <TABLE>
   <TR><TD>Maintainer:</TD> <TD>user-f6f804cb0a50@xymon.invalid</TD></TR>
   <TR><TD>Description:</TD><TD>Number of current SGE Jobs running on > >    the
 JCVI SGE GRID</TD></TR>
 </TABLE>
 routers.cgi*Options[sge_1]: noabsmax nodetail
 Target[sge_7]: `/local/packages/IT/MRTG/bin/qgraph.sh medium`
 MaxBytes[sge_7]: 354
 AbsMax[sge_7]: 100000000
 ShortLegend[sge_7]:  Jobs
 YLegend[sge_7]: jobs
 Options[sge_7]: absolute, gauge, noinfo, nopercent, growright, > >  integer,
 unknaszero
 Title[sge_7]: Current Jobs in medium.q and fast.q
 Legend1[sge_7]: Medium
 Legend2[sge_7]: Fast
 Legend3[sge_7]: Peak medium
 Legend4[sge_7]: Peak fast
 LegendI[sge_7]: medium
 LegendO[sge_7]: fast
 Colours[sge_7]: GREEN#00eb0c,BLUE#0000ff,GRAY#AAAAAA,VIOLET#ff00ff
 PageTop[sge_7]: <H1>Current Jobs on JCVI GRID</H1>
 <TABLE>
   <TR><TD>Maintainer:</TD> <TD>user-f6f804cb0a50@xymon.invalid</TD></TR>
   <TR><TD>Description:</TD><TD>Number of current SGE Jobs in the > >    fast.q and
 medium.q</TD></TR>
 </TABLE>
 routers.cgi*Options[sge_7]: noabsmax nodetail
 The script is rather simple:
# !/bin/sh
# > > > >  case "$1" in
  test )
          echo 50
          echo 20 ;;
  total )
          echo $T
          echo 0 ;;
  running )
          echo $R
          echo 0 ;;
  msc )
    T=`/usr/local/bin/qstat -q fast.msc.q -s r | /usr/bin/tail +3 |
 /usr/bin/wc -l`
    R=`/usr/local/bin/qstat -q msc.q -s r | /usr/bin/tail +3 | > >  /usr/bin/wc
 -l`
    echo $R
    echo $T ;;
  medium )
    T=`/usr/local/bin/qstat -q fast.q -s r | /usr/bin/tail +3 | > >  /usr/bin/wc
 -l`
    R=`/usr/local/bin/qstat -q medium.q -s r | /usr/bin/tail +3 |
 /usr/bin/wc -l`
    echo $R
    echo $T ;;
  fast )
    T=`/usr/local/bin/qstat -q default.q -s r | /usr/bin/tail +3 |
 /usr/bin/wc -l`
    R=`/usr/local/bin/qstat -q fast.q -s r | /usr/bin/tail +3 | > >  /usr/bin/wc
 -l`
    echo $R
    echo $T ;;
  desktop )
    T=`/usr/local/bin/qstat -q desktop.q -s r | /usr/bin/tail +3 |
 /usr/bin/wc -l`
    R=`/usr/local/bin/qstat -q fast.q -s r | /usr/bin/tail +3 | > >  /usr/bin/wc
 -l`
    echo $R
    echo $T ;;
  jtc )
    T=`ssh wasp "use sgejcvi; qstat -g dt" | /usr/bin/tail +3 | > >  /usr/bin/wc
 -l`
    R=`ssh wasp "use sgejcvi; qstat -s r" | /usr/bin/tail +3 | > >  /usr/bin/wc
 -l`
    echo $R
    echo $T ;;
  * | both )
    T=`/usr/local/bin/qstat -g dt | /usr/bin/tail +3 | /usr/bin/wc -l`
    R=`/usr/local/bin/qstat -s r | /usr/bin/tail +3 | /usr/bin/wc -l`
    echo $R
    echo $T ;;
 esac
 echo foo
 echo SGE
 The two graphs configured above:
 From: Martin Flemming <user-f286aaa49a76@xymon.invalid>
 Reply-To: <user-ae9b8668bcde@xymon.invalid>
 Date: Fri, 10 Oct 2008 11:46:29 +0200 (CEST)
 To: <user-ae9b8668bcde@xymon.invalid>
 Subject: Re: [hobbit] Sun Grid Engine monitoring
 Hmm, i didn't run mrtg ...
 How can i do it for sge ?
 .. i want to graph e.g jobs in the queue for each node, project or > > >  queue ..
 ... everything i can do :-)
 Or is it maybe possible to create graphs with the data of sge.sh > > >  script ?
 thanks & cheers
 martin
 On Fri, 10 Oct 2008, michael nemeth wrote:
 Ok! well I run mrtg too so I can do that. Also I thought I try > > > >  creating
 graphs for  the sge.sh when
 it's implemented.   I've done custom graphs before (clearcase).
 As I said I know little about SGE and the SGE support person knows > > > >  little
 about hobbit.
 Deal, Richard wrote:
  What did you want to graph.  I use mrtg to graph global she > > > > >   stats
  Sent from a portable system with a small keyboard.
  On Oct 9, 2008, at 5:51 PM, "Martin Flemming" > > > > >   <user-f286aaa49a76@xymon.invalid>
  wrote:
  Yep, i use it too ... but unfortunatley it's without trending > > > > > >   graphs ..
  Got anybody a solution for it ?
  .. i'm very interested :-)
  cheers,
     martin
  On Thu, 9 Oct 2008, rdeal wrote:
  It is called sge.sh
  http://www.deadcat.net/viewfile.php?fileid=920
  And we run it under hobbit now.
  Richard
  From: Galen Johnson <user-87f955643e3d@xymon.invalid>
  Reply-To: <user-ae9b8668bcde@xymon.invalid>
  Date: Thu, 9 Oct 2008 14:57:10 -0400
  To: "user-ae9b8668bcde@xymon.invalid" <user-ae9b8668bcde@xymon.invalid>
  Subject: RE: [hobbit] Sun Grid Engine monitoring
  Where on deadcat?  I can't find anything that mentions it > > > > > > > >   (a search
  on grid on
  Deacat turns up something for the California power grid). > > > > > > > >   Most
  monitors from
  deadcat should work with little to no modification for > > > > > > > >   Hobbit.
  Also, Hobbit
  will monitor whatever you tell it so the sky's the limit > > > > > > > >   there...or
  in this
  case the scripter is the limit.
  =G=
  -----Original Message-----
  From: michael nemeth [mailto:user-609d3fab5b2d@xymon.invalid]
  Sent: Thursday, October 09, 2008 2:46 PM
  To: user-ae9b8668bcde@xymon.invalid
  Subject: [hobbit] Sun Grid Engine monitoring
  While I not been involved in the effort yet,we are about > > > > > > > >   to bring as
  "farm" under Sun Grid Engine in to production. Anyone > > > > > > > >   doing any
  monitoring like this?  I saw a script on deadcat.  I > > > > > > > >   really know
  little about it but said Id check into what monitoring > > > > > > > >   and metrics
  could
  be done by hobbit.
  Gruss
        Martin Flemming
  Martin Flemming
  DESY / IT          office : Building 2b / 008a
  Notkestr. 85       phone  : XXX - XXXX - XXXX
  22603 Hamburg      mail   : user-f286aaa49a76@xymon.invalid
 Gruss
         Martin Flemming
 Martin Flemming
 DESY / IT          office : Building 2b / 008a
 Notkestr. 85       phone  : XXX - XXXX - XXXX
 22603 Hamburg      mail   : user-f286aaa49a76@xymon.invalid
 Gruss
         Martin Flemming
 Martin Flemming
 DESY / IT          office : Building 2b / 008a
 Notkestr. 85       phone  : XXX - XXXX - XXXX
 22603 Hamburg      mail   : user-f286aaa49a76@xymon.invalid
Gruss

      Martin Flemming


Martin Flemming
DESY / IT          office : Building 2b / 008a
Notkestr. 85       phone  : XXX - XXXX - XXXX
22603 Hamburg      mail   : user-f286aaa49a76@xymon.invalid

Gruss

        Martin Flemming


Martin Flemming
DESY / IT          office : Building 2b / 008a
Notkestr. 85       phone  : XXX - XXXX - XXXX
22603 Hamburg      mail   : user-f286aaa49a76@xymon.invalid