Well, I think these tests are coded on the server, and then fetched by the
client and implemented at the client level. Under perfect
conditions it can take 10 or 15 minutes for the client to retrieve and
implement any changes. However, if you have coded everything
correctly (like there is no mismatch on your host name, etc.) then there
may be a problem with the client fetching mechanism.
Since none of your stuff is working, it is hard to say whether the problem
is in your configuration file (remember that Xymon silently
ignores errors) or in the client itself.
GLH
On Thu, Jan 14, 2010 at 12:44 PM, Jonathan B. Horen <user-12d4882938ba@xymon.invalid>wrote:
Yes, I correctly configured hobbit-clients.cfg on the Xymon server (Linux
host), 'cuz it works fine on the cluster's head-node, but not on any of the
compute nodes. And it's not a "cluster" thing, 'cuz cpu, disk, memory, and
messages all work fine on the compute nodes.
"Ports" says "No port checks defined", and "Procs" says "No process checks
defined"; but, here are the entries for the first compute node:
HOST=node13.cluster.private
PROC hobbitlaunch
PROC sge_execd
PORT "LOCAL=%([.:]22)$" state=LISTEN color=yellow TRACK=sshd
"TEXT=SSHD Server"
However, clicking on the blinking white "Ports" LED displays the following:
Thu Jan 14 09:38:57 AKST 2010 - Ports NOT ok
No port checks defined
tcp4 0 0 192.168.2.113.55604 192.168.2.100.2049 ESTABLISHED
tcp4 0 0 192.168.2.113.55602 199.165.76.54.2049 ESTABLISHED
tcp4 0 0 192.168.2.113.50066 199.165.76.54.6444 ESTABLISHED
tcp4 0 0 *.8649 *.* LISTEN
tcp4 0 0 *.6445 *.* LISTEN
tcp4 0 0 *.311 *.* LISTEN
tcp46 0 0 *.5900 *.* LISTEN
tcp4 0 0 *.88 *.* LISTEN
tcp6 0 0 *.88 *.* LISTEN
tcp4 0 0 *.22 *.* LISTEN
tcp6 0 0 *.22 *.* LISTEN
tcp4 0 0 *.625 *.* LISTEN
tcp4 0 0 127.0.0.1.631 *.* LISTEN
tcp6 0 0 ::1.631 *.* LISTEN
So we see that port 22 is being LISTENed on.
And, clicking on the blinking white "Ports" LED displays the following:
Thu Jan 14 09:43:57 AKST 2010 - Processes NOT ok
[image: clear] No process checks defined
PID PPID USER STARTED STAT PRI %CPU TIME %MEM RSS VSZ COMMAND
1 0 root Mon08AM Ss 31 0.0 0:46.13 0.0 588 76512 /sbin/launchd
25 1 root Mon08AM Ss 31 0.0 0:01.31 0.0 1292 75944 /usr/libexec/kextd
26 1 root Mon08AM Ss 31 1.8 32:04.05 0.1 5036 80040 /usr/sbin/DirectoryService
27 1 root Mon08AM Ss 31 0.0 0:04.26 0.0 484 75920 /usr/sbin/notifyd
28 1 root Mon08AM Ss 31 0.0 0:48.60 0.0 484 77012 /usr/sbin/syslogd
29 1 root Mon08AM Ss 31 0.0 6:40.76 0.0 1796 77508 /usr/sbin/configd
30 1 daemon Mon08AM Ss 31 0.0 0:03.21 0.0 656 75324 /usr/sbin/distnoted
31 1 _mdnsresponder Mon08AM Ss 31 0.0 0:01.01 0.0 1296 77360 /usr/sbin/mDNSResponder -launchd
35 1 root Mon08AM Ss 31 0.0 0:00.99 0.0 1704 77088 /usr/sbin/securityd -i
39 1 root Mon08AM Ss 31 0.0 0:04.95 0.0 752 76484 master
40 1 root Mon08AM Ss 31 0.0 0:15.51 0.0 856 75888 /usr/sbin/ntpd -c /private/etc/ntp-restrict.conf -n -g -p /var/run/ntpd.pid -f /var/db/ntp.drift
41 1 _amavisd Mon08AM Ss 31 0.0 0:08.86 0.4 37516 114284 clamd
42 1 root Mon08AM Ss 31 0.0 0:00.01 0.0 320 75320 getty serial.57600 tty.serial
43 1 root Mon08AM Ss 63 0.0 0:01.99 0.0 652 75576 watchdogtimerd
44 1 213 Mon08AM Ss 31 0.0 0:00.04 0.0 1028 77308 /System/Library/PrivateFrameworks/MobileDevice.framework/Versions/A/Resources/usbmuxd -launchd
45 1 root Mon08AM Ss 31 0.0 0:46.29 0.0 292 75300 /usr/sbin/update
46 1 root Mon08AM Ss 31 0.0 0:00.02 0.0 684 75344 /sbin/SystemStarter
49 1 root Mon08AM Ss 31 0.0 4:54.83 0.1 8892 99552 servermgrd -x
51 1 root Mon08AM Ss 31 0.0 0:00.02 0.0 1064 76368 /System/Library/CoreServices/RemoteManagement/AppleVNCServer.bundle/Contents/Support/RFBRegisterMDNS
52 1 root Mon08AM Ss 50 0.0 0:35.73 0.1 5668 119164 /System/Library/Frameworks/CoreServices.framework/Frameworks/Metadata.framework/Support/mds
53 1 root Mon08AM Ss 48 0.0 0:03.22 0.0 3844 99900 /System/Library/CoreServices/loginwindow.app/Contents/MacOS/loginwindow console
54 1 root Mon08AM Ss 31 0.0 0:00.04 0.0 652 75420 /usr/sbin/KernelEventAgent
56 1 root Mon08AM Ss 31 0.0 18:55.08 0.0 1840 75932 hwmond
57 1 root Mon08AM Ss 31 0.0 0:00.67 0.0 600 75864 /usr/libexec/hidd
59 1 root Mon08AM Ss 50 0.0 0:06.90 0.0 1176 80024 /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/CarbonCore.framework/Versions/A/Support/fseventsd
61 1 root Mon08AM Ss 31 0.0 1:58.65 0.0 1816 85404 /sbin/emond
62 1 root Mon08AM Ss 63 0.0 0:00.01 0.0 700 75348 /sbin/dynamic_pager -F /private/var/vm/swapfile
65 1 root Mon08AM Ss 31 0.0 0:00.24 0.0 940 75432 /usr/sbin/diskarbitrationd
69 1 root Mon08AM Ss 31 0.0 0:00.02 0.0 676 75360 autofsd
79 1 root Mon08AM Ss 31 0.0 0:14.39 0.0 996 75468 /usr/sbin/kdcmond -n -a
82 1 root Mon08AM Ss 31 0.0 0:00.79 0.0 2108 78684 /System/Library/CoreServices/coreservicesd
84 1 _windowserver Mon08AM Ss 63 0.0 3:01.21 0.2 16928 114260 /System/Library/Frameworks/ApplicationServices.framework/Frameworks/CoreGraphics.framework/Resources/WindowServer -daemon
87 79 root Mon08AM S 31 0.0 0:00.04 0.0 1208 75772 /usr/sbin/krb5kdc -n -r LKDC:SHA1.A84C8C2567E3A97141E7B5B705DB050E8D8F8D0E
90 39 _postfix Mon08AM S 31 0.0 0:00.60 0.0 836 75524 qmgr -l -t fifo -u
102 1 _atsserver Mon08AM Ss 31 0.0 0:00.25 0.0 1744 112100 /System/Library/Frameworks/ApplicationServices.framework/Frameworks/ATS.framework/Support/ATSServer
108 1 nobody Mon08AM Ss 97 0.0 0:00.07 0.0 1872 86948 /System/Library/CoreServices/RemoteManagement/ARDAgent.app/Contents/MacOS/ARDAgent
109 1 root Mon08AM Ss 31 0.0 0:00.02 0.0 872 75336 /usr/sbin/UserEventAgent -l LoginWindow
111 53 root Mon08AM Ss 31 0.0 0:05.27 0.0 3912 99100 /System/Library/CoreServices/ManagedClient.app/Contents/MacOS/ManagedClient -s
112 108 nobody Mon08AM S 31 0.0 0:00.05 0.0 1716 85020 /System/Library/CoreServices/RemoteManagement/AppleVNCServer.bundle/Contents/MacOS/AppleVNCServer
113 35 root Mon08AM S 31 0.0 0:00.11 0.0 1536 86656 /System/Library/CoreServices/SecurityAgent.app/Contents/Resources/authorizationhost
115 35 _securityagent Mon08AM S 47 0.7 50:36.10 0.1 11336 154704 /System/Library/CoreServices/SecurityAgent.app/Contents/MacOS/SecurityAgent
220 1 3000 Mon08AM S 31 0.0 0:34.61 0.0 912 76648 /usr/local/sge/bin/darwin/sge_execd
231 1 nobody Mon08AM Ss 31 0.0 0:40.63 0.0 2684 76092 /usr/sbin/gmond
346 1 root Mon09AM Ss 31 0.0 0:16.81 0.0 840 76676 /usr/sbin/serialnumberd
8374 1 _update_sharing 1:37PM Ss 31 0.0 0:00.01 0.0 300 67120 /System/Library/Frameworks/JavaVM.framework/Versions/A/Resources/bin/updateSharingD
25291 39 _postfix 8:23AM S 31 0.0 0:00.02 0.0 760 75468 pickup -l -t fifo -u -o content_filter
26058 1 root 9:08AM Ss 31 0.0 0:00.03 0.0 276 76400 /usr/local/xymon/client/bin/hobbitlaunch --config=/usr/local/xymon/client/etc/clientlaunch.cfg --log=/usr/local/xymon/client/logs/clientlaunch.log --pidfile=/usr/local/xymon/client/logs/clientlaunch.node13.cluster.private.pid
26558 1 root 9:43AM Ss 31 0.0 0:00.01 0.0 788 75440 /usr/libexec/samba/synchronize-preferences --linger
26563 26058 root 9:43AM S 31 3.0 0:00.01 0.0 728 75944 /bin/sh /usr/local/xymon/client/bin/hobbitclient.sh
26567 26563 root 9:43AM R 30 5.3 0:00.02 0.0 684 75944 /bin/sh /usr/local/xymon/client/bin/hobbitclient-darwin.sh
26583 26567 root 9:43AM R 31 0.0 0:00.00 0.0 360 75352 ps -ax -ww -o pid
So we see that both the hobbitlaunch and sge_execd processes are present.
Thoughts? Suggestions?
--
JONATHAN B. HOREN
Systems Administrator
UAF Life Science Informatics
Center for Research Services
user-12d4882938ba@xymon.invalid
http://biotech.inbre.alaska.edu
--
Disclaimer: 1) all opinions are my own, 2) I may be completely wrong, 3) my
advice is worth at least as much as what you are paying for it, or your
money cheerfully refunded.