If you're seeing a crash alert ("Signal received", etc) and it's purple,
it was just the one-time note that something internal to xymon crashed.
(That, of course, isn't supposed to happen, but... :/ )
xymond_rrd normally doesn't send in a test about itself (none of the
processors launched via xymond_channel do by default, only the
xymonlaunch-ed daemons and run-once commands), so it won't clear even if
the system is running fine now.
Also, I'd have to check, but I believe re-enabling of disables doesn't
take effect right away -- there's a part that may not update until the
next status message is received for it.
In either case, just drop the now-spurious "xymond_rrd" test using
something like:
./xymon 0.0.0.0 "drop xymonems xymond_rrd"
HTH,
-jc
On Tue, October 21, 2014 6:57 am, Neal, Jonathan W wrote:Anyone got any ideas about this? The test does not show as disabled on
the enable/disable page, but is still blue and doesnât seem to update at
all. No xymond_rrd files are being created in /export/home/xymon/data/*
anywhere. If I do a ./xymon 0.0.0.0 "enable xymonems.xymond_rrd" it
doesnât change it at all. It is like the status is stuck somewhere, but
I am not sure how or where.
From: Neal, Jonathan W [mailto:user-9e31f92d698c@xymon.invalid]
Sent: Monday, October 20, 2014 12:52 PM
To: Jeremy Laidman
Cc: xymon at xymon.com
Subject: RE: [Xymon] Multiple Issues with 4.3.17 install
No core file on the system. I think there is something else odd going
on. I removed all the data that belongs to the xymonems host from
/data/* . I restarted the system and xymond_rrd is still blue, even
though it isnât even disabled any longer. Itâs like it canât or
doesnât know how to update the status for it. I watched the xymond
status for from yellow to green after the restarted, but xymond_rrd never
changed.
From: Jeremy Laidman [mailto:user-71895fb2e44c@xymon.invalid]
Sent: Sunday, October 19, 2014 4:08 PM
To: Neal, Jonathan W
Subject: Re: [Xymon] Multiple Issues with 4.3.17 install
Look for a core file, then use gdb to get a backtrace. This will tell us
what it is doing when it crashes.
J
On 18/10/2014 7:31 AM, "Neal, Jonathan W via Xymon" <xymon at xymon.com>
wrote:
---------- Forwarded message ----------
From:Â "Neal, Jonathan W" <user-9e31f92d698c@xymon.invalid>
To:Â "xymon at xymon.com" <xymon at xymon.com>
Cc:Â
Date:Â Fri, 17 Oct 2014 16:31:06 -0400
Subject:Â RE: [Xymon] Multiple Issues with 4.3.17 install
I am unsure why it is even showing purple, but it definitely is and it
keeps alerting on it. If I drill down into a system I see data being
graphed that is valid. If I look at the processes on the system I see:
xymonems:xymon > ps -ef |grep xymond_rrd
  xymon 18720 18714  0 02:45:03 ?      0:00
xymond_channel --channel=data --log=/var/log/xymon/rrd-data.log xymond_rrd
--rr
  xymon 18806 18720  0 02:45:12 ?      0:02 xymond_rrd
--rrddir=/export/home/xymon/data/rrd
  xymon 18761 18719  0 02:45:04 ?      0:51 xymond_rrd
--rrddir=/export/home/xymon/data/rrd
  xymon 18719 18714  0 02:45:03 ?      0:14
xymond_channel --channel=status --log=/var/log/xymon/rrd-status.log
xymond_rrd
So to me it seems as if it is running. What am I missing here?
Wes
---------- Forwarded message ----------
From:Â "Neal, Jonathan W" <user-9e31f92d698c@xymon.invalid>
To:Â "xymon at xymon.com" <xymon at xymon.com>
Cc:Â
Date:Â Thu, 16 Oct 2014 18:40:46 -0400
Subject:Â Multiple Issues with 4.3.17 install
I am coming from an early 4.2 install. I merged my bb-hosts,
hobbit-alerts.cfg and hobbit-clients.cfg files into the proper files in
the new 4.3.17 install. I also copied over the entire histlogs directory
from data.  Currently xymon_rrd keeps dying and going purple with a
Fatal signal error.
Â
rrd-status log has this in it going back most of the day:
Â
2014-10-16 19:23:00 Peer at 0.0.0.0:0 failed: Broken pipe
2014-10-16 19:23:00 Peer not up, flushing message queue
2014-10-16 19:24:43 Shutting down, flushing cached updates to disk
2014-10-16 19:28:39 Peer not up, flushing message queue
2014-10-16 20:00:35 Shutting down, flushing cached updates to disk
2014-10-16 20:00:36 Cache flush completed
2014-10-16 21:58:19 Peer not up, flushing message queue
2014-10-16 22:30:58 Shutting down, flushing cached updates to disk
2014-10-16 22:30:59 Cache flush completed
2014-10-16 22:31:14 Peer not up, flushing message queue
Â
Xymond is also constantly going yellow and I again see that 0.0.0.0:1984
that is mentioned above:
Â
Statistics for Xymon daemon
Version: 4.3.17
Up since 16-Oct-2014 22:31:09 (0 days, 00:04:59)
Â
Incoming messages     :       937
- status              :       885
- combo               :         1
- extcombo            :        22
- page                :         0
- summary             :         0
- data                :         6
- client              :         2
- notes               :         0
- enable              :         0
- disable             :         0
- ack                 :         0
- config              :         4
- query               :         0
- xymondboard         :         6
- xymondlog           :         5
- drop                :         0
- rename              :         0
- dummy               :         1
- ping                :         0
- notify              :         0
- schedule            :         0
- download            :         0
- Bogus/Timeouts      :         5
Incoming messages/sec :         3 (average last 300 seconds)
Â
status channel messages:Â Â Â Â Â Â Â 885 (1 readers)
stachg channel messages:Â Â Â Â Â Â Â 877 (1 readers)
page  channel messages:        37 (1 readers)
data  channel messages:         6 (1 readers)
notes channel messages:         0 (0 readers)
enadis channel messages:Â Â Â Â Â Â Â Â Â 0 (0 readers)
client channel messages:Â Â Â Â Â Â Â Â Â 2 (1 readers)
clichg channel messages:Â Â Â Â Â Â Â Â Â 0 (1 readers)
user  channel messages:         0 (0 readers)
backfeed messages     :         0
Â
Â
Latest error messages:
Loading hostnames
Loading saved state
Setting up network listener on 0.0.0.0:1984
Setting up signal handlers
Setting up xymond channels
Setting up logfiles
Setup complete
Â
Can anyone tell me what might be going on?
Thanks in advance!
Â