I am unsure why it is even showing purple, but it definitely is and it keeps alerting on it. If I drill down into a system I see data being graphed that is valid. If I look at the processes on the system I see:
xymonems:xymon > ps -ef |grep xymond_rrd
xymon 18720 18714 0 02:45:03 ? 0:00 xymond_channel --channel=data --log=/var/log/xymon/rrd-data.log xymond_rrd --rr
xymon 18806 18720 0 02:45:12 ? 0:02 xymond_rrd --rrddir=/export/home/xymon/data/rrd
xymon 18761 18719 0 02:45:04 ? 0:51 xymond_rrd --rrddir=/export/home/xymon/data/rrd
xymon 18719 18714 0 02:45:03 ? 0:14 xymond_channel --channel=status --log=/var/log/xymon/rrd-status.log xymond_rrd
So to me it seems as if it is running. What am I missing here?
Wes
---------- Forwarded message ----------
From: "Neal, Jonathan W" <user-9e31f92d698c@xymon.invalid>
To: "xymon at xymon.com" <xymon at xymon.com>
Cc:
Date: Thu, 16 Oct 2014 18:40:46 -0400
Subject: Multiple Issues with 4.3.17 install
I am coming from an early 4.2 install. I merged my bb-hosts, hobbit-alerts.cfg and hobbit-clients.cfg files into the proper files in the new 4.3.17 install. I also copied over the entire histlogs directory from data. Currently xymon_rrd keeps dying and going purple with a Fatal signal error.
rrd-status log has this in it going back most of the day:
2014-10-16 19:23:00 Peer at 0.0.0.0:0 failed: Broken pipe
2014-10-16 19:23:00 Peer not up, flushing message queue
2014-10-16 19:24:43 Shutting down, flushing cached updates to disk
2014-10-16 19:28:39 Peer not up, flushing message queue
2014-10-16 20:00:35 Shutting down, flushing cached updates to disk
2014-10-16 20:00:36 Cache flush completed
2014-10-16 21:58:19 Peer not up, flushing message queue
2014-10-16 22:30:58 Shutting down, flushing cached updates to disk
2014-10-16 22:30:59 Cache flush completed
2014-10-16 22:31:14 Peer not up, flushing message queue
Xymond is also constantly going yellow and I again see that 0.0.0.0:1984 that is mentioned above:
Statistics for Xymon daemon
Version: 4.3.17
Up since 16-Oct-2014 22:31:09 (0 days, 00:04:59)
Incoming messages : 937
- status : 885
- combo : 1
- extcombo : 22
- page : 0
- summary : 0
- data : 6
- client : 2
- notes : 0
- enable : 0
- disable : 0
- ack : 0
- config : 4
- query : 0
- xymondboard : 6
- xymondlog : 5
- drop : 0
- rename : 0
- dummy : 1
- ping : 0
- notify : 0
- schedule : 0
- download : 0
- Bogus/Timeouts : 5
Incoming messages/sec : 3 (average last 300 seconds)
status channel messages: 885 (1 readers)
stachg channel messages: 877 (1 readers)
page channel messages: 37 (1 readers)
data channel messages: 6 (1 readers)
notes channel messages: 0 (0 readers)
enadis channel messages: 0 (0 readers)
client channel messages: 2 (1 readers)
clichg channel messages: 0 (1 readers)
user channel messages: 0 (0 readers)
backfeed messages : 0
Latest error messages:
Loading hostnames
Loading saved state
Setting up network listener on 0.0.0.0:1984
Setting up signal handlers
Setting up xymond channels
Setting up logfiles
Setup complete
Can anyone tell me what might be going on?
Thanks in advance!