Xymon Mailing List Archive search

Multiple Issues with 4.3.17 install

list Wes Neal
Tue, 21 Oct 2014 09:57:48 -0400
Message-Id: <user-84bdf9244e60@xymon.invalid>

Anyone got any ideas about this?  The test does not show as disabled on the enable/disable page, but is still blue and doesn’t seem to update at all.  No xymond_rrd files are being created in /export/home/xymon/data/* anywhere.  If I do a ./xymon 0.0.0.0 "enable xymonems.xymond_rrd"  it doesn’t change it at all.  It is like the status is stuck somewhere, but I am not sure how or where.


From: Neal, Jonathan W [mailto:user-9e31f92d698c@xymon.invalid] 
Sent: Monday, October 20, 2014 12:52 PM
To: Jeremy Laidman
Cc: xymon at xymon.com
Subject: RE: [Xymon] Multiple Issues with 4.3.17 install

No core file on the system.  I think there is something else odd going on.  I removed all the data that belongs to the xymonems host from /data/* .  I restarted the system and xymond_rrd is still blue, even though it isn’t even disabled any longer.  It’s like it can’t or doesn’t know how to update the status for it.  I watched the xymond status for from yellow to green after the restarted, but xymond_rrd never changed.


From: Jeremy Laidman [mailto:user-71895fb2e44c@xymon.invalid] 
Sent: Sunday, October 19, 2014 4:08 PM
To: Neal, Jonathan W
Subject: Re: [Xymon] Multiple Issues with 4.3.17 install

Look for a core file, then use gdb to get a backtrace. This will tell us what it is doing when it crashes.
J
On 18/10/2014 7:31 AM, "Neal, Jonathan W via Xymon" <xymon at xymon.com> wrote:


---------- Forwarded message ----------
From: "Neal, Jonathan W" <user-9e31f92d698c@xymon.invalid>
To: "xymon at xymon.com" <xymon at xymon.com>
Cc: 
Date: Fri, 17 Oct 2014 16:31:06 -0400
Subject: RE: [Xymon] Multiple Issues with 4.3.17 install
I am unsure why it is even showing purple, but it definitely is and it keeps alerting on it.  If I drill down into a system I see data being graphed that is valid.  If I look at the processes on the system I see:

xymonems:xymon > ps -ef |grep xymond_rrd
   xymon 18720 18714   0 02:45:03 ?           0:00 xymond_channel --channel=data --log=/var/log/xymon/rrd-data.log xymond_rrd --rr
   xymon 18806 18720   0 02:45:12 ?           0:02 xymond_rrd --rrddir=/export/home/xymon/data/rrd
   xymon 18761 18719   0 02:45:04 ?           0:51 xymond_rrd --rrddir=/export/home/xymon/data/rrd
   xymon 18719 18714   0 02:45:03 ?           0:14 xymond_channel --channel=status --log=/var/log/xymon/rrd-status.log xymond_rrd

So to me it seems as if it is running.  What am I missing here?

Wes


---------- Forwarded message ----------
From: "Neal, Jonathan W" <user-9e31f92d698c@xymon.invalid>
To: "xymon at xymon.com" <xymon at xymon.com>
Cc: 
Date: Thu, 16 Oct 2014 18:40:46 -0400
Subject: Multiple Issues with 4.3.17 install
I am coming from an early 4.2 install.  I merged my bb-hosts, hobbit-alerts.cfg and hobbit-clients.cfg files into the proper files in the new 4.3.17 install.  I also copied over the entire histlogs directory from data.   Currently xymon_rrd keeps dying and going purple with a Fatal signal error.
 
rrd-status log has this in it going back most of the day:
 
2014-10-16 19:23:00 Peer at 0.0.0.0:0 failed: Broken pipe
2014-10-16 19:23:00 Peer not up, flushing message queue
2014-10-16 19:24:43 Shutting down, flushing cached updates to disk
2014-10-16 19:28:39 Peer not up, flushing message queue
2014-10-16 20:00:35 Shutting down, flushing cached updates to disk
2014-10-16 20:00:36 Cache flush completed
2014-10-16 21:58:19 Peer not up, flushing message queue
2014-10-16 22:30:58 Shutting down, flushing cached updates to disk
2014-10-16 22:30:59 Cache flush completed
2014-10-16 22:31:14 Peer not up, flushing message queue
 
Xymond is also constantly going yellow and I again see that 0.0.0.0:1984 that is mentioned above:
 
Statistics for Xymon daemon
Version: 4.3.17
Up since 16-Oct-2014 22:31:09 (0 days, 00:04:59)
 
Incoming messages      :        937
- status               :        885
- combo                :          1
- extcombo             :         22
- page                 :          0
- summary              :          0
- data                 :          6
- client               :          2
- notes                :          0
- enable               :          0
- disable              :          0
- ack                  :          0
- config               :          4
- query                :          0
- xymondboard          :          6
- xymondlog            :          5
- drop                 :          0
- rename               :          0
- dummy                :          1
- ping                 :          0
- notify               :          0
- schedule             :          0
- download             :          0
- Bogus/Timeouts       :          5
Incoming messages/sec  :          3 (average last 300 seconds)
 
status channel messages:        885 (1 readers)
stachg channel messages:        877 (1 readers)
page   channel messages:         37 (1 readers)
data   channel messages:          6 (1 readers)
notes  channel messages:          0 (0 readers)
enadis channel messages:          0 (0 readers)
client channel messages:          2 (1 readers)
clichg channel messages:          0 (1 readers)
user   channel messages:          0 (0 readers)
backfeed messages      :          0
 
 
Latest error messages:
Loading hostnames
Loading saved state
Setting up network listener on 0.0.0.0:1984
Setting up signal handlers
Setting up xymond channels
Setting up logfiles
Setup complete
 
Can anyone tell me what might be going on?
Thanks in advance!