Xymon Mailing List Archive search

xymond_rrd program crashed

list Japheth Cleaver
Wed, 7 Aug 2019 14:17:27 -0700
Message-Id: <user-9a5ce56c3d64@xymon.invalid>

Thanks,

I thought I had actually added this one in, but it appears to have gotten lost in an edit.

Added in https://sourceforge.net/p/xymon/code/8073

Regards,
-jc

On 8/6/2019 4:04 PM, Tom Schmidt (tschmidt) wrote:
Andreas,

??? I likewise am seeing xymond_rrd program crashing with 4.3.29 but it did not with 4.3.28.? If you look at the differences in the xymond/rrd/do_netstat.c between 4.3.28 and 4.3.29 you will see that the code was updated to handle newer Linux systems net-tools format.? Your logs also show that it is not recognizing the OS for your ?rackstation*? hosts.? I do not know what type of OS these systems are, but you may have to update your netstat monitor or the xymond/rrd/do_netstat.c code to recognize their netstat output format.

??? For me, xymond_rrd is crashing when trying to extract RRD data from some of our temperature reports.? Some of our older temperature monitors returned data in this format:

Device????????????????? Temp(C)? Temp(F) Threshold(C)


&green System Board Inlet Temp 21?????? 69???? ( 42)

&green CPU1 Temp?????????????? 32 89???? ( 93)

&green CPU2 Temp?????????????? 31 87???? ( 93)


Xymon 4.3.28 handled this properly by treating the data in the parenthesis as comments.? Attached is my patch to fix xymond/rrd/do_temperature.c to handle temperature reports like the above. ?Most of this code was already in 4.3.28 but I do not know why it was removed.

*Japheth*,

?? Please consider adding this patch for xymond/rrd/do_temperature.c ?to the next release.? It also includes the stripping off of any leading bold and italic HTML tags from sensor names that I submitted earlier.

Thanks?Tom


http://collab.micron.com/corp/brand/SiteAssets/Micron.png <http://www.micron.com/>;

	
Tom Schmidt
Sr Manager, IT, Product Engineering
IT ETD Eng Sites US
Micron Technology, Inc.

Office:?+X (XXX) XXX-XXXX ?Fax:?(208)368-2807

Email: user-48d3fa8908d4@xymon.invalid <mailto:user-48d3fa8908d4@xymon.invalid> Website: micron.com <http://www.micron.com/>;
Micron Technology, Inc., Confidential and Proprietary.

*From:* Xymon <xymon-bounces at xymon.com> *On Behalf Of *Andreas Kunberger
*Sent:* Monday, August 5, 2019 4:13 AM
*To:* 'Xymon at xymon.com' <Xymon at xymon.com>
*Subject:* [EXT] [Xymon] xymond_rrd program crashed

Since we ?have updated to Xymon 4.3.29-1.el7.terabithia

We get the xymond status:

????xymond_rrd program crashed

??? Fatal signal caught!

In the /var/log/messages we have:

Aug? 4 06:59:33 suse abrt-hook-ccpp[9966]: Process 8736 (xymond_rrd) of user 1000 killed by SIGABRT - dumping core

Aug? 4 06:59:34 suse abrt-server[9968]: Package 'xymon' isn't signed with proper key

Aug? 4 06:59:34 suse abrt-server[9968]: 'post-create' on '/var/spool/abrt/ccpp-2019-08-04-06:59:33-8736' exited with 1

Aug? 4 06:59:34 suse abrt-server[9968]: Deleting problem directory '/var/spool/abrt/ccpp-2019-08-04-06:59:33-8736'

and in /var/log/xymon/rrd-status

2019-08-04 06:58:43.164318 net-janus/ntpd.rrd: Bug - duplicate RRD data with same timestamp 1564894723, different data

2019-08-04 06:59:33.750946 Host 'rackstation' reports netstat for an unknown OS

2019-08-04 06:59:34.621163 xymond_channel: Child process 8736 died: Signal 6

2019-08-04 06:59:34.739280 xymond_channel: Peer at 0.0.0.0:0 failed: Broken pipe

2019-08-04 06:59:34.986178 xymond_channel: Peer not up, flushing message queue

2019-08-04 06:59:56.727302 Host 'rackstation3' reports netstat for an unknown OS

2019-08-04 06:59:57.603455 xymond_channel: Child process 9971 died: Signal 6

2019-08-04 06:59:58.136009 xymond_channel: Peer at 0.0.0.0:0 failed: Broken pipe

2019-08-04 06:59:58.136109 xymond_channel: Peer not up, flushing message queue

2019-08-04 06:59:58.136150 xymond_channel: Peer not up, flushing message queue

2019-08-04 06:59:58.136192 xymond_channel: Peer not up, flushing message queue

2019-08-04 06:59:58.136316 xymond_channel: Peer not up, flushing message queue

2019-08-04 06:59:58.136374 xymond_channel: Peer not up, flushing message queue

?.

The Server runs on CentOS Linux release 7.6.1810 (Core)

Thanks in advance!

i.A. Andreas Kunberger

-- 

Andreas Kunberger

ZD/IT