xymondboard error
list Daniel Lozovsky
I recently started seeing these error messages in my log file periodically. When these messages appeared, I noticed that some of the tests turned red and white status but not all. Could you point me in the right direction of how I can trouble shoot this error messages? 2020-07-29 12:30:13.241732 Whoops ! Failed to send message (timeout) 2020-07-29 12:30:13.456322 -> 2020-07-29 12:30:13.456362 -> Recipient x.x.x.x, timeout 15 2020-07-29 12:30:13.456375 -> 1st line: 'xymondboard fields=hostname,testname,color,flags,lastchange,logtime,validtime,acktime,disabletime,sender,cookie,line1,acklist ' 2020-07-29 12:30:13.456394 xymond status-board not available, code 7 2020-07-29 12:30:13.456409 Failed to load current Xymon status, aborting page-update
list Jeremy Laidman
Daniel Which logfile did you see this in? How often do you see these? How long ago did this start? On the face of it, it looks like a process (I'm guessing xymongen) is having a problem connecting to the xymond daemon, which would cause it to fail to construct a display page. The problem is most likely that xymond is unable to accept a connection, or respond quickly enough, hence the 15 second timeout. The red/white status you are seeing is possibly other messages being dropped by the xymond daemon. I would take a look at the xymond status page of the Xymon servers and see if there are any useful messages or counters there. Take a look at other logfiles (eg xymongen.log, xymonnet.log) to see if there are similar messages. Cheers Jeremy
▸
On Thu, 30 Jul 2020 at 08:19, LOZOVSKY, DANIEL <user-5085da3588ee@xymon.invalid> wrote:
I recently started seeing these error messages in my log file periodically. When these messages appeared, I noticed that some of the tests turned red and white status but not all. Could you point me in the right direction of how I can trouble shoot this error messages? 2020-07-29 12:30:13.241732 Whoops ! Failed to send message (timeout) 2020-07-29 12:30:13.456322 -> 2020-07-29 12:30:13.456362 -> Recipient x.x.x.x, timeout 15 2020-07-29 12:30:13.456375 -> 1st line: 'xymondboard fields=hostname,testname,color,flags,lastchange,logtime,validtime,acktime,disabletime,sender,cookie,line1,acklist ' 2020-07-29 12:30:13.456394 xymond status-board not available, code 7 2020-07-29 12:30:13.456409 Failed to load current Xymon status, aborting page-update
list Daniel Lozovsky
Hi Jeremy thank you for replying. Looks like my company is blocking some of the emails coming from xymon so I did not see your reply. Which logfile did you see this in? - xymongen.log How often do you see these? - This happened twice on 7/25 and 7/29 never saw this before. How long ago did this start? - Started 5 ago but only saw it twice. I did not see anything really strange in any other log. I am not sure why this started happening. I did see huge spike approx. 5x more than usual on xymongen and xymonnet graphs both spikes occurred at the same time on Wednesday which correlated with the 7/29 xymonboard error in the log. The spike quickly went back to normal. This is a bit wired since I never saw this type of issue before. Daniel
▸
On the face of it, it looks like a process (I'm guessing xymongen) is
having a problem connecting to the xymond daemon, which would cause it to
fail to construct a display page. The problem is most likely that xymond is
unable to accept a connection, or respond quickly enough, hence the 15
second timeout. The red/white status you are seeing is possibly other
messages being dropped by the xymond daemon.
I would take a look at the xymond status page of the Xymon servers and see
if there are any useful messages or counters there. Take a look at other
logfiles (eg xymongen.log, xymonnet.log) to see if there are similar
messages.
Cheers
Jeremy
From: LOZOVSKY, DANIEL
Sent: Wednesday, July 29, 2020 11:06 AM
To: 'xymon at xymon.com' <xymon at xymon.com>
Subject: xymondboard error
I recently started seeing these error messages in my log file periodically. When these messages appeared, I noticed that some of the tests turned red and white status but not all. Could you point me in the right direction of how I can trouble shoot this error messages?
2020-07-29 12:30:13.241732 Whoops ! Failed to send message (timeout)
2020-07-29 12:30:13.456322 ->
2020-07-29 12:30:13.456362 -> Recipient x.x.x.x, timeout 15
2020-07-29 12:30:13.456375 -> 1st line: 'xymondboard fields=hostname,testname,color,flags,lastchange,logtime,validtime,acktime,disabletime,sender,cookie,line1,acklist '
2020-07-29 12:30:13.456394 xymond status-board not available, code 7
2020-07-29 12:30:13.456409 Failed to load current Xymon status, aborting page-update