Xymon Mailing List Archive search

msgcache is unstable

list Rolf Masfelder
Thu, 24 Aug 2006 09:07:41 +0200
Message-Id: <user-fc755bcd352e@xymon.invalid>

Am Dienstag 22 August 2006 18:10 schrieb Rolf Masfelder:
Am Dienstag 22 August 2006 16:21 schrieb Henrik Stoerner:
On Tue, Aug 22, 2006 at 04:03:20PM +0200, Rolf Masfelder wrote:
i'm running msgcache on a remote machine. For some houres it
works fine, but at some point it stopps working.

Here is a part of msgcache.log (msgcache is running with --debug)
... as before ...
Finally, when msgcache is in this state, what happens if you run
I have to wait for this ...
- from the Hobbit server - the command

   bb IP.OF.CLIENT.HOST "pullclient"

It should dump the last status message to the screen.
on the server:
2006-08-24 08:50:55 Whoops ! bb failed to send message - timeout

on the client:
the last entries in msgcache.log are
2006-08-24 08:47:49 New connection
2006-08-24 08:49:08 New connection
2006-08-24 08:50:15 New connection
2006-08-24 08:50:40 New connection

the last entires in hobbitclient.log:
2006-08-24 08:29:19 Whoops ! bb failed to send message - timeout
2006-08-24 08:34:20 Whoops ! bb failed to send message - timeout
2006-08-24 08:39:21 Whoops ! bb failed to send message - timeout
2006-08-24 08:44:22 Whoops ! bb failed to send message - timeout
2006-08-24 08:49:23 Whoops ! bb failed to send message - timeout
2006-08-24 08:54:24 Whoops ! bb failed to send message - timeout

Time is correct on both machines.

What I have seen:
there are two types of processing-blocks:
--- first, occures every fourth or fifthes connection
2006-08-23 19:02:06 New connection
2006-08-23 19:02:06 Queuing outbound message
2006-08-23 19:02:15 New connection
2006-08-23 19:02:15 -> oksender
2006-08-23 19:02:15 <- oksender(1-a)
2006-08-23 19:02:15 Got pullclient request: pullclient 1
log:/var/log/messages:10240
ignore MARK


2006-08-23 19:02:15 Saved client response: log:/var/log/messages:10240
ignore MARK

there is a "Queuing outbound message" between two "New connection"

--- second, the 'normal' Version
2006-08-23 19:03:14 New connection
2006-08-23 19:03:14 -> oksender
2006-08-23 19:03:14 <- oksender(1-a)
2006-08-23 19:03:14 Got pullclient request: pullclient 1
log:/var/log/messages:10240
ignore MARK


2006-08-23 19:03:14 Saved client response: log:/var/log/messages:10240
ignore MARK

one "New connection" without "Queuing ..."

--- the last block before the 'problem' occures:
2006-08-23 19:04:16 New connection
2006-08-23 19:05:53 New connection
2006-08-23 19:05:53 -> oksender
2006-08-23 19:05:53 <- oksender(1-a)
2006-08-23 19:05:53 Got pullclient request: pullclient 1
log:/var/log/messages:10240
ignore MARK


2006-08-23 19:05:53 Saved client response: log:/var/log/messages:10240
ignore MARK

there are two "New connection" without the "Queuing ..."

after that there are only 
2006-08-23 19:06:39 New connection
2006-08-23 19:07:06 New connection
2006-08-23 19:09:35 New connection
2006-08-23 19:11:47 New connection
2006-08-23 19:12:07 New connection

in the log

i hope that helps

Regards,
Henrik

Thanks in advance
Greetings
-- 
Rolf Masfelder

Tel.:	XXXXX XXX XXX
FAX:	XXXXX XXX XXX
Mobil:	XXXX XX XX XXX
world:	0700 NECTORGMbh
EMail: 	user-7583fc084b48@xymon.invalid

http://www.nector.de