Xymon Mailing List Archive search

All network tests now purple - update

6 messages in this thread

list Rich Smrcina · Thu, 26 Oct 2006 13:43:02 -0500 ·
I rebooted the machine now after having installed the update and now I 
see this in bb-network.log:

*** glibc detected *** corrupted double-linked list: 0x0000000080051ae0 ***

No core file.
-- 
Rich Smrcina
VM Assist, Inc.
Phone: XXX-XXX-XXXX
Ans Service:  XXX-XXX-XXXX
user-61add9955ef9@xymon.invalid

Catch the WAVV!  http://www.wavv.org
WAVV 2007 - Green Bay, WI - May 18-22, 2007
list Henrik Størner · Fri, 27 Oct 2006 22:49:27 +0200 ·
quoted from Rich Smrcina
On Thu, Oct 26, 2006 at 01:43:02PM -0500, Rich Smrcina wrote:
I rebooted the machine now after having installed the update and now I 
see this in bb-network.log:

*** glibc detected *** corrupted double-linked list: 0x0000000080051ae0 ***

No core file.
This does indicate some sort of bad memory access by bbtest-net.
Unfortunately, it doesn't provide much clue about what caused this
problem.

Since it's glibc, I suppose it's on an Intel-based system ? In that
case, it would be helpful if you could pick up the "Valgrind" utility; 
it is included in some Linux distributions, but you can also grab it
from valgrind.org. Once you have it, you can run bbtest-net with the
command

   valgrind --log-file=vlg.log bbtest-net [any bbtest-net options]

If there's a memory access error, it should catch it right away.
I'd be interested to see the logfile it produces.


Regards,
Henrik
list Rich Smrcina · Fri, 27 Oct 2006 15:57:11 -0500 ·
This s390x architecture (64-bit Mainframe).  The web site makes no mention of the architecture, but I'm checking with the Linux for zSeries list to see if anyone has ever used it on the mainframe.

If it looks like it will work, I'll create the log file and send it.

Thanks.
quoted from Henrik Størner

Henrik Stoerner wrote:
On Thu, Oct 26, 2006 at 01:43:02PM -0500, Rich Smrcina wrote:
I rebooted the machine now after having installed the update and now I see this in bb-network.log:

*** glibc detected *** corrupted double-linked list: 0x0000000080051ae0 ***

No core file.
This does indicate some sort of bad memory access by bbtest-net.
Unfortunately, it doesn't provide much clue about what caused this
problem.

Since it's glibc, I suppose it's on an Intel-based system ? In that
case, it would be helpful if you could pick up the "Valgrind" utility; it is included in some Linux distributions, but you can also grab it
from valgrind.org. Once you have it, you can run bbtest-net with the
command

   valgrind --log-file=vlg.log bbtest-net [any bbtest-net options]

If there's a memory access error, it should catch it right away.
I'd be interested to see the logfile it produces.


Regards,
Henrik

-- 
Rich Smrcina
VM Assist, Inc.
Phone: XXX-XXX-XXXX
Ans Service:  XXX-XXX-XXXX
user-61add9955ef9@xymon.invalid

Catch the WAVV!  http://www.wavv.org
WAVV 2007 - Green Bay, WI - May 18-22, 2007
list Rich Smrcina · Fri, 27 Oct 2006 21:33:24 -0500 ·
:(  Any other ideas?  Valgrind is not supported on the s390/s390x architecture (configure fails with a message to that effect).
quoted from Rich Smrcina

Henrik Stoerner wrote:
On Thu, Oct 26, 2006 at 01:43:02PM -0500, Rich Smrcina wrote:
I rebooted the machine now after having installed the update and now I see this in bb-network.log:

*** glibc detected *** corrupted double-linked list: 0x0000000080051ae0 ***

No core file.
This does indicate some sort of bad memory access by bbtest-net.
Unfortunately, it doesn't provide much clue about what caused this
problem.

Since it's glibc, I suppose it's on an Intel-based system ? In that
case, it would be helpful if you could pick up the "Valgrind" utility; it is included in some Linux distributions, but you can also grab it
from valgrind.org. Once you have it, you can run bbtest-net with the
command

   valgrind --log-file=vlg.log bbtest-net [any bbtest-net options]

If there's a memory access error, it should catch it right away.
I'd be interested to see the logfile it produces.


Regards,
Henrik

-- 
Rich Smrcina
VM Assist, Inc.
Phone: XXX-XXX-XXXX
Ans Service:  XXX-XXX-XXXX
user-61add9955ef9@xymon.invalid

Catch the WAVV!  http://www.wavv.org
WAVV 2007 - Green Bay, WI - May 18-22, 2007
list Henrik Størner · Sat, 28 Oct 2006 09:07:41 +0200 ·
quoted from Rich Smrcina
On Fri, Oct 27, 2006 at 09:33:24PM -0500, Rich Smrcina wrote:
:(  Any other ideas?  Valgrind is not supported on the s390/s390x 
architecture (configure fails with a message to that effect).
Hmm ... perhaps you can run it inside gdb. su to the Hobbit user,
then run 

   $ gdb ~/server/bin/bbtest-net
   gdb> set follow-fork-mode parent
   gdb> run --debug [other bbtest-net options]

When it crashes, use the "bt" command to get a stacktrace.
Unfortunately that might not tell me much because it could
very well be that the problem happens some time before it is
detected.

Also, could you send me your bb-hosts, bb-services and 
hobbitserver.cfg files ?


Regards,
Henrik
list Rich Smrcina · Sat, 28 Oct 2006 07:21:34 -0500 ·
OK, the information has been collected and sent privately.

Thanks.
quoted from Henrik Størner

Henrik Stoerner wrote:
On Fri, Oct 27, 2006 at 09:33:24PM -0500, Rich Smrcina wrote:
:(  Any other ideas?  Valgrind is not supported on the s390/s390x architecture (configure fails with a message to that effect).
Hmm ... perhaps you can run it inside gdb. su to the Hobbit user,
then run 
   $ gdb ~/server/bin/bbtest-net
   gdb> set follow-fork-mode parent
   gdb> run --debug [other bbtest-net options]

When it crashes, use the "bt" command to get a stacktrace.
Unfortunately that might not tell me much because it could
very well be that the problem happens some time before it is
detected.

Also, could you send me your bb-hosts, bb-services and hobbitserver.cfg files ?


Regards,
Henrik

-- 
Rich Smrcina
VM Assist, Inc.
Phone: XXX-XXX-XXXX
Ans Service:  XXX-XXX-XXXX
user-61add9955ef9@xymon.invalid

Catch the WAVV!  http://www.wavv.org
WAVV 2007 - Green Bay, WI - May 18-22, 2007