Xymon Mailing List Archive search

Need help quick!

2 messages in this thread

list Winn Beutler · Tue, 28 Mar 2006 10:44:35 -0700 ·

This morning, all of a sudden, one of our HOBBIT servers lost the ability to ping other servers.

Everything on the network is up and working fine, but the file:
/var/log/hobbit/hobbitlaunch.log    says:

2006-03-28 10:19:50 Task bbnet started with PID 10586
2006-03-28 10:19:50 Task bbnet terminated by signal 6

and when I run bbtest-net as the user hobbit, I get the following:
$ ../bin/bbtest-net
2006-03-28 09:40:27 xgetenv: Cannot find value for variable MACHINE
Aborted (core dumped)

Help!  I have had to shut down paging because the network test is broken and thinks all the clients are dead!

Thanks for any help you can give - Winn

SPECIAL NOTICE

All information transmitted hereby is intended only for the use of the
addressee(s) named above and may contain confidential and privileged
information. Any unauthorized review, use, disclosure or distribution
of confidential and privileged information is prohibited. If the reader
of this message is not the intended recipient(s) or the employee or agent
responsible for delivering the message to the intended recipient, you are
hereby notified that you must not read this transmission and that disclosure,
copying, printing, distribution or use of any of the information contained
in or attached to this transmission is STRICTLY PROHIBITED.

Anyone who receives confidential and privileged information in error should
notify us immediately by telephone and mail the original message to us at
the above address and destroy all copies.  To the extent any portion of this
communication contains public information, no such restrictions apply to that
information. (gate01)
list Henrik Størner · Tue, 28 Mar 2006 22:17:00 +0200 ·
quoted from Winn Beutler
On Tue, Mar 28, 2006 at 10:44:35AM -0700, Winn Beutler wrote:
This morning, all of a sudden, one of our HOBBIT servers lost the ability to ping other servers.

Everything on the network is up and working fine, but the file:
/var/log/hobbit/hobbitlaunch.log    says:

2006-03-28 10:19:50 Task bbnet started with PID 10586
2006-03-28 10:19:50 Task bbnet terminated by signal 6
What version ?

What's in the bb-network.log file ?

This indicates a programming error in the bbtest-net tool, which
causes it to crash. There should be a core file in ~hobbit/server/tmp/
directory - please run it through gdb as described in
http://www.hswn.dk/hobbit/help/known-issues.html#bugreport
quoted from Winn Beutler
and when I run bbtest-net as the user hobbit, I get the following:
$ ../bin/bbtest-net
2006-03-28 09:40:27 xgetenv: Cannot find value for variable MACHINE
Aborted (core dumped)
And if you run it as "bbcmd bbtest-net --debug" ? bbcmd defines a bunch 
of environment settings that bbtest-net needs (hobbitlaunch also defines
them, when it launches tasks to run automatically).


Regards,
Henrik