Hello,
is the number of tests per host limited? I've cleaned my bb-hosts and as I add one host with many tests. I disabled all customized tests and tried to find out if its a single test. But all test run by itself didn't crash the hobbit server - running all together did!
The host runs about 20 tests.
Regards,
Stefan
<br><br><br>>From: "Stefan Loos" <user-dea24d965402@xymon.invalid><br>>Reply-To: user-ae9b8668bcde@xymon.invalid<br>>To: user-ae9b8668bcde@xymon.invalid<br>>Subject: Re: [hobbit] bbdisplay problems after adding some new hosts<br>>Date: Wed, 11 May 2005 09:49:30 +0000<br>><br>>Hi Henrik,<br>><br>>now the errors in the hobbitlaunch.log are gone but in <br>>bb-display.log are still there. And another strange thing - since I <br>>reenabled the bbdisplay this morning I didn't see any host at the <br>>hobbit server! Just the subpages and groups are there.<br>><br>>Regards,<br>><br>>Stefan Loos<br>><br>><br><br><br>&gt;From: user-ce4a2c883f75@xymon.invalid (Henrik <br>>Stoerner)<br>&gt;Reply-To: user-ae9b8668bcde@xymon.invalid<br>&gt;To: <br>>user-ae9b8668bcde@xymon.invalid<br>&gt;Subject: Re: [hobbit] bbdisplay problems after <br>>adding some new hosts<br>&gt;Date: Wed, 11 May 2005 11:06:13 <br>>+0200<br>&gt;<br>&gt;Could you try removing the <br>>&quot;HEARTBEAT&quot; line from hobbitlaunch.cfg and<br>&gt;see if <br>>things run OK after that <br>>?<br>&gt;<br>&gt;<br>&gt;Regards,<br>&gt;Henrik<br>&gt;<br>&gt;On <br>>Wed, May 11, 2005 at 07:41:37AM +0000, Stefan Loos wrote:<br>&gt; <br>>&gt; Hi,<br>&gt; &gt;<br>&gt; &gt; yesterday I add some new hosts to <br>>my hobbit-server and short after that<br>&gt; &gt; hobbit had some <br>>problems.<br>&gt; &gt; Here is what hobbitlauch.log says:<br>&gt; <br>>&gt;<br>&gt; &gt; 2005-05-11 09:11:18 Heartbeat lost for task <br>>hobbitd, bouncing it<br>&gt; &gt; 2005-05-11 09:11:18 Task bbretest <br>>started with PID 4523<br>&gt; &gt; 2005-05-11 09:11:23 Heartbeat <br>>lost for task hobbitd, killing it<br>&gt; &gt; 2005-05-11 09:11:23 <br>>Task bbdisplay started with PID 4524<br>&gt; &gt; 2005-05-11 <br>>09:11:23 Task hobbitd terminated by signal 9<br>&gt; &gt; 2005-05-11 <br>>09:11:23 Task hobbitd started with PID 4525<br>&gt; &gt; 2005-05-11 <br>>09:11:23 Loading hostnames<br>&gt; &gt; 2005-05-11 09:11:23 Loading <br>>saved state<br>&gt; &gt; 2005-05-11 09:11:23 Setting up network <br>>listener on 0.0.0.0:1984<br>&gt; &gt; 2005-05-11 09:11:23 Setting up <br>>signal handlers<br>&gt; &gt; 2005-05-11 09:11:23 Setting up hobbitd <br>>channels<br>&gt; &gt; 2005-05-11 09:11:23 Setting up <br>>logfiles<br>&gt; &gt; 2005-05-11 09:11:28 Task bbhistory started <br>>with PID 4527<br>&gt; &gt; 2005-05-11 09:11:28 Task bbenadis started <br>>with PID 4528<br>&gt; &gt; 2005-05-11 09:11:28 Task bbpage started <br>>with PID 4530<br>&gt; &gt; 2005-05-11 09:11:28 Task larrdstatus <br>>started with PID 4532<br>&gt; &gt; 2005-05-11 09:11:28 Task <br>>larrddata started with PID 4534<br>&gt; &gt; 2005-05-11 09:12:18 <br>>Task bbretest started with PID 4541<br>&gt; &gt; 2005-05-11 09:12:23 <br>>Task bbdisplay started with PID 4542<br>&gt; &gt; 2005-05-11 <br>>09:12:43 Heartbeat lost for task hobbitd, bouncing it<br>&gt; &gt; <br>>2005-05-11 09:12:48 Heartbeat lost for task hobbitd, killing <br>>it<br>&gt; &gt; 2005-05-11 09:12:48 Task hobbitd terminated by <br>>signal 9<br>&gt; &gt; 2005-05-11 09:12:48 Task bbdisplay terminated <br>>by signal 15<br>&gt; &gt;<br>&gt; &gt; So I tried to find out which <br>>component causes the problem and disabled<br>&gt; &gt; everything in <br>>hobbitlauch.cfg and reenabled one by one.<br>&gt; &gt; I found out <br>>that everytime I enabled bbdisplay those errors occour.<br>&gt; &gt; <br>>The bb-display.log looks like this:<br>&gt; &gt;<br>&gt; &gt; <br>>2005-05-11 09:09:48 Whoops ! bb failed to send message - <br>>timeout<br>&gt; &gt; 2005-05-11 09:09:48 hobbitd status-board not <br>>available<br>&gt; &gt; 2005-05-11 09:09:53 Whoops ! bb failed to <br>>send message - timeout<br>&gt; &gt; 2005-05-11 09:10:53 Whoops ! bb <br>>failed to send message - timeout<br>&gt; &gt; 2005-05-11 09:10:53 <br>>hobbitd status-board not available<br>&gt; &gt; 2005-05-11 09:11:23 <br>>Could not connect to bbd at 10.207.193.41:1984 -<br>&gt; &gt; <br>>Connection refused<br>&gt; &gt; 2005-05-11 09:11:23 Whoops ! bb <br>>failed to send message - Connection failed<br>&gt; &gt; 2005-05-11 <br>>09:11:23 hobbitd status-board not available<br>&gt; &gt; 2005-05-11 <br>>09:11:23 Could not connect to bbd at 10.207.193.41:1984 -<br>&gt; &gt; <br>>Connection refused<br>&gt; &gt; 2005-05-11 09:11:23 Whoops ! bb <br>>failed to send message - Connection failed<br>&gt; &gt;<br>&gt; &gt; <br>>I also found some core files in ~server/tmp but I'm pretty shure <br>>they came<br>&gt; &gt; from killing hobbit - nevertheless I've run <br>>the gdb util:<br>&gt; &gt;<br>&gt; &gt; GNU gdb Red Hat Linux <br>>(6.1post-1.20040607.52rh)<br>&gt; &gt; Copyright 2004 Free Software <br>>Foundation, Inc.<br>&gt; &gt; GDB is free software, covered by the <br>>GNU General Public License, and you are<br>&gt; &gt; welcome to <br>>change it and/or distribute copies of it under certain<br>&gt; &gt; <br>>conditions.<br>&gt; &gt; Type &quot;show copying&quot; to see the <br>>conditions.<br>&gt; &gt; There is absolutely no warranty for GDB. <br>>Type &quot;show warranty&quot; for details.<br>&gt; &gt; This GDB <br>>was configured as &quot;i386-redhat-linux-gnu&quot;...Using <br>>host<br>&gt; &gt; libthread_db library <br>>&quot;/lib/tls/libthread_db.so.1&quot;.<br>&gt; &gt;<br>&gt; &gt; <br>>Core was generated by `hobbitd --debug<br>&gt; &gt; <br>>--pidfile=/var/log/hobbit/hobbitd.pid <br>>--restart=/usr/local/hobb'.<br>&gt; &gt; Program terminated with <br>>signal 6, Aborted.<br>&gt; &gt; Reading symbols from <br>>/lib/tls/libc.so.6...done.<br>&gt; &gt; Loaded symbols for <br>>/lib/tls/libc.so.6<br>&gt; &gt; Reading symbols from <br>>/lib/ld-linux.so.2...done.<br>&gt; &gt; Loaded symbols for <br>>/lib/ld-linux.so.2<br>&gt; &gt; #0 0x00df4cef in raise () from <br>>/lib/tls/libc.so.6<br>&gt; &gt; (gdb) bt<br>&gt; &gt; #0 0x00df4cef <br>>in raise () from /lib/tls/libc.so.6<br>&gt; &gt; #1 0x00df64f5 in <br>>abort () from /lib/tls/libc.so.6<br>&gt; &gt; #2 0x08054126 in <br>>sigsegv_handler (signum=11) at sig.c:57<br>&gt; &gt; #3 &lt;signal <br>>handler called&gt;<br>&gt; &gt; #4 0x00e46cac in mempcpy () from <br>>/lib/tls/libc.so.6<br>&gt; &gt; #5 0x00e3a4d2 in <br>>_IO_default_xsputn_internal () from /lib/tls/libc.so.6<br>&gt; &gt; <br>>#6 0x00e13527 in vfprintf () from /lib/tls/libc.so.6<br>&gt; &gt; <br>>#7 0x00e2f3dc in vsprintf () from /lib/tls/libc.so.6<br>&gt; &gt; <br>>#8 0x00e1a03d in sprintf () from /lib/tls/libc.so.6<br>&gt; &gt; #9 <br>> 0x0804d7a4 in do_message (msg=0x9e0b3f8, origin=0x80554bb <br>>&quot;&quot;) at<br>&gt; &gt; hobbitd.c:1903<br>&gt; &gt; #10 <br>>0x0804fcb5 in main (argc=8, argv=0xbfff9084) at <br>>hobbitd.c:2944<br>&gt; &gt; (gdb)<br>&gt; &gt;<br>&gt; &gt; Now I <br>>try to find out which of the new hosts - and what test causes <br>>the<br>&gt; &gt; problems...<br>&gt; &gt;<br>&gt; &gt; <br>>Regards,<br>&gt; &gt;<br>&gt; &gt; Stefan Loos<br>&gt; &gt;<br>&gt; <br>>&gt;<br>&gt; &gt;<br>&gt; &gt; To unsubscribe from the hobbit list, <br>>send an e-mail to<br>&gt; &gt; user-095ef1c764a2@xymon.invalid<br>&gt; <br>>&gt;<br>&gt; &gt;<br>&gt;<br>&gt;--<br>&gt;Henrik <br>>Storner<br>&gt;<br>&gt;To unsubscribe from the hobbit list, send an <br>>e-mail to<br>&gt;user-095ef1c764a2@xymon.invalid<br>&gt;<br>&gt;<br><br>><br>><br>><br>>To unsubscribe from the hobbit list, send an e-mail to<br>>user-095ef1c764a2@xymon.invalid<br>><br>><br>