Patch for 4.2.0
list Francesco Duranti
This could be useful for other too so I'll post it on the list .. I've patched the 4.2.0 with latest patches and I've applied some little change I was doing in the last couple of days. Those are small change that will not influence the way hobbit is working for a normal setup. They're just a bunch of line for each modification. This is a small description on what is changed on each single file: - bbdisplay/bbgen.c : Don't disable rss and wml in pagesets. If the user don't want them for pagesets they can leave out the options to generate them while running bbgen - hobbitd/rrd/do_disk.c (bugfix): While using /group option of the bb command if the status is green the first line will not be skipped at all and will generate one more rrd files. - lib/htmllog.c : If the message has already a <!-- linecount=xx --> tag inside of the message don't count the line but get that value. This will also make the client that will use it work without having to modify the multigraphs settings - web/hobbitgraph.c : Modify FNPATTERN to have a default "^" in front of the file spec. If it's needed to get some file with pattern in the middle of the file you can put a +* in front of the pattern but at least for all the default graph it will not create any problem and it will allow for using rrd filename containing for example the word "disk" in the middle without distrupting the disk graph. The patch is against 4.2.0 with the allinone.patch of 3/10/2006. For Henrik, I've sent you some mails (also this one) in the last few days but I think I get some problems to send email directly to you... Francesco
Attachments (1)
list Henrik Størner
▸
On Thu, Oct 05, 2006 at 11:58:31AM +0200, Francesco Duranti wrote:
For Henrik, I've sent you some mails (also this one) in the last few days but I think I get some problems to send email directly to you...
I think your mails are getting through, I just haven't had time to respond to them. Henrik
list Jason Altrincham Jones
Hi all, I'm getting an unusual situation, I have just applied the allinone patch and bbgen is reporting: - Program crashed Fatal signal caught! But all webpages are still working, even the alternate pageset, the status of the alert is odd too it appears green but when you click on it the reports says it's been red for 35 minutes, do I need to be concerned about this? Thanks, Jason.
list Jason Altrincham Jones
Hi all, The answer to my question seems to be yes, propagation no longer works on any of the tests...help? Thanks, Jason.
▸
-----Original Message-----
From: Jones, Jason (Altrincham)
Sent: 05 October 2006 11:51
To: user-ae9b8668bcde@xymon.invalid
Subject: [hobbit] bbgen program crashed?
Hi all,
I'm getting an unusual situation, I have just applied the allinone patch
and bbgen is reporting:
- Program crashed
Fatal signal caught!
But all webpages are still working, even the alternate pageset, the
status of the alert is odd too it appears green but when you click on it
the reports says it's been red for 35 minutes, do I need to be concerned
about this?
Thanks,
Jason.
list Jason Altrincham Jones
Hi all, Got it all working now by cheating :) I coped the bbgen file from our local hobbit server and put it on the main one on our other site.
▸
Thanks,
Jason.
-----Original Message-----
From: Jones, Jason (Altrincham)
Sent: 05 October 2006 11:56
To: user-ae9b8668bcde@xymon.invalid
Subject: RE: [hobbit] bbgen program crashed?
Hi all,
The answer to my question seems to be yes, propagation no longer works
on any of the tests...help?
Thanks,
Jason.
-----Original Message-----
From: Jones, Jason (Altrincham)
Sent: 05 October 2006 11:51
To: user-ae9b8668bcde@xymon.invalid
Subject: [hobbit] bbgen program crashed?
Hi all,
I'm getting an unusual situation, I have just applied the allinone patch
and bbgen is reporting:
- Program crashed
Fatal signal caught!
But all webpages are still working, even the alternate pageset, the
status of the alert is odd too it appears green but when you click on it
the reports says it's been red for 35 minutes, do I need to be concerned
about this?
Thanks,
Jason.
list Jason Altrincham Jones
Or..not that one has failed too after reporting green for like 30 seconds, I just don't understand what has caused this, this is my only clue so far: ./bbgen 2006-10-05 12:07:54 hobbitd status-board not available Aborted (core dumped)
▸
Jason.
-----Original Message-----
From: Jones, Jason (Altrincham)
Sent: 05 October 2006 12:09
To: 'user-ae9b8668bcde@xymon.invalid'
Subject: RE: [hobbit] bbgen program crashed?
Hi all,
Got it all working now by cheating :) I coped the bbgen file from our
local hobbit server and put it on the main one on our other site.
Thanks,
Jason.
-----Original Message-----
From: Jones, Jason (Altrincham)
Sent: 05 October 2006 11:56
To: user-ae9b8668bcde@xymon.invalid
Subject: RE: [hobbit] bbgen program crashed?
Hi all,
The answer to my question seems to be yes, propagation no longer works
on any of the tests...help?
Thanks,
Jason.
-----Original Message-----
From: Jones, Jason (Altrincham)
Sent: 05 October 2006 11:51
To: user-ae9b8668bcde@xymon.invalid
Subject: [hobbit] bbgen program crashed?
Hi all,
I'm getting an unusual situation, I have just applied the allinone patch
and bbgen is reporting:
- Program crashed
Fatal signal caught!
But all webpages are still working, even the alternate pageset, the
status of the alert is odd too it appears green but when you click on it
the reports says it's been red for 35 minutes, do I need to be concerned
about this?
Thanks,
Jason.
list Henrik Størner
▸
On Thu, Oct 05, 2006 at 12:12:00PM +0100, Jones, Jason (Altrincham) wrote:
Or..not that one has failed too after reporting green for like 30 seconds, I just don't understand what has caused this, this is my only clue so far: ./bbgen 2006-10-05 12:07:54 hobbitd status-board not available Aborted (core dumped)
Which version of Hobbit are you running ? If 4.2.0, did you apply
any of the patches ?
There should be a core-file in the ~hobbit/server/tmp/ directory.
Please run this through gdb like this:
$ gdb ~hobbit/server/bin/bbgen ~hobbit/server/tmp/core
gdb> bt
and send me the output from that "bt" command.
A copy of your hobbitserver.cfg and bb-hosts file would also be
helpful, and the ~hobbit/server/tmp/hobbitd.chk file. Please send
them directly to me (user-ce4a2c883f75@xymon.invalid) - no reason to send them to the
entire mailing list.
Regards,
Henrik
list Henrik Størner
▸
On Thu, Oct 05, 2006 at 02:15:29PM +0200, Henrik Stoerner wrote:
A copy of your hobbitserver.cfg and bb-hosts file would also be helpful, and the ~hobbit/server/tmp/hobbitd.chk file. Please send them directly to me (user-ce4a2c883f75@xymon.invalid) - no reason to send them to the entire mailing list.
Also, check the hobbitlaunch.log for any signs that the "hobbitd"
program has crashed. What happens when you run the command
~hobbit/server/bin/bb 127.0.0.1 hobbitdboard
on the server ?
Henrik
list Jason Altrincham Jones
Hi Henrik, gdb /usr/local/hobbit/server/bin/bbgen /usr/local/hobbit/server/tmp/core.19948 GNU gdb Red Hat Linux (5.3.90-0.20030710.40rh) Copyright 2003 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "i386-redhat-linux-gnu"...Using host libthread_db library "/lib/tls/libthread_db.so.1". warning: exec file is newer than core file. Core was generated by `/usr/local/hobbit/server/bin/bbgen --pageset=itnow --subpagecolumns=6 --nopropy'. Program terminated with signal 6, Aborted. Reading symbols from /lib/libpcre.so.0...done. Loaded symbols for /lib/libpcre.so.0 Reading symbols from /lib/tls/libc.so.6...done. Loaded symbols for /lib/tls/libc.so.6 Reading symbols from /lib/ld-linux.so.2...done. Loaded symbols for /lib/ld-linux.so.2 #0 0xb75ebc32 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2 (gdb) bt #0 0xb75ebc32 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2 #1 0xb74bea09 in raise () from /lib/tls/libc.so.6 #2 0xb74c0235 in abort () from /lib/tls/libc.so.6 #3 0x080626be in sigsegv_handler (signum=11) at sig.c:52 #4 <signal handler called> #5 calc_pagecolors (phead=0x8084e00) at process.c:106 #6 0x080512d0 in calc_pagecolors (phead=0x8084db0) at process.c:155 #7 0x08049e59 in main (argc=6, argv=0xbfff9aa4) at bbgen.c:594 (gdb) If that means anything to you, I am running 4.2.0 and applied the allinone patch when it happened, seems to work now I just reinstall using make clean and make check, still strange how it happened. As for the: /usr/local/hobbit/server/bin/bb 127.0.0.1 hobbitdboard That output is huge still want me to send it so maybe you can figure out how I messed up? Jason.
▸
-----Original Message-----
From: Henrik Stoerner [mailto:user-ce4a2c883f75@xymon.invalid]
Sent: 05 October 2006 13:17
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: FW: [hobbit] bbgen program crashed?
On Thu, Oct 05, 2006 at 02:15:29PM +0200, Henrik Stoerner wrote:A copy of your hobbitserver.cfg and bb-hosts file would also be helpful, and the ~hobbit/server/tmp/hobbitd.chk file. Please send them directly to me (user-ce4a2c883f75@xymon.invalid) - no reason to send them to the entire mailing list.
Also, check the hobbitlaunch.log for any signs that the "hobbitd"
program has crashed. What happens when you run the command
~hobbit/server/bin/bb 127.0.0.1 hobbitdboard
on the server ?
Henrik
list Michael A. Price
I had the same problem.
I think it was because I didnt stop the hobbit client/server when doing
the make install.
So I went back and killed all the hobbit processes then did a make install.
Then restarted everything and it worked.
good luck, michael
Michael A. Price
Performance Network Engineering
NASA/GSFC Code 440.8/LMB
Greenbelt, Maryland 20770
Phone: XXX-XXX-XXXX
Cell: XXX-XXX-XXXX
e-mail: user-2a9e3c790857@xymon.invalid
▸
Jones, Jason (Altrincham) wrote:Hi all, I'm getting an unusual situation, I have just applied the allinone patch and bbgen is reporting: - Program crashed Fatal signal caught! But all webpages are still working, even the alternate pageset, the status of the alert is odd too it appears green but when you click on it the reports says it's been red for 35 minutes, do I need to be concerned about this? Thanks, Jason.
list Dean Casey
I had to do the same - stop all Hobbit processes on the server while re-compiling and then re-start Hobbit afterward. That seemed to prevent the mysterious crashes Jason mentions. Fortunately, in our setup we have two failover production servers and a test box - so I was able to compile on the test server and run the patches there for a few days; then compile on the standby prod server; then failover to the standby system (invisibly to users); compile on primary; then fail back to primary. (Sounds more complicated than it actually was.) My guess would be that in a running server, some images or libraries in memory that won't match what's on disk if you do a recompile without first shutting down any running Hobbit processes. Dean Casey
▸
-----Original Message-----
From: Michael A. Price [mailto:user-2a9e3c790857@xymon.invalid]
Sent: Thursday, October 05, 2006 10:15 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] bbgen program crashed?
I had the same problem.
I think it was because I didnt stop the hobbit client/server when doing
the make install.
So I went back and killed all the hobbit processes then did a make
install.
Then restarted everything and it worked.
good luck, michael
Michael A. Price
Performance Network Engineering
NASA/GSFC Code 440.8/LMB
Greenbelt, Maryland 20770
Phone: XXX-XXX-XXXX
Cell: XXX-XXX-XXXX
e-mail: user-2a9e3c790857@xymon.invalid
Jones, Jason (Altrincham) wrote:Hi all, I'm getting an unusual situation, I have just applied the allinone patch and bbgen is reporting: - Program crashed Fatal signal caught! But all webpages are still working, even the alternate pageset, the status of the alert is odd too it appears green but when you click on it the reports says it's been red for 35 minutes, do I need to be concerned about this? Thanks, Jason.
list Henrik Størner
Hi Jason, it's fine that it is working now, but I would like to find out what happened. If you could pack the bbgen executable and the core file into a tar.gz archive and either mail it to me or ftp it to the /incoming directory on www.hswn.dk, I would appreciate it.
▸
On Thu, Oct 05, 2006 at 01:40:09PM +0100, Jones, Jason (Altrincham) wrote:#3 0x080626be in sigsegv_handler (signum=11) at sig.c:52 #4 <signal handler called> #5 calc_pagecolors (phead=0x8084e00) at process.c:106 #6 0x080512d0 in calc_pagecolors (phead=0x8084db0) at process.c:155 #7 0x08049e59 in main (argc=6, argv=0xbfff9aa4) at bbgen.c:594
I can see where it crashes, but not quite why. That's the reason I'd like to have a closer look at whatever caused this on your system. Regards, Henrik
list Henrik Størner
▸
On Thu, Oct 05, 2006 at 10:26:52AM -0400, Dean Casey wrote:
My guess would be that in a running server, some images or libraries in memory that won't match what's on disk if you do a recompile without first shutting down any running Hobbit processes.
It generally a good thing to shut down while running the "make install" that copies the executables into place - as you say, some systems are not quite fond of having the running executables disappear. But I cannot really see why it would affect the "bbgen" program, which only runs for a few seconds (at the most) every minute. Regards, Henrik
list Dean Casey
▸
On Thu, Oct 05, 2006 at 10:26:52AM -0400, Dean Casey wrote:
My guess would be that in a running server, some images or libraries in memory that won't match what's on disk if you do a recompile without first shutting down any running Hobbit processes.
It generally a good thing to shut down while running the "make install" that copies the executables into place - as you say, some systems are not quite fond of having the running executables disappear.
But I cannot really see why it would affect the "bbgen" program, which only runs for a few seconds (at the most) every minute.
Nor can I - unless maybe the system Jason is using (I forget which OS he said he was running) hasn't gotten around to flushing that image out of the particular pages in physical memory? Or, I could just be talking out of my hat & this specific point could be due to another issue entirely. =D
Regards, Henrik
Dean
list Jason Altrincham Jones
:P I didn't RH4 Enterprise for the record, and Henrik I'll E-mail you the tar.gz later today would you like the /usr/local/hobbit/server/bin/bb 127.0.0.1 hobbitdboard output too? Jason.
▸
-----Original Message-----
From: Dean Casey [mailto:user-99832ab50ffd@xymon.invalid]
Sent: 05 October 2006 21:44
To: user-ae9b8668bcde@xymon.invalid
Subject: RE: [hobbit] bbgen program crashed?
On Thu, Oct 05, 2006 at 10:26:52AM -0400, Dean Casey wrote:
My guess would be that in a running server, some images or libraries in memory that won't match what's on disk if you do a recompile without first shutting down any running Hobbit processes.
It generally a good thing to shut down while running the "make install" that copies the executables into place - as you say, some systems are not quite fond of having the running executables disappear.
But I cannot really see why it would affect the "bbgen" program, which only runs for a few seconds (at the most) every minute.
Nor can I - unless maybe the system Jason is using (I forget which OS he said he was running) hasn't gotten around to flushing that image out of the particular pages in physical memory? Or, I could just be talking out of my hat & this specific point could be due to another issue entirely. =D
Regards, Henrik
Dean
list Henrik Størner
▸
On Fri, Oct 06, 2006 at 09:37:57AM +0100, Jones, Jason (Altrincham) wrote:
:P I didn't RH4 Enterprise for the record, and Henrik I'll E-mail you the tar.gz later today would you like the /usr/local/hobbit/server/bin/bb 127.0.0.1 hobbitdboard output too?
No, that was really just needed to make sure the problem was with bbgen and not with hobbitd. The executable and the core-file will do fine. Regards, Henrik