Xymon Mailing List Archive search

Patch for 4.2.0

16 messages in this thread

list Francesco Duranti · Thu, 5 Oct 2006 11:58:31 +0200 ·
This could be useful for other too so I'll post it on the list ..

I've patched the 4.2.0 with latest patches and I've applied some little
change I was doing in the last couple of days.
Those are small change that will not influence the way hobbit is working
for a normal setup.
They're just a bunch of line for each modification. This is a small
description on what is changed on each single file:

- bbdisplay/bbgen.c : Don't disable rss and wml in pagesets. If the user
don't want them for pagesets they can leave out the options to generate
them while running bbgen
- hobbitd/rrd/do_disk.c (bugfix): While using /group option of the bb
command if the status is green the first line will not be skipped at all
and will generate one more rrd files.
- lib/htmllog.c : If the message has already a <!-- linecount=xx --> tag
inside of the message don't count the line but get that value. This will
also make the client that will use it work without having to modify the
multigraphs settings
- web/hobbitgraph.c :  Modify FNPATTERN to have a default "^" in front
of the file spec. If it's needed to get some file with pattern in the
middle of the file you can put a +* in front of the pattern but at least
for all the default graph it will not create any problem and it will
allow for using rrd filename containing for example the word "disk" in
the middle without distrupting the disk graph.

The patch is against 4.2.0 with the allinone.patch of 3/10/2006.


For Henrik, I've sent you some mails (also this one) in the last few
days but I think I get some problems to send email directly to you... 

Francesco
Attachments (1)
list Henrik Størner · Thu, 5 Oct 2006 12:42:59 +0200 ·
quoted from Francesco Duranti
On Thu, Oct 05, 2006 at 11:58:31AM +0200, Francesco Duranti wrote:
For Henrik, I've sent you some mails (also this one) in the last few
days but I think I get some problems to send email directly to you... 
I think your mails are getting through, I just haven't had time to
respond to them.


Henrik
list Jason Altrincham Jones · Thu, 5 Oct 2006 11:50:44 +0100 ·
Hi all,

I'm getting an unusual situation, I have just applied the allinone patch
and bbgen is reporting:

- Program crashed

Fatal signal caught!

But all webpages are still working, even the alternate pageset, the
status of the alert is odd too it appears green but when you click on it
the reports says it's been red for 35 minutes, do I need to be concerned
about this?

Thanks,
Jason.
list Jason Altrincham Jones · Thu, 5 Oct 2006 11:56:17 +0100 ·
Hi all,

The answer to my question seems to be yes, propagation no longer works
on any of the tests...help?

Thanks,
Jason.
quoted from Jason Altrincham Jones

-----Original Message-----
From: Jones, Jason (Altrincham) 
Sent: 05 October 2006 11:51
To: user-ae9b8668bcde@xymon.invalid
Subject: [hobbit] bbgen program crashed?

Hi all,

I'm getting an unusual situation, I have just applied the allinone patch
and bbgen is reporting:

- Program crashed

Fatal signal caught!

But all webpages are still working, even the alternate pageset, the
status of the alert is odd too it appears green but when you click on it
the reports says it's been red for 35 minutes, do I need to be concerned
about this?

Thanks,
Jason.
list Jason Altrincham Jones · Thu, 5 Oct 2006 12:09:22 +0100 ·
Hi all,

Got it all working now by cheating :) I coped the bbgen file from our
local hobbit server and put it on the main one on our other site.
quoted from Jason Altrincham Jones

Thanks,
Jason.
 -----Original Message-----
From: Jones, Jason (Altrincham) 
Sent: 05 October 2006 11:56
To: user-ae9b8668bcde@xymon.invalid
Subject: RE: [hobbit] bbgen program crashed?

Hi all,

The answer to my question seems to be yes, propagation no longer works
on any of the tests...help?

Thanks,
Jason.

-----Original Message-----
From: Jones, Jason (Altrincham) 
Sent: 05 October 2006 11:51
To: user-ae9b8668bcde@xymon.invalid
Subject: [hobbit] bbgen program crashed?

Hi all,

I'm getting an unusual situation, I have just applied the allinone patch
and bbgen is reporting:

- Program crashed

Fatal signal caught!

But all webpages are still working, even the alternate pageset, the
status of the alert is odd too it appears green but when you click on it
the reports says it's been red for 35 minutes, do I need to be concerned
about this?

Thanks,
Jason.
list Jason Altrincham Jones · Thu, 5 Oct 2006 12:12:00 +0100 ·
Or..not that one has failed too after reporting green for like 30
seconds, I just don't understand what has caused this, this is my only
clue so far:

./bbgen
2006-10-05 12:07:54 hobbitd status-board not available
Aborted (core dumped)
quoted from Jason Altrincham Jones

Jason.

-----Original Message-----
From: Jones, Jason (Altrincham) 
Sent: 05 October 2006 12:09
To: 'user-ae9b8668bcde@xymon.invalid'
Subject: RE: [hobbit] bbgen program crashed?

Hi all,

Got it all working now by cheating :) I coped the bbgen file from our
local hobbit server and put it on the main one on our other site.

Thanks,
Jason.
 -----Original Message-----
From: Jones, Jason (Altrincham) 
Sent: 05 October 2006 11:56
To: user-ae9b8668bcde@xymon.invalid
Subject: RE: [hobbit] bbgen program crashed?

Hi all,

The answer to my question seems to be yes, propagation no longer works
on any of the tests...help?

Thanks,
Jason.

-----Original Message-----
From: Jones, Jason (Altrincham) 
Sent: 05 October 2006 11:51
To: user-ae9b8668bcde@xymon.invalid
Subject: [hobbit] bbgen program crashed?

Hi all,

I'm getting an unusual situation, I have just applied the allinone patch
and bbgen is reporting:

- Program crashed

Fatal signal caught!

But all webpages are still working, even the alternate pageset, the
status of the alert is odd too it appears green but when you click on it
the reports says it's been red for 35 minutes, do I need to be concerned
about this?

Thanks,
Jason.
list Henrik Størner · Thu, 5 Oct 2006 14:15:29 +0200 ·
quoted from Jason Altrincham Jones
On Thu, Oct 05, 2006 at 12:12:00PM +0100, Jones, Jason (Altrincham) wrote:
Or..not that one has failed too after reporting green for like 30
seconds, I just don't understand what has caused this, this is my only
clue so far:

./bbgen
2006-10-05 12:07:54 hobbitd status-board not available
Aborted (core dumped)
Which version of Hobbit are you running ? If 4.2.0, did you apply 
any of the patches ?

There should be a core-file in the ~hobbit/server/tmp/ directory.
Please run this through gdb like this:

    $ gdb ~hobbit/server/bin/bbgen ~hobbit/server/tmp/core
    gdb> bt

and send me the output from that "bt" command.

A copy of your hobbitserver.cfg and bb-hosts file would also be
helpful, and the ~hobbit/server/tmp/hobbitd.chk file. Please send
them directly to me (user-ce4a2c883f75@xymon.invalid) - no reason to send them to the
entire mailing list.


Regards,
Henrik
list Henrik Størner · Thu, 5 Oct 2006 14:16:55 +0200 ·
quoted from Henrik Størner
On Thu, Oct 05, 2006 at 02:15:29PM +0200, Henrik Stoerner wrote:
A copy of your hobbitserver.cfg and bb-hosts file would also be
helpful, and the ~hobbit/server/tmp/hobbitd.chk file. Please send
them directly to me (user-ce4a2c883f75@xymon.invalid) - no reason to send them to the
entire mailing list.
Also, check the hobbitlaunch.log for any signs that the "hobbitd" 
program has crashed. What happens when you run the command

    ~hobbit/server/bin/bb 127.0.0.1 hobbitdboard

on the server ?


Henrik
list Jason Altrincham Jones · Thu, 5 Oct 2006 13:40:09 +0100 ·
Hi Henrik,

gdb /usr/local/hobbit/server/bin/bbgen
/usr/local/hobbit/server/tmp/core.19948

GNU gdb Red Hat Linux (5.3.90-0.20030710.40rh)
Copyright 2003 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you
are
welcome to change it and/or distribute copies of it under certain
conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for
details.
This GDB was configured as "i386-redhat-linux-gnu"...Using host
libthread_db library "/lib/tls/libthread_db.so.1".


warning: exec file is newer than core file.
Core was generated by `/usr/local/hobbit/server/bin/bbgen
--pageset=itnow --subpagecolumns=6 --nopropy'.
Program terminated with signal 6, Aborted.
Reading symbols from /lib/libpcre.so.0...done.
Loaded symbols for /lib/libpcre.so.0
Reading symbols from /lib/tls/libc.so.6...done.
Loaded symbols for /lib/tls/libc.so.6
Reading symbols from /lib/ld-linux.so.2...done.
Loaded symbols for /lib/ld-linux.so.2
#0  0xb75ebc32 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2

(gdb) bt
#0  0xb75ebc32 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
#1  0xb74bea09 in raise () from /lib/tls/libc.so.6
#2  0xb74c0235 in abort () from /lib/tls/libc.so.6
#3  0x080626be in sigsegv_handler (signum=11) at sig.c:52
#4  <signal handler called>
#5  calc_pagecolors (phead=0x8084e00) at process.c:106
#6  0x080512d0 in calc_pagecolors (phead=0x8084db0) at process.c:155
#7  0x08049e59 in main (argc=6, argv=0xbfff9aa4) at bbgen.c:594
(gdb)

If that means anything to you, I am running 4.2.0 and applied the
allinone patch when it happened, seems to work now I just reinstall
using make clean and make check, still strange how it happened.  As for
the:

/usr/local/hobbit/server/bin/bb 127.0.0.1 hobbitdboard

That output is huge still want me to send it so maybe you can figure out
how I messed up?

Jason.
quoted from Henrik Størner

-----Original Message-----
From: Henrik Stoerner [mailto:user-ce4a2c883f75@xymon.invalid] 
Sent: 05 October 2006 13:17
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: FW: [hobbit] bbgen program crashed?

On Thu, Oct 05, 2006 at 02:15:29PM +0200, Henrik Stoerner wrote:
A copy of your hobbitserver.cfg and bb-hosts file would also be
helpful, and the ~hobbit/server/tmp/hobbitd.chk file. Please send
them directly to me (user-ce4a2c883f75@xymon.invalid) - no reason to send them to the
entire mailing list.
Also, check the hobbitlaunch.log for any signs that the "hobbitd" 
program has crashed. What happens when you run the command

    ~hobbit/server/bin/bb 127.0.0.1 hobbitdboard

on the server ?


Henrik
list Michael A. Price · Thu, 05 Oct 2006 10:15:28 -0400 ·
I had the same problem.

I think it was because I didnt stop the hobbit client/server when doing 
the make install.

So I went back and killed all the hobbit processes then did a make install.
Then restarted everything and it worked.

good luck, michael

Michael A. Price
Performance Network Engineering
NASA/GSFC Code 440.8/LMB
Greenbelt, Maryland 20770
            Phone:  XXX-XXX-XXXX
            Cell:   XXX-XXX-XXXX
            e-mail: user-2a9e3c790857@xymon.invalid
quoted from Jason Altrincham Jones


Jones, Jason (Altrincham) wrote:
Hi all,

I'm getting an unusual situation, I have just applied the allinone patch
and bbgen is reporting:

- Program crashed

Fatal signal caught!

But all webpages are still working, even the alternate pageset, the
status of the alert is odd too it appears green but when you click on it
the reports says it's been red for 35 minutes, do I need to be concerned
about this?

Thanks,
Jason.

list Dean Casey · Thu, 5 Oct 2006 10:26:52 -0400 ·
I had to do the same - stop all Hobbit processes on the server while
re-compiling and then re-start Hobbit afterward. That seemed to prevent
the mysterious crashes Jason mentions. 

Fortunately, in our setup we have two failover production servers and a
test box - so I was able to compile on the test server and run the
patches there for a few days; then compile on the standby prod server;
then failover to the standby system (invisibly to users); compile on
primary; then fail back to primary. (Sounds more complicated than it
actually was.) 

My guess would be that in a running server, some images or libraries in
memory that won't match what's on disk if you do a recompile without
first shutting down any running Hobbit processes.

Dean Casey
quoted from Michael A. Price

-----Original Message-----
From: Michael A. Price [mailto:user-2a9e3c790857@xymon.invalid] 
Sent: Thursday, October 05, 2006 10:15 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] bbgen program crashed?

I had the same problem.

I think it was because I didnt stop the hobbit client/server when doing 
the make install.

So I went back and killed all the hobbit processes then did a make
install.
Then restarted everything and it worked.

good luck, michael

Michael A. Price
Performance Network Engineering
NASA/GSFC Code 440.8/LMB
Greenbelt, Maryland 20770
            Phone:  XXX-XXX-XXXX
            Cell:   XXX-XXX-XXXX
            e-mail: user-2a9e3c790857@xymon.invalid


Jones, Jason (Altrincham) wrote:
Hi all,

I'm getting an unusual situation, I have just applied the allinone
patch
and bbgen is reporting:

- Program crashed

Fatal signal caught!

But all webpages are still working, even the alternate pageset, the
status of the alert is odd too it appears green but when you click on
it
the reports says it's been red for 35 minutes, do I need to be
concerned
about this?

Thanks,
Jason.

list Henrik Størner · Thu, 5 Oct 2006 22:34:38 +0200 ·
Hi Jason,

it's fine that it is working now, but I would like to find out what
happened. If you could pack the bbgen executable and the core file
into a tar.gz archive and either mail it to me or ftp it to the
/incoming directory on www.hswn.dk, I would appreciate it.
quoted from Jason Altrincham Jones

On Thu, Oct 05, 2006 at 01:40:09PM +0100, Jones, Jason (Altrincham) wrote:
#3  0x080626be in sigsegv_handler (signum=11) at sig.c:52
#4  <signal handler called>
#5  calc_pagecolors (phead=0x8084e00) at process.c:106
#6  0x080512d0 in calc_pagecolors (phead=0x8084db0) at process.c:155
#7  0x08049e59 in main (argc=6, argv=0xbfff9aa4) at bbgen.c:594
I can see where it crashes, but not quite why. That's the reason I'd
like to have a closer look at whatever caused this on your system.


Regards,
Henrik
list Henrik Størner · Thu, 5 Oct 2006 22:37:27 +0200 ·
quoted from Dean Casey
On Thu, Oct 05, 2006 at 10:26:52AM -0400, Dean Casey wrote:
My guess would be that in a running server, some images or libraries in
memory that won't match what's on disk if you do a recompile without
first shutting down any running Hobbit processes.
It generally a good thing to shut down while running the "make install"
that copies the executables into place - as you say, some systems are
not quite fond of having the running executables disappear.

But I cannot really see why it would affect the "bbgen" program, which
only runs for a few seconds (at the most) every minute.


Regards,
Henrik
list Dean Casey · Thu, 5 Oct 2006 16:43:58 -0400 ·
quoted from Dean Casey
On Thu, Oct 05, 2006 at 10:26:52AM -0400, Dean Casey wrote:
My guess would be that in a running server, some images or libraries
in
memory that won't match what's on disk if you do a recompile without
first shutting down any running Hobbit processes.
It generally a good thing to shut down while running the "make install"
that copies the executables into place - as you say, some systems are
not quite fond of having the running executables disappear.
But I cannot really see why it would affect the "bbgen" program, which
only runs for a few seconds (at the most) every minute.
Nor can I - unless maybe the system Jason is using (I forget which OS he
said he was running) hasn't gotten around to flushing that image out of
the particular pages in physical memory? Or, I could just be talking out
of my hat & this specific point could be due to another issue entirely.
=D
Regards,
Henrik
Dean
list Jason Altrincham Jones · Fri, 6 Oct 2006 09:37:57 +0100 ·
:P I didn't RH4 Enterprise for the record, and Henrik I'll E-mail you
the tar.gz later today would you like the
/usr/local/hobbit/server/bin/bb 127.0.0.1 hobbitdboard output too?

Jason.
quoted from Dean Casey
-----Original Message-----
From: Dean Casey [mailto:user-99832ab50ffd@xymon.invalid] 
Sent: 05 October 2006 21:44
To: user-ae9b8668bcde@xymon.invalid
Subject: RE: [hobbit] bbgen program crashed?

On Thu, Oct 05, 2006 at 10:26:52AM -0400, Dean Casey wrote:
My guess would be that in a running server, some images or libraries
in
memory that won't match what's on disk if you do a recompile without
first shutting down any running Hobbit processes.
It generally a good thing to shut down while running the "make install"
that copies the executables into place - as you say, some systems are
not quite fond of having the running executables disappear.
But I cannot really see why it would affect the "bbgen" program, which
only runs for a few seconds (at the most) every minute.
Nor can I - unless maybe the system Jason is using (I forget which OS he
said he was running) hasn't gotten around to flushing that image out of
the particular pages in physical memory? Or, I could just be talking out
of my hat & this specific point could be due to another issue entirely.
=D
Regards,
Henrik
Dean
list Henrik Størner · Fri, 6 Oct 2006 11:38:21 +0200 ·
quoted from Jason Altrincham Jones
On Fri, Oct 06, 2006 at 09:37:57AM +0100, Jones, Jason (Altrincham) wrote:
:P I didn't RH4 Enterprise for the record, and Henrik I'll E-mail you
the tar.gz later today would you like the
/usr/local/hobbit/server/bin/bb 127.0.0.1 hobbitdboard output too?
No, that was really just needed to make sure the problem was with bbgen
and not with hobbitd. The executable and the core-file will do fine.


Regards,
Henrik