Xymon Mailing List Archive search

trends columns all turned purple

19 messages in this thread

list Richard Deal · Thu, 31 Mar 2005 17:45:39 -0500 ·
Things looked for a little bit then all the trends columns turned
purple.
Any one else see this or better yet solve it?
Thanks
list David Gore · Thu, 31 Mar 2005 22:53:24 +0000 ·
quoted from Richard Deal
Deal, Richard wrote:
Things looked for a little bit then all the trends columns turned
purple.
Any one else see this or better yet solve it?
Thanks

I am seeing the same thing.  I do not know how to fix it.  Has only occurred since installing 4.0.1.  Running on solaris 9.

~David
list Richard Deal · Thu, 31 Mar 2005 18:04:14 -0500 ·
I am running 4.0 on solaris 9.
I did a hobbit.sh stop Then a rm data/hist/*.trends And restarted hobbit

It has cleared up the problem for now but it has only been 10 mins or
so.

Has anyone else notices maint.pl lists now hosts, only "ALL"?
quoted from David Gore


-----Original Message-----
From: David Gore [mailto:user-0ef10ea25696@xymon.invalid] Sent: Thursday, March 31, 2005 5:53 PM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] trends columns all turned purple

Deal, Richard wrote:
Things looked for a little bit then all the trends columns turned
purple.
Any one else see this or better yet solve it?
Thanks

I am seeing the same thing.  I do not know how to fix it.  Has only occurred since installing 4.0.1.  Running on solaris 9.

~David
list Lars Ebeling · Fri, 1 Apr 2005 05:41:44 +0200 (CEST) ·
When I had a look this morning everything looked well. No purple trends and "all" four boxes in maint.pl
 
Rergards
Lars

"Deal, Richard" <user-f6f804cb0a50@xymon.invalid> wrote:
Datum: Thu, 31 Mar 2005 18:04:14 -0500
Från: "Deal, Richard" <user-f6f804cb0a50@xymon.invalid>
Till: <user-ae9b8668bcde@xymon.invalid>
Ämne: RE: [hobbit] trends columns all turned purple

Från: Deal, Richard[SMTP:user-f6f804cb0a50@xymon.invalid]
Skickat: den 1 april 2005 01:04:14
Till: user-ae9b8668bcde@xymon.invalid
Ämne: RE: [hobbit] trends columns all turned purple
Automatiskt vidarebefordrat av en regel
quoted from Richard Deal
I am running 4.0 on solaris 9.
I did a hobbit.sh stop 
Then a rm data/hist/*.trends 
And restarted hobbit

It has cleared up the problem for now but it has only been 10 mins or
so.

Has anyone else notices maint.pl lists now hosts, only "ALL"?


-----Original Message-----
From: David Gore [mailto:user-0ef10ea25696@xymon.invalid] 
Sent: Thursday, March 31, 2005 5:53 PM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] trends columns all turned purple

Deal, Richard wrote:
Things looked for a little bit then all the trends columns turned
purple.
Any one else see this or better yet solve it?
Thanks

I am seeing the same thing. I do not know how to fix it. Has only 
occurred since installing 4.0.1. Running on solaris 9.

~David


Hobbithobbyist

The trouble with the world is that the stupid are cocksure and the intelligent are full of doubt.
--Bertrand Russell
list Henrik Størner · Fri, 1 Apr 2005 08:33:32 +0200 ·
On Thu, Mar 31, 2005 at 06:04:14PM -0500, Deal, Richard wrote:

[trends column goes purple]
Has anyone else notices maint.pl lists now hosts, only "ALL"?
Neither of these show up on any of the hosts I've tested. sounds
like there's some problem with the hobbit daemon on your box,
could you try stopping Hobbit, then do a "ps" to make sure all of the
tasks have stopped, then restarting it ?


Regards,
Henrik
list Richard Deal · Fri, 1 Apr 2005 08:07:31 -0500 ·
I tried but I still don't get an actual list of hosts just the option
for "ALL".
The trends column has remained green though.  It may have been from
starting and stopping a few times when I was testing at the beginning
that the history logs were created.  
quoted from Henrik Størner

-----Original Message-----
From: Henrik Stoerner [mailto:user-ce4a2c883f75@xymon.invalid] 
Sent: Friday, April 01, 2005 1:34 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] trends columns all turned purple

On Thu, Mar 31, 2005 at 06:04:14PM -0500, Deal, Richard wrote:

[trends column goes purple]
Has anyone else notices maint.pl lists now hosts, only "ALL"?
Neither of these show up on any of the hosts I've tested. sounds
like there's some problem with the hobbit daemon on your box,
could you try stopping Hobbit, then do a "ps" to make sure all of the
tasks have stopped, then restarting it ?

Regards,
Henrik
list David Gore · Fri, 01 Apr 2005 17:07:47 +0000 ·
I re-installed RC6 to get rid of the purple trends.  The maint.pl lists and 'ALL' may still be a problem, it does occur in both Firefox and IE.   It can be alleviated by changing views.  Typically we monitor bb2.html, but if we switch to the Main view and try enable/disable, it typically works, this only occurred since RC6, I think.

Richard, you are running Solaris 9 also?

~David
quoted from Richard Deal


Deal, Richard wrote:
I tried but I still don't get an actual list of hosts just the option
for "ALL".
The trends column has remained green though.  It may have been from
starting and stopping a few times when I was testing at the beginning
that the history logs were created.  
-----Original Message-----
From: Henrik Stoerner [mailto:user-ce4a2c883f75@xymon.invalid] Sent: Friday, April 01, 2005 1:34 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] trends columns all turned purple

On Thu, Mar 31, 2005 at 06:04:14PM -0500, Deal, Richard wrote:

[trends column goes purple]
Has anyone else notices maint.pl lists now hosts, only "ALL"?

Neither of these show up on any of the hosts I've tested. sounds
like there's some problem with the hobbit daemon on your box,
could you try stopping Hobbit, then do a "ps" to make sure all of the
tasks have stopped, then restarting it ?

Regards,
Henrik

list Henrik Størner · Fri, 1 Apr 2005 17:19:26 +0000 (UTC) ·
OK, I think there are two completely un-related issues here.


In <user-87577d67ef7d@xymon.invalid> David Gore <user-0ef10ea25696@xymon.invalid> writes:
I re-installed RC6 to get rid of the purple trends.  
The "trends" don't go purple by themselves, they go purple because the
bb-larrdcolumn tool doesn't update them.

I'd like you to check for unusual messages in the
/var/log/hobbit/bb-display.log file, and also for any core-files left
behind from bb-larrdcolumn. They should be in the ~/server/tmp/
directory, but please check in the ~/data/rrd/ directory also.
quoted from David Gore

The maint.pl lists and 'ALL' may still be a problem, it does occur in
both Firefox and IE.  It can be alleviated by changing views.
Typically we monitor bb2.html, but if we switch to the Main view and
try enable/disable, it typically works, this only occurred since RC6,
I think.
The missing hosts on the maint.pl display sounds like you're still
running the version of maint.pl that uses cookies to try and display
only the hosts from the page you were on. This was removed in the 4.0
release precisely because it was causing problems. Could you check
your maint.pl script and see if lines 432-433 look like this:

    432   # open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." \"hobbitdboard ".$filter."\" |");
    433   open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." hobbitdboard |");


Thanks,
Henrik
list Richard Deal · Fri, 1 Apr 2005 12:42:10 -0500 ·
Removing the history info for trends fixed my purple issue.  Yes I am running solaris 9 with 4.0
I tried changing views but same result for maint.pl
quoted from David Gore


-----Original Message-----
From: David Gore [mailto:user-0ef10ea25696@xymon.invalid] Sent: Friday, April 01, 2005 12:08 PM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] trends columns all turned purple

I re-installed RC6 to get rid of the purple trends.  The maint.pl lists and 'ALL' may still be a problem, it does occur in both Firefox and IE.   It can be alleviated by changing views.  Typically we monitor bb2.html, but if we switch to the Main view and try enable/disable, it typically works, this only occurred since RC6, I think.

Richard, you are running Solaris 9 also?

~David


Deal, Richard wrote:
I tried but I still don't get an actual list of hosts just the option
for "ALL".
The trends column has remained green though.  It may have been from
starting and stopping a few times when I was testing at the beginning
that the history logs were created.  
-----Original Message-----
From: Henrik Stoerner [mailto:user-ce4a2c883f75@xymon.invalid] Sent: Friday, April 01, 2005 1:34 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] trends columns all turned purple

On Thu, Mar 31, 2005 at 06:04:14PM -0500, Deal, Richard wrote:

[trends column goes purple]
Has anyone else notices maint.pl lists now hosts, only "ALL"?

Neither of these show up on any of the hosts I've tested. sounds
like there's some problem with the hobbit daemon on your box,
could you try stopping Hobbit, then do a "ps" to make sure all of the
tasks have stopped, then restarting it ?

Regards,
Henrik

list Richard Deal · Fri, 1 Apr 2005 12:56:24 -0500 ·
My maint.pl lines are:
 # open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}."
\"hobbitdboard".$filter."\" |");
 open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." hobbitdboard |");

I have a core from hobbitd in /tmp
file tmp/core
tmp/core:       ELF 32-bit MSB core file SPARC Version 1, from 'hobbitd'
more hobbitd.log 
2005-04-01 12:52:47 Setup complete
2005-04-01 12:52:54 Setup complete
more larrd-data.log 
more larrd-status.log
more bb-display.log
I had it stopped for about two hours while I copied history and rrd over
when I stared it back up trends and now info went purple.
Right now I am running on port 201984 for testing but other than that it
is pretty straight forward install.
quoted from Henrik Størner
 

-----Original Message-----
From: Henrik Storner [mailto:user-ce4a2c883f75@xymon.invalid] 
Sent: Friday, April 01, 2005 12:19 PM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] trends columns all turned purple

OK, I think there are two completely un-related issues here.


In <user-87577d67ef7d@xymon.invalid> David Gore <user-0ef10ea25696@xymon.invalid> writes:
I re-installed RC6 to get rid of the purple trends.  
The "trends" don't go purple by themselves, they go purple because the
bb-larrdcolumn tool doesn't update them.

I'd like you to check for unusual messages in the
/var/log/hobbit/bb-display.log file, and also for any core-files left
behind from bb-larrdcolumn. They should be in the ~/server/tmp/
directory, but please check in the ~/data/rrd/ directory also.

The maint.pl lists and 'ALL' may still be a problem, it does occur in
both Firefox and IE.  It can be alleviated by changing views.
Typically we monitor bb2.html, but if we switch to the Main view and
try enable/disable, it typically works, this only occurred since RC6,
I think.
The missing hosts on the maint.pl display sounds like you're still
running the version of maint.pl that uses cookies to try and display
only the hosts from the page you were on. This was removed in the 4.0
release precisely because it was causing problems. Could you check
your maint.pl script and see if lines 432-433 look like this:

    432   # open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." \"hobbitdboard
".$filter."\" |");
    433   open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." hobbitdboard |");


Thanks,
Henrik
list Richard Deal · Fri, 1 Apr 2005 13:10:26 -0500 ·
Ok I found the problem to maint.pl
It is hardcoded to use the default bb port of 1984
I changed line 433 from:
 open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." hobbitdboard |");
to :
 open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}.":".$BBENV{'BBPORT'}."
hobbitdboard |");

and now it works.
quoted from Richard Deal

-----Original Message-----
From: Henrik Storner [mailto:user-ce4a2c883f75@xymon.invalid] 
Sent: Friday, April 01, 2005 12:19 PM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] trends columns all turned purple

OK, I think there are two completely un-related issues here.


In <user-87577d67ef7d@xymon.invalid> David Gore <user-0ef10ea25696@xymon.invalid> writes:
I re-installed RC6 to get rid of the purple trends.  
The "trends" don't go purple by themselves, they go purple because the
bb-larrdcolumn tool doesn't update them.

I'd like you to check for unusual messages in the
/var/log/hobbit/bb-display.log file, and also for any core-files left
behind from bb-larrdcolumn. They should be in the ~/server/tmp/
directory, but please check in the ~/data/rrd/ directory also.

The maint.pl lists and 'ALL' may still be a problem, it does occur in
both Firefox and IE.  It can be alleviated by changing views.
Typically we monitor bb2.html, but if we switch to the Main view and
try enable/disable, it typically works, this only occurred since RC6,
I think.
The missing hosts on the maint.pl display sounds like you're still
running the version of maint.pl that uses cookies to try and display
only the hosts from the page you were on. This was removed in the 4.0
release precisely because it was causing problems. Could you check
your maint.pl script and see if lines 432-433 look like this:

    432   # open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." \"hobbitdboard
".$filter."\" |");
    433   open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." hobbitdboard |");


Thanks,
Henrik
list David Gore · Fri, 01 Apr 2005 21:51:45 +0000 ·
Yes one core file:

hobbit at hobbit ~/server> find . -name core
./tmp/core
hobbit at hobbit ~/server> file tmp/core
tmp/core:       ELF 32-bit MSB core file SPARC Version 1, from 'hobbitd'
hobbit at hobbit ~/server> ls -al tmp/core
-rw-------    1 hobbit   other     8322084 Apr  1 03:27 tmp/core

I checked to make sure the lines were correct in maint.pl.  I am reinstalling 4.0.1, again and will let you know how it goes.

~David
quoted from Richard Deal

Henrik Storner wrote:
OK, I think there are two completely un-related issues here.


In <user-87577d67ef7d@xymon.invalid> David Gore <user-0ef10ea25696@xymon.invalid> writes:

I re-installed RC6 to get rid of the purple trends.  

The "trends" don't go purple by themselves, they go purple because the
bb-larrdcolumn tool doesn't update them.

I'd like you to check for unusual messages in the
/var/log/hobbit/bb-display.log file, and also for any core-files left
behind from bb-larrdcolumn. They should be in the ~/server/tmp/
directory, but please check in the ~/data/rrd/ directory also.

The maint.pl lists and 'ALL' may still be a problem, it does occur in
both Firefox and IE.  It can be alleviated by changing views.
Typically we monitor bb2.html, but if we switch to the Main view and
try enable/disable, it typically works, this only occurred since RC6,
I think.

The missing hosts on the maint.pl display sounds like you're still
running the version of maint.pl that uses cookies to try and display
only the hosts from the page you were on. This was removed in the 4.0
release precisely because it was causing problems. Could you check
your maint.pl script and see if lines 432-433 look like this:

    432   # open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." \"hobbitdboard ".$filter."\" |");
    433   open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." hobbitdboard |");


Thanks,
Henrik

list David Gore · Fri, 01 Apr 2005 22:11:24 +0000 ·
quoted from David Gore
David Gore wrote:
Yes one core file:

hobbit at hobbit ~/server> find . -name core
./tmp/core
hobbit at hobbit ~/server> file tmp/core
tmp/core:       ELF 32-bit MSB core file SPARC Version 1, from 'hobbitd'
hobbit at hobbit ~/server> ls -al tmp/core
-rw-------    1 hobbit   other     8322084 Apr  1 03:27 tmp/core

I checked to make sure the lines were correct in maint.pl.  I am 
reinstalling 4.0.1, again and will let you know how it goes.

~David

Henrik Storner wrote:
OK, I think there are two completely un-related issues here.


In <user-87577d67ef7d@xymon.invalid> David Gore <user-0ef10ea25696@xymon.invalid> writes:

I re-installed RC6 to get rid of the purple trends.  

The "trends" don't go purple by themselves, they go purple because the
bb-larrdcolumn tool doesn't update them.

I'd like you to check for unusual messages in the
/var/log/hobbit/bb-display.log file, and also for any core-files left
behind from bb-larrdcolumn. They should be in the ~/server/tmp/
directory, but please check in the ~/data/rrd/ directory also.

The maint.pl lists and 'ALL' may still be a problem, it does occur in
both Firefox and IE.  It can be alleviated by changing views.
Typically we monitor bb2.html, but if we switch to the Main view and
try enable/disable, it typically works, this only occurred since RC6,
I think.

The missing hosts on the maint.pl display sounds like you're still
running the version of maint.pl that uses cookies to try and display
only the hosts from the page you were on. This was removed in the 4.0
release precisely because it was causing problems. Could you check
your maint.pl script and see if lines 432-433 look like this:

    432   # open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." \"hobbitdboard 
".$filter."\" |");
    433   open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." hobbitdboard |");


Thanks,
Henrik

New core and the purples are back:

find -namehobbit at hobbit ~/server> find -name core -ls
141454 8024 -rw-------   1 hobbit   other     8207396 Apr  1 22:08 
quoted from David Gore
./tmp/core
hobbit at hobbit ~/server> file ./tmp/core
./tmp/core:     ELF 32-bit MSB core file SPARC Version 1, from 'hobbitd'
list David Gore · Fri, 01 Apr 2005 22:17:58 +0000 ·
David Gore (v965-3670)
Enhanced Technology Support (ETS)
Network Management Systems (NMS)
IMPACT Transport Team Lead - SCSA, SCNA
Page: 1-800-PAG-eMCI pin 1406090
Vnet: 965-3676

David Gore (v965-3670)
Enhanced Technology Support (ETS)
Network Management Systems (NMS)
IMPACT Transport Team Lead - SCSA, SCNA
Page: 1-800-PAG-eMCI pin 1406090
Vnet: 965-3676
quoted from David Gore


David Gore wrote:
David Gore wrote:
Yes one core file:

hobbit at hobbit ~/server> find . -name core
./tmp/core
hobbit at hobbit ~/server> file tmp/core
tmp/core:       ELF 32-bit MSB core file SPARC Version 1, from 'hobbitd'
hobbit at hobbit ~/server> ls -al tmp/core
-rw-------    1 hobbit   other     8322084 Apr  1 03:27 tmp/core

I checked to make sure the lines were correct in maint.pl.  I am 
reinstalling 4.0.1, again and will let you know how it goes.

~David

Henrik Storner wrote:
OK, I think there are two completely un-related issues here.


In <user-87577d67ef7d@xymon.invalid> David Gore <user-0ef10ea25696@xymon.invalid> writes:

I re-installed RC6 to get rid of the purple trends.  

The "trends" don't go purple by themselves, they go purple because the
bb-larrdcolumn tool doesn't update them.

I'd like you to check for unusual messages in the
/var/log/hobbit/bb-display.log file, and also for any core-files left
behind from bb-larrdcolumn. They should be in the ~/server/tmp/
directory, but please check in the ~/data/rrd/ directory also.

The maint.pl lists and 'ALL' may still be a problem, it does occur in
both Firefox and IE.  It can be alleviated by changing views.
Typically we monitor bb2.html, but if we switch to the Main view and
try enable/disable, it typically works, this only occurred since RC6,
I think.

The missing hosts on the maint.pl display sounds like you're still
running the version of maint.pl that uses cookies to try and display
only the hosts from the page you were on. This was removed in the 4.0
release precisely because it was causing problems. Could you check
your maint.pl script and see if lines 432-433 look like this:

    432   # open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." 
\"hobbitdboard ".$filter."\" |");
    433   open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." hobbitdboard |");


Thanks,
Henrik

New core and the purples are back:

find -namehobbit at hobbit ~/server> find -name core -ls
141454 8024 -rw-------   1 hobbit   other     8207396 Apr  1 22:08 
./tmp/core
hobbit at hobbit ~/server> file ./tmp/core
./tmp/core:     ELF 32-bit MSB core file SPARC Version 1, from 'hobbitd'
One message in bb-display.log:

2005-04-01 22:07:55 Whoops ! bb failed to send message - timeout

We do have close to a 1000 entries in bb-hosts, if it matters.
list David Gore · Fri, 01 Apr 2005 22:53:53 +0000 ·
quoted from David Gore
David Gore wrote:
David Gore wrote:
Yes one core file:

hobbit at hobbit ~/server> find . -name core
./tmp/core
hobbit at hobbit ~/server> file tmp/core
tmp/core:       ELF 32-bit MSB core file SPARC Version 1, from 'hobbitd'
hobbit at hobbit ~/server> ls -al tmp/core
-rw-------    1 hobbit   other     8322084 Apr  1 03:27 tmp/core

I checked to make sure the lines were correct in maint.pl.  I am 
reinstalling 4.0.1, again and will let you know how it goes.

~David

Henrik Storner wrote:
OK, I think there are two completely un-related issues here.


In <user-87577d67ef7d@xymon.invalid> David Gore <user-0ef10ea25696@xymon.invalid> writes:

I re-installed RC6 to get rid of the purple trends.  

The "trends" don't go purple by themselves, they go purple because the
bb-larrdcolumn tool doesn't update them.

I'd like you to check for unusual messages in the
/var/log/hobbit/bb-display.log file, and also for any core-files left
behind from bb-larrdcolumn. They should be in the ~/server/tmp/
directory, but please check in the ~/data/rrd/ directory also.

The maint.pl lists and 'ALL' may still be a problem, it does occur in
both Firefox and IE.  It can be alleviated by changing views.
Typically we monitor bb2.html, but if we switch to the Main view and
try enable/disable, it typically works, this only occurred since RC6,
I think.

The missing hosts on the maint.pl display sounds like you're still
running the version of maint.pl that uses cookies to try and display
only the hosts from the page you were on. This was removed in the 4.0
release precisely because it was causing problems. Could you check
your maint.pl script and see if lines 432-433 look like this:

    432   # open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." 
\"hobbitdboard ".$filter."\" |");
    433   open (HOBBITDLIST, "bb ".$BBENV{'BBDISP'}." hobbitdboard |");


Thanks,
Henrik

New core and the purples are back:

find -namehobbit at hobbit ~/server> find -name core -ls
141454 8024 -rw-------   1 hobbit   other     8207396 Apr  1 22:08 
./tmp/core
hobbit at hobbit ~/server> file ./tmp/core
./tmp/core:     ELF 32-bit MSB core file SPARC Version 1, from 'hobbitd'
More cores...

hobbit at hobbit ~> find -name core -ls
141462 9000 -rw-------   1 hobbit   other     9206820 Apr  1 22:37 
./server/tmp/core
843033 1784 -rw-------   1 hobbit   other     1817864 Mar  2 08:20 
./data/acks/core
910789  464 -rw-------   1 hobbit   other      465764 Apr  1 21:52 
./data/logs/core
hobbit at hobbit ~> file ./data/logs/core
./data/logs/core:       ELF 32-bit MSB core file SPARC Version 1, from 
'bb-larrdcolumn'
hobbit at hobbit ~> file ./server/tmp/core
./server/tmp/core:      ELF 32-bit MSB core file SPARC Version 1, from 
'hobbitd'
list Henrik Størner · Sat, 2 Apr 2005 13:39:39 +0200 ·
quoted from David Gore
On Fri, Apr 01, 2005 at 09:51:45PM +0000, David Gore wrote:
Yes one core file:

hobbit at hobbit ~/server> find . -name core
./tmp/core
hobbit at hobbit ~/server> file tmp/core
tmp/core:       ELF 32-bit MSB core file SPARC Version 1, from 'hobbitd'
hobbit at hobbit ~/server> ls -al tmp/core
-rw-------    1 hobbit   other     8322084 Apr  1 03:27 tmp/core

I checked to make sure the lines were correct in maint.pl.  I am 
reinstalling 4.0.1, again and will let you know how it goes.
OK, so it does dump core.

To get some more info about this, you need the core file *and* the
hobbitd binary that generated it. Then run

   $ gdb bin/hobbitd tmp/core
   [messages from gdb]
   gdb> bt

to load the core file and the hobbitd binary into gdb (the GNU
debugger), and the the "bt" command will provide a call trace of what
happened when the program crashed, that is the first piece of
information that is needed to find the bug.


Regards,
Henrik
list David Gore · Sat, 02 Apr 2005 16:09:40 +0100 ·
Here is the first core trace after the re-install and start 4.0.1:

hobbit at hobbit ~/server> gdb bin/hobbitd tmp/core
GNU gdb 6.0
Copyright 2003 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain 
conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "sparc-sun-solaris2.9"...
Core was generated by `hobbitd 
--restart=/export/home/hobbit/server/tmp/hobbitd.chk --checkpoint-file='.
Program terminated with signal 6, Aborted.
Reading symbols from /usr/lib/libresolv.so.2...done.
Loaded symbols for /usr/lib/libresolv.so.2
Reading symbols from /usr/lib/libsocket.so.1...done.
Loaded symbols for /usr/lib/libsocket.so.1
Reading symbols from /usr/lib/libnsl.so.1...done.
Loaded symbols for /usr/lib/libnsl.so.1
Reading symbols from /usr/lib/libc.so.1...done.
Loaded symbols for /usr/lib/libc.so.1
Reading symbols from /usr/lib/libdl.so.1...done.
Loaded symbols for /usr/lib/libdl.so.1
Reading symbols from /usr/lib/libmp.so.2...done.
Loaded symbols for /usr/lib/libmp.so.2
Reading symbols from /usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1...done.
Loaded symbols for /usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1
#0  0xff19d3d4 in _libc_kill () from /usr/lib/libc.so.1
(gdb) bt
#0  0xff19d3d4 in _libc_kill () from /usr/lib/libc.so.1
#1  0xff135698 in abort () from /usr/lib/libc.so.1
#2  0x0001db68 in sigsegv_handler (signum=10) at sig.c:57
#3  <signal handler called>
signature

David Gore (v965-3670)
Enhanced Technology Support (ETS)
Network Management Systems (NMS)
IMPACT Transport Team Lead - SCSA, SCNA
Page: 1-800-PAG-eMCI pin 1406090
Vnet: 965-3676


quoted from Henrik Størner
Henrik Stoerner wrote:
On Fri, Apr 01, 2005 at 09:51:45PM +0000, David Gore wrote:
 
Yes one core file:

hobbit at hobbit ~/server> find . -name core
./tmp/core
hobbit at hobbit ~/server> file tmp/core
tmp/core:       ELF 32-bit MSB core file SPARC Version 1, from 'hobbitd'
hobbit at hobbit ~/server> ls -al tmp/core
-rw-------    1 hobbit   other     8322084 Apr  1 03:27 tmp/core

I checked to make sure the lines were correct in maint.pl.  I am 
reinstalling 4.0.1, again and will let you know how it goes.
   
OK, so it does dump core.

To get some more info about this, you need the core file *and* the
hobbitd binary that generated it. Then run

  $ gdb bin/hobbitd tmp/core
  [messages from gdb]
  gdb> bt

to load the core file and the hobbitd binary into gdb (the GNU
debugger), and the the "bt" command will provide a call trace of what
happened when the program crashed, that is the first piece of
information that is needed to find the bug.


Regards,
Henrik

list David Gore · Sat, 02 Apr 2005 17:30:02 +0100 ·
quoted from David Gore
David Gore wrote:
Here is the first core trace after the re-install and start 4.0.1:

hobbit at hobbit ~/server> gdb bin/hobbitd tmp/core
GNU gdb 6.0
Copyright 2003 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "sparc-sun-solaris2.9"...
Core was generated by `hobbitd --restart=/export/home/hobbit/server/tmp/hobbitd.chk --checkpoint-file='.
Program terminated with signal 6, Aborted.
Reading symbols from /usr/lib/libresolv.so.2...done.
Loaded symbols for /usr/lib/libresolv.so.2
Reading symbols from /usr/lib/libsocket.so.1...done.
Loaded symbols for /usr/lib/libsocket.so.1
Reading symbols from /usr/lib/libnsl.so.1...done.
Loaded symbols for /usr/lib/libnsl.so.1
Reading symbols from /usr/lib/libc.so.1...done.
Loaded symbols for /usr/lib/libc.so.1
Reading symbols from /usr/lib/libdl.so.1...done.
Loaded symbols for /usr/lib/libdl.so.1
Reading symbols from /usr/lib/libmp.so.2...done.
Loaded symbols for /usr/lib/libmp.so.2
Reading symbols from /usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1...done.
Loaded symbols for /usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1
#0  0xff19d3d4 in _libc_kill () from /usr/lib/libc.so.1
(gdb) bt
#0  0xff19d3d4 in _libc_kill () from /usr/lib/libc.so.1
#1  0xff135698 in abort () from /usr/lib/libc.so.1
#2  0x0001db68 in sigsegv_handler (signum=10) at sig.c:57
#3  <signal handler called>


Henrik Stoerner wrote:
On Fri, Apr 01, 2005 at 09:51:45PM +0000, David Gore wrote:
 
Yes one core file:

hobbit at hobbit ~/server> find . -name core
./tmp/core
hobbit at hobbit ~/server> file tmp/core
tmp/core:       ELF 32-bit MSB core file SPARC Version 1, from 'hobbitd'
hobbit at hobbit ~/server> ls -al tmp/core
-rw-------    1 hobbit   other     8322084 Apr  1 03:27 tmp/core

I checked to make sure the lines were correct in maint.pl.  I am reinstalling 4.0.1, again and will let you know how it goes.
  
OK, so it does dump core.

To get some more info about this, you need the core file *and* the
hobbitd binary that generated it. Then run

  $ gdb bin/hobbitd tmp/core
  [messages from gdb]
  gdb> bt

to load the core file and the hobbitd binary into gdb (the GNU
debugger), and the the "bt" command will provide a call trace of what
happened when the program crashed, that is the first piece of
information that is needed to find the bug.


Regards,
Henrik

Based on the core trace, I decided to remove all files from ~/server/tmp/.  I do not think we had removed any files from running RC1,2,4-6 from '~/server/tmp/' so perhaps it was choking on something (hobbitd.chk)?  It has been running clean for more than an hour.  It typically dumps core before an hour has passed and the purple trends occur before an hour of running too.  I did have to re-disable my disabled hosts of course.  So far it looks good!  Thanks for your help Henrik.

~David
list David Gore · Sat, 02 Apr 2005 18:35:12 +0100 ·
quoted from David Gore
David Gore wrote:
David Gore wrote:
Here is the first core trace after the re-install and start 4.0.1:

hobbit at hobbit ~/server> gdb bin/hobbitd tmp/core
GNU gdb 6.0
Copyright 2003 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "sparc-sun-solaris2.9"...
Core was generated by `hobbitd --restart=/export/home/hobbit/server/tmp/hobbitd.chk --checkpoint-file='.
Program terminated with signal 6, Aborted.
Reading symbols from /usr/lib/libresolv.so.2...done.
Loaded symbols for /usr/lib/libresolv.so.2
Reading symbols from /usr/lib/libsocket.so.1...done.
Loaded symbols for /usr/lib/libsocket.so.1
Reading symbols from /usr/lib/libnsl.so.1...done.
Loaded symbols for /usr/lib/libnsl.so.1
Reading symbols from /usr/lib/libc.so.1...done.
Loaded symbols for /usr/lib/libc.so.1
Reading symbols from /usr/lib/libdl.so.1...done.
Loaded symbols for /usr/lib/libdl.so.1
Reading symbols from /usr/lib/libmp.so.2...done.
Loaded symbols for /usr/lib/libmp.so.2
Reading symbols from /usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1...done.
Loaded symbols for /usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1
#0  0xff19d3d4 in _libc_kill () from /usr/lib/libc.so.1
(gdb) bt
#0  0xff19d3d4 in _libc_kill () from /usr/lib/libc.so.1
#1  0xff135698 in abort () from /usr/lib/libc.so.1
#2  0x0001db68 in sigsegv_handler (signum=10) at sig.c:57
#3  <signal handler called>


Henrik Stoerner wrote:
On Fri, Apr 01, 2005 at 09:51:45PM +0000, David Gore wrote:
 
Yes one core file:

hobbit at hobbit ~/server> find . -name core
./tmp/core
hobbit at hobbit ~/server> file tmp/core
tmp/core:       ELF 32-bit MSB core file SPARC Version 1, from 'hobbitd'
hobbit at hobbit ~/server> ls -al tmp/core
-rw-------    1 hobbit   other     8322084 Apr  1 03:27 tmp/core

I checked to make sure the lines were correct in maint.pl.  I am reinstalling 4.0.1, again and will let you know how it goes.
  
OK, so it does dump core.

To get some more info about this, you need the core file *and* the
hobbitd binary that generated it. Then run

  $ gdb bin/hobbitd tmp/core
  [messages from gdb]
  gdb> bt

to load the core file and the hobbitd binary into gdb (the GNU
debugger), and the the "bt" command will provide a call trace of what
happened when the program crashed, that is the first piece of
information that is needed to find the bug.


Regards,
Henrik

Based on the core trace, I decided to remove all files from ~/server/tmp/.  I do not think we had removed any files from running RC1,2,4-6 from '~/server/tmp/' so perhaps it was choking on something (hobbitd.chk)?  It has been running clean for more than an hour.  It typically dumps core before an hour has passed and the purple trends occur before an hour of running too.  I did have to re-disable my disabled hosts of course.  So far it looks good!  Thanks for your help Henrik.

~David

Well, it's back after a restart.   I will upload the cores, please delete them when you are finished.  I probably will try removing the check file, and restarting, but that is not a real solution, if it works.

~David