Xymon Mailing List Archive search

hobbitd_client crash

18 messages in this thread

list Al Jeffcoat · Fri, 19 Aug 2005 11:52:34 -0400 ·
Hello,

 
After using updating to the snapshot from 8/12, I started getting
red/purple for hobbitd_client.  I updated to the 8/19 snapshot, and it
is doing the same thing.  Running Fedora Core 4.

 
Also, I noticed that I'm getting the following in rrd-data.log:

 
2005-08-19 11:50:06 RRD error updating
/hobbit/data/rrd/sovera1/netstat.rrd from ###.###.###.###: expected 16
data source readings (got 15) from
1124466606:365339:67276:0:89917:318118:U:U:382851363:3633351626:3778183:
133445:3432584:12859462:10

2481:123:...

 
This error is occurring for 2 AIX and 1 Tru64 server, all three running
the 4.1.2 snapshot from varying dates.

 
Al

This e-mail message and any attached files are confidential and are intended solely for the use of the addressee(s) named above. If you are not the intended recipient, any review, use, or distribution of this e-mail message and any attached files is strictly prohibited. This communication may contain material protected by Federal privacy regulations, attorney-client work product, or other privileges. If you have received this confidential communication in error, please notify the sender immediately by reply e-mail message and permanently delete the original message.  To reply to our email administrator directly, send an email to:  user-ecde3bbc361d@xymon.invalid .  If this e-mail message concerns a contract matter, be advised that no employee or agent is authorized to conclude any binding agreement on behalf of Orlando Regional Healthcare by e-mail without express written confirmation by an officer of the corporation. Any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of Orlando Regional Healthcare.
list Al Jeffcoat · Fri, 19 Aug 2005 11:55:41 -0400 ·
Sorry, in such a hurry, I forgot to post the details on the red/purplefor hobbitd_client...

 
On the web page, getting: 

 
Program crashed

 
Fatal signal caught!

 
I do not see anything in the logs directory for this.  It turns red on start, then after ½ hour, goes purple until the next restart.

 
Al
quoted from Al Jeffcoat

 
From: Jeffcoat, Al 
Sent: Friday, August 19, 2005 11:53 AM
To: 'user-ae9b8668bcde@xymon.invalid'
Subject: hobbitd_client crash

 
Hello,

 
After using updating to the snapshot from 8/12, I started getting red/purple for hobbitd_client.  I updated to the 8/19 snapshot, and it is doing the same thing.  Running Fedora Core 4.

 
Also, I noticed that I'm getting the following in rrd-data.log:

 

2005-08-19 11:50:06 RRD error updating /hobbit/data/rrd/sovera1/netstat.rrd from ###.###.###.###: expected 16 data source readings (got 15) from 1124466606:365339:67276:0:89917:318118:U:U:382851363:3633351626:3778183:133445:3432584:12859462:10
quoted from Al Jeffcoat

2481:123:...

 
This error is occurring for 2 AIX and 1 Tru64 server, all three running the 4.1.2 snapshot from varying dates.

 
Al

This e-mail message and any attached files are confidential and are intended solely for the use of the addressee(s) named above. If you are not the intended recipient, any review, use, or distribution of this e-mail message and any attached files is strictly prohibited. This communication may contain material protected by Federal privacy regulations, attorney-client work product, or other privileges. If you have received this confidential communication in error, please notify the sender immediately by reply e-mail message and permanently delete the original message.  To reply to our email administrator directly, send an email to:  user-ecde3bbc361d@xymon.invalid .  If this e-mail message concerns a contract matter, be advised that no employee or agent is authorized to conclude any binding agreement on behalf of Orlando Regional Healthcare by e-mail without express written confirmation by an officer of the corporation. Any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of Orlando Regional Healthcare.
list Terry Rossi · 24 Aug 2005 23:55:24 GMT ·
Hello,

 
After using updating to the snapshot from 8/12, I started getting
red/purple for hobbitd_client.  I updated to the 8/19 snapshot, and it
is doing the same thing.  Running Fedora Core 4.

 
Also, I noticed that I'm getting the following in rrd-data.log:

 
2005-08-19 11:50:06 RRD error updating
/hobbit/data/rrd/sovera1/netstat.rrd from ###.###.###.###: expected 16
data source readings (got 15) from
1124466606:365339:67276:0:89917:318118:U:U:382851363:3633351626:3778183:
133445:3432584:12859462:10

2481:123:...

 
This error is occurring for 2 AIX and 1 Tru64 server, all three running
the 4.1.2 snapshot from varying dates.

 
Al

This e-mail message and any attached files are confidential and are intended solely for the use of the addressee(s) named above. If you are not the intended recipient, any review, use, or distribution of this e-mail message and any attached files is strictly prohibited. This communication may contain material protected by Federal privacy regulations, attorney-client work product, or other privileges. If you have received this confidential communication in error, please notify the sender immediately by reply e-mail message and permanently delete the original message.  To reply to our email administrator directly, send an email to:  user-ecde3bbc361d@xymon.invalid .  If this e-mail message concerns a contract matter, be advised that no employee or agent is authorized to conclude any binding agreement on behalf of Orlando Regional Healthcare by e-mail without express written confirmation by an officer of the corporation. Any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of Orlando Regional Healthcare.
list Henrik Størner · Sat, 27 Aug 2005 11:23:04 +0200 ·
quoted from Al Jeffcoat
On Fri, Aug 19, 2005 at 11:55:41AM -0400, Jeffcoat, Al wrote:
Sorry, in such a hurry, I forgot to post the details on the red/purplefor hobbitd_client...

Program crashed
Fatal signal caught!
There ought to be a core file in the ~hobbit/server/tmp/ directory from
this. Could you run it through the procedure in
http://www.hswn.dk/hobbit/help/known-issues.html#bugreport

and let me know what the trace says ? Also, what version are you
running: The original 4.1.1, or one of the (many) snapshots ?


Regards,
Henrik
list Al Jeffcoat · Sat, 27 Aug 2005 10:32:06 -0400 ·
Well, after updating to todays snapshot, the hobbitd_client column is no
longer there...  If it occurs again, I will run it through gdb...  The
snapshot I was using was 08.11 and 08.19.

Al
quoted from Terry Rossi
 

-----Original Message-----
From: Henrik Stoerner [mailto:user-ce4a2c883f75@xymon.invalid] 
Sent: Saturday, August 27, 2005 5:23 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] RE: hobbitd_client crash

On Fri, Aug 19, 2005 at 11:55:41AM -0400, Jeffcoat, Al wrote:
Sorry, in such a hurry, I forgot to post the details on the
red/purplefor hobbitd_client...

Program crashed
Fatal signal caught!
There ought to be a core file in the ~hobbit/server/tmp/ directory from
this. Could you run it through the procedure in
http://www.hswn.dk/hobbit/help/known-issues.html#bugreport

and let me know what the trace says ? Also, what version are you
running: The original 4.1.1, or one of the (many) snapshots ?


Regards,
Henrik


This e-mail message and any attached files are confidential and are intended solely for the use of the addressee(s) named above. If you are not the intended recipient, any review, use, or distribution of this e-mail message and any attached files is strictly prohibited. This communication may contain material protected by Federal privacy regulations, attorney-client work product, or other privileges. If you have received this confidential communication in error, please notify the sender immediately by reply e-mail message and permanently delete the original message.  To reply to our email administrator directly, send an email to:  user-ecde3bbc361d@xymon.invalid .  If this e-mail message concerns a contract matter, be advised that no employee or agent is authorized to conclude any binding agreement on behalf of Orlando Regional Healthcare by e-mail without express written confirmation by an officer of the corporation. Any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of Orlando Regional Healthcare.
list Michael Dunne · Thu, 28 Sep 2006 17:39:32 -0400 ·
Greetings,

 The hobbitd_client module has been crashing on and off all day. Below is
the gdb output. Any advice would be greatly appreciated.

This GDB was configured as "powerpc-apple-darwin".
Reading symbols for shared libraries .. done
#0  0x9004312c in kill ()
(gdb) bt
#0  0x9004312c in kill ()
#1  0x9009fb9c in abort ()
#2  0x00016bdc in receive_samples ()
#3  <signal handler called>
#4  0x90007260 in strlen ()
#5  0x0000ec68 in error_message ()
#6  0x00002dc8 in main (argc=1192252, argv=0xa0005290) at bb.c:138
(gdb) quit


Thanks,

Mike


Michael Dunne
Network Support Specialist
AccuWeather, Inc. 

~HISTORY, n. An account mostly false, of events mostly unimportant, which
are brought about by rulers mostly knaves, and soldiers mostly fools.

(Ambrose Bierce, "The Devil's Dictionary")~
list Henrik Størner · Fri, 29 Sep 2006 08:41:54 +0200 ·
quoted from Michael Dunne
On Thu, Sep 28, 2006 at 05:39:32PM -0400, Michael Dunne wrote:
 The hobbitd_client module has been crashing on and off all day. Below is
the gdb output. Any advice would be greatly appreciated.

This GDB was configured as "powerpc-apple-darwin".
Reading symbols for shared libraries .. done
#3  <signal handler called>
#4  0x90007260 in strlen ()
#5  0x0000ec68 in error_message ()
#6  0x00002dc8 in main (argc=1192252, argv=0xa0005290) at bb.c:138
This trace doesn't make sense - the "bb.c" sourcefile is not part of
the hobbitd_client binary.

Are you sure you fed gdb the right binary ? If it is hobbitd_client that
is crashing, then the command should be

    $ cd ~hobbit/server
    $ gdb bin/hobbitd_client /path/to/corefile
    gdb> bt


Regards,
Henrik
list Jason Altrincham Jones · Fri, 29 Sep 2006 10:02:00 +0100 ·
Hi all,

I know I've asked this before and I'm sorry to repeat but could someone
tell me the steps to add a hour and days input field to the ack alert
form?  (this is the first time I've altered anything on the hobbit
webpage side of things), also would it be possible to not allow acks
longer than 1 week?

Thanks,
Jason.
list Henrik Størner · Fri, 29 Sep 2006 12:32:28 +0200 ·
quoted from Jason Altrincham Jones
On Fri, Sep 29, 2006 at 10:02:00AM +0100, Jones, Jason (Altrincham) wrote:
I know I've asked this before and I'm sorry to repeat but could someone
tell me the steps to add a hour and days input field to the ack alert
form?  (this is the first time I've altered anything on the hobbit
webpage side of things), 
It's not quite what you want, but the attached patch lets you enter the
time as "6h" (6 hours) or "2d8h" (2 days + 8 hours).

also would it be possible to not allow acks longer than 1 week?
That would need some more code in the bb-ack CGI to reject the too-long
acks. Some sort of configuration mechanism would also be needed to set
the maximum time - either globally, or per-host.


Regards,
Henrik

-------------- next part --------------
--- lib/timefunc.c	2006/07/20 16:06:41	1.31
+++ lib/timefunc.c	2006/09/29 10:21:23
@@ -245,6 +245,7 @@
 	return found;
 }
 
+#ifndef CLIENTONLY
 char *check_downtime(char *hostname, char *testname)
 {
 	namelist_t *hinfo = hostinfo(hostname);
@@ -300,6 +301,7 @@
 
 	return NULL;
 }
+#endif
 
 int periodcoversnow(char *tag)
 {
@@ -415,20 +417,30 @@
 	 */
 
 	int result = 0;
-	char *p;
-	char modifier;
+	char *startofval;
 
-	p = dur + strspn(dur, "0123456789");
-	modifier = *p;
-	*p = '\0';
-	result = atoi(dur);
-	*p = modifier;
+	startofval = dur;
• +	while (startofval && (isdigit((int)*startofval))) {
+		char *p;
+		char modifier;
+		int oneval = 0;
• +		p = startofval + strspn(startofval, "0123456789");
+		modifier = *p;
+		*p = '\0';
+		oneval = atoi(startofval);
+		*p = modifier;
• +		switch (modifier) {
+		  case 'm': break;			/* minutes */
+		  case 'h': oneval *= 60; break;	/* hours */
+		  case 'd': oneval *= 1440; break;	/* days */
+		  case 'w': oneval *= 10080; break;	/* weeks */
+		}
 
-	switch (modifier) {
-	  case 'm': break;			/* minutes */
-	  case 'h': result *= 60; break;	/* hours */
-	  case 'd': result *= 1440; break;	/* days */
-	  case 'w': result *= 10080; break;	/* weeks */
+		result += oneval;
+		if (*p) startofval = p+1;
 	}
 
 	return result;
--- web/bb-ack.c	2006/08/11 21:04:17	1.33
+++ web/bb-ack.c	2006/09/29 10:22:07
@@ -133,7 +133,7 @@
 				}
 
 				if (acknum) awalk->acknum = atoi(acknum);
-				if (validity) awalk->validity = atoi(validity);
+				if (validity) awalk->validity = durationvalue(validity);
 				if (ackmsg) awalk->ackmsg = strdup(ackmsg);
 				if (hostname) awalk->hostname = strdup(hostname);
 				if (testname) awalk->testname = strdup(testname);
@@ -161,7 +161,7 @@
 
 	fprintf(output, "    <td>%s</td>\n", (hname ? hname : "&nbsp;"));
 	fprintf(output, "    <td>%s</td>\n", (tname ? tname : "&nbsp;"));
-	fprintf(output, "    <TD><INPUT TYPE=TEXT NAME=\"DELAY_%s\" SIZE=4 MAXLENGTH=4></TD>\n", numstr);
+	fprintf(output, "    <TD><INPUT TYPE=TEXT NAME=\"DELAY_%s\" SIZE=8 MAXLENGTH=20></TD>\n", numstr);
 	fprintf(output, "    <TD><INPUT TYPE=TEXT NAME=\"MESSAGE_%s\" SIZE=60 MAXLENGTH=80></TD>\n", numstr);
 
 	fprintf(output, "    <TD>\n");
@@ -316,7 +316,7 @@
 
 			if (reqtype == ACK_MANY) {
 				if (!awalk->ackmsg) awalk->ackmsg = ackmsgall;
-				if (!awalk->validity && validityall) awalk->validity = atoi(validityall);
+				if (!awalk->validity && validityall) awalk->validity = durationvalue(validityall);
 			}
 
 			count++;
list Jason Altrincham Jones · Fri, 29 Sep 2006 12:33:59 +0100 ·
This may sound like a really stupid and obvious question but how do I
apply this patch? :$ the person who used to handle hobbit left about a
month ago and I never was never told how to

Jason.
quoted from Henrik Størner

-----Original Message-----
From: Henrik Stoerner [mailto:user-ce4a2c883f75@xymon.invalid] 
Sent: 29 September 2006 11:32
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] longer ack alerts

On Fri, Sep 29, 2006 at 10:02:00AM +0100, Jones, Jason (Altrincham)
wrote:
I know I've asked this before and I'm sorry to repeat but could
someone
tell me the steps to add a hour and days input field to the ack alert
form?  (this is the first time I've altered anything on the hobbit
webpage side of things), 
It's not quite what you want, but the attached patch lets you enter the
time as "6h" (6 hours) or "2d8h" (2 days + 8 hours).

also would it be possible to not allow acks longer than 1 week?
That would need some more code in the bb-ack CGI to reject the too-long
acks. Some sort of configuration mechanism would also be needed to set
the maximum time - either globally, or per-host.


Regards,
Henrik
list Henrik Størner · Fri, 29 Sep 2006 16:39:17 +0200 ·
quoted from Jason Altrincham Jones
On Fri, Sep 29, 2006 at 12:33:59PM +0100, Jones, Jason (Altrincham) wrote:
This may sound like a really stupid and obvious question but how do I
apply this patch? :$ the person who used to handle hobbit left about a
month ago and I never was never told how to
OK:

* Save the patch to a file, e.g. /tmp/somefile.patch
* Go to the hobbit-4.2.0 source directory
* Run the command "patch -p0 < /tmp/somefile.patch"
* Recompile hobbit with "make; make install"
* Restart Hobbit (not needed for this patch, but just in case).


Regards,
Henrik
list Michael Dunne · Fri, 29 Sep 2006 11:27:45 -0400 ·
Henrik,

Apologies, here is the correct output.
quoted from Michael Dunne

Reading symbols for shared libraries ... done
#0  0x9004312c in kill ()
(gdb) bt
#0  0x9004312c in kill ()
#1  0x9009fb9c in abort ()

#2  0x00016bdc in ucp_table ()
#3  <signal handler called>
#4  0x90007290 in strlen ()
#5  0x0000ec68 in ucp_table ()
#6  0x00002dc8 in find_fixedlength (code=0x0, options=-1610591600) at
pcre.c:1398
#7  0x000052b0 in handle_linux_client () at pcre.c:3391
#8  0x00008f90 in main () at pcre.c:6810
(gdb) 

Mike
quoted from Henrik Størner


On 9/29/06 2:41 AM, "Henrik Stoerner" <user-ce4a2c883f75@xymon.invalid> wrote:
On Thu, Sep 28, 2006 at 05:39:32PM -0400, Michael Dunne wrote:
 The hobbitd_client module has been crashing on and off all day. Below is
the gdb output. Any advice would be greatly appreciated.

This GDB was configured as "powerpc-apple-darwin".
Reading symbols for shared libraries .. done
#3  <signal handler called>
#4  0x90007260 in strlen ()
#5  0x0000ec68 in error_message ()
#6  0x00002dc8 in main (argc=1192252, argv=0xa0005290) at bb.c:138
This trace doesn't make sense - the "bb.c" sourcefile is not part of
the hobbitd_client binary.

Are you sure you fed gdb the right binary ? If it is hobbitd_client that
is crashing, then the command should be

    $ cd ~hobbit/server
    $ gdb bin/hobbitd_client /path/to/corefile
    gdb> bt


Regards,
Henrik

Michael Dunne
Network Support Specialist
AccuWeather, Inc. 

XXX-XXX-XXXX Ext. XXXX
quoted from Michael Dunne

~HISTORY, n. An account mostly false, of events mostly unimportant, which
are brought about by rulers mostly knaves, and soldiers mostly fools.

(Ambrose Bierce, "The Devil's Dictionary")~
list Henrik Størner · Tue, 3 Oct 2006 10:08:18 +0200 ·
quoted from Michael Dunne
On Fri, Sep 29, 2006 at 11:27:45AM -0400, Michael Dunne wrote:
Henrik,

Apologies, here is the correct output.
It still doesn't look right, but I had another report about a crash in
hobbitd_client this morning. Could you try the attached patch and let
me know if it makes any difference?


Regards,
Henrik

-------------- next part --------------
--- hobbitd/client_config.c	2006/08/06 21:03:46	1.50
+++ hobbitd/client_config.c	2006/10/03 05:25:53
@@ -1249,6 +1249,7 @@
 	*abswarn = 0;
 	*abspanic = 0;
 	*ignored = 0;
+	*group = NULL;
 
 	rule = getrule(hostname, pagename, classname, C_DISK);
 	while (rule && !namematch(fsname, rule->rule.disk.fsexp->pattern, rule->rule.disk.fsexp->exp)) {
list Jason Altrincham Jones · Tue, 3 Oct 2006 09:49:09 +0100 ·
Hi all,

I have recently applied the ack-longtime patch but now the info column
won't load, could the patch have caused this? Or is there some other
cause that I can fix?

Thanks,
Jason.
list Michael Dunne · Tue, 03 Oct 2006 17:55:44 -0400 ·
Henrik,

The patch seems to have done the trick. Thanks again for your help!

Mike
quoted from Henrik Størner


On 10/3/06 4:08 AM, "Henrik Stoerner" <user-ce4a2c883f75@xymon.invalid> wrote:
On Fri, Sep 29, 2006 at 11:27:45AM -0400, Michael Dunne wrote:
Henrik,

Apologies, here is the correct output.
It still doesn't look right, but I had another report about a crash in
hobbitd_client this morning. Could you try the attached patch and let
me know if it makes any difference?


Regards,
Henrik

Michael Dunne
Network Support Specialist
AccuWeather, Inc. 
XXX-XXX-XXXX Ext. XXXX

~HISTORY, n. An account mostly false, of events mostly unimportant, which
are brought about by rulers mostly knaves, and soldiers mostly fools.

(Ambrose Bierce, "The Devil's Dictionary")~
list Kevin Kelly · Mon, 15 Jun 2009 13:50:46 -0500 ·
Just wondering if there is a way to put extra information on this page.
Like what apps on this system, owner of the app, etc.

 
Kevin Kelly

Unix Administrator

Lifetouch Inc

XXX-XXX-XXXXw

XXX-XXX-XXXXc

 
This email has been scanned by the MessageLabs Email Security System.
For more information please visit http://www.messagelabs.com/email 
list Dan McDonald · Mon, 15 Jun 2009 14:40:34 -0500 ·
quoted from Kevin Kelly
On Mon, 2009-06-15 at 13:50 -0500, Kevin Kelly wrote:
Just wondering if there is a way to put extra information on this
page. Like what apps on this system, owner of the app, etc.
The xymon notes editor would probably be the best fit.
http://www.trantor.org/xymonton/doku.php/addons:hobbitnotes


-- 
Daniel J McDonald, CCIE # 2495, CISSP # 78281, CNX
www.austinenergy.com
list David Peters · Tue, 16 Jun 2009 13:00:07 +1000 ·
Another way is if you have the information elsewhere then import it into the INFO column being careful with the formatting.

David Peters

+61 2 63913565
+61 4 17863572


"McDonald, Dan" <user-290ce4e24e19@xymon.invalid> wrote on 16/06/2009 05:40:34 AM:
[image removed] 
Re: [hobbit] info column

McDonald, Dan 
to:

hobbit

16/06/2009 05:45 AM

Please respond to hobbit
quoted from Kevin Kelly

On Mon, 2009-06-15 at 13:50 -0500, Kevin Kelly wrote:
Just wondering if there is a way to put extra information on this
page. Like what apps on this system, owner of the app, etc.
The xymon notes editor would probably be the best fit.
http://www.trantor.org/xymonton/doku.php/addons:hobbitnotes


-- 
Daniel J McDonald, CCIE # 2495, CISSP # 78281, CNX
www.austinenergy.com

[attachment "signature.asc" deleted by David Peters/DPI/NSW] 
This message is intended for the addressee named and may contain confidential information. If you are not the intended recipient, please delete it and notify the sender. Views expressed in this message are those of the individual sender, and are not necessarily the views of their organisation.