Xymon Mailing List Archive search

BBTest Error

9 messages in this thread

list David Gilmore · Fri, 30 Jun 2006 18:11:17 -0400 ·
Pardon my ignorance, but what does this mean?
 
Error output:
Timeout waiting for data from child, killing it
Timeout waiting for data from child, killing it
Child process terminated with signal 15

 
And can I apply it to my five year old when she is not doing what I ask?  :)
 
David Gilmore
Consultant
Stenhouse Consulting, LLC.
X Traverse St
Providence, RI   XXXXX
XXX.XXX.XXXX x12
XXX.XXX.XXXX (fax)
list Henrik Størner · Thu, 6 Jul 2006 12:53:23 +0200 ·
quoted from David Gilmore
On Fri, Jun 30, 2006 at 06:11:17PM -0400, David Gilmore wrote:
Pardon my ignorance, but what does this mean?
 
Error output:
Timeout waiting for data from child, killing it
Timeout waiting for data from child, killing it
Child process terminated with signal 15
It is a message that can occur for the network tests (ntp, rpc and
traceroute) that rely on external programs rather than being implemented
natively by Hobbit. It means that the program did not finish testing in
30 seconds, so Hobbit assumes it's hung waiting for some network I/O,
and decides to treat this as a timeout error.

It's a safety measure to prevent e.g. a hung traceroute from stopping
all of the network tests.

If 30 seconds is not enough, you can raise that with the --cmdtimeout=N
option for bbtest-net.

And can I apply it to my five year old when she is not doing what I ask?  :)
Perhaps ... but you'd better check with your local child-abuse
authorities first :-)


Regards,
Henrik
list David Gilmore · Thu, 6 Jul 2006 13:16:04 -0400 ·
Henrik,

Ok.  I had implemented TRACEROUTE in the BBHOSTS file recently.  It was one
of those inconsistent errors that had me puzzled.

Dave
 
-----Original Message-----
From: hobbit-return-7863-david=user-9e293dd11111@xymon.invalid [mailto:hobbit-return-7863-david=stenhouseconsulting.com at hswn.
dk] On Behalf Of Henrik Stoerner
Sent: Thursday, July 06, 2006 6:53 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] BBTest Error
quoted from David Gilmore

On Fri, Jun 30, 2006 at 06:11:17PM -0400, David Gilmore wrote:
Pardon my ignorance, but what does this mean?
Error output:
Timeout waiting for data from child, killing it Timeout waiting for > data from child, killing it Child process terminated with signal 15
It is a message that can occur for the network tests (ntp, rpc and
traceroute) that rely on external programs rather than being implemented natively by Hobbit. It means that the program did not finish testing in 30 seconds, so Hobbit assumes it's hung waiting for some network I/O, and decides to treat this as a timeout error.

It's a safety measure to prevent e.g. a hung traceroute from stopping all of the network tests.

If 30 seconds is not enough, you can raise that with the --cmdtimeout=N option for bbtest-net.

And can I apply it to my five year old when she is not doing what I > ask?  :)
Perhaps ... but you'd better check with your local child-abuse authorities first :-)


Regards,
Henrik

list Larry Barber · Mon, 20 Nov 2006 14:24:32 -0600 ·
I am occasionally seeing the following error report from bbtest:

IO error in SSL_connect to https (47873/tcp) on host x.y.z.a:
error:00000000:lib(0):func(0):reason(0)


Does anybody know what this signifies?

(my apologies for the duplicate post)

Thanks,
Larry Barber
list Dan Vande More · Mon, 20 Nov 2006 14:42:33 -0600 ·
I used to see this error w/ squid when using self-generated or chained
certs. I always figured it was something to do with the client not accepting
the cert or having an error checking the chain of trust.
quoted from Larry Barber


On 11/20/06, Larry Barber <user-6ef9c2864140@xymon.invalid> wrote:
I am occasionally seeing the following error report from bbtest:

IO error in SSL_connect to https (47873/tcp) on host x.y.z.a: error:00000000:lib(0):func(0):reason(0)


Does anybody know what this signifies?

(my apologies for the duplicate post)

Thanks,
Larry Barber

list Larry Barber · Mon, 20 Nov 2006 16:13:16 -0600 ·
That doesn't seem to apply here, the cert is from Verisign and I don't
believe that it is chained. These are just intermittent failures that last
from 1 round of testing to several minutes, then go away. The websites
associated with the test also have problems when this error is reported, I
don't think this is a Hobbit problem, I'm just trying to help with the
trouble-shooting.

Thanks,
Larry Barber
quoted from Dan Vande More

On 11/20/06, Dan Vande More <user-f3c4c62d9d50@xymon.invalid> wrote:
I used to see this error w/ squid when using self-generated or chained
certs. I always figured it was something to do with the client not accepting
the cert or having an error checking the chain of trust.


On 11/20/06, Larry Barber <user-6ef9c2864140@xymon.invalid> wrote:
I am occasionally seeing the following error report from bbtest:

IO error in SSL_connect to https (47873/tcp) on host x.y.z.a: error:00000000:lib(0):func(0):reason(0)


Does anybody know what this signifies?

(my apologies for the duplicate post)

Thanks,
Larry Barber

list Henrik Størner · Mon, 20 Nov 2006 23:20:41 +0100 ·
quoted from Larry Barber
On Mon, Nov 20, 2006 at 04:13:16PM -0600, Larry Barber wrote:
That doesn't seem to apply here, the cert is from Verisign and I don't
believe that it is chained. These are just intermittent failures that last
from 1 round of testing to several minutes, then go away. The websites
associated with the test also have problems when this error is reported, I
don't think this is a Hobbit problem, I'm just trying to help with the
trouble-shooting.
I've also seen this occasionally with https tests. The frustrating thing
is that OpenSSL reports an error, but doesn't provide any information
whatsoever about what the error is (that's the "error:0...." message).
But since it happens when calling SSL_connect, I suppose there is some 
sort of problem with the SSL handshake (this error happens *after*
Hobbit managed to connect at the network - TCP/IP - layer, so the
connection to the webserver has been established). It could be a lack of
entropy (randomness) on the Hobbit server so it cannot generate a
session key, but that is just a wild guess - and it doesnt really sound
like something that would happen.


Regards,
Henrik
list Larry Barber · Mon, 20 Nov 2006 16:30:35 -0600 ·
I don't think it has anything to do with Hobbit or the Hobbit server, as the
website goes down when we see this error as well.

Thanks,
Larry Barber
quoted from Henrik Størner

On 11/20/06, Henrik Stoerner <user-ce4a2c883f75@xymon.invalid> wrote:
On Mon, Nov 20, 2006 at 04:13:16PM -0600, Larry Barber wrote:
That doesn't seem to apply here, the cert is from Verisign and I don't
believe that it is chained. These are just intermittent failures that
last
from 1 round of testing to several minutes, then go away. The websites
associated with the test also have problems when this error is reported,
I
don't think this is a Hobbit problem, I'm just trying to help with the
trouble-shooting.
I've also seen this occasionally with https tests. The frustrating thing
is that OpenSSL reports an error, but doesn't provide any information
whatsoever about what the error is (that's the "error:0...." message).
But since it happens when calling SSL_connect, I suppose there is some
sort of problem with the SSL handshake (this error happens *after*
Hobbit managed to connect at the network - TCP/IP - layer, so the
connection to the webserver has been established). It could be a lack of
entropy (randomness) on the Hobbit server so it cannot generate a
session key, but that is just a wild guess - and it doesnt really sound
like something that would happen.


Regards,
Henrik

list Larry Barber · Tue, 21 Nov 2006 09:30:41 -0600 ·
Is it possible that this error might be caused by a lack of entropy on the
webserver being tested?

Thanks,
Larry Barber
quoted from Larry Barber

On 11/20/06, Larry Barber <user-6ef9c2864140@xymon.invalid> wrote:
I don't think it has anything to do with Hobbit or the Hobbit server, as
the website goes down when we see this error as well.

Thanks,
Larry Barber

On 11/20/06, Henrik Stoerner <user-ce4a2c883f75@xymon.invalid> wrote:
On Mon, Nov 20, 2006 at 04:13:16PM -0600, Larry Barber wrote:
That doesn't seem to apply here, the cert is from Verisign and I don't
believe that it is chained. These are just intermittent failures that
last
from 1 round of testing to several minutes, then go away. The websites
associated with the test also have problems when this error is
reported, I
don't think this is a Hobbit problem, I'm just trying to help with the
trouble-shooting.
I've also seen this occasionally with https tests. The frustrating thing
is that OpenSSL reports an error, but doesn't provide any information
whatsoever about what the error is (that's the "error:0...." message).
But since it happens when calling SSL_connect, I suppose there is some
sort of problem with the SSL handshake (this error happens *after*
Hobbit managed to connect at the network - TCP/IP - layer, so the
connection to the webserver has been established). It could be a lack of

entropy (randomness) on the Hobbit server so it cannot generate a
session key, but that is just a wild guess - and it doesnt really sound
like something that would happen.


Regards,
Henrik