Xymon Mailing List Archive search

Conn test failing

4 messages in this thread

list Jeff Stuart · Mon, 07 Nov 2005 03:30:20 -0500 ·
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
 
Ok, I've got some WEIRD problems here.  My conn test (IE fping) is
failing on the BBNET server itself!  It's going red for no apparent
reason.  Here's the results of the page back to me:

red  Mon Nov  7 02:00:25 2005 conn NOT ok

Service conn on 123.123.123.123 is not OK : Host does not respond to ping


System unreachable for 70 poll periods (20867 seconds)

&red 123.123.123.123 is unreachable
Traceroute results:
traceroute to 123.123.123.123 (123.123.123.123), 30 hops max, 38 byte
packets
 1  server3 (123.123.123.123)  0.075 ms  0.026 ms  0.020 ms

And I have this problem with other servers too where the ping doesn't
apparently go through yet the traceroute that gets run shows it DOES
get through.  Here's an example of that:

red  Mon Nov  7 01:55:25 2005 conn NOT ok

Service conn on 124.124.124.124 is not OK : Host does not respond to ping


System unreachable for 64 poll periods (19054 seconds)

&red 124.124.124.124 is unreachable
Traceroute results:
traceroute to 124.124.124.124 (124.124.124.124), 30 hops max, 38 byte
packets
 1  69.45.152.2 (122.122.122.122)  0.459 ms  0.545 ms  0.600 ms
 2  server2.myinternetservices.com (124.124.124.124)  1.085 ms  1.450 ms *

As you can see, the traceroute DOES go through, if I hop onto the
bbnet server and immediately do an fping, the fping goes through AND I
can even ssh into the client server with no problems.  So I don't know
what's going on or how to fix it.  Any ideas anyone?

- --
Jeff Stuart
Network Admin
MyInternetServices
1-800-300-HOST
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.2 (MingW32)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org
 
iD8DBQFDbxCbjkq2AIfBlPgRAq/uAJ49XK3Xcq8gAWHvn3WYL66PV+dfcACgx3Up
p2hI853dCzma6yK50/Pgh34=
=ouoR
-----END PGP SIGNATURE-----


--
list Figaro Nicolas · Mon, 7 Nov 2005 09:44:52 +0100 ·
Hi, 
Did you check with the standard ping command that runs with your OS ? Could you also check to run the fping command on your server. 
N F
quoted from Jeff Stuart
-----Message d'origine-----
De : Jeff Stuart [mailto:user-2aec98d44f5e@xymon.invalid] Envoyé : lundi 7 novembre 2005 09:30
À : user-ae9b8668bcde@xymon.invalid
Objet : [hobbit] Conn test failing


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
 Ok, I've got some WEIRD problems here.  My conn test (IE fping) is failing on the BBNET server itself!  It's going red for no apparent reason.  Here's the results of the page back to me:

red  Mon Nov  7 02:00:25 2005 conn NOT ok

Service conn on 123.123.123.123 is not OK : Host does not respond to ping


System unreachable for 70 poll periods (20867 seconds)

&red 123.123.123.123 is unreachable
Traceroute results:
traceroute to 123.123.123.123 (123.123.123.123), 30 hops max, 38 byte packets  1  server3 (123.123.123.123)  0.075 ms  0.026 ms  0.020 ms

And I have this problem with other servers too where the ping doesn't apparently go through yet the traceroute that gets run shows it DOES get through.  Here's an example of that:

red  Mon Nov  7 01:55:25 2005 conn NOT ok

Service conn on 124.124.124.124 is not OK : Host does not respond to ping


System unreachable for 64 poll periods (19054 seconds)

&red 124.124.124.124 is unreachable
Traceroute results:
traceroute to 124.124.124.124 (124.124.124.124), 30 hops max, 38 byte packets  1  69.45.152.2 (122.122.122.122)  0.459 ms  0.545 ms  0.600 ms  2  server2.myinternetservices.com (124.124.124.124)  1.085 ms  1.450 ms *

As you can see, the traceroute DOES go through, if I hop onto the bbnet server and immediately do an fping, the fping goes through AND I can even ssh into the client server with no problems.  So I don't know what's going on or how to fix it.  Any ideas anyone?

- --
Jeff Stuart
Network Admin
MyInternetServices
1-800-300-HOST
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.2 (MingW32)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org
 iD8DBQFDbxCbjkq2AIfBlPgRAq/uAJ49XK3Xcq8gAWHvn3WYL66PV+dfcACgx3Up
p2hI853dCzma6yK50/Pgh34=
=ouoR
-----END PGP SIGNATURE-----


-- 

Date: 11/2/2005

list Henrik Størner · Mon, 7 Nov 2005 10:56:18 +0100 ·
quoted from Jeff Stuart
On Mon, Nov 07, 2005 at 03:30:20AM -0500, Jeff Stuart wrote:
 
Ok, I've got some WEIRD problems here.  My conn test (IE fping) is
failing on the BBNET server itself!  It's going red for no apparent
reason.  Here's the results of the page back to me:

red  Mon Nov  7 02:00:25 2005 conn NOT ok

Service conn on 123.123.123.123 is not OK : Host does not respond to ping
Have you tried running fping on the BBNET server using both the hostname
and the IP address ? Hobbit uses the hostname, and if that gets resolved
to something other than the IP address you expect, strange things can 
happen.

A telltale sign is if the IP address reported in the "conn" status is
something other than what you would expect.


Does it also fail if you login to the BBNET server and run

   bbcmd bbtest-net --debug --ping HOSTNAME.OF.BBNET.SERVER

The debug output includes any errors that Hobbit has when trying to run
fping. It also saves some fping-std* files in the 
~hobbit/server/tmp/ directory - those might be interesting 
to look at.


Regards,
Henrik
list Jeff Stuart · Mon, 07 Nov 2005 14:02:07 -0500 ·
quoted from Henrik Størner
Henrik Stoerner wrote:
Have you tried running fping on the BBNET server using both the hostname
and the IP address ? Hobbit uses the hostname, and if that gets resolved
to something other than the IP address you expect, strange things can 
happen.

A telltale sign is if the IP address reported in the "conn" status is
something other than what you would expect.


Does it also fail if you login to the BBNET server and run

  bbcmd bbtest-net --debug --ping HOSTNAME.OF.BBNET.SERVER

The debug output includes any errors that Hobbit has when trying to run
fping. It also saves some fping-std* files in the 
~hobbit/server/tmp/ directory - those might be interesting 
to look at.


Regards,
Henrik

I actually don't use the hostnames in my bb-hosts file.  IE all machines are listed with their IP for both IP and hostname.

I'll try the bbcmd debug and see what's going on.

As far (to answer the other email) as running fping, yes I am able to run fping.

-- 
Jeff Stuart
Network Admin 
MyInternetServices
1-800-300-HOST


--