Xymon Mailing List Archive search

purple status for one server and hobbitfetch?

5 messages in this thread

list Cade Robinson · Fri, 05 Feb 2010 11:17:28 -0600 ·
I have one server that keeps coming up purple and I can't get rid of it.
The ping it working and it is alerting when it goes red but it just
stays purple all the time.

When I look at the conn page it shows:
Status message received from hobbitd
rather than
Status message received from 172.16.225.176

Also as a test someone setup hobbit on this machine at one time to see
how hobbit worked without screwing up the current real hobbit server.
Since then the machine in question had hobbit server removed and is now
just a client.

I have tried to drop the conn column and also the whole server and also
removed all files with the server name in hist and histlogs.  It still
comes back purple and is the only one.

I can't find any file in server/etc that has its name or IP.

Any thoughts on why?

Also we have clients in a DMZ we want monitored and are using
hobbitfetch for that.  Once I fixed the core dumping of hobbitfetch and
it is staying up the status is still purple.
Anyone know how to get that not purple?
Is it not reporting a status to something? I was trying to figure out
how status of the daemons was determined and haven't found it yet but my
guess is that hobbitfetch isn't reporting something it should.

Thanks
Cade
list Wiskbroom · Fri, 5 Feb 2010 13:59:42 -0500 ·
quoted from Cade Robinson
I have one server that keeps coming up purple and I can't get rid of it.
The ping it working and it is alerting when it goes red but it just
stays purple all the time.

When I look at the conn page it shows:
Status message received from hobbitd
rather than
Status message received from 172.16.225.176
Can you access this machine and determine who it's server is? 

Since you'd stated below that you have nothing in your server etc directory with it, I am guessing you had a two-server setup at one time, the "other" server may be sharing its data with yours.  You may also want to see how you have BBDISP setup, whether its setup as one or more than one BBDISP.

list Cade Robinson · Fri, 05 Feb 2010 13:25:48 -0600 ·
We didn't have a "true" two server setup.
There was the main server and then this test one.

On the main server the test server was setup as a client but had server
software on it.
The main server picked up that the test server had server software on
it.

Since then the server software on the test server was removed and client
software installed.

I checked the BBDISP and that is set right.  It points to the main
server.

This test should be just the ping test.
I can ping the client just fine from the hobbit server and if the ping
fails we do get red alerts.
I found that out when a network port got changed on accident to a
different vlan.
So the ping is working it just isn't reporting the correct result.
quoted from Wiskbroom

On Fri, 2010-02-05 at 13:59 -0500, user-ddebaeecde97@xymon.invalid wrote:
I have one server that keeps coming up purple and I can't get rid of it.
The ping it working and it is alerting when it goes red but it just
stays purple all the time.

When I look at the conn page it shows:
Status message received from hobbitd
rather than
Status message received from 172.16.225.176
Can you access this machine and determine who it's server is? 

Since you'd stated below that you have nothing in your server etc directory with it, I am guessing you had a two-server setup at one time, the "other" server may be sharing its data with yours.  You may also want to see how you have BBDISP setup, whether its setup as one or more than one BBDISP.

list Wiskbroom · Fri, 5 Feb 2010 14:46:08 -0500 ·
quoted from Cade Robinson
We didn't have a "true" two server setup.

There was the main server and then this test one.
This is defined in your bb-hosts like this, no?

172.16.225.176   host-i-want-to-ping.org   # conn

I can tell you that I was getting similar results when I had two hosts defined as BBDISPLAYS, and BBDISP set to 0.0.0.0

Just to recap, you have a host, 172.16.225.176, that is NOT configured to send to your xymon server, but for some reason your Xymon server is reporting it, and it is purple, is this correct?
list Cade Robinson · Fri, 05 Feb 2010 15:26:43 -0600 ·
Lets start over... :)
I guess using machine name examples would help.

monitor is the main hobbit server - IP 172.16.225.176
28512dbtst is the client that keeps going purple.  IP 172.16.225.182

"monitor" is setup to monitor "28512dbtst".
At one point someone put the hobbit server software on "28512dbtst".
"monitor" figured out automatically that server software was running on
"28512dbtst".
"28512dbtst" and "monitor" were never setup in a dual server way.  They
were separate except that "monitor" was monitoring "28512dbtst".

We got the server software removed from "28512dbtst" and the client
software installed.
We dropped "28512dbtst" (bb 127.0.0.1 "drop 28512dbtst") on "monitor"
and let "monitor" rediscover "28512dbtst"

Now out of 30+ servers being monitored by "monitor", "28512dbtst" is the
only one that shows up purple.
Even though "28512dbtst" has the same client software as everything else
and is setup to push the data to "monitor"

I also just tried to drop "28512dbtst" and stop the client software on
it and let "monitor" just ping it.
Still purple, but conn is ok:

Fri Feb 5 15:20:44 2010 conn ok 

Service conn on 28512dbtst is OK (up)


green 172.16.225.182 is alive (4.31 ms)
   
Status unchanged in 0 hours, 4 minutes
Status message received from hobbitd
quoted from Wiskbroom


On Fri, 2010-02-05 at 14:46 -0500, user-ddebaeecde97@xymon.invalid wrote: 
We didn't have a "true" two server setup.

There was the main server and then this test one.
This is defined in your bb-hosts like this, no?

172.16.225.176   host-i-want-to-ping.org   # conn

I can tell you that I was getting similar results when I had two hosts defined as BBDISPLAYS, and BBDISP set to 0.0.0.0

Just to recap, you have a host, 172.16.225.176, that is NOT configured to send to your xymon server, but for some reason your Xymon server is reporting it, and it is purple, is this correct?