The hobbit server considers any service it hasn't heard from in a
configurable period of time as purple. Mine is set at 30 minutes, I
think the default is somewhere around that area. Assuming the default
is 30 minutes, this means that if it all went purple 25 minutes ago,
the test hasn't reported to the server in 55 minutes.
I would suggest ensuring clients have connectivity to the server, ie:
telnet bbserver 1984
Which tests for tcp connectivity to port 1984, the port hobbit expects
client updates on. Of the millions of things that could cause this,
I'd say server connectivity (Default gateway, IP address),firewall
ACLs or the hobbit server no longer running are the most likely.
Good luck
Dan
On 10/31/05, Rob Munsch <user-f39e4aae1456@xymon.invalid> wrote:
Consider the below. Approx. 25 minutes ago, across all monitored
systems, all net monitored services - ssh, ldaps and dns - went to
purple. They are still up, running, and just fine in every respect.
The status message is even the same as when it was showing green. But
now every ssh, ldaps and dns light is purple.
The last thing i was messing with when this happened was the alerts
config file; i hadn't touched bb-hosts. cpu, disk, memory etc. all
remain green.
I cannot find anything in the logs that indicated what changed 25
minutes ago. I have restarted the hobbitd. Something like this seemed
to happen yesterday; after a number of monitored services were green and
unchanging for a while, they went purple, yet report as "OK" across the
board. While tweaking other settings, everything went back to normal.
I don't understand what could call this, or why it's displaying the
purple light when it "knows" it's fine.
Any ideas?
Mon Oct 31 16:30:06 2005 ssh ok
Service ssh on <machinename> is OK (up)
SSH-<ver>-OpenSSH_<ver>
Seconds: 0.00
Status unchanged in 0 hours, 25 minutes
Status message received from hobbitd
--
Rob Munsch
Systems Analyst, Solutions for Progress
http://www.solutionsforprogress.com