Xymon Mailing List Archive search

Page level conn status Purple, test level status Green

5 messages in this thread

list David Boldt · Thu, 13 Oct 2016 17:36:38 -0400 ·
On the Display page we have a server where the conn test is persistently Purple,
but if one clicks on the status, the detailed Status page shows Green.

The DNS configuration may be causing issues:

  % dig +noall +answer natwebvaii01.er.usgs.gov
  natwebvaii01.er.usgs.gov. 43200 IN      CNAME   igsaaahigsvaii1.er.usgs.gov.
  igsaaahigsvaii1.er.usgs.gov. 300 IN     A       130.11.9.89

  % dig +noall +answer -x 130.11.9.96
  96.9.11.130.in-addr.arpa. 43200 IN      PTR     natwebvaii04.er.usgs.gov.

Hoped to be able to address this with hosts.cfg entry:

  130.11.9.89     natwebvaii01.er.usgs.gov        #
CLIENT:igsaaahigsvaii1.er.usgs.gov

alas no.

I'm not clear enough on how the status that is seen for an individual test is
propagated up to a page view to know where to look when there is a conflict.


-- 
                                         -- David Boldt
                                            <user-945c7be1b8e4@xymon.invalid>


  "A complex system that works is invariably found to have evolved
from a simple system that works."
    --John Gaule
list John Thurston · Thu, 13 Oct 2016 13:46:51 -0800 ·
quoted from David Boldt
On 10/13/2016 1:36 PM, Boldt, David wrote:
On the Display page we have a server where the conn test is persistently Purple,
but if one clicks on the status, the detailed Status page shows Green.

The DNS configuration may be causing issues:

  % dig +noall +answer natwebvaii01.er.usgs.gov
  natwebvaii01.er.usgs.gov. 43200 IN      CNAME   igsaaahigsvaii1.er.usgs.gov.
  igsaaahigsvaii1.er.usgs.gov. 300 IN     A       130.11.9.89

  % dig +noall +answer -x 130.11.9.96
  96.9.11.130.in-addr.arpa. 43200 IN      PTR     natwebvaii04.er.usgs.gov.
I don't understand what the PTR record for 130.11.9.96 has to do with 
the A or CNAME resolving to 130.11.9.89
quoted from David Boldt
Hoped to be able to address this with hosts.cfg entry:

  130.11.9.89     natwebvaii01.er.usgs.gov        #
CLIENT:igsaaahigsvaii1.er.usgs.gov
If you want to use the IP address you've specified, you either need to 
tag the line with "testip" or run xymonnet with the appropriate option 
to disregard DNS resolution.

I don't think
   0.0.0.0  foo.bar.com #
behaves any differently than
   1.2.3.4  foo.bar.com #
until the tag "testip" is appended
   1.2.3.4  foo.bar.com # testip

-- 
    Do things because you should, not just because you can.

John Thurston    XXX-XXX-XXXX
user-ce4d79d99bab@xymon.invalid
Enterprise Technology Services
Department of Administration
State of Alaska
list Japheth Cleaver · Fri, 14 Oct 2016 17:57:28 -0700 ·
quoted from John Thurston

On Thu, October 13, 2016 2:36 pm, Boldt, David wrote:
On the Display page we have a server where the conn test is persistently
Purple,
but if one clicks on the status, the detailed Status page shows Green.
*snip*
quoted from David Boldt
I'm not clear enough on how the status that is seen for an individual test
is
propagated up to a page view to know where to look when there is a
conflict.

David,

This is typically a sign that there's a problem with xymongen not being
able to overwrite the display page with the most current status. Check the
timestamp on the top of the Display page and compare it with the current
status by clicking through to the Status page. If it's more than whatever
the INTERVAL is configured for xymongen in tasks.cfg, there's likely
something unusual going on.

As a sanity check, pull up the status history page to verify that the
status message actually *is* coming through properly.... If you see
alternating purple/green's, there's probably a different problem, but I
suspect it's more a xymongen issue.

Check out any errors in xymongen.log either in retrieving the current
board status from xymond or in writing updated pages out to your web
directory.

Are you seeing this on an assortment of pages? And are there any unusual
warnings on the 'xymongen' status page itself for this server?


Regards,
-jc
list David Boldt · Wed, 26 Oct 2016 16:18:00 -0400 ·
hosts.cfg entry (updated with "testip"):

130.11.9.89     natwebvaii01.er.usgs.gov        #
CLIENT:igsaaahigsvaii1.er.usgs.gov testip

forward lookup:
quoted from John Thurston

% dig +noall +answer natwebvaii01.er.usgs.gov
natwebvaii01.er.usgs.gov. 43200 IN      CNAME   igsaaahigsvaii1.er.usgs.gov.
igsaaahigsvaii1.er.usgs.gov. 300 IN     A       130.11.9.89

reverse lookup

% dig +noall +answer -x 130.11.9.89
89.9.11.130.in-addr.arpa. 300   IN      PTR     igsaaahigsvaii1.er.usgs.gov.


The test does seem to be occurring since the conn page shows green.
But this is not being reflected on the parent page; where a purple
icon is displayed.
quoted from John Thurston


On Thu, Oct 13, 2016 at 5:46 PM, John Thurston <user-ce4d79d99bab@xymon.invalid> wrote:
On 10/13/2016 1:36 PM, Boldt, David wrote:
On the Display page we have a server where the conn test is persistently
Purple,
but if one clicks on the status, the detailed Status page shows Green.

The DNS configuration may be causing issues:

  % dig +noall +answer natwebvaii01.er.usgs.gov
  natwebvaii01.er.usgs.gov. 43200 IN      CNAME
igsaaahigsvaii1.er.usgs.gov.
  igsaaahigsvaii1.er.usgs.gov. 300 IN     A       130.11.9.89

  % dig +noall +answer -x 130.11.9.96
  96.9.11.130.in-addr.arpa. 43200 IN      PTR
natwebvaii04.er.usgs.gov.

I don't understand what the PTR record for 130.11.9.96 has to do with the A
or CNAME resolving to 130.11.9.89
Hoped to be able to address this with hosts.cfg entry:

  130.11.9.89     natwebvaii01.er.usgs.gov        #
CLIENT:igsaaahigsvaii1.er.usgs.gov

If you want to use the IP address you've specified, you either need to tag
the line with "testip" or run xymonnet with the appropriate option to
disregard DNS resolution.

I don't think
  0.0.0.0  foo.bar.com #
behaves any differently than
  1.2.3.4  foo.bar.com #
until the tag "testip" is appended
  1.2.3.4  foo.bar.com # testip

--
   Do things because you should, not just because you can.

John Thurston    XXX-XXX-XXXX
user-ce4d79d99bab@xymon.invalid
Enterprise Technology Services
Department of Administration
State of Alaska
-- 

                                         -- David Boldt
                                            <user-945c7be1b8e4@xymon.invalid>


  "To die for an idea; it is unquestionably noble. But how much nobler
it would be if men died for ideas that were true!"
    --H. L. Mencken (1880 - 1956)
list David Boldt · Thu, 27 Oct 2016 17:41:20 -0400 ·
I might be able to help debug the problem myself if I knew how xymon
determines the alert levels for each test so that it can set icons
appropriately on the summary pages.
quoted from David Boldt


On Wed, Oct 26, 2016 at 4:18 PM, Boldt, David <user-945c7be1b8e4@xymon.invalid> wrote:
hosts.cfg entry (updated with "testip"):

130.11.9.89     natwebvaii01.er.usgs.gov        #
CLIENT:igsaaahigsvaii1.er.usgs.gov testip

forward lookup:

% dig +noall +answer natwebvaii01.er.usgs.gov
natwebvaii01.er.usgs.gov. 43200 IN      CNAME
igsaaahigsvaii1.er.usgs.gov.
igsaaahigsvaii1.er.usgs.gov. 300 IN     A       130.11.9.89

reverse lookup

% dig +noall +answer -x 130.11.9.89
89.9.11.130.in-addr.arpa. 300   IN      PTR
igsaaahigsvaii1.er.usgs.gov.


The test does seem to be occurring since the conn page shows green.
But this is not being reflected on the parent page; where a purple
icon is displayed.


On Thu, Oct 13, 2016 at 5:46 PM, John Thurston <user-ce4d79d99bab@xymon.invalid>
wrote:
On 10/13/2016 1:36 PM, Boldt, David wrote:
On the Display page we have a server where the conn test is persistently
Purple,
but if one clicks on the status, the detailed Status page shows Green.

The DNS configuration may be causing issues:

  % dig +noall +answer natwebvaii01.er.usgs.gov
  natwebvaii01.er.usgs.gov. 43200 IN      CNAME
igsaaahigsvaii1.er.usgs.gov.
  igsaaahigsvaii1.er.usgs.gov. 300 IN     A       130.11.9.89

  % dig +noall +answer -x 130.11.9.96
  96.9.11.130.in-addr.arpa. 43200 IN      PTR
natwebvaii04.er.usgs.gov.

I don't understand what the PTR record for 130.11.9.96 has to do with
the A
or CNAME resolving to 130.11.9.89
Hoped to be able to address this with hosts.cfg entry:

  130.11.9.89     natwebvaii01.er.usgs.gov        #
CLIENT:igsaaahigsvaii1.er.usgs.gov

If you want to use the IP address you've specified, you either need to
tag
the line with "testip" or run xymonnet with the appropriate option to
disregard DNS resolution.

I don't think
  0.0.0.0  foo.bar.com #
behaves any differently than
  1.2.3.4  foo.bar.com #
until the tag "testip" is appended
  1.2.3.4  foo.bar.com # testip

--
   Do things because you should, not just because you can.

John Thurston    XXX-XXX-XXXX
user-ce4d79d99bab@xymon.invalid
Enterprise Technology Services
Department of Administration
State of Alaska
--
                                         -- David Boldt
                                            <user-945c7be1b8e4@xymon.invalid>


  "To die for an idea; it is unquestionably noble. But how much nobler
it would be if men died for ideas that were true!"
    --H. L. Mencken (1880 - 1956)
-- 
                                         -- David Boldt
                                            <user-945c7be1b8e4@xymon.invalid>


  "To die for an idea; it is unquestionably noble. But how much nobler it
would be if men died for ideas that were true!"
    --H. L. Mencken (1880 - 1956)