Xymon Mailing List Archive search

Acked alerts occasionally continuing to email?

5 messages in this thread

list Betsy Schwartz · Mon, 27 Oct 2014 09:12:54 -0400 ·
Running xymon 4.3.17on rhel6

We are occasionally seeing custom test yellow alerts continue to email
after being ack'ed. It's been hard to pin down, because if I send a green
and then let them go yellow again I am not seeing the problem recur. They
appear as acked in the web interface.

Wondering if anyone else has seen anything like this?

(major boss-annoyance factor here)

thanks Betsy
list Mark Felder · Mon, 27 Oct 2014 09:32:04 -0500 ·
quoted from Betsy Schwartz

On Mon, Oct 27, 2014, at 08:12, Betsy Schwartz wrote:
Running xymon 4.3.17on rhel6

We are occasionally seeing custom test yellow alerts continue to email
after being ack'ed. It's been hard to pin down, because if I send a green
and then let them go yellow again I am not seeing the problem recur. They
appear as acked in the web interface.

Wondering if anyone else has seen anything like this?

(major boss-annoyance factor here)

thanks Betsy
Yes and unfortunately I find it hard to locate someone with the same
problem. I don't have access to that Xymon server anymore, so I can't
compare notes.
list Betsy Schwartz · Tue, 28 Oct 2014 09:14:10 -0400 ·
This is getting to be  a REALLY BIG problem for us.
I've got two alerts that keep emailing after ack, but not consistently
my boss's boss wants me to make fixing this my next #1 priority


Anyone else experiencing this or have any thoughts on what might be
triggering it?
quoted from Mark Felder


On Mon, Oct 27, 2014 at 10:32 AM, Mark Felder <user-db141d317836@xymon.invalid> wrote:
On Mon, Oct 27, 2014, at 08:12, Betsy Schwartz wrote:
Running xymon 4.3.17on rhel6

We are occasionally seeing custom test yellow alerts continue to email
after being ack'ed. It's been hard to pin down, because if I send a green
and then let them go yellow again I am not seeing the problem recur. They
appear as acked in the web interface.

Wondering if anyone else has seen anything like this?

(major boss-annoyance factor here)

thanks Betsy
Yes and unfortunately I find it hard to locate someone with the same
problem. I don't have access to that Xymon server anymore, so I can't
compare notes.

list Mark Felder · Tue, 28 Oct 2014 08:16:36 -0500 ·
quoted from Betsy Schwartz

On Tue, Oct 28, 2014, at 08:14, Betsy Schwartz wrote:
This is getting to be  a REALLY BIG problem for us.
I've got two alerts that keep emailing after ack, but not consistently
my boss's boss wants me to make fixing this my next #1 priority


Anyone else experiencing this or have any thoughts on what might be
triggering it?

I recall running into this once where the directory the ack database is
stored in wasn't writable or didn't exist, so the ack existed in memory
but the alerts couldn't read the database and just sent alerts anyway.

I'm pretty sure this is what I hit once, but that was a few years ago.
I've long since solved that problem permanently and have run into other
mysterious alert problems I can't explain.

It might be worth checking into this first.
list Betsy Schwartz · Fri, 31 Oct 2014 14:07:13 -0400 ·
Verified that permissions are OK on the entire tree

Not sure if it's related but we're also seeing an issue where sometimes
when we go to ACK a custom test, we see "No Active Alerts"  and can't ack,
or "No Acks Requested" after ack


(This is driving my boss's boss nuts, he's going to push us to Nagios if we
can't reassure him that Xymon is working correctly)
quoted from Mark Felder


On Tue, Oct 28, 2014 at 9:16 AM, Mark Felder <user-db141d317836@xymon.invalid> wrote:
On Tue, Oct 28, 2014, at 08:14, Betsy Schwartz wrote:
This is getting to be  a REALLY BIG problem for us.
I've got two alerts that keep emailing after ack, but not consistently
my boss's boss wants me to make fixing this my next #1 priority


Anyone else experiencing this or have any thoughts on what might be
triggering it?

I recall running into this once where the directory the ack database is
stored in wasn't writable or didn't exist, so the ack existed in memory
but the alerts couldn't read the database and just sent alerts anyway.

I'm pretty sure this is what I hit once, but that was a few years ago.
I've long since solved that problem permanently and have run into other
mysterious alert problems I can't explain.

It might be worth checking into this first.