Xymon Mailing List Archive search

HA solutions

8 messages in this thread

list Gary Baluha · Tue, 10 Feb 2009 10:36:53 -0500 ·
I know there was a thread on this a while back, and if I recall, there was
mention that the 4.3.0 line would have features for HA.

Anyway, I've been working on implementing a 2-node HA Xymon cluster using
LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD,
though).  The Apache and Xymon services fail over to the other node fairly
nicely, and I have successfully testing that using a shared IP to the Xymon
web page works.  It's still in the early stages of implementation and
testing, but so far it appears hold hold promise.

I'd be interesting in hearing about other HA solutions that have been
implemented (if any).  I don't recall previous mention of LinuxHA, so if no
one else has done it, I'll be glad to update the list with my progress.
list Gary Baluha · Tue, 10 Feb 2009 10:46:43 -0500 ·
Oh yes, forgot about this:
http://en.wikibooks.org/wiki/System_Monitoring_with_Hobbit/Administration_Guide#Hobbit_in_HA
quoted from Gary Baluha


On Tue, Feb 10, 2009 at 10:36 AM, Gary Baluha <user-ae3e15c22de1@xymon.invalid> wrote:
I know there was a thread on this a while back, and if I recall, there was
mention that the 4.3.0 line would have features for HA.

Anyway, I've been working on implementing a 2-node HA Xymon cluster using
LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD,
though).  The Apache and Xymon services fail over to the other node fairly
nicely, and I have successfully testing that using a shared IP to the Xymon
web page works.  It's still in the early stages of implementation and
testing, but so far it appears hold hold promise.

I'd be interesting in hearing about other HA solutions that have been
implemented (if any).  I don't recall previous mention of LinuxHA, so if no
one else has done it, I'll be glad to update the list with my progress.
list Martin Flemming · Tue, 10 Feb 2009 18:28:26 +0100 (CET) ·
This will be great ! :-)

.. i've got new hardware since 3 month for that (my) "project",
but not real enough time for a striking success :-(

This would be very helpful !

cheers,
 	martin
quoted from Gary Baluha

On Tue, 10 Feb 2009, Gary Baluha wrote:
I know there was a thread on this a while back, and if I recall, there was
mention that the 4.3.0 line would have features for HA.

Anyway, I've been working on implementing a 2-node HA Xymon cluster using
LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD,
though).  The Apache and Xymon services fail over to the other node fairly
nicely, and I have successfully testing that using a shared IP to the Xymon
web page works.  It's still in the early stages of implementation and
testing, but so far it appears hold hold promise.

I'd be interesting in hearing about other HA solutions that have been
implemented (if any).  I don't recall previous mention of LinuxHA, so if no
one else has done it, I'll be glad to update the list with my progress.
list Sean Clark · Tue, 10 Feb 2009 13:43:26 -0500 ·
I have it running with Conga/Ricci/lucci redhat-cluster and a 3510
shared disk array.
[2 node with shared IP]
 
I was thinking of switching it over to linux-HA though, because it takes
3-5 seconds to fail over, and this seems too slow for me
 
how fast does linuxHA failover?
 
 
-Sean
quoted from Gary Baluha


From: Gary Baluha [mailto:user-ae3e15c22de1@xymon.invalid] 
Sent: Tuesday, February 10, 2009 10:37 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: [hobbit] HA solutions


I know there was a thread on this a while back, and if I recall, there
was mention that the 4.3.0 line would have features for HA.

Anyway, I've been working on implementing a 2-node HA Xymon cluster
using LinuxHA, and probably DRBD for shared disk (I have yet to
implement DRBD, though).  The Apache and Xymon services fail over to the
other node fairly nicely, and I have successfully testing that using a
shared IP to the Xymon web page works.  It's still in the early stages
of implementation and testing, but so far it appears hold hold promise.

I'd be interesting in hearing about other HA solutions that have been
implemented (if any).  I don't recall previous mention of LinuxHA, so if
no one else has done it, I'll be glad to update the list with my
progress.


P Go Green! Print this email only when necessary. Thank you for helping Time Warner Cable be environmentally responsible.
 
 
This E-mail and any of its attachments may contain Time Warner
Cable proprietary information, which is privileged, confidential,
or subject to copyright belonging to Time Warner Cable. This E-mail
is intended solely for the use of the individual or entity to which
it is addressed. If you are not the intended recipient of this
E-mail, you are hereby notified that any dissemination,
distribution, copying, or action taken in relation to the contents
of and attachments to this E-mail is strictly prohibited and may be
unlawful. If you have received this E-mail in error, please notify
the sender immediately and permanently delete the original and any
copy of this E-mail and any printout.
list Gary Baluha · Tue, 10 Feb 2009 14:14:19 -0500 ·
I haven't gotten down to tuning the setup, but out of the box, LinuxHA seems
to fail over in about that same 3-5 second time frame.  That's with the
heartbeat going through the primary LAN interface, and just going by what
crm_mon says (with a 5 second update interval).

Once I get DRBD up and running on the nodes, I'm going to do a bit more
exhaustive testing and see how quickly and smoothly I get can things to fail
over.  Maybe now that I've started this thread, it will give me an incentive
to spend some more time on this ;-)
quoted from Sean Clark

On Tue, Feb 10, 2009 at 1:43 PM, Clark, Sean <user-2db5fbcae9a7@xymon.invalid> wrote:
 I have it running with Conga/Ricci/lucci redhat-cluster and a 3510 shared
disk array.
[2 node with shared IP]

I was thinking of switching it over to linux-HA though, because it takes
3-5 seconds to fail over, and this seems too slow for me

how fast does linuxHA failover?


-Sean


P Go Green! Print this email only when necessary. Thank you for helping
Time Warner Cable be environmentally responsible.


*From:* Gary Baluha [mailto:user-ae3e15c22de1@xymon.invalid]
*Sent:* Tuesday, February 10, 2009 10:37 AM
*To:* user-ae9b8668bcde@xymon.invalid
*Subject:* [hobbit] HA solutions

I know there was a thread on this a while back, and if I recall, there was
mention that the 4.3.0 line would have features for HA.

Anyway, I've been working on implementing a 2-node HA Xymon cluster using
LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD,
though).  The Apache and Xymon services fail over to the other node fairly
nicely, and I have successfully testing that using a shared IP to the Xymon
web page works.  It's still in the early stages of implementation and
testing, but so far it appears hold hold promise.

I'd be interesting in hearing about other HA solutions that have been
implemented (if any).  I don't recall previous mention of LinuxHA, so if no
one else has done it, I'll be glad to update the list with my progress.

This E-mail and any of its attachments may contain Time Warner
Cable proprietary information, which is privileged, confidential,
or subject to copyright belonging to Time Warner Cable. This E-mail
is intended solely for the use of the individual or entity to which
it is addressed. If you are not the intended recipient of this
E-mail, you are hereby notified that any dissemination,
distribution, copying, or action taken in relation to the contents
of and attachments to this E-mail is strictly prohibited and may be
unlawful. If you have received this E-mail in error, please notify
the sender immediately and permanently delete the original and any
copy of this E-mail and any printout.

list T.J. Yang · Tue, 10 Feb 2009 13:32:46 -0600 ·
Gary, 


HA over WAN is the approach I am using due to WAN network structure I have.

I tested linux-ha+drbd for HA over LAN requirement a few years ago and this approach is not needed for my current requirements. So please submit your notes about this HA-LAN using drbd approach to xymon wiki site.


T.J. Yang
quoted from Gary Baluha


Date: Tue, 10 Feb 2009 14:14:19 -0500
From: user-ae3e15c22de1@xymon.invalid
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] HA solutions

I haven't gotten down to tuning the setup, but out of the box, LinuxHA seems to fail over in about that same 3-5 second time frame.  That's with the heartbeat going through the primary LAN interface, and just going by what crm_mon says (with a 5 second update interval).


Once I get DRBD up and running on the nodes, I'm going to do a bit more exhaustive testing and see how quickly and smoothly I get can things to fail over.  Maybe now that I've started this thread, it will give me an incentive to spend some more time on this ;-)


On Tue, Feb 10, 2009 at 1:43 PM, Clark, Sean <user-2db5fbcae9a7@xymon.invalid> wrote:


I have it running with Conga/Ricci/lucci redhat-cluster and a 3510 shared disk array.
[2 node with shared IP]
 I was thinking of switching it over to linux-HA though, because it takes 3-5 seconds to fail over, and this seems too slow for me
 how fast does linuxHA failover?
  -Sean


P Go Green! Print this email only when necessary. Thank you for helping Time Warner Cable be environmentally responsible.
 
 
From: Gary Baluha [mailto:user-ae3e15c22de1@xymon.invalid] 
Sent: Tuesday, February 10, 2009 10:37 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: [hobbit] HA solutions


I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA.

Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though).  The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works.  It's still in the early stages of implementation and testing, but so far it appears hold hold promise.

I'd be interesting in hearing about other HA solutions that have been implemented (if any).  I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
This E-mail and any of its attachments may contain Time Warner
Cable proprietary information, which is privileged, confidential,
or subject to copyright belonging to Time Warner Cable. This E-mail
is intended solely for the use of the individual or entity to which
it is addressed. If you are not the intended recipient of this
E-mail, you are hereby notified that any dissemination,
distribution, copying, or action taken in relation to the contents
of and attachments to this E-mail is strictly prohibited and may be
unlawful. If you have received this E-mail in error, please notify
the sender immediately and permanently delete the original and any
copy of this E-mail and any printout.


Windows Live™: Keep your life in sync. http://windowslive.com/explore?ocid=TXT_TAGLM_WL_t1_allup_explore_022009
list Alexander Keller · Tue, 10 Feb 2009 20:48:07 +0100 ·
Hi,

we're running a 2-node Linux-HA cluster using DRBD 8 and Xen. Xymon/hobbit is installed in a Xen domain.

If a node fails the Xen domain is automatically switched over to the other node.

With DRBD 8.0.x even live migration is possible. There is no need to use a cluster filesystem (like ocfs2 or gfs).
Have a look at http://fghaas.wordpress.com/2007/09/03/drbd-806-brings-full-live-migration-for-xen-on-drbd

Our current installation uses Debian Etch 4.0, Xen 3.2 (Backports), DRBD (Source) and Heartbeat 2.1.x (Backports).

Regards,
Alexander
quoted from Gary Baluha

Am 10.02.2009 16:36, Gary Baluha schrieb:
I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA.

Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though).  The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works.  It's still in the early stages of implementation and testing, but so far it appears hold hold promise.

I'd be interesting in hearing about other HA solutions that have been implemented (if any).  I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
list Jason Hand · Tue, 10 Feb 2009 15:01:37 -0500 ·
Anyone know how I can check to see why the 'bbd' service isn't apparently
running on my Hobbit server and better yet, how do I start it?
 
Thanks,
Jason