HA solutions
list Gary Baluha
I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA. Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though). The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works. It's still in the early stages of implementation and testing, but so far it appears hold hold promise. I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
list Gary Baluha
Oh yes, forgot about this: http://en.wikibooks.org/wiki/System_Monitoring_with_Hobbit/Administration_Guide#Hobbit_in_HA
▸
On Tue, Feb 10, 2009 at 10:36 AM, Gary Baluha <user-ae3e15c22de1@xymon.invalid> wrote:
I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA. Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though). The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works. It's still in the early stages of implementation and testing, but so far it appears hold hold promise. I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
list Martin Flemming
This will be great ! :-) .. i've got new hardware since 3 month for that (my) "project", but not real enough time for a striking success :-( This would be very helpful ! cheers, martin
▸
On Tue, 10 Feb 2009, Gary Baluha wrote:
I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA. Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though). The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works. It's still in the early stages of implementation and testing, but so far it appears hold hold promise. I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
list Sean Clark
I have it running with Conga/Ricci/lucci redhat-cluster and a 3510 shared disk array. [2 node with shared IP] I was thinking of switching it over to linux-HA though, because it takes 3-5 seconds to fail over, and this seems too slow for me how fast does linuxHA failover? -Sean
▸
From: Gary Baluha [mailto:user-ae3e15c22de1@xymon.invalid]
Sent: Tuesday, February 10, 2009 10:37 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: [hobbit] HA solutions
I know there was a thread on this a while back, and if I recall, there
was mention that the 4.3.0 line would have features for HA.
Anyway, I've been working on implementing a 2-node HA Xymon cluster
using LinuxHA, and probably DRBD for shared disk (I have yet to
implement DRBD, though). The Apache and Xymon services fail over to the
other node fairly nicely, and I have successfully testing that using a
shared IP to the Xymon web page works. It's still in the early stages
of implementation and testing, but so far it appears hold hold promise.
I'd be interesting in hearing about other HA solutions that have been
implemented (if any). I don't recall previous mention of LinuxHA, so if
no one else has done it, I'll be glad to update the list with my
progress.
P Go Green! Print this email only when necessary. Thank you for helping Time Warner Cable be environmentally responsible.
This E-mail and any of its attachments may contain Time Warner
Cable proprietary information, which is privileged, confidential,
or subject to copyright belonging to Time Warner Cable. This E-mail
is intended solely for the use of the individual or entity to which
it is addressed. If you are not the intended recipient of this
E-mail, you are hereby notified that any dissemination,
distribution, copying, or action taken in relation to the contents
of and attachments to this E-mail is strictly prohibited and may be
unlawful. If you have received this E-mail in error, please notify
the sender immediately and permanently delete the original and any
copy of this E-mail and any printout.
list Gary Baluha
I haven't gotten down to tuning the setup, but out of the box, LinuxHA seems to fail over in about that same 3-5 second time frame. That's with the heartbeat going through the primary LAN interface, and just going by what crm_mon says (with a 5 second update interval). Once I get DRBD up and running on the nodes, I'm going to do a bit more exhaustive testing and see how quickly and smoothly I get can things to fail over. Maybe now that I've started this thread, it will give me an incentive to spend some more time on this ;-)
▸
On Tue, Feb 10, 2009 at 1:43 PM, Clark, Sean <user-2db5fbcae9a7@xymon.invalid> wrote:
I have it running with Conga/Ricci/lucci redhat-cluster and a 3510 shared disk array. [2 node with shared IP] I was thinking of switching it over to linux-HA though, because it takes 3-5 seconds to fail over, and this seems too slow for me how fast does linuxHA failover? -Sean P Go Green! Print this email only when necessary. Thank you for helping Time Warner Cable be environmentally responsible. *From:* Gary Baluha [mailto:user-ae3e15c22de1@xymon.invalid] *Sent:* Tuesday, February 10, 2009 10:37 AM *To:* user-ae9b8668bcde@xymon.invalid *Subject:* [hobbit] HA solutions I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA. Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though). The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works. It's still in the early stages of implementation and testing, but so far it appears hold hold promise. I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress. This E-mail and any of its attachments may contain Time Warner Cable proprietary information, which is privileged, confidential, or subject to copyright belonging to Time Warner Cable. This E-mail is intended solely for the use of the individual or entity to which it is addressed. If you are not the intended recipient of this E-mail, you are hereby notified that any dissemination, distribution, copying, or action taken in relation to the contents of and attachments to this E-mail is strictly prohibited and may be unlawful. If you have received this E-mail in error, please notify the sender immediately and permanently delete the original and any copy of this E-mail and any printout.
list T.J. Yang
Gary, HA over WAN is the approach I am using due to WAN network structure I have. I tested linux-ha+drbd for HA over LAN requirement a few years ago and this approach is not needed for my current requirements. So please submit your notes about this HA-LAN using drbd approach to xymon wiki site. T.J. Yang
▸
Date: Tue, 10 Feb 2009 14:14:19 -0500
From: user-ae3e15c22de1@xymon.invalid
To: user-ae9b8668bcde@xymon.invalid
Subject: Re: [hobbit] HA solutions
I haven't gotten down to tuning the setup, but out of the box, LinuxHA seems to fail over in about that same 3-5 second time frame. That's with the heartbeat going through the primary LAN interface, and just going by what crm_mon says (with a 5 second update interval).
Once I get DRBD up and running on the nodes, I'm going to do a bit more exhaustive testing and see how quickly and smoothly I get can things to fail over. Maybe now that I've started this thread, it will give me an incentive to spend some more time on this ;-)
On Tue, Feb 10, 2009 at 1:43 PM, Clark, Sean <user-2db5fbcae9a7@xymon.invalid> wrote:
I have it running with Conga/Ricci/lucci redhat-cluster and a 3510 shared disk array.
[2 node with shared IP]
I was thinking of switching it over to linux-HA though, because it takes 3-5 seconds to fail over, and this seems too slow for me
how fast does linuxHA failover?
-Sean
P Go Green! Print this email only when necessary. Thank you for helping Time Warner Cable be environmentally responsible.
From: Gary Baluha [mailto:user-ae3e15c22de1@xymon.invalid]
Sent: Tuesday, February 10, 2009 10:37 AM
To: user-ae9b8668bcde@xymon.invalid
Subject: [hobbit] HA solutions
I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA.
Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though). The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works. It's still in the early stages of implementation and testing, but so far it appears hold hold promise.
I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
This E-mail and any of its attachments may contain Time Warner
Cable proprietary information, which is privileged, confidential,
or subject to copyright belonging to Time Warner Cable. This E-mail
is intended solely for the use of the individual or entity to which
it is addressed. If you are not the intended recipient of this
E-mail, you are hereby notified that any dissemination,
distribution, copying, or action taken in relation to the contents
of and attachments to this E-mail is strictly prohibited and may be
unlawful. If you have received this E-mail in error, please notify
the sender immediately and permanently delete the original and any
copy of this E-mail and any printout.
Windows Live™: Keep your life in sync. http://windowslive.com/explore?ocid=TXT_TAGLM_WL_t1_allup_explore_022009
list Alexander Keller
Hi, we're running a 2-node Linux-HA cluster using DRBD 8 and Xen. Xymon/hobbit is installed in a Xen domain. If a node fails the Xen domain is automatically switched over to the other node. With DRBD 8.0.x even live migration is possible. There is no need to use a cluster filesystem (like ocfs2 or gfs). Have a look at http://fghaas.wordpress.com/2007/09/03/drbd-806-brings-full-live-migration-for-xen-on-drbd Our current installation uses Debian Etch 4.0, Xen 3.2 (Backports), DRBD (Source) and Heartbeat 2.1.x (Backports). Regards, Alexander
▸
Am 10.02.2009 16:36, Gary Baluha schrieb:I know there was a thread on this a while back, and if I recall, there was mention that the 4.3.0 line would have features for HA. Anyway, I've been working on implementing a 2-node HA Xymon cluster using LinuxHA, and probably DRBD for shared disk (I have yet to implement DRBD, though). The Apache and Xymon services fail over to the other node fairly nicely, and I have successfully testing that using a shared IP to the Xymon web page works. It's still in the early stages of implementation and testing, but so far it appears hold hold promise. I'd be interesting in hearing about other HA solutions that have been implemented (if any). I don't recall previous mention of LinuxHA, so if no one else has done it, I'll be glad to update the list with my progress.
list Jason Hand
Anyone know how I can check to see why the 'bbd' service isn't apparently running on my Hobbit server and better yet, how do I start it? Thanks, Jason