Tod Hansmann wrote:
Let me see if I understand. You have several bb servers at one
datacenter, each with their twin at the other datacenter, and both sets
do the tests. They report to one central display server, but only one
set reports at a time, depending on failover state, correct?
You have the basic idea, but there is no single central server, just
pairs of bb servers, one to a data center, in each lan which is being
monitored. For each pair of bb servers, only the server at data center A
does reporting, unless the server in data center B cannot reach the
server in data center A, in which case the server in data center B will
take over the reporting duties until the bb server in data center A
becomes reachable again. While this could theoretically lead to a split
brain condition, the failover condition has only ever triggered when
there was a wan outage.
Is this failover automatic? If so, how is this failover determined?
What if this failover has a false positive? If not, what is your
timeframe to swap over?
IIRC It takes one bb cycle to kick in.
We've not seen a false positive, as I mentioned above.
It's just the standard built-in bb failover -
head ~bb/ext/failover follows:
#!/bin/sh
# failover
#
# BIG BROTHER - FAILOVER SCRIPT
# Sean MacGuire
#
# (c) Copyright Quest Software, Inc. 1997-2003 All rights reserved.
#
#
# failover WATCHES BBNET and BBPAGER
#
# IF BBNET OR BBPAGER BECOMES UNAVAILABLE, THEN TAKE OVER UNTIL THEY
RETURN
#
# To use, just add failover to the BBEXT variable in etc/bbdef.sh
#
# To configure BBPAGER failover:
# define both the primary and failover machines as BBPAGERS in
etc/bb-hosts
# and set bbwarn: FAILOVER in etc/bbwarnsetup.cfg
Joe