[Linux-cluster] Tiebreaker IP Address
Barry Brimer
lists at brimer.org
Sun Jan 20 15:31:26 UTC 2008
> Can any one explain me what exactly is the tiebreaker IP and how does it
> function? What is the use if we set the tiebreaker IP as the Default Gateway
> address?
In clustering, it is important that the cluster nodes are able to
communicate with one another. It is also important that the cluster nodes
agree on the status of the cluster. To acheive this, various methods are
used to communicate between cluster nodes to inform the other nodes that
this node is active and participating in the cluster. Quorum is usually
defined as "greater than one half". In a cluster larger than 2 nodes,
the cluster nodes can determine that if they stop receiving cluster
communications (usually referred to as heartbeat) from a particular node,
they assume that the non-responsive node is not functioning correctly, and
one of the remaining nodes in the cluster will fence the non-responsive
node. Fencing is a nice way of saying "make sure the non-responsive node
can not write anything to our disks, by whatever means necessary". This
usually involves the equivalent of pulling the power plug out of the
non-responsive node. Why be so harsh? Why not do a normal shutdown? If
the non-responsive node has data in buffers that has not been written to
disk, and the other cluster nodes feel that this node is having a problem,
they want to ensure that the non-responsive node can not write its buffers
out to disk, in order to make sure that the non-responsive node has no
chance of corrupting the data used by the cluster. This is all fine,
because if you have greater than 2 nodes, you should be able to get
agreement by a majority on whether a node is functioning, and therefore
whether the cluster is allowed to operate. In a two-node cluster, we need
to have some other way to determine which cluster member is healthy, and
which one isn't. If a cluster node were functioning correctly, it would
be able to reach its default gateway. Therefore the tiebreaker IP address
is the default gateway because both machines should be able to reach it if
they were functioning properly. Therefore if one node is able to reach
the tiebreaker IP address, and one isn't, it is assumed that the properly
running node is the one that can reach the default gateway, and that
allows the tie to be broken and allows that node to fence the other node.
Barry
More information about the Linux-cluster
mailing list