[Linux-cluster] Adding a node back to cluster failing
Jan Pokorný
jpokorny at redhat.com
Thu Nov 7 17:15:20 UTC 2013
On 07/11/13 13:35 +0800, Zama Ques wrote:
>> A. The host "%s" is already a member of cluster "%s"
>>
>> or
>>
>> B. %s is already a member of a cluster named "%s"
>>
>> or some other message (interpolate "%s" above with your values)?
>>
>>
>>> But the node name is not there in cluster.conf file.
>>
>> you are talking about the original surviving node now, right?
>>
>> It seems more likely to me that B. from above is right, so then,
>> please make sure there is no /etc/cluster/cluster.conf on the node you
>> are trying to add (may be a leftover from the original setup of this
>> recovered node).
>
> Thanks Jan, you assumed correctly.
>
> Deleting /etc/cluster/cluster.conf actually resolved the issue.
> Node is successfully added back to cluster.
Great to hear that :)
The problem there was that we distinguish two situations in luci:
a. node is associated with the cluster (as per clusternode entry
in cluster.conf), however not an active member of the cluster,
(due to cman service not running there as e.g., when the node
has been a subject of "leave cluster" action in luci)
- this node is explicitly listed amongst the cluster nodes
with "not a cluster member" status and from here, it
can be selected for "join cluster" action, which will
start cman + rgmanager on that node again, leading
to a cluster membership (if nothing goes wrong)
- in this case the node is expected (enforced) to carry
cluster.conf (which is also being updated throughout the changes
in cluster configuration as long as ricci is run there
and cluster.conf is not deleted manually in between)
b. node is not a priori associated with the cluster (it's not mentioned
in the cluster.conf across the cluster)
- this node is apparently not (no hint for that) listed amongst
the cluster nodes and can be added via "add" from that view
- in this case the node is expected *not* to contain cluster.conf
upon being added to an existing cluster; simply because by
mistake, an attempt to add a node being already a member of
a different cluster could be made, possibly leading to some
inconsistencies in the luci view of the cluster
NB: we might tighten this constraint and allow the cluster
configuration to be already present on the node being
added provided that the cluster name matches the destination
(which would help in this very case, IMHO)
Apparently this was case of b. and the above description explains
why removing cluster.conf from the node to be added was of help.
It's up to consideration if to provide a solution to such class
of cases as suggested, feel free to comment on:
https://bugzilla.redhat.com/show_bug.cgi?id=1028092
note: I mentioned some other little discrepancies I've discovered
there
--
Jan
More information about the Linux-cluster
mailing list