[Linux-cluster] what should fence_xvm do if dom0 is down

Joel Heenan joelh at planetjoel.com
Fri Oct 1 06:11:43 UTC 2010


So just further to this I found a Red Hat bug about this exact issue:

https://bugzilla.redhat.com/show_bug.cgi?id=570373

And for me it works perfectly if the dom0 is fenced using fence_node on the
command line. However, if the host becomes unavailable then it is not
fenced, and from reading the fenced man page it seems this is because there
isn't a shared resource like clvm or gfs, so therefore the cluster doesn't
see a need to fence the host. This means subsequent fence_xvm commands fail.

I guess I need to find a way to force fenced to operate without clvm and
fence dom0s?

Joel

On Fri, Oct 1, 2010 at 3:12 PM, Joel Heenan <joelh at planetjoel.com> wrote:

> I'm conducting some HA testing and I found that if I destroy a dom0, the
> other guest nodes (running on separate dom0s) all try to fence the guest
> that has disappeared but they fail because they can't reach fence_xvmd (the
> dom0 is down). They sit there trying to fence forever and never make any
> progress until the dom0 has rebooted and the guest is back up. This is a lot
> of downtime.
>
> What I tried was to make a secondary fence_manual after the fence_xen but
> the problem is that it then requires me to manuall acknowledge fencing.
>
> What I would like is "try fencing, if it fails don't worry just continue as
> if it succeeded"? Is there an option for this?
>
> Thanks
>
> Joel
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20101001/f05cac41/attachment.htm>


More information about the Linux-cluster mailing list