[Linux-cluster] (new) problems with qdisk, running test rpms
Frederik Ferner
frederik.ferner at diamond.ac.uk
Thu May 10 10:31:28 UTC 2007
Hi Lon,
thanks for your reply.
Unfortunately I'm currently not in a position to test this at the
moment. I really should get myself a proper test setup :-(
Before I try the next time, I have one small detail to clarify here.
Lon Hohberger wrote:
> On Wed, May 02, 2007 at 02:44:06PM +0100, Frederik Ferner wrote:
>> With the new version of qdiskd it seems the heuristics are not tested
>> anymore after it reaches a sufficient score once. When the outside
>> network is lost qdiskd on both server still claim the same score in the
>> status file and both servers report the votes for the qdisk to cman.
> Hmm, could you add 'tko="1"' to your cluster.conf for the heuristics? I
^^^^^^^
> wonder if it's an initialization problem.
>> If qdiskd is started while the outside network is unreachable the scores
>> start without the scores for the failing heuristics. Once network is
>> restored the score jumps to at least the minimum required for operation
>> and once again stays there.
> This seems to work for me:
>
> [10538] debug: Heuristic: 'ping 192.168.79.254 -c1 -t3' missed (1/3)
> [10538] debug: Heuristic: 'ping 192.168.79.254 -c1 -t3' missed (2/3)
> [10538] info: Heuristic: 'ping 192.168.79.254 -c1 -t3' DOWN (3/3)
> [10537] notice: Score insufficient for master operation (0/11;
> required=6); downgrading
>
> Message from syslogd at green at Mon May 7 10:36:43 2007 ...
> green clurgmgrd[7305]: <emerg> #1: Quorum Dissolved
>
> (machine rebooted)
[snip]
> Hmm, try adding tko="3" to each of your ping heuristics, like this:
^^^^^^^
Is this the same suggestion as above (tko="1")? In any case I'll try
that next time I get a chance.
Many thanks,
Frederik
--
Frederik Ferner
Linux Systems Administrator phone: +44 1235 77 8624
Diamond Light Source Ltd. mob: +44 7917 08 5110
More information about the Linux-cluster
mailing list