[Linux-cluster] Cluster 3.0.0.rc3 release

Guido Günther agx at sigxcpu.org
Wed Jul 1 16:40:07 UTC 2009


Hi Fabio,
On Wed, Jul 01, 2009 at 03:23:56PM +0200, Fabio M. Di Nitto wrote:
> Now my question is: did you mangle the configuration you sent me
> manually? because there is no matching entry between device to use for a
> node and the fencedevices section and I get:
Yes, I had to get some internal names out. This is what went wrong:

-<fencedevice agent="fence_ilo" hostname="rnode1.foo.bar" login="reboot" name="node1" passwd="pass"/>
+<fencedevice agent="fence_ilo" hostname="rnode1.foo.bar" login="reboot" name="fence1" passwd="pass"/>
									       ^^^^^^

(same for node2/fence2).
> Since you can reproduce the problem regularly I'd really like to see
> some debugging output of libfence to start with. I'd really appreciate
> if you could help us.
> 
> test 1:
> 
> Please add a bunch fprintf(stderr, to agents.c to see the created XPath
> queries and the result coming back from libccs.

# fence_node -vv node2
make_args(149): /cluster/fencedevices/fencedevice[@name="fence2"]/@*
make_args(156)
Segmentation fault

> test 2:
> 
> If you could please find:
> 
> cd = ccs_connect(); (line 287 in agent.c)
> and right before that add:
> fullxpath=1;
>
> That change will ask libccs to use a different Xpath engine internally.
> 
> And then re-run test1.
# fence_node -vv node2
fence_node(289): fullxpath: 0
fence_node(291): fullxpath: 1
make_args(149): /cluster/fencedevices/fencedevice[@name="fence2"]/@*
make_args(156)
Segmentation fault

make_args(156) is just before the strncmp. Trying to print out str
results in a segfault too (that's why it's missing from the output).

[..snip..]  
> the next question is: are you running on some fancy architecture? Maybe
> something in that environment is not initialized properly (the garbage
> string you get back from libccs sounds like that) but on more common
> arches like x86/x86_64 gcc takes care of that for us.... (really wild
> guessing but still something to fix!).
Nothing fancy here:

# uname -a
Linux vm41 2.6.30-1-amd64 #1 SMP Sun Jun 14 15:00:29 UTC 2009 x86_64
GNU/Linux

Cheers,
 -- Guido




More information about the Linux-cluster mailing list