[Linux-cluster] Problem with Fenced

David Teigland teigland at redhat.com
Wed Jun 3 15:40:44 UTC 2009


On Wed, Jun 03, 2009 at 04:22:30PM +0200, Marco Nietz wrote:
> Hi,
> 
> i have a Problem with (propably) the Communication between fenced and
> ccsd. After a node-failure, fenced should connect ccsd and then try to
> fence the failing node. this does not happen on one of our systems.
> 
> Here's an strace from the fence-daemon.
> 
> socket(PF_FILE, SOCK_STREAM, 0)         = 9
> connect(9, {sa_family=AF_FILE, path=@"groupd_socket"}, 16) = 0
> write(9, "get_group -1 groupd\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 2200) = 2200
> read(9,
> "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
> 1128) = 1128
> close(9)                                = 0
> write(7, "start_done default 3\0\0\0\0\0\0\0\0\0\0\0\0"..., 2200) = 2200
> poll([{fd=4, events=POLLIN}, {fd=5, events=POLLIN}, {fd=7,
> events=POLLIN}, {fd=-1}], 4, -1) = 1 ([{fd=7, revents=POLLIN}])
> read(7, "finish default 3\0\0\0\0\0\0\0\0\350\37Y\21\377\177\0\0"...,
> 2200) = 2200
> poll([{fd=4, events=POLLIN}, {fd=5, events=POLLIN}, {fd=7,
> events=POLLIN}, {fd=-1}], 4, -1
> 
> At the Connect-Line i expect the Path to the ccsd-socket
> (/var/run/cluster/ccsd.sock).
> 
> How can i tell fenced where to find the Socket.

It's not clear from this that fenced/ccsd communication is the problem.

After the node failure, please collect from all nodes the output of
- cman_tool nodes
- group_tool -v
- group_tool dump fence
- any messages in /var/log/messages

Dave




More information about the Linux-cluster mailing list