[Linux-cluster] Error: ClientSocket(String): connect() failed: No such file or directory

Megan . nagemnna at gmail.com
Thu Jun 4 12:23:55 UTC 2015


FYI - i talked to our network folks and it looks like they were doing some
testing last night with port failover which may or may not have caused this
issue.  However, I was able to correct it by fencing the problem nodes.

On Wed, Jun 3, 2015 at 10:31 AM, Megan . <nagemnna at gmail.com> wrote:

> Anybody ever seen "Error: ClientSocket(String): connect() failed: No such
> file or directory" when doing a start all?  Something seems to have
> broken with our closer.  Our UAT setup works as expected.  I looked at
> tcpdumps the best that i could (i'm not a network person though) and i
> didn't see anything obvious.  I shutdown iptables on all nodes.
>
> We are running Centos 6,6, ccs-0.16.2-75.el6_6.1.x86_64
> cman-3.0.12.1-68.el6.x86_64.  We have a 12 node cluster in production that
> allows us to share gfs2 iscsi mounts.  no other services are used.  clvmd
> -R runs fine at this time.  ccs -h node --sync --activate also runs fine.
>
>
> [root at admin1 ~]# ccs -h admin1-ops --startall
>
> Unable to start map1-ops, possibly due to lack of quorum, try --startall
>
> Error: ClientSocket(String): connect() failed: No such file or directory
>
> Started cache2-ops
>
> Unable to start data1-ops, possibly due to lack of quorum, try --startall
>
> Error: ClientSocket(String): connect() failed: No such file or directory
>
> Started map2-ops
>
> Unable to start archive1-ops, possibly due to lack of quorum, try
> --startall
>
> Error: ClientSocket(String): connect() failed: No such file or directory
>
> Started data3-ops
>
> Started mgmt1-ops
>
> Unable to start admin1-ops, possibly due to lack of quorum, try --startall
>
> Error: ClientSocket(String): connect() failed: No such file or directory
>
> Started data2-ops
>
> Started cache1-ops
>
> [root at admin1 ~]#
>
> I have quorum:
>
> [root at admin1 ~]# clustat
>
> Cluster Status for bitsops @ Wed Jun  3 02:13:08 2015
>
> Member Status: Quorate
>
>
>  Member Name                                                     ID
> Status
>
>  ------ ----                                                     ----
> ------
>
>  admin1-ops                                                          1
> Online, Local
>
>  mgmt1-ops                                                           2
> Online
>
>  archive1-ops                                                        3
> Online
>
>  map1-ops                                                            4
> Online
>
>  map2-ops                                                            5
> Online
>
>  cache1-ops                                                          6
> Online
>
>  cache2-ops                                                          7
> Online
>
>  data1-ops                                                           8
> Online
>
>  data2-ops                                                           9
> Online
>
>  data3-ops                                                          10
> Online
>
>
>
>
> Here is what I expect, and what UAT gives me:
>
> [root at admin1-uat ~]# ccs -h admin1-uat --startall
>
> Started mgmt1-uat
>
> Started data1-uat
>
> Started data2-uat
>
> Started admin1-uat
>
> Started tools-uat
>
> Started map1-uat
>
> Started archive1-uat
>
> Started cache2-uat
>
> Started cache1-uat
>
> Started map2-uat
>
> [root at admin1-uat ~]#
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20150604/996c88c6/attachment.htm>


More information about the Linux-cluster mailing list