[Linux-cluster] Error: ClientSocket(String): connect() failed: No such file or directory
Megan .
nagemnna at gmail.com
Thu Jun 4 12:23:55 UTC 2015
FYI - i talked to our network folks and it looks like they were doing some
testing last night with port failover which may or may not have caused this
issue. However, I was able to correct it by fencing the problem nodes.
On Wed, Jun 3, 2015 at 10:31 AM, Megan . <nagemnna at gmail.com> wrote:
> Anybody ever seen "Error: ClientSocket(String): connect() failed: No such
> file or directory" when doing a start all? Something seems to have
> broken with our closer. Our UAT setup works as expected. I looked at
> tcpdumps the best that i could (i'm not a network person though) and i
> didn't see anything obvious. I shutdown iptables on all nodes.
>
> We are running Centos 6,6, ccs-0.16.2-75.el6_6.1.x86_64
> cman-3.0.12.1-68.el6.x86_64. We have a 12 node cluster in production that
> allows us to share gfs2 iscsi mounts. no other services are used. clvmd
> -R runs fine at this time. ccs -h node --sync --activate also runs fine.
>
>
> [root at admin1 ~]# ccs -h admin1-ops --startall
>
> Unable to start map1-ops, possibly due to lack of quorum, try --startall
>
> Error: ClientSocket(String): connect() failed: No such file or directory
>
> Started cache2-ops
>
> Unable to start data1-ops, possibly due to lack of quorum, try --startall
>
> Error: ClientSocket(String): connect() failed: No such file or directory
>
> Started map2-ops
>
> Unable to start archive1-ops, possibly due to lack of quorum, try
> --startall
>
> Error: ClientSocket(String): connect() failed: No such file or directory
>
> Started data3-ops
>
> Started mgmt1-ops
>
> Unable to start admin1-ops, possibly due to lack of quorum, try --startall
>
> Error: ClientSocket(String): connect() failed: No such file or directory
>
> Started data2-ops
>
> Started cache1-ops
>
> [root at admin1 ~]#
>
> I have quorum:
>
> [root at admin1 ~]# clustat
>
> Cluster Status for bitsops @ Wed Jun 3 02:13:08 2015
>
> Member Status: Quorate
>
>
> Member Name ID
> Status
>
> ------ ---- ----
> ------
>
> admin1-ops 1
> Online, Local
>
> mgmt1-ops 2
> Online
>
> archive1-ops 3
> Online
>
> map1-ops 4
> Online
>
> map2-ops 5
> Online
>
> cache1-ops 6
> Online
>
> cache2-ops 7
> Online
>
> data1-ops 8
> Online
>
> data2-ops 9
> Online
>
> data3-ops 10
> Online
>
>
>
>
> Here is what I expect, and what UAT gives me:
>
> [root at admin1-uat ~]# ccs -h admin1-uat --startall
>
> Started mgmt1-uat
>
> Started data1-uat
>
> Started data2-uat
>
> Started admin1-uat
>
> Started tools-uat
>
> Started map1-uat
>
> Started archive1-uat
>
> Started cache2-uat
>
> Started cache1-uat
>
> Started map2-uat
>
> [root at admin1-uat ~]#
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20150604/996c88c6/attachment.htm>
More information about the Linux-cluster
mailing list