<table cellspacing="0" cellpadding="0" border="0"><tr><td valign="top">Hi,<br/><br/>The linux clustering will be only working perfectly if you run the linux operating systems between nodes. Allow root ssh persistent connection on top of same specifications hardware platform.<br/><br/>To perform test or proof of concept, you may allow to run and configure between two nodes.<br/><br/>The databases for clustering will be configure right after the two nodes linux operating systems run with persistent root access ssh connection.<a href="https://overview.mail.yahoo.com?.src=iOS"><br/><br/>Sent from Yahoo Mail for iPhone</a></td></tr></table>            <div id="_origMsg_">
                <div>
                    <br />
                    <div>
                        <div style="font-size:0.9em">
                            <hr size="1">
                            <b>
                                <span style="font-weight:bold">From:</span>
                            </b>
                            Schaefer, Micah <Micah.Schaefer@jhuapl.edu>;                            <br>
                            <b>
                                <span style="font-weight:bold">To:</span>
                            </b>
                            linux clustering <linux-cluster@redhat.com>;                                                                             <br>
                            <b>
                                <span style="font-weight:bold">Subject:</span>
                            </b>
                            Re: [Linux-cluster] Node is randomly fenced                            <br>
                            <b>
                                <span style="font-weight:bold">Sent:</span>
                            </b>
                            Tue, Jun 17, 2014 2:27:29 PM                            <br>
                        </div>
                            <br>
                            <table cellspacing="0" cellpadding="0" border="0">
                                <tbody>
                                    <tr>
                                        <td valign="top">I am running Red Hat 6.4 with the HA/ load balancing packages from the<br clear="none">install DVD. <br clear="none"><br clear="none"><br clear="none">-bash-4.1$ cat /etc/redhat-release<br clear="none">Red Hat Enterprise Linux Server release 6.4 (Santiago)<br clear="none"><br clear="none">-bash-4.1$ corosync -v<br clear="none">Corosync Cluster Engine, version '1.4.1'<br clear="none">Copyright (c) 2006-2009 Red Hat, Inc.<br clear="none"><br clear="none"><br clear="none"><br clear="none"><br clear="none"><br clear="none"><br clear="none"><br clear="none">On 6/17/14, 8:41 AM, "Christine Caulfield" <<a shape="rect" ymailto="mailto:ccaulfie@redhat.com" href="javascript:return">ccaulfie@redhat.com</a>> wrote:<br clear="none"><br clear="none">>On 12/06/14 20:06, Digimer wrote:<br clear="none">>> Hrm, I'm not really sure that I am able to interpret this without
 making<br clear="none">>> guesses. I'm cc'ing one of the devs (who I hope will poke the right<br clear="none">>> person if he's not able to help at the moment). Lets see what he has to<br clear="none">>> say.<br clear="none">>><br clear="none">>> I am curious now, too. :)<br clear="none">>><br clear="none">>> On 12/06/14 03:02 PM, Schaefer, Micah wrote:<br clear="none">>>> Node4 was fenced again, I was able to get some debug logs (below), a<br clear="none">>>>new<br clear="none">>>> message :<br clear="none">>>><br clear="none">>>> "Jun 12 14:01:56 corosync [TOTEM ] The token was lost in the<br clear="none">>>>OPERATIONAL<br clear="none">>>> state.³<br clear="none">>>><br clear="none">>>><br clear="none">>>> Rest of corosync logs<br clear="none">>>><br clear="none">>>> <a shape="rect"
 href="http://pastebin.com/iYFbkbhb" target="_blank">http://pastebin.com/iYFbkbhb</a><br clear="none">>>><br clear="none">>>><br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] entering OPERATIONAL state.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] A processor joined or left the<br clear="none">>>> membership and a new membership was formed.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] waiting_trans_ack changed to 0<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause detected for 32947 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] entering GATHER state from 12.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause detected for 32947 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM
 ] Process pause detected for 32947 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause detected for 33016 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause detected for 33016 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause detected for 33016 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause detected for 33016 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause detected for 33086 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process
 pause detected for 33086 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause detected for 33086 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause detected for 33086 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause detected for 33155 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause detected for 33155 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause detected for 33155 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:49 corosync [TOTEM ] Process pause
 detected for 33155 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:50 corosync [TOTEM ] Process pause detected for 33224 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:50 corosync [TOTEM ] Process pause detected for 33224 ms,<br clear="none">>>> flushing membership messages.<br clear="none">>>> Jun 12 14:44:50 corosync [TOTEM ] Process pause detected for 33225 ms,<br clear="none">>>> flushing membership messages.<br clear="none">><br clear="none">><br clear="none">>I'm concerned that the pause messages are repeating like that, it looks<br clear="none">>like it might be a fixed bug. What version of corosync do you have?<br clear="none">><br clear="none">>Chrissie<br clear="none">><br clear="none">>-- <br clear="none">>Linux-cluster mailing list<br clear="none">><a shape="rect"
 ymailto="mailto:Linux-cluster@redhat.com" href="javascript:return">Linux-cluster@redhat.com</a><br clear="none">><a shape="rect" href="https://www.redhat.com/mailman/listinfo/linux-cluster" target="_blank">https://www.redhat.com/mailman/listinfo/linux-cluster</a><div class="yqt3398587462" id="yqtfd66691"><br clear="none"><br clear="none"><br clear="none">-- <br clear="none">Linux-cluster mailing list<br clear="none"><a shape="rect" ymailto="mailto:Linux-cluster@redhat.com" href="javascript:return">Linux-cluster@redhat.com</a><br clear="none"><a shape="rect" href="https://www.redhat.com/mailman/listinfo/linux-cluster" target="_blank">https://www.redhat.com/mailman/listinfo/linux-cluster</a></div></td>
                                    </tr>
                                </tbody>
                            </table>
                    </div>
                </div>
            </div>