From Mark.Vallevand at UNISYS.com Tue Aug 18 14:59:54 2015 From: Mark.Vallevand at UNISYS.com (Vallevand, Mark K) Date: Tue, 18 Aug 2015 14:59:54 +0000 Subject: [Linux-cluster] Quick question about node offline Message-ID: <2a7803b2cd994be5af35d9b840b31e82@US-EXCH13-5.na.uis.unisys.com> I have a report from a user about both nodes in a cluster being offline. The user explicitly issued a 'crm node standby' for one node. (Part of our testing.) There was some error with our resource so it didn't stop correctly on that node. Then, the user noticed that both nodes were offline. I don't have good logs from this incident. My quick question: Will pacemaker/cman/corosync take a node offline without the user requesting it? Obviously, I need to get good logs and dig deeper. But, a quick answer is greatly appreciated. Regards. Mark K Vallevand Mark.Vallevand at Unisys.com Never try and teach a pig to sing: it's a waste of time, and it annoys the pig. THIS COMMUNICATION MAY CONTAIN CONFIDENTIAL AND/OR OTHERWISE PROPRIETARY MATERIAL and is thus for use only by the intended recipient. If you received this in error, please contact the sender and delete the e-mail and its attachments from all computers. -------------- next part -------------- An HTML attachment was scrubbed... URL: From misch at schwartzkopff.org Tue Aug 18 15:13:22 2015 From: misch at schwartzkopff.org (Michael Schwartzkopff) Date: Tue, 18 Aug 2015 17:13:22 +0200 Subject: [Linux-cluster] Quick question about node offline In-Reply-To: <2a7803b2cd994be5af35d9b840b31e82@US-EXCH13-5.na.uis.unisys.com> References: <2a7803b2cd994be5af35d9b840b31e82@US-EXCH13-5.na.uis.unisys.com> Message-ID: <4443114.ises4hgxpW@nb003> Am Dienstag, 18. August 2015, 14:59:54 schrieb Vallevand, Mark K: > I have a report from a user about both nodes in a cluster being offline. > The user explicitly issued a 'crm node standby' for one node. (Part of our > testing.) There was some error with our resource so it didn't stop > correctly on that node. Then, the user noticed that both nodes were > offline. I don't have good logs from this incident. > > My quick question: Will pacemaker/cman/corosync take a node offline without > the user requesting it? In some cases yes. Clusters are beasts. But you definitely need the configs and the logs to look what really happened. > Obviously, I need to get good logs and dig deeper. But, a quick answer is > greatly appreciated. Yes. Configs, status and logs from BOTH nodes. -- Dr. Michael Schwartzkopff Guardinistr. 63 81375 M?nchen Tel: (0162) 1650044 Fax: (089) 620 304 13 -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 230 bytes Desc: This is a digitally signed message part. URL: From Mark.Vallevand at UNISYS.com Tue Aug 18 17:57:04 2015 From: Mark.Vallevand at UNISYS.com (Vallevand, Mark K) Date: Tue, 18 Aug 2015 17:57:04 +0000 Subject: [Linux-cluster] Quick question about node offline In-Reply-To: <4443114.ises4hgxpW@nb003> References: <2a7803b2cd994be5af35d9b840b31e82@US-EXCH13-5.na.uis.unisys.com> <4443114.ises4hgxpW@nb003> Message-ID: <49bb1a32139342079962600a81c720de@US-EXCH13-5.na.uis.unisys.com> Thanks! The user confessed that he actually did 'crm node standby' for both nodes. We are reproducing the error in the stop of our resource and will collect all the logs. Regards. Mark K Vallevand Mark.Vallevand at Unisys.com Never try and teach a pig to sing: it's a waste of time, and it annoys the pig. THIS COMMUNICATION MAY CONTAIN CONFIDENTIAL AND/OR OTHERWISE PROPRIETARY MATERIAL and is thus for use only by the intended recipient. If you received this in error, please contact the sender and delete the e-mail and its attachments from all computers. -----Original Message----- From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Michael Schwartzkopff Sent: Tuesday, August 18, 2015 10:13 AM To: linux clustering Subject: Re: [Linux-cluster] Quick question about node offline Am Dienstag, 18. August 2015, 14:59:54 schrieb Vallevand, Mark K: > I have a report from a user about both nodes in a cluster being offline. > The user explicitly issued a 'crm node standby' for one node. (Part of our > testing.) There was some error with our resource so it didn't stop > correctly on that node. Then, the user noticed that both nodes were > offline. I don't have good logs from this incident. > > My quick question: Will pacemaker/cman/corosync take a node offline without > the user requesting it? In some cases yes. Clusters are beasts. But you definitely need the configs and the logs to look what really happened. > Obviously, I need to get good logs and dig deeper. But, a quick answer is > greatly appreciated. Yes. Configs, status and logs from BOTH nodes. -- Dr. Michael Schwartzkopff Guardinistr. 63 81375 M?nchen Tel: (0162) 1650044 Fax: (089) 620 304 13 From Mark.Vallevand at UNISYS.com Tue Aug 18 18:16:16 2015 From: Mark.Vallevand at UNISYS.com (Vallevand, Mark K) Date: Tue, 18 Aug 2015 18:16:16 +0000 Subject: [Linux-cluster] Quick question about node offline In-Reply-To: <49bb1a32139342079962600a81c720de@US-EXCH13-5.na.uis.unisys.com> References: <2a7803b2cd994be5af35d9b840b31e82@US-EXCH13-5.na.uis.unisys.com> <4443114.ises4hgxpW@nb003> <49bb1a32139342079962600a81c720de@US-EXCH13-5.na.uis.unisys.com> Message-ID: Then I can get to the bottom of things. Regards. Mark K Vallevand Mark.Vallevand at Unisys.com Never try and teach a pig to sing: it's a waste of time, and it annoys the pig. THIS COMMUNICATION MAY CONTAIN CONFIDENTIAL AND/OR OTHERWISE PROPRIETARY MATERIAL and is thus for use only by the intended recipient. If you received this in error, please contact the sender and delete the e-mail and its attachments from all computers. -----Original Message----- From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Vallevand, Mark K Sent: Tuesday, August 18, 2015 12:57 PM To: misch at schwartzkopff.org; linux clustering Subject: Re: [Linux-cluster] Quick question about node offline Thanks! The user confessed that he actually did 'crm node standby' for both nodes. We are reproducing the error in the stop of our resource and will collect all the logs. Regards. Mark K Vallevand Mark.Vallevand at Unisys.com Never try and teach a pig to sing: it's a waste of time, and it annoys the pig. THIS COMMUNICATION MAY CONTAIN CONFIDENTIAL AND/OR OTHERWISE PROPRIETARY MATERIAL and is thus for use only by the intended recipient. If you received this in error, please contact the sender and delete the e-mail and its attachments from all computers. -----Original Message----- From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Michael Schwartzkopff Sent: Tuesday, August 18, 2015 10:13 AM To: linux clustering Subject: Re: [Linux-cluster] Quick question about node offline Am Dienstag, 18. August 2015, 14:59:54 schrieb Vallevand, Mark K: > I have a report from a user about both nodes in a cluster being offline. > The user explicitly issued a 'crm node standby' for one node. (Part of our > testing.) There was some error with our resource so it didn't stop > correctly on that node. Then, the user noticed that both nodes were > offline. I don't have good logs from this incident. > > My quick question: Will pacemaker/cman/corosync take a node offline without > the user requesting it? In some cases yes. Clusters are beasts. But you definitely need the configs and the logs to look what really happened. > Obviously, I need to get good logs and dig deeper. But, a quick answer is > greatly appreciated. Yes. Configs, status and logs from BOTH nodes. -- Dr. Michael Schwartzkopff Guardinistr. 63 81375 M?nchen Tel: (0162) 1650044 Fax: (089) 620 304 13 -- Linux-cluster mailing list Linux-cluster at redhat.com https://www.redhat.com/mailman/listinfo/linux-cluster