From pieter.baele at gmail.com  Wed Feb  2 08:43:36 2011
From: pieter.baele at gmail.com (Pieter Baele)
Date: Wed, 2 Feb 2011 09:43:36 +0100
Subject: [Linux-cluster] setting multicast address on bond0
Message-ID: <AANLkTinW9jWsCqbT=JkAHWG=O8CxbF8x2XqKV6u5ERC2@mail.gmail.com>

Hi,

What's the correct way to set the multicast address on a bonded
interface? (RHEL6 - Cluster 3.x)

I can't use multicast on the primary interface because of network
topology (2 sites....).
So I want to set up a multicast address on bond0 (2 interfaces so this
is fault-tolerant)

Adding <multicast addr="239.192.0.1"/ interface="bond0"> fails
ccs_config_validate validation.
(RHEL 5 cluster worked, but for some reason this doesn't work in the
latest version)

I've taken a look at the cluster.rng file....

Sincereley
Pieter Baele



From mgrac at redhat.com  Wed Feb  2 15:01:34 2011
From: mgrac at redhat.com (Marek Grac)
Date: Wed, 02 Feb 2011 16:01:34 +0100
Subject: [Linux-cluster] Configuring a samba resource under RHCS
In-Reply-To: <AANLkTinjFJnE6d6RDpzDnKthBZcNUPQouV71G3ctyidz@mail.gmail.com>
References: <AANLkTinjFJnE6d6RDpzDnKthBZcNUPQouV71G3ctyidz@mail.gmail.com>
Message-ID: <4D4971CE.6080905@redhat.com>

On 10/18/2010 01:46 PM, C. L. Martinez wrote:
> Hi all,
>
>   How can I configure different shared folders with samba under RHCS??
> Exists some resource agents?? I need to allow to access to sme Windows
> 7 And Windows 2008 R2 clients without AD authentication.
>

Resource agent for samba does not modify shared folders configuration, 
so there is no difference between setting one or more of them. All you 
have to do is to add samba resource agent, ip address(es) on which samba 
should listen, and filesystem on which are shared folders (there is no 
auto-detection).

m,




From pieter.baele at gmail.com  Wed Feb  2 15:47:57 2011
From: pieter.baele at gmail.com (Pieter Baele)
Date: Wed, 2 Feb 2011 16:47:57 +0100
Subject: [Linux-cluster] clvmd mirroring problems on split-site SAN (2 node
 cluster) - cpg_dispatch failed: SA_AIS_ERR_LIBRARY
Message-ID: <AANLkTin8f1hyXNUTNK9DRaYxp+g4=W+voaNK4GN4nVqS@mail.gmail.com>

Hi,

After doing a lot of research on the several ways to mirror a device
from one LUN to another on another side,
I had this problem:

Feb  2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
Feb  2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
Feb  2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
Feb  2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
Feb  2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
Feb  2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
Feb  2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
Feb  2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
Feb  2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY

............


What does it mean and could be the reason?

clustat on the one node shows bode online, on the other one is offline...



From linux at alteeve.com  Wed Feb  2 15:50:58 2011
From: linux at alteeve.com (Digimer)
Date: Wed, 02 Feb 2011 10:50:58 -0500
Subject: [Linux-cluster] setting multicast address on bond0
In-Reply-To: <AANLkTinW9jWsCqbT=JkAHWG=O8CxbF8x2XqKV6u5ERC2@mail.gmail.com>
References: <AANLkTinW9jWsCqbT=JkAHWG=O8CxbF8x2XqKV6u5ERC2@mail.gmail.com>
Message-ID: <4D497D62.2050500@alteeve.com>

On 02/02/2011 03:43 AM, Pieter Baele wrote:
> Adding <multicast addr="239.192.0.1"/ interface="bond0"> fails
> ccs_config_validate validation.
> (RHEL 5 cluster worked, but for some reason this doesn't work in the
> latest version)
> 
> I've taken a look at the cluster.rng file....
> 
> Sincereley
> Pieter Baele

I don't believe that 'interface=' is valid.

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From pieter.baele at gmail.com  Wed Feb  2 15:57:39 2011
From: pieter.baele at gmail.com (Pieter Baele)
Date: Wed, 2 Feb 2011 16:57:39 +0100
Subject: [Linux-cluster] clvmd mirroring problems on split-site SAN (2
 node cluster) - cpg_dispatch failed: SA_AIS_ERR_LIBRARY
In-Reply-To: <AANLkTin8f1hyXNUTNK9DRaYxp+g4=W+voaNK4GN4nVqS@mail.gmail.com>
References: <AANLkTin8f1hyXNUTNK9DRaYxp+g4=W+voaNK4GN4nVqS@mail.gmail.com>
Message-ID: <AANLkTi=qMvsWfQsDXrsJZuvKmp3F_pV535kRwYXa-PGg@mail.gmail.com>

On Wed, Feb 2, 2011 at 16:47, Pieter Baele <pieter.baele at gmail.com> wrote:
> Hi,
>
> After doing a lot of research on the several ways to mirror a device
> from one LUN to another on another side,
> I had this problem:
>
> Feb ?2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
> Feb ?2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
> Feb ?2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
> Feb ?2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
> Feb ?2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
> Feb ?2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
> Feb ?2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
> Feb ?2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
> Feb ?2 16:42:49 x cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
>
> ............
>
>
> What does it mean and could be the reason?
>
> clustat on the one node shows bode online, on the other one is offline...
>


I also received this warning some minutes ago:

Message from syslogd at x at Feb  2 16:54:35 ...
 corosync[6368]:   [TOTEM ] LOGSYS EMERGENCY: TOTEM Unable to write to
/var/log/cluster/corosync.log.

Message from syslogd at x at Feb  2 16:54:36 ...
 corosync[6368]:   [QUORUM] LOGSYS EMERGENCY: QUORUM Unable to write
to /var/log/cluster/corosync.log.

Do I have to fiddle with the mcast parameters?



From brett.dellegrazie at gmail.com  Wed Feb  2 16:52:50 2011
From: brett.dellegrazie at gmail.com (Brett Delle Grazie)
Date: Wed, 2 Feb 2011 16:52:50 +0000
Subject: [Linux-cluster] clvmd mirroring problems on split-site SAN (2
 node cluster) - cpg_dispatch failed: SA_AIS_ERR_LIBRARY
In-Reply-To: <AANLkTi=qMvsWfQsDXrsJZuvKmp3F_pV535kRwYXa-PGg@mail.gmail.com>
References: <AANLkTin8f1hyXNUTNK9DRaYxp+g4=W+voaNK4GN4nVqS@mail.gmail.com>
	<AANLkTi=qMvsWfQsDXrsJZuvKmp3F_pV535kRwYXa-PGg@mail.gmail.com>
Message-ID: <AANLkTi=99Z_fMZknmMs+hdeXEk+Qm9oPzgvn5Zxv7as9@mail.gmail.com>

Hi,

On 2 February 2011 15:57, Pieter Baele <pieter.baele at gmail.com> wrote:
> On Wed, Feb 2, 2011 at 16:47, Pieter Baele <pieter.baele at gmail.com> wrote:

<snip>
> I also received this warning some minutes ago:
>
> Message from syslogd at x at Feb ?2 16:54:35 ...
> ?corosync[6368]: ? [TOTEM ] LOGSYS EMERGENCY: TOTEM Unable to write to
> /var/log/cluster/corosync.log.
>
> Message from syslogd at x at Feb ?2 16:54:36 ...
> ?corosync[6368]: ? [QUORUM] LOGSYS EMERGENCY: QUORUM Unable to write
> to /var/log/cluster/corosync.log.
>
> Do I have to fiddle with the mcast parameters?
>

I know this is obvious but is /var/log full?

-- 
Best Regards,

Brett Delle Grazie



From pieter.baele at gmail.com  Wed Feb  2 19:59:54 2011
From: pieter.baele at gmail.com (Pieter Baele)
Date: Wed, 2 Feb 2011 20:59:54 +0100
Subject: [Linux-cluster] clvmd mirroring problems on split-site SAN (2
 node cluster) - cpg_dispatch failed: SA_AIS_ERR_LIBRARY
In-Reply-To: <AANLkTi=99Z_fMZknmMs+hdeXEk+Qm9oPzgvn5Zxv7as9@mail.gmail.com>
References: <AANLkTin8f1hyXNUTNK9DRaYxp+g4=W+voaNK4GN4nVqS@mail.gmail.com>
	<AANLkTi=qMvsWfQsDXrsJZuvKmp3F_pV535kRwYXa-PGg@mail.gmail.com>
	<AANLkTi=99Z_fMZknmMs+hdeXEk+Qm9oPzgvn5Zxv7as9@mail.gmail.com>
Message-ID: <AANLkTinypeXsJmoycVRLvwyYQZ4S-h47H9TRUGe-zqQN@mail.gmail.com>

On Wed, Feb 2, 2011 at 17:52, Brett Delle Grazie
<brett.dellegrazie at gmail.com> wrote:
> Hi,
>>
>> Do I have to fiddle with the mcast parameters?
>>
>
> I know this is obvious but is /var/log full?
>

> 4 GB free ;-)

Regards, Pieter



From dgmorales at gmail.com  Wed Feb  2 22:53:33 2011
From: dgmorales at gmail.com (Diego Morales)
Date: Wed, 2 Feb 2011 20:53:33 -0200
Subject: [Linux-cluster] Fence agent for Citrix XenServer / XCP
Message-ID: <AANLkTinxMkXUtcoitMQG0p0e53VnAUB74VXRzC1+YaeL@mail.gmail.com>

I'm setting up some GFS clusters on top of Citrix XenServer (XCP is
its "free as in freedom" counterpart). And so I'm looking for some
fencing agents to use with that.

I did some googling and it seems that these do not support libvirt (at
least not "officially", not yet).  So I guess the use of fence_xvm or
fence_virsh may be tricky or even impossible.

What I expected to find was some fence_agent built using XenAPI.py
(that they support) or SSH'ing and using the xe command. Didn't find,
thought about doing it myself. But before that... probably I'm not the
only one using rhcs & friends on XenServer, so does anybody has some
nice pointers?


Thanks in advance,


Diego Morales



From sklemer at gmail.com  Thu Feb  3 07:26:35 2011
From: sklemer at gmail.com (=?UTF-8?B?16nXnNeV150g16fXnNee16g=?=)
Date: Thu, 3 Feb 2011 09:26:35 +0200
Subject: [Linux-cluster] falied to implement HA-LVM with clvmd rhcs5.6
Message-ID: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>

Hello.



I followed redhat instruction trying install HA-LVM with clvmd. ( rhcs 5.6 -
rgmanager 2.0.52-9 )



I can't make it work.



lvm.conf- locking_type=3

clvmd work

Its failed saying HA-LVM is not configured correctly.

The manual said that we should run "lvchange -a n lvxx" edit the
cluster.conf & start the service.



But From lvm.conf :



case $1 in

start)

        if ! [[ $(vgs -o attr --noheadings $OCF_RESKEY_vg_name) =~ .....c
]]; then

                ha_lvm_proper_setup_check || exit 1



If the vg is not taged as cluster than the ha_lvm is looking for volume_list
in lvm.conf.



I am confused- Does the VG should taged as cluster ??  ( BTW - the old
fashion HA-LVM is worked with no problems )


redhat instructions :


*To set up HA LVM Failover (using the preferred CLVM variant), perform the
following steps:*



1. Ensure that the parameter locking_type in the global section of
/etc/lvm/lvm.conf is set to the value '3', that all the necessary LVM
cluster packages are installed, and the necessary daemons are started (like
'clvmd' and the cluster mirror log daemon - if necessary).



2. Create the logical volume and filesystem using standard LVM2 and file
system commands. For example:

# pvcreate /dev/sd[cde]1

 # vgcreate <volume group name> /dev/sd[cde]1

 # lvcreate -L 10G -n <logical volume name> <volume group name>

 # mkfs.ext3 /dev/<volume group name>/<logical volume name>

 # lvchange -an <volume group name>/<logical volume name>



3. Edit /etc/cluster/cluster.conf to include the newly created logical
volume as a resource in one of your services. Alternatively, configuration
tools such as Conga or system-config-cluster may be used to create these
entries.  Below is a sample resource manager section from
/etc/cluster/cluster.conf:



<rm>      <failoverdomains>        <failoverdomain name="FD"
ordered="1" restricted="0">           <failoverdomainnode
name="neo-01" priority="1"/>           <failoverdomainnode
name="neo-02" priority="2"/>        </failoverdomain>
</failoverdomains>    <resources>        <lvm name="lvm"
vg_name="shared_vg" lv_name="ha-lv"/>        <fs name="FS"
device="/dev/shared_vg/ha-lv" force_fsck="0" force_unmount="1"
fsid="64050" fstype="ext3" mountpoint="/mnt" options=""
self_fence="0"/>    </resources>    <service autostart="1" domain="FD"
name="serv" recovery="relocate">        <lvm ref="lvm"/>        <fs
ref="FS"/>    </service> </rm>




Regards

Shalom.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110203/19b27daf/attachment.htm>

From pieter.baele at gmail.com  Thu Feb  3 07:37:20 2011
From: pieter.baele at gmail.com (Pieter Baele)
Date: Thu, 3 Feb 2011 08:37:20 +0100
Subject: [Linux-cluster] clvmd mirroring problems on split-site SAN (2
 node cluster) - cpg_dispatch failed: SA_AIS_ERR_LIBRARY
In-Reply-To: <AANLkTinypeXsJmoycVRLvwyYQZ4S-h47H9TRUGe-zqQN@mail.gmail.com>
References: <AANLkTin8f1hyXNUTNK9DRaYxp+g4=W+voaNK4GN4nVqS@mail.gmail.com>
	<AANLkTi=qMvsWfQsDXrsJZuvKmp3F_pV535kRwYXa-PGg@mail.gmail.com>
	<AANLkTi=99Z_fMZknmMs+hdeXEk+Qm9oPzgvn5Zxv7as9@mail.gmail.com>
	<AANLkTinypeXsJmoycVRLvwyYQZ4S-h47H9TRUGe-zqQN@mail.gmail.com>
Message-ID: <AANLkTikFWSQFprpC_AH91yOzBagVQ_YWoidXhsVteuyy@mail.gmail.com>

On Wed, Feb 2, 2011 at 20:59, Pieter Baele <pieter.baele at gmail.com> wrote:
> On Wed, Feb 2, 2011 at 17:52, Brett Delle Grazie
> <brett.dellegrazie at gmail.com> wrote:
>> Hi,
>>>
>>> Do I have to fiddle with the mcast parameters?
>>>
>>
>> I know this is obvious but is /var/log full?

I was wrong, looked at the wrong server
/var/log/messages is full very very fast

always the same message:

Feb  3 08:35:13 nodex cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY



From corey.kovacs at gmail.com  Thu Feb  3 09:13:47 2011
From: corey.kovacs at gmail.com (Corey Kovacs)
Date: Thu, 3 Feb 2011 09:13:47 +0000
Subject: [Linux-cluster] falied to implement HA-LVM with clvmd rhcs5.6
In-Reply-To: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>
References: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>
Message-ID: <AANLkTineaTjMex9oVdNysG3=RBsPg21J3PENoCSmm0aV@mail.gmail.com>

Is using ha-lvm with clvmd a new capability? It's always been my
understanding that the lvm locking type for using ha-lvm had to be set
to '1'.

I'd much rather be using clvmd if it is the way to go. Can you point
me to the docs you are seeing these instructions in please?

As for why your config isn't working, clvmd requires that it's
resources are indeed tagged as cluster volumes, so you might try doing
that and see how it goes.

-C

On Thu, Feb 3, 2011 at 7:26 AM, ???? ???? <sklemer at gmail.com> wrote:
> Hello.
>
>
>
> I followed redhat instruction trying install HA-LVM with clvmd. ( rhcs 5.6 -
> rgmanager 2.0.52-9 )
>
>
>
> I can't make it work.
>
>
>
> lvm.conf- locking_type=3
>
> clvmd work
>
> Its failed saying HA-LVM is not configured correctly.
>
> The manual said that we should run "lvchange -a n lvxx" edit the
> cluster.conf & start the service.
>
>
>
> But From lvm.conf :
>
>
>
> case $1 in
>
> start)
>
> ?? ? ? ?if ! [[ $(vgs -o attr --noheadings $OCF_RESKEY_vg_name) =~ .....c
> ]]; then
>
> ?? ? ? ? ? ? ? ?ha_lvm_proper_setup_check || exit 1
>
>
>
> If the vg is not taged as cluster than the ha_lvm is looking for volume_list
> in lvm.conf.
>
>
>
> I am confused- Does the VG should taged as cluster ?? ?( BTW - the old
> fashion HA-LVM is worked with no problems )
>
> redhat instructions :
>
> To set up HA LVM Failover (using the preferred CLVM variant), perform the
> following steps:
>
>
>
> 1. Ensure that the parameter?locking_type?in the global section
> of?/etc/lvm/lvm.conf?is set to the value?'3', that all the necessary LVM
> cluster packages are installed, and the necessary daemons are started (like
> 'clvmd' and the cluster mirror log daemon - if necessary).
>
>
>
> 2. Create the logical volume and filesystem using standard LVM2 and file
> system commands. For example:
>
> # pvcreate /dev/sd[cde]1
>
> ?# vgcreate <volume group name> /dev/sd[cde]1
>
> ?# lvcreate -L 10G -n <logical volume name> <volume group name>
>
> ?# mkfs.ext3 /dev/<volume group name>/<logical volume name>
>
> ?# lvchange -an <volume group name>/<logical volume name>
>
>
>
> 3. Edit /etc/cluster/cluster.conf to include the newly created logical
> volume as a resource in one of your services. Alternatively, configuration
> tools such as?Conga?or?system-config-cluster?may be used to create these
> entries.? Below is a sample resource manager section
> from?/etc/cluster/cluster.conf:
>
>
>
> <rm>?  ?? <failoverdomains> ?????? <failoverdomain name="FD" ordered="1"
> restricted="0"> ????????? <failoverdomainnode name="neo-01" priority="1"/>
> ????????? <failoverdomainnode name="neo-02" priority="2"/>
> </failoverdomain> ?? </failoverdomains> ?? <resources> ?????? <lvm
> name="lvm" vg_name="shared_vg" lv_name="ha-lv"/> ?????? <fs name="FS"
> device="/dev/shared_vg/ha-lv" force_fsck="0" force_unmount="1" fsid="64050"
> fstype="ext3" mountpoint="/mnt" options="" self_fence="0"/> ?? </resources>
> ?? <service autostart="1" domain="FD" name="serv" recovery="relocate">
> ?????? <lvm ref="lvm"/> ?????? <fs ref="FS"/> ?? </service> </rm>
>
>
>
> Regards
>
> Shalom.
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From brett.dellegrazie at gmail.com  Thu Feb  3 09:17:14 2011
From: brett.dellegrazie at gmail.com (Brett Delle Grazie)
Date: Thu, 3 Feb 2011 09:17:14 +0000
Subject: [Linux-cluster] clvmd mirroring problems on split-site SAN (2
 node cluster) - cpg_dispatch failed: SA_AIS_ERR_LIBRARY
In-Reply-To: <AANLkTikFWSQFprpC_AH91yOzBagVQ_YWoidXhsVteuyy@mail.gmail.com>
References: <AANLkTin8f1hyXNUTNK9DRaYxp+g4=W+voaNK4GN4nVqS@mail.gmail.com>
	<AANLkTi=qMvsWfQsDXrsJZuvKmp3F_pV535kRwYXa-PGg@mail.gmail.com>
	<AANLkTi=99Z_fMZknmMs+hdeXEk+Qm9oPzgvn5Zxv7as9@mail.gmail.com>
	<AANLkTinypeXsJmoycVRLvwyYQZ4S-h47H9TRUGe-zqQN@mail.gmail.com>
	<AANLkTikFWSQFprpC_AH91yOzBagVQ_YWoidXhsVteuyy@mail.gmail.com>
Message-ID: <AANLkTimPyHHLFBNrKNTAG6FGtQ0b25VbSVCMOr+1-PGK@mail.gmail.com>

On 3 February 2011 07:37, Pieter Baele <pieter.baele at gmail.com> wrote:
> On Wed, Feb 2, 2011 at 20:59, Pieter Baele <pieter.baele at gmail.com> wrote:
>> On Wed, Feb 2, 2011 at 17:52, Brett Delle Grazie
>> <brett.dellegrazie at gmail.com> wrote:
>>> Hi,
>>>>
>>>> Do I have to fiddle with the mcast parameters?
>>>>
>>>
>>> I know this is obvious but is /var/log full?
>
> I was wrong, looked at the wrong server
> /var/log/messages is full very very fast
>
> always the same message:
>
> Feb ?3 08:35:13 nodex cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY

Something is very odd and/or broken.

What versions are you running:
OS:?
Kernel:?
cman:?
openais:?
lvm2:?
lvm2-cluster:?

This is one you're probably going to have to raise with RedHat or
someone on this list far
more experienced than I.

-- 
Best Regards,

Brett Delle Grazie



From sklemer at gmail.com  Thu Feb  3 10:35:13 2011
From: sklemer at gmail.com (=?UTF-8?B?16nXnNeV150g16fXnNee16g=?=)
Date: Thu, 3 Feb 2011 12:35:13 +0200
Subject: [Linux-cluster] falied to implement HA-LVM with clvmd rhcs5.6
In-Reply-To: <AANLkTineaTjMex9oVdNysG3=RBsPg21J3PENoCSmm0aV@mail.gmail.com>
References: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>
	<AANLkTineaTjMex9oVdNysG3=RBsPg21J3PENoCSmm0aV@mail.gmail.com>
Message-ID: <AANLkTi=AXnPoC5kpoH=Yq8W3tc71onrcY=E9aHQCU91g@mail.gmail.com>

https://access.redhat.com/kb/docs/DOC-3068

On Thu, Feb 3, 2011 at 11:13 AM, Corey Kovacs <corey.kovacs at gmail.com>wrote:

> Is using ha-lvm with clvmd a new capability? It's always been my
> understanding that the lvm locking type for using ha-lvm had to be set
> to '1'.
>
> I'd much rather be using clvmd if it is the way to go. Can you point
> me to the docs you are seeing these instructions in please?
>
> As for why your config isn't working, clvmd requires that it's
> resources are indeed tagged as cluster volumes, so you might try doing
> that and see how it goes.
>
> -C
>
> On Thu, Feb 3, 2011 at 7:26 AM, ???? ???? <sklemer at gmail.com> wrote:
> > Hello.
> >
> >
> >
> > I followed redhat instruction trying install HA-LVM with clvmd. ( rhcs
> 5.6 -
> > rgmanager 2.0.52-9 )
> >
> >
> >
> > I can't make it work.
> >
> >
> >
> > lvm.conf- locking_type=3
> >
> > clvmd work
> >
> > Its failed saying HA-LVM is not configured correctly.
> >
> > The manual said that we should run "lvchange -a n lvxx" edit the
> > cluster.conf & start the service.
> >
> >
> >
> > But From lvm.conf :
> >
> >
> >
> > case $1 in
> >
> > start)
> >
> >         if ! [[ $(vgs -o attr --noheadings $OCF_RESKEY_vg_name) =~ .....c
> > ]]; then
> >
> >                 ha_lvm_proper_setup_check || exit 1
> >
> >
> >
> > If the vg is not taged as cluster than the ha_lvm is looking for
> volume_list
> > in lvm.conf.
> >
> >
> >
> > I am confused- Does the VG should taged as cluster ??  ( BTW - the old
> > fashion HA-LVM is worked with no problems )
> >
> > redhat instructions :
> >
> > To set up HA LVM Failover (using the preferred CLVM variant), perform the
> > following steps:
> >
> >
> >
> > 1. Ensure that the parameter locking_type in the global section
> > of /etc/lvm/lvm.conf is set to the value '3', that all the necessary LVM
> > cluster packages are installed, and the necessary daemons are started
> (like
> > 'clvmd' and the cluster mirror log daemon - if necessary).
> >
> >
> >
> > 2. Create the logical volume and filesystem using standard LVM2 and file
> > system commands. For example:
> >
> > # pvcreate /dev/sd[cde]1
> >
> >  # vgcreate <volume group name> /dev/sd[cde]1
> >
> >  # lvcreate -L 10G -n <logical volume name> <volume group name>
> >
> >  # mkfs.ext3 /dev/<volume group name>/<logical volume name>
> >
> >  # lvchange -an <volume group name>/<logical volume name>
> >
> >
> >
> > 3. Edit /etc/cluster/cluster.conf to include the newly created logical
> > volume as a resource in one of your services. Alternatively,
> configuration
> > tools such as Conga or system-config-cluster may be used to create these
> > entries.  Below is a sample resource manager section
> > from /etc/cluster/cluster.conf:
> >
> >
> >
> > <rm>      <failoverdomains>        <failoverdomain name="FD" ordered="1"
> > restricted="0">           <failoverdomainnode name="neo-01"
> priority="1"/>
> >           <failoverdomainnode name="neo-02" priority="2"/>
> > </failoverdomain>    </failoverdomains>    <resources>        <lvm
> > name="lvm" vg_name="shared_vg" lv_name="ha-lv"/>        <fs name="FS"
> > device="/dev/shared_vg/ha-lv" force_fsck="0" force_unmount="1"
> fsid="64050"
> > fstype="ext3" mountpoint="/mnt" options="" self_fence="0"/>
> </resources>
> >    <service autostart="1" domain="FD" name="serv" recovery="relocate">
> >        <lvm ref="lvm"/>        <fs ref="FS"/>    </service> </rm>
> >
> >
> >
> > Regards
> >
> > Shalom.
> >
> >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110203/9600ec62/attachment.htm>

From sklemer at gmail.com  Thu Feb  3 10:38:54 2011
From: sklemer at gmail.com (=?UTF-8?B?16nXnNeV150g16fXnNee16g=?=)
Date: Thu, 3 Feb 2011 12:38:54 +0200
Subject: [Linux-cluster] falied to implement HA-LVM with clvmd rhcs5.6
In-Reply-To: <AANLkTi=AXnPoC5kpoH=Yq8W3tc71onrcY=E9aHQCU91g@mail.gmail.com>
References: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>
	<AANLkTineaTjMex9oVdNysG3=RBsPg21J3PENoCSmm0aV@mail.gmail.com>
	<AANLkTi=AXnPoC5kpoH=Yq8W3tc71onrcY=E9aHQCU91g@mail.gmail.com>
Message-ID: <AANLkTik0F+tUWNtW65tDvpmEeLRErdo=eCEnRwmBwM9K@mail.gmail.com>

On Thu, Feb 3, 2011 at 12:35 PM, ???? ???? <sklemer at gmail.com> wrote:

>
> https://access.redhat.com/kb/docs/DOC-3068
>
>
> On Thu, Feb 3, 2011 at 11:13 AM, Corey Kovacs <corey.kovacs at gmail.com>wrote:
>
>> Is using ha-lvm with clvmd a new capability? It's always been my
>> understanding that the lvm locking type for using ha-lvm had to be set
>> to '1'.
>>
>> I'd much rather be using clvmd if it is the way to go. Can you point
>> me to the docs you are seeing these instructions in please?
>>
>> As for why your config isn't working, clvmd requires that it's
>> resources are indeed tagged as cluster volumes, so you might try doing
>> that and see how it goes.
>>
>> -C
>>
>> On Thu, Feb 3, 2011 at 7:26 AM, ???? ???? <sklemer at gmail.com> wrote:
>> > Hello.
>> >
>> >
>> >
>> > I followed redhat instruction trying install HA-LVM with clvmd. ( rhcs
>> 5.6 -
>> > rgmanager 2.0.52-9 )
>> >
>> >
>> >
>> > I can't make it work.
>> >
>> >
>> >
>> > lvm.conf- locking_type=3
>> >
>> > clvmd work
>> >
>> > Its failed saying HA-LVM is not configured correctly.
>> >
>> > The manual said that we should run "lvchange -a n lvxx" edit the
>> > cluster.conf & start the service.
>> >
>> >
>> >
>> > But From lvm.conf :
>> >
>> >
>> >
>> > case $1 in
>> >
>> > start)
>> >
>> >         if ! [[ $(vgs -o attr --noheadings $OCF_RESKEY_vg_name) =~
>> .....c
>> > ]]; then
>> >
>> >                 ha_lvm_proper_setup_check || exit 1
>> >
>> >
>> >
>> > If the vg is not taged as cluster than the ha_lvm is looking for
>> volume_list
>> > in lvm.conf.
>> >
>> >
>> >
>> > I am confused- Does the VG should taged as cluster ??  ( BTW - the old
>> > fashion HA-LVM is worked with no problems )
>> >
>> > redhat instructions :
>> >
>> > To set up HA LVM Failover (using the preferred CLVM variant), perform
>> the
>> > following steps:
>> >
>> >
>> >
>> > 1. Ensure that the parameter locking_type in the global section
>> > of /etc/lvm/lvm.conf is set to the value '3', that all the necessary LVM
>> > cluster packages are installed, and the necessary daemons are started
>> (like
>> > 'clvmd' and the cluster mirror log daemon - if necessary).
>> >
>> >
>> >
>> > 2. Create the logical volume and filesystem using standard LVM2 and file
>> > system commands. For example:
>> >
>> > # pvcreate /dev/sd[cde]1
>> >
>> >  # vgcreate <volume group name> /dev/sd[cde]1
>> >
>> >  # lvcreate -L 10G -n <logical volume name> <volume group name>
>> >
>> >  # mkfs.ext3 /dev/<volume group name>/<logical volume name>
>> >
>> >  # lvchange -an <volume group name>/<logical volume name>
>> >
>> >
>> >
>> > 3. Edit /etc/cluster/cluster.conf to include the newly created logical
>> > volume as a resource in one of your services. Alternatively,
>> configuration
>> > tools such as Conga or system-config-cluster may be used to create these
>> > entries.  Below is a sample resource manager section
>> > from /etc/cluster/cluster.conf:
>> >
>> >
>> >
>> > <rm>      <failoverdomains>        <failoverdomain name="FD" ordered="1"
>> > restricted="0">           <failoverdomainnode name="neo-01"
>> priority="1"/>
>> >           <failoverdomainnode name="neo-02" priority="2"/>
>> > </failoverdomain>    </failoverdomains>    <resources>        <lvm
>> > name="lvm" vg_name="shared_vg" lv_name="ha-lv"/>        <fs name="FS"
>> > device="/dev/shared_vg/ha-lv" force_fsck="0" force_unmount="1"
>> fsid="64050"
>> > fstype="ext3" mountpoint="/mnt" options="" self_fence="0"/>
>> </resources>
>> >    <service autostart="1" domain="FD" name="serv" recovery="relocate">
>> >        <lvm ref="lvm"/>        <fs ref="FS"/>    </service> </rm>
>> >
>> >
>> >
>> > Regards
>> >
>> > Shalom.
>> >
>> >
>> >
>> > --
>> > Linux-cluster mailing list
>> > Linux-cluster at redhat.com
>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>> >
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110203/7d486938/attachment.htm>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110203/7d486938/attachment-0001.htm>

From brett.dellegrazie at gmail.com  Thu Feb  3 10:39:43 2011
From: brett.dellegrazie at gmail.com (Brett Delle Grazie)
Date: Thu, 3 Feb 2011 10:39:43 +0000
Subject: [Linux-cluster] clvmd mirroring problems on split-site SAN (2
 node cluster) - cpg_dispatch failed: SA_AIS_ERR_LIBRARY
In-Reply-To: <AANLkTimnjnPyfvc559o5v17fH04Z6a6W3fDdsPe6m4VH@mail.gmail.com>
References: <AANLkTin8f1hyXNUTNK9DRaYxp+g4=W+voaNK4GN4nVqS@mail.gmail.com>
	<AANLkTi=qMvsWfQsDXrsJZuvKmp3F_pV535kRwYXa-PGg@mail.gmail.com>
	<AANLkTi=99Z_fMZknmMs+hdeXEk+Qm9oPzgvn5Zxv7as9@mail.gmail.com>
	<AANLkTinypeXsJmoycVRLvwyYQZ4S-h47H9TRUGe-zqQN@mail.gmail.com>
	<AANLkTikFWSQFprpC_AH91yOzBagVQ_YWoidXhsVteuyy@mail.gmail.com>
	<AANLkTimPyHHLFBNrKNTAG6FGtQ0b25VbSVCMOr+1-PGK@mail.gmail.com>
	<AANLkTimnjnPyfvc559o5v17fH04Z6a6W3fDdsPe6m4VH@mail.gmail.com>
Message-ID: <AANLkTikQQ3WQTGupcZX9s=vRhAkEA8SyQh4WuxCRnPW8@mail.gmail.com>

On 3 February 2011 10:22, Pieter Baele <pieter.baele at gmail.com> wrote:
> On Thu, Feb 3, 2011 at 10:17, Brett Delle Grazie
> <brett.dellegrazie at gmail.com> wrote:
>> On 3 February 2011 07:37, Pieter Baele <pieter.baele at gmail.com> wrote:
>>> Feb ?3 08:35:13 nodex cmirrord[3682]: cpg_dispatch failed: SA_AIS_ERR_LIBRARY
>>
>> Something is very odd and/or broken.
>>
>> What versions are you running:
>> OS:?
>> Kernel:?
>> cman:?
>> openais:?
>> lvm2:?
>> lvm2-cluster:?
>>
>
> OS: RH 6.0
> Kernel: 2.6.32-71.el6.x86_64
> cman-3.0.12-23.el6.x86_64
> openais-1.1.1-6.el6.x86_64
> lvm2-2.02.72-8.el6.x86_64
> lvm2-cluster-2.02.72-8.el6.x86_64
>
>> This is one you're probably going to have to raise with RedHat or
>> someone on this list far
>> more experienced than I.
>>
> Already placed it on the customer portal as well.
> But mailing list are a better way to get the right specialists ;-)

Then I suggest you mail lvm2 and/or OpenAIS mailing lists as well.

Please don't take the discussion off list as it prevents others who
have your problem from
seeing the solution. CLVMD mirroring is quite new and its unlikely
many people have experience
with it.  At this stage, there is nothing else I can provide apart
from suggesting more obvious things
like checking the state of OpenAIS / Corosync / whatever back-end
you're using and checking you
have no networking issues (failures, packet drops, broken multicast etc.)

Good luck.

>
> Greetings, PieterB
>

-- 
Best Regards,

Brett Delle Grazie



From corey.kovacs at gmail.com  Thu Feb  3 10:49:18 2011
From: corey.kovacs at gmail.com (Corey Kovacs)
Date: Thu, 3 Feb 2011 10:49:18 +0000
Subject: [Linux-cluster] Multi-homing in rhel5
Message-ID: <AANLkTimAesyx5mz4eRrcVKj9O0ZeeoUxHEo-kBMge0A3@mail.gmail.com>

The cluster2 docs outline a procedure for multihoming which is
unsupported by redhat.

Is anyone actually using this method or are people more inclined to
use configs in which secondary interfaces are given names by which the
cluster then uses them as primary config nodes.

For example, on my cluster I have eth0 as the primary interface for
all normal system traffic, and eth1 as my cluster interconnect.

eth0 - nodename
eth1 - nodename-clu <-- cluster config points to this as nodes....

clients access the cluster services via eth0.

I've seen other configs where people configure the cluster to use eth0
for cluster coms so that ricci/luci work correctly, but I don't use
those.

Is there an advantage of one method over the other ?



From corey.kovacs at gmail.com  Thu Feb  3 14:32:20 2011
From: corey.kovacs at gmail.com (Corey Kovacs)
Date: Thu, 3 Feb 2011 14:32:20 +0000
Subject: [Linux-cluster] falied to implement HA-LVM with clvmd rhcs5.6
In-Reply-To: <AANLkTik0F+tUWNtW65tDvpmEeLRErdo=eCEnRwmBwM9K@mail.gmail.com>
References: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>
	<AANLkTineaTjMex9oVdNysG3=RBsPg21J3PENoCSmm0aV@mail.gmail.com>
	<AANLkTi=AXnPoC5kpoH=Yq8W3tc71onrcY=E9aHQCU91g@mail.gmail.com>
	<AANLkTik0F+tUWNtW65tDvpmEeLRErdo=eCEnRwmBwM9K@mail.gmail.com>
Message-ID: <AANLkTimHa+MhiiE0e2+ZzbMnxWyQvwpv=s6geXBsWcVQ@mail.gmail.com>

Excellent,


Thanks

-C

On Thu, Feb 3, 2011 at 10:38 AM, ???? ???? <sklemer at gmail.com> wrote:
>
>
> On Thu, Feb 3, 2011 at 12:35 PM, ???? ???? <sklemer at gmail.com> wrote:
>>
>>
>>
>> https://access.redhat.com/kb/docs/DOC-3068
>>
>> On Thu, Feb 3, 2011 at 11:13 AM, Corey Kovacs <corey.kovacs at gmail.com>
>> wrote:
>>>
>>> Is using ha-lvm with clvmd a new capability? It's always been my
>>> understanding that the lvm locking type for using ha-lvm had to be set
>>> to '1'.
>>>
>>> I'd much rather be using clvmd if it is the way to go. Can you point
>>> me to the docs you are seeing these instructions in please?
>>>
>>> As for why your config isn't working, clvmd requires that it's
>>> resources are indeed tagged as cluster volumes, so you might try doing
>>> that and see how it goes.
>>>
>>> -C
>>>
>>> On Thu, Feb 3, 2011 at 7:26 AM, ???? ???? <sklemer at gmail.com> wrote:
>>> > Hello.
>>> >
>>> >
>>> >
>>> > I followed redhat instruction trying install HA-LVM with clvmd. ( rhcs
>>> > 5.6 -
>>> > rgmanager 2.0.52-9 )
>>> >
>>> >
>>> >
>>> > I can't make it work.
>>> >
>>> >
>>> >
>>> > lvm.conf- locking_type=3
>>> >
>>> > clvmd work
>>> >
>>> > Its failed saying HA-LVM is not configured correctly.
>>> >
>>> > The manual said that we should run "lvchange -a n lvxx" edit the
>>> > cluster.conf & start the service.
>>> >
>>> >
>>> >
>>> > But From lvm.conf :
>>> >
>>> >
>>> >
>>> > case $1 in
>>> >
>>> > start)
>>> >
>>> > ?? ? ? ?if ! [[ $(vgs -o attr --noheadings $OCF_RESKEY_vg_name) =~
>>> > .....c
>>> > ]]; then
>>> >
>>> > ?? ? ? ? ? ? ? ?ha_lvm_proper_setup_check || exit 1
>>> >
>>> >
>>> >
>>> > If the vg is not taged as cluster than the ha_lvm is looking for
>>> > volume_list
>>> > in lvm.conf.
>>> >
>>> >
>>> >
>>> > I am confused- Does the VG should taged as cluster ?? ?( BTW - the old
>>> > fashion HA-LVM is worked with no problems )
>>> >
>>> > redhat instructions :
>>> >
>>> > To set up HA LVM Failover (using the preferred CLVM variant), perform
>>> > the
>>> > following steps:
>>> >
>>> >
>>> >
>>> > 1. Ensure that the parameter?locking_type?in the global section
>>> > of?/etc/lvm/lvm.conf?is set to the value?'3', that all the necessary
>>> > LVM
>>> > cluster packages are installed, and the necessary daemons are started
>>> > (like
>>> > 'clvmd' and the cluster mirror log daemon - if necessary).
>>> >
>>> >
>>> >
>>> > 2. Create the logical volume and filesystem using standard LVM2 and
>>> > file
>>> > system commands. For example:
>>> >
>>> > # pvcreate /dev/sd[cde]1
>>> >
>>> > ?# vgcreate <volume group name> /dev/sd[cde]1
>>> >
>>> > ?# lvcreate -L 10G -n <logical volume name> <volume group name>
>>> >
>>> > ?# mkfs.ext3 /dev/<volume group name>/<logical volume name>
>>> >
>>> > ?# lvchange -an <volume group name>/<logical volume name>
>>> >
>>> >
>>> >
>>> > 3. Edit /etc/cluster/cluster.conf to include the newly created logical
>>> > volume as a resource in one of your services. Alternatively,
>>> > configuration
>>> > tools such as?Conga?or?system-config-cluster?may be used to create
>>> > these
>>> > entries.? Below is a sample resource manager section
>>> > from?/etc/cluster/cluster.conf:
>>> >
>>> >
>>> >
>>> > <rm>? ??? <failoverdomains> ?????? <failoverdomain name="FD"
>>> > ordered="1"
>>> > restricted="0"> ????????? <failoverdomainnode name="neo-01"
>>> > priority="1"/>
>>> > ????????? <failoverdomainnode name="neo-02" priority="2"/>
>>> > </failoverdomain> ?? </failoverdomains> ?? <resources> ?????? <lvm
>>> > name="lvm" vg_name="shared_vg" lv_name="ha-lv"/> ?????? <fs name="FS"
>>> > device="/dev/shared_vg/ha-lv" force_fsck="0" force_unmount="1"
>>> > fsid="64050"
>>> > fstype="ext3" mountpoint="/mnt" options="" self_fence="0"/>
>>> > </resources>
>>> > ?? <service autostart="1" domain="FD" name="serv" recovery="relocate">
>>> > ?????? <lvm ref="lvm"/> ?????? <fs ref="FS"/> ?? </service> </rm>
>>> >
>>> >
>>> >
>>> > Regards
>>> >
>>> > Shalom.
>>> >
>>> >
>>> >
>>> > --
>>> > Linux-cluster mailing list
>>> > Linux-cluster at redhat.com
>>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>>> >
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From Colin.Simpson at iongeo.com  Thu Feb  3 15:37:41 2011
From: Colin.Simpson at iongeo.com (Colin Simpson)
Date: Thu, 03 Feb 2011 15:37:41 +0000
Subject: [Linux-cluster] Multi-homing in rhel5
In-Reply-To: <AANLkTimAesyx5mz4eRrcVKj9O0ZeeoUxHEo-kBMge0A3@mail.gmail.com>
References: <AANLkTimAesyx5mz4eRrcVKj9O0ZeeoUxHEo-kBMge0A3@mail.gmail.com>
Message-ID: <1296747461.23971.15.camel@cowie.iouk.ioroot.tld>

I'd like to know best practice on this too. 

It's always seemed a bit unclear to me how to configure this if fail
over is used "alt-name". Or how good or quick the failover is or worth
it over bonding. 


Colin

On Thu, 2011-02-03 at 10:49 +0000, Corey Kovacs wrote:
> The cluster2 docs outline a procedure for multihoming which is
> unsupported by redhat.
> 
> Is anyone actually using this method or are people more inclined to
> use configs in which secondary interfaces are given names by which the
> cluster then uses them as primary config nodes.
> 
> For example, on my cluster I have eth0 as the primary interface for
> all normal system traffic, and eth1 as my cluster interconnect.
> 
> eth0 - nodename
> eth1 - nodename-clu <-- cluster config points to this as nodes....
> 
> clients access the cluster services via eth0.
> 
> I've seen other configs where people configure the cluster to use eth0
> for cluster coms so that ricci/luci work correctly, but I don't use
> those.
> 
> Is there an advantage of one method over the other ?
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 

This email and any files transmitted with it are confidential and are intended solely for the use of the individual or entity to whom they are addressed.  If you are not the original recipient or the person responsible for delivering the email to the intended recipient, be advised that you have received this email in error, and that any use, dissemination, forwarding, printing, or copying of this email is strictly prohibited. If you received this email in error, please immediately notify the sender and delete the original.





From Bennie_R_Thomas at raytheon.com  Thu Feb  3 20:35:07 2011
From: Bennie_R_Thomas at raytheon.com (Bennie Thomas)
Date: Thu, 03 Feb 2011 14:35:07 -0600
Subject: [Linux-cluster] Running cluster tools using non-root user
In-Reply-To: <AANLkTimjg0ADMaiT+V6A2=EVUnUciiQhmQkhnOg4o2N=@mail.gmail.com>
References: <AANLkTinPmPZOe77J-1mxzFovL9zoVz1vGoGr2BcniVGK@mail.gmail.com>	<AANLkTin390eSrG=RCuocD-0VjBOJQ9DXaSnms26N8QjS@mail.gmail.com>	<AANLkTikRoLgf-_A5PShjKvGzgzjm1JAA1XwE2_aKSC6A@mail.gmail.com>
	<AANLkTimjg0ADMaiT+V6A2=EVUnUciiQhmQkhnOg4o2N=@mail.gmail.com>
Message-ID: <4D4B117B.2060804@raytheon.com>

For the /usr/sbin/clustat to work for the basic user you must set uid.

chmod u+s /usr/sbin/clustat

You do not need sudo for this command to work. Now for clusvcadm. I 
would set up sudo for this, that way
you can limit the users.

Andrew Beekhof wrote:
> On Thu, Jan 27, 2011 at 10:56 AM, Parvez Shaikh
> <parvez.h.shaikh at gmail.com> wrote:
>   
>> I believe Pacemaker is not same as "RHCS"
>>     
>
> Correct. At least not yet anyway.
> Thats why I called my reply a shameless plug since it was for a
> competing project.
>
> Pacemaker does ship in RHEL6 though.
>
>   
>> or do they share code?
>>     
>
> A Pacemaker installation shares almost all the underlying
> infrastructure of what you know as RHCS - it just replaces the
> rgmanager part.
>
>   
>> If yes, in which version of RHCS would this feature would be available?
>>     
>
> We can't comment on future releases sorry.
>
>   
>> I require to enable service, disable service, and get status. I am using CLI
>> tools and any scripting trick can help me running clusvcadm and/or clustat.
>>
>> su -c "clusvcadm...." require entering password, can this also be eliminated
>> using sudoers?
>>
>> Thanks
>>
>> On Wed, Jan 26, 2011 at 3:22 PM, Andrew Beekhof <andrew at beekhof.net> wrote:
>>     
>>> [Shameless plug]
>>>
>>> The next version of Pacemaker (1.1.6) will have this feature :-)
>>> The patches were merged form our devel branch about a week ago.
>>>
>>> [/Shameless plug]
>>>
>>> On Tue, Jan 25, 2011 at 10:39 AM, Parvez Shaikh
>>> <parvez.h.shaikh at gmail.com> wrote:
>>>       
>>>> Hi all
>>>>
>>>> Is it possible to run cluster tools like clustat or clusvcadm etc. using
>>>> non-root user?
>>>>
>>>> If yes, to which groups this user should belong to? Otherwise can this
>>>> be
>>>> done using sudo(and sudoers) file.
>>>>
>>>> As of now I get following error on clustat -
>>>>
>>>> Could not connect to CMAN: Permission denied
>>>>
>>>>
>>>> Thanks,
>>>> Parvez
>>>>
>>>> --
>>>> Linux-cluster mailing list
>>>> Linux-cluster at redhat.com
>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>
>>>>         
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>       
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>     
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>   

-- 
Bennie Thomas
Sr. Information Systems Technologist II
Raytheon Company

972.205.4126
972.205.6363 fax
888.347.1660 pager
Bennie_R_Thomas at raytheon.com


DISCLAIMER: This message contains information that may be confidential and privileged. Unless you are the addressee (or authorized to receive mail for the addressee), you should not use, copy or disclose to anyone this message or any information contained in this message. If you have received this message in error, please so advise the sender by reply e-mail and delete this message. Thank you for your cooperation.

Any views or opinions presented are solely those of the author and do not necessarily represent those of Raytheon unless specifically stated. 
Electronic communications including email may be monitored by Raytheon
for operational or business reasons.



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110203/58d5ea7e/attachment.htm>

From punit_j at rediffmail.com  Fri Feb  4 12:14:41 2011
From: punit_j at rediffmail.com (punit_j)
Date: 4 Feb 2011 12:14:41 -0000
Subject: [Linux-cluster] =?utf-8?q?Redhat_cluster_not_Quorate?=
Message-ID: <20110204121441.17755.qmail@f5mail-236-235.rediffmail.com>

Hi ,

I am using Redhat cluster suite for HA for my services. I have a 3+ 1 node cluster with 1 vote each for a node and also a Quoram disk with votes=3. So the total expected_votes is 7.

The problem I am facing is if my 1 node goes down it causes all the nodes to be fenced and cluster to go inquorate.

Is this an issue with my number of votes I assigned ?

Thanks and Regards,
Punit
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110204/b89cf39b/attachment.htm>

From nehemiasjahcob at gmail.com  Fri Feb  4 12:27:28 2011
From: nehemiasjahcob at gmail.com (Nehemias Urzua Q.)
Date: Fri, 4 Feb 2011 09:27:28 -0300
Subject: [Linux-cluster] Redhat cluster not Quorate
In-Reply-To: <20110204121441.17755.qmail@f5mail-236-235.rediffmail.com>
References: <20110204121441.17755.qmail@f5mail-236-235.rediffmail.com>
Message-ID: <AANLkTimsPO2_JXcbZXcYu=W+qL0XXGxTC78+a-t0gh+o@mail.gmail.com>

Hi

You can send your configuration file please.

best regards

2011/2/4 punit_j <punit_j at rediffmail.com>

> Hi ,
>
> I am using Redhat cluster suite for HA for my services. I have a 3+ 1 node
> cluster with 1 vote each for a node and also a Quoram disk with votes=3. So
> the total expected_votes is 7.
>
> The problem I am facing is if my 1 node goes down it causes all the nodes
> to be fenced and cluster to go inquorate.
>
> Is this an issue with my number of votes I assigned ?
>
> Thanks and Regards,
> Punit
>
>
> <http://sigads.rediff.com/RealMedia/ads/click_nx.ads/www.rediffmail.com/signatureline.htm at Middle?>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110204/80a5e6c0/attachment.htm>

From sklemer at gmail.com  Fri Feb  4 13:13:01 2011
From: sklemer at gmail.com (=?UTF-8?B?16nXnNeV150g16fXnNee16g=?=)
Date: Fri, 4 Feb 2011 15:13:01 +0200
Subject: [Linux-cluster] Redhat cluster not Quorate
In-Reply-To: <20110204121441.17755.qmail@f5mail-236-235.rediffmail.com>
References: <20110204121441.17755.qmail@f5mail-236-235.rediffmail.com>
Message-ID: <AANLkTi=XXLY-fbFRCpg6Com61yDNgA8QvFONjh3pETc9@mail.gmail.com>

Hi.
Can you please attach the cluster.conf file ?

Shalom.

On Fri, Feb 4, 2011 at 2:14 PM, punit_j <punit_j at rediffmail.com> wrote:

> Hi ,
>
> I am using Redhat cluster suite for HA for my services. I have a 3+ 1 node
> cluster with 1 vote each for a node and also a Quoram disk with votes=3. So
> the total expected_votes is 7.
>
> The problem I am facing is if my 1 node goes down it causes all the nodes
> to be fenced and cluster to go inquorate.
>
> Is this an issue with my number of votes I assigned ?
>
> Thanks and Regards,
> Punit
>
>
> <http://sigads.rediff.com/RealMedia/ads/click_nx.ads/www.rediffmail.com/signatureline.htm at Middle?>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110204/4f88b2d7/attachment.htm>

From share2dom at gmail.com  Fri Feb  4 14:32:55 2011
From: share2dom at gmail.com (Dominic Geevarghese)
Date: Fri, 4 Feb 2011 20:02:55 +0530
Subject: [Linux-cluster] falied to implement HA-LVM with clvmd rhcs5.6
In-Reply-To: <AANLkTimHa+MhiiE0e2+ZzbMnxWyQvwpv=s6geXBsWcVQ@mail.gmail.com>
References: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>
	<AANLkTineaTjMex9oVdNysG3=RBsPg21J3PENoCSmm0aV@mail.gmail.com>
	<AANLkTi=AXnPoC5kpoH=Yq8W3tc71onrcY=E9aHQCU91g@mail.gmail.com>
	<AANLkTik0F+tUWNtW65tDvpmEeLRErdo=eCEnRwmBwM9K@mail.gmail.com>
	<AANLkTimHa+MhiiE0e2+ZzbMnxWyQvwpv=s6geXBsWcVQ@mail.gmail.com>
Message-ID: <AANLkTinw7NWLncpo7M7hg_u_fMn420E0tQTBuUNNiThs@mail.gmail.com>

Hi,

I am not sure about the error you are getting but it would be great if you
could try the preferred method

locking_type = 1
volume_list [ "your-root-vg-name" , "@hostname" ]

rebuild initrd

add the <lvm> and <fs> resources in cluster.conf , start the cman, rgmanager
.


Thanks,

On Thu, Feb 3, 2011 at 8:02 PM, Corey Kovacs <corey.kovacs at gmail.com> wrote:

> Excellent,
>
>
> Thanks
>
> -C
>
> On Thu, Feb 3, 2011 at 10:38 AM, ???? ???? <sklemer at gmail.com> wrote:
> >
> >
> > On Thu, Feb 3, 2011 at 12:35 PM, ???? ???? <sklemer at gmail.com> wrote:
> >>
> >>
> >>
> >> https://access.redhat.com/kb/docs/DOC-3068
> >>
> >> On Thu, Feb 3, 2011 at 11:13 AM, Corey Kovacs <corey.kovacs at gmail.com>
> >> wrote:
> >>>
> >>> Is using ha-lvm with clvmd a new capability? It's always been my
> >>> understanding that the lvm locking type for using ha-lvm had to be set
> >>> to '1'.
> >>>
> >>> I'd much rather be using clvmd if it is the way to go. Can you point
> >>> me to the docs you are seeing these instructions in please?
> >>>
> >>> As for why your config isn't working, clvmd requires that it's
> >>> resources are indeed tagged as cluster volumes, so you might try doing
> >>> that and see how it goes.
> >>>
> >>> -C
> >>>
> >>> On Thu, Feb 3, 2011 at 7:26 AM, ???? ???? <sklemer at gmail.com> wrote:
> >>> > Hello.
> >>> >
> >>> >
> >>> >
> >>> > I followed redhat instruction trying install HA-LVM with clvmd. (
> rhcs
> >>> > 5.6 -
> >>> > rgmanager 2.0.52-9 )
> >>> >
> >>> >
> >>> >
> >>> > I can't make it work.
> >>> >
> >>> >
> >>> >
> >>> > lvm.conf- locking_type=3
> >>> >
> >>> > clvmd work
> >>> >
> >>> > Its failed saying HA-LVM is not configured correctly.
> >>> >
> >>> > The manual said that we should run "lvchange -a n lvxx" edit the
> >>> > cluster.conf & start the service.
> >>> >
> >>> >
> >>> >
> >>> > But From lvm.conf :
> >>> >
> >>> >
> >>> >
> >>> > case $1 in
> >>> >
> >>> > start)
> >>> >
> >>> >         if ! [[ $(vgs -o attr --noheadings $OCF_RESKEY_vg_name) =~
> >>> > .....c
> >>> > ]]; then
> >>> >
> >>> >                 ha_lvm_proper_setup_check || exit 1
> >>> >
> >>> >
> >>> >
> >>> > If the vg is not taged as cluster than the ha_lvm is looking for
> >>> > volume_list
> >>> > in lvm.conf.
> >>> >
> >>> >
> >>> >
> >>> > I am confused- Does the VG should taged as cluster ??  ( BTW - the
> old
> >>> > fashion HA-LVM is worked with no problems )
> >>> >
> >>> > redhat instructions :
> >>> >
> >>> > To set up HA LVM Failover (using the preferred CLVM variant), perform
> >>> > the
> >>> > following steps:
> >>> >
> >>> >
> >>> >
> >>> > 1. Ensure that the parameter locking_type in the global section
> >>> > of /etc/lvm/lvm.conf is set to the value '3', that all the necessary
> >>> > LVM
> >>> > cluster packages are installed, and the necessary daemons are started
> >>> > (like
> >>> > 'clvmd' and the cluster mirror log daemon - if necessary).
> >>> >
> >>> >
> >>> >
> >>> > 2. Create the logical volume and filesystem using standard LVM2 and
> >>> > file
> >>> > system commands. For example:
> >>> >
> >>> > # pvcreate /dev/sd[cde]1
> >>> >
> >>> >  # vgcreate <volume group name> /dev/sd[cde]1
> >>> >
> >>> >  # lvcreate -L 10G -n <logical volume name> <volume group name>
> >>> >
> >>> >  # mkfs.ext3 /dev/<volume group name>/<logical volume name>
> >>> >
> >>> >  # lvchange -an <volume group name>/<logical volume name>
> >>> >
> >>> >
> >>> >
> >>> > 3. Edit /etc/cluster/cluster.conf to include the newly created
> logical
> >>> > volume as a resource in one of your services. Alternatively,
> >>> > configuration
> >>> > tools such as Conga or system-config-cluster may be used to create
> >>> > these
> >>> > entries.  Below is a sample resource manager section
> >>> > from /etc/cluster/cluster.conf:
> >>> >
> >>> >
> >>> >
> >>> > <rm>      <failoverdomains>        <failoverdomain name="FD"
> >>> > ordered="1"
> >>> > restricted="0">           <failoverdomainnode name="neo-01"
> >>> > priority="1"/>
> >>> >           <failoverdomainnode name="neo-02" priority="2"/>
> >>> > </failoverdomain>    </failoverdomains>    <resources>        <lvm
> >>> > name="lvm" vg_name="shared_vg" lv_name="ha-lv"/>        <fs name="FS"
> >>> > device="/dev/shared_vg/ha-lv" force_fsck="0" force_unmount="1"
> >>> > fsid="64050"
> >>> > fstype="ext3" mountpoint="/mnt" options="" self_fence="0"/>
> >>> > </resources>
> >>> >    <service autostart="1" domain="FD" name="serv"
> recovery="relocate">
> >>> >        <lvm ref="lvm"/>        <fs ref="FS"/>    </service> </rm>
> >>> >
> >>> >
> >>> >
> >>> > Regards
> >>> >
> >>> > Shalom.
> >>> >
> >>> >
> >>> >
> >>> > --
> >>> > Linux-cluster mailing list
> >>> > Linux-cluster at redhat.com
> >>> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >>> >
> >>>
> >>> --
> >>> Linux-cluster mailing list
> >>> Linux-cluster at redhat.com
> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110204/dff98533/attachment.htm>

From sklemer at gmail.com  Fri Feb  4 15:04:44 2011
From: sklemer at gmail.com (=?UTF-8?B?16nXnNeV150g16fXnNee16g=?=)
Date: Fri, 4 Feb 2011 17:04:44 +0200
Subject: [Linux-cluster] falied to implement HA-LVM with clvmd rhcs5.6
In-Reply-To: <AANLkTinw7NWLncpo7M7hg_u_fMn420E0tQTBuUNNiThs@mail.gmail.com>
References: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>
	<AANLkTineaTjMex9oVdNysG3=RBsPg21J3PENoCSmm0aV@mail.gmail.com>
	<AANLkTi=AXnPoC5kpoH=Yq8W3tc71onrcY=E9aHQCU91g@mail.gmail.com>
	<AANLkTik0F+tUWNtW65tDvpmEeLRErdo=eCEnRwmBwM9K@mail.gmail.com>
	<AANLkTimHa+MhiiE0e2+ZzbMnxWyQvwpv=s6geXBsWcVQ@mail.gmail.com>
	<AANLkTinw7NWLncpo7M7hg_u_fMn420E0tQTBuUNNiThs@mail.gmail.com>
Message-ID: <AANLkTikyJuRV30c0872Aoy5ebAsf7mygZAb3xn68px-m@mail.gmail.com>

Thanks.

This is the old methos and its work great but its hard to maintain such
cluster.

Shalom.

On Fri, Feb 4, 2011 at 4:32 PM, Dominic Geevarghese <share2dom at gmail.com>wrote:

>
> Hi,
>
> I am not sure about the error you are getting but it would be great if you
> could try the preferred method
>
> locking_type = 1
> volume_list [ "your-root-vg-name" , "@hostname" ]
>
> rebuild initrd
>
> add the <lvm> and <fs> resources in cluster.conf , start the cman,
> rgmanager .
>
>
> Thanks,
>
> On Thu, Feb 3, 2011 at 8:02 PM, Corey Kovacs <corey.kovacs at gmail.com>wrote:
>
>> Excellent,
>>
>>
>> Thanks
>>
>> -C
>>
>> On Thu, Feb 3, 2011 at 10:38 AM, ???? ???? <sklemer at gmail.com> wrote:
>> >
>> >
>> > On Thu, Feb 3, 2011 at 12:35 PM, ???? ???? <sklemer at gmail.com> wrote:
>> >>
>> >>
>> >>
>> >> https://access.redhat.com/kb/docs/DOC-3068
>> >>
>> >> On Thu, Feb 3, 2011 at 11:13 AM, Corey Kovacs <corey.kovacs at gmail.com>
>> >> wrote:
>> >>>
>> >>> Is using ha-lvm with clvmd a new capability? It's always been my
>> >>> understanding that the lvm locking type for using ha-lvm had to be set
>> >>> to '1'.
>> >>>
>> >>> I'd much rather be using clvmd if it is the way to go. Can you point
>> >>> me to the docs you are seeing these instructions in please?
>> >>>
>> >>> As for why your config isn't working, clvmd requires that it's
>> >>> resources are indeed tagged as cluster volumes, so you might try doing
>> >>> that and see how it goes.
>> >>>
>> >>> -C
>> >>>
>> >>> On Thu, Feb 3, 2011 at 7:26 AM, ???? ???? <sklemer at gmail.com> wrote:
>> >>> > Hello.
>> >>> >
>> >>> >
>> >>> >
>> >>> > I followed redhat instruction trying install HA-LVM with clvmd. (
>> rhcs
>> >>> > 5.6 -
>> >>> > rgmanager 2.0.52-9 )
>> >>> >
>> >>> >
>> >>> >
>> >>> > I can't make it work.
>> >>> >
>> >>> >
>> >>> >
>> >>> > lvm.conf- locking_type=3
>> >>> >
>> >>> > clvmd work
>> >>> >
>> >>> > Its failed saying HA-LVM is not configured correctly.
>> >>> >
>> >>> > The manual said that we should run "lvchange -a n lvxx" edit the
>> >>> > cluster.conf & start the service.
>> >>> >
>> >>> >
>> >>> >
>> >>> > But From lvm.conf :
>> >>> >
>> >>> >
>> >>> >
>> >>> > case $1 in
>> >>> >
>> >>> > start)
>> >>> >
>> >>> >         if ! [[ $(vgs -o attr --noheadings $OCF_RESKEY_vg_name) =~
>> >>> > .....c
>> >>> > ]]; then
>> >>> >
>> >>> >                 ha_lvm_proper_setup_check || exit 1
>> >>> >
>> >>> >
>> >>> >
>> >>> > If the vg is not taged as cluster than the ha_lvm is looking for
>> >>> > volume_list
>> >>> > in lvm.conf.
>> >>> >
>> >>> >
>> >>> >
>> >>> > I am confused- Does the VG should taged as cluster ??  ( BTW - the
>> old
>> >>> > fashion HA-LVM is worked with no problems )
>> >>> >
>> >>> > redhat instructions :
>> >>> >
>> >>> > To set up HA LVM Failover (using the preferred CLVM variant),
>> perform
>> >>> > the
>> >>> > following steps:
>> >>> >
>> >>> >
>> >>> >
>> >>> > 1. Ensure that the parameter locking_type in the global section
>> >>> > of /etc/lvm/lvm.conf is set to the value '3', that all the necessary
>> >>> > LVM
>> >>> > cluster packages are installed, and the necessary daemons are
>> started
>> >>> > (like
>> >>> > 'clvmd' and the cluster mirror log daemon - if necessary).
>> >>> >
>> >>> >
>> >>> >
>> >>> > 2. Create the logical volume and filesystem using standard LVM2 and
>> >>> > file
>> >>> > system commands. For example:
>> >>> >
>> >>> > # pvcreate /dev/sd[cde]1
>> >>> >
>> >>> >  # vgcreate <volume group name> /dev/sd[cde]1
>> >>> >
>> >>> >  # lvcreate -L 10G -n <logical volume name> <volume group name>
>> >>> >
>> >>> >  # mkfs.ext3 /dev/<volume group name>/<logical volume name>
>> >>> >
>> >>> >  # lvchange -an <volume group name>/<logical volume name>
>> >>> >
>> >>> >
>> >>> >
>> >>> > 3. Edit /etc/cluster/cluster.conf to include the newly created
>> logical
>> >>> > volume as a resource in one of your services. Alternatively,
>> >>> > configuration
>> >>> > tools such as Conga or system-config-cluster may be used to create
>> >>> > these
>> >>> > entries.  Below is a sample resource manager section
>> >>> > from /etc/cluster/cluster.conf:
>> >>> >
>> >>> >
>> >>> >
>> >>> > <rm>      <failoverdomains>        <failoverdomain name="FD"
>> >>> > ordered="1"
>> >>> > restricted="0">           <failoverdomainnode name="neo-01"
>> >>> > priority="1"/>
>> >>> >           <failoverdomainnode name="neo-02" priority="2"/>
>> >>> > </failoverdomain>    </failoverdomains>    <resources>        <lvm
>> >>> > name="lvm" vg_name="shared_vg" lv_name="ha-lv"/>        <fs
>> name="FS"
>> >>> > device="/dev/shared_vg/ha-lv" force_fsck="0" force_unmount="1"
>> >>> > fsid="64050"
>> >>> > fstype="ext3" mountpoint="/mnt" options="" self_fence="0"/>
>> >>> > </resources>
>> >>> >    <service autostart="1" domain="FD" name="serv"
>> recovery="relocate">
>> >>> >        <lvm ref="lvm"/>        <fs ref="FS"/>    </service> </rm>
>> >>> >
>> >>> >
>> >>> >
>> >>> > Regards
>> >>> >
>> >>> > Shalom.
>> >>> >
>> >>> >
>> >>> >
>> >>> > --
>> >>> > Linux-cluster mailing list
>> >>> > Linux-cluster at redhat.com
>> >>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>> >>> >
>> >>>
>> >>> --
>> >>> Linux-cluster mailing list
>> >>> Linux-cluster at redhat.com
>> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
>> >
>> >
>> > --
>> > Linux-cluster mailing list
>> > Linux-cluster at redhat.com
>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>> >
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110204/12b67693/attachment.htm>

From share2dom at gmail.com  Fri Feb  4 15:23:02 2011
From: share2dom at gmail.com (dOminic)
Date: Fri, 4 Feb 2011 20:53:02 +0530
Subject: [Linux-cluster] falied to implement HA-LVM with clvmd rhcs5.6
In-Reply-To: <AANLkTikyJuRV30c0872Aoy5ebAsf7mygZAb3xn68px-m@mail.gmail.com>
References: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>
	<AANLkTineaTjMex9oVdNysG3=RBsPg21J3PENoCSmm0aV@mail.gmail.com>
	<AANLkTi=AXnPoC5kpoH=Yq8W3tc71onrcY=E9aHQCU91g@mail.gmail.com>
	<AANLkTik0F+tUWNtW65tDvpmEeLRErdo=eCEnRwmBwM9K@mail.gmail.com>
	<AANLkTimHa+MhiiE0e2+ZzbMnxWyQvwpv=s6geXBsWcVQ@mail.gmail.com>
	<AANLkTinw7NWLncpo7M7hg_u_fMn420E0tQTBuUNNiThs@mail.gmail.com>
	<AANLkTikyJuRV30c0872Aoy5ebAsf7mygZAb3xn68px-m@mail.gmail.com>
Message-ID: <AANLkTi=fWQu_LzdaDOOZGePJghKdkvfHK7FkH9PJD+Sz@mail.gmail.com>

Hi Shalom,

Are you still facing problem implementing HA-LVM with locking_type = 3
setting ?. If yes, it would be great if you could provide the following
details .
So that others can also check

* steps you are following along with complete output
* status in "clustat" after making changes in cluster.conf
* attach cluster.conf and /var/log/messages.

dominic

On Fri, Feb 4, 2011 at 8:34 PM, ???? ???? <sklemer at gmail.com> wrote:

> Thanks.
>
> This is the old methos and its work great but its hard to maintain such
> cluster.
>
> Shalom.
>
>
> On Fri, Feb 4, 2011 at 4:32 PM, Dominic Geevarghese <share2dom at gmail.com>wrote:
>
>>
>> Hi,
>>
>> I am not sure about the error you are getting but it would be great if you
>> could try the preferred method
>>
>> locking_type = 1
>> volume_list [ "your-root-vg-name" , "@hostname" ]
>>
>> rebuild initrd
>>
>> add the <lvm> and <fs> resources in cluster.conf , start the cman,
>> rgmanager .
>>
>>
>> Thanks,
>>
>> On Thu, Feb 3, 2011 at 8:02 PM, Corey Kovacs <corey.kovacs at gmail.com>wrote:
>>
>>> Excellent,
>>>
>>>
>>> Thanks
>>>
>>> -C
>>>
>>> On Thu, Feb 3, 2011 at 10:38 AM, ???? ???? <sklemer at gmail.com> wrote:
>>> >
>>> >
>>> > On Thu, Feb 3, 2011 at 12:35 PM, ???? ???? <sklemer at gmail.com> wrote:
>>> >>
>>> >>
>>> >>
>>> >> https://access.redhat.com/kb/docs/DOC-3068
>>> >>
>>> >> On Thu, Feb 3, 2011 at 11:13 AM, Corey Kovacs <corey.kovacs at gmail.com
>>> >
>>> >> wrote:
>>> >>>
>>> >>> Is using ha-lvm with clvmd a new capability? It's always been my
>>> >>> understanding that the lvm locking type for using ha-lvm had to be
>>> set
>>> >>> to '1'.
>>> >>>
>>> >>> I'd much rather be using clvmd if it is the way to go. Can you point
>>> >>> me to the docs you are seeing these instructions in please?
>>> >>>
>>> >>> As for why your config isn't working, clvmd requires that it's
>>> >>> resources are indeed tagged as cluster volumes, so you might try
>>> doing
>>> >>> that and see how it goes.
>>> >>>
>>> >>> -C
>>> >>>
>>> >>> On Thu, Feb 3, 2011 at 7:26 AM, ???? ???? <sklemer at gmail.com> wrote:
>>> >>> > Hello.
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > I followed redhat instruction trying install HA-LVM with clvmd. (
>>> rhcs
>>> >>> > 5.6 -
>>> >>> > rgmanager 2.0.52-9 )
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > I can't make it work.
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > lvm.conf- locking_type=3
>>> >>> >
>>> >>> > clvmd work
>>> >>> >
>>> >>> > Its failed saying HA-LVM is not configured correctly.
>>> >>> >
>>> >>> > The manual said that we should run "lvchange -a n lvxx" edit the
>>> >>> > cluster.conf & start the service.
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > But From lvm.conf :
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > case $1 in
>>> >>> >
>>> >>> > start)
>>> >>> >
>>> >>> >         if ! [[ $(vgs -o attr --noheadings $OCF_RESKEY_vg_name) =~
>>> >>> > .....c
>>> >>> > ]]; then
>>> >>> >
>>> >>> >                 ha_lvm_proper_setup_check || exit 1
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > If the vg is not taged as cluster than the ha_lvm is looking for
>>> >>> > volume_list
>>> >>> > in lvm.conf.
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > I am confused- Does the VG should taged as cluster ??  ( BTW - the
>>> old
>>> >>> > fashion HA-LVM is worked with no problems )
>>> >>> >
>>> >>> > redhat instructions :
>>> >>> >
>>> >>> > To set up HA LVM Failover (using the preferred CLVM variant),
>>> perform
>>> >>> > the
>>> >>> > following steps:
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > 1. Ensure that the parameter locking_type in the global section
>>> >>> > of /etc/lvm/lvm.conf is set to the value '3', that all the
>>> necessary
>>> >>> > LVM
>>> >>> > cluster packages are installed, and the necessary daemons are
>>> started
>>> >>> > (like
>>> >>> > 'clvmd' and the cluster mirror log daemon - if necessary).
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > 2. Create the logical volume and filesystem using standard LVM2 and
>>> >>> > file
>>> >>> > system commands. For example:
>>> >>> >
>>> >>> > # pvcreate /dev/sd[cde]1
>>> >>> >
>>> >>> >  # vgcreate <volume group name> /dev/sd[cde]1
>>> >>> >
>>> >>> >  # lvcreate -L 10G -n <logical volume name> <volume group name>
>>> >>> >
>>> >>> >  # mkfs.ext3 /dev/<volume group name>/<logical volume name>
>>> >>> >
>>> >>> >  # lvchange -an <volume group name>/<logical volume name>
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > 3. Edit /etc/cluster/cluster.conf to include the newly created
>>> logical
>>> >>> > volume as a resource in one of your services. Alternatively,
>>> >>> > configuration
>>> >>> > tools such as Conga or system-config-cluster may be used to create
>>> >>> > these
>>> >>> > entries.  Below is a sample resource manager section
>>> >>> > from /etc/cluster/cluster.conf:
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > <rm>      <failoverdomains>        <failoverdomain name="FD"
>>> >>> > ordered="1"
>>> >>> > restricted="0">           <failoverdomainnode name="neo-01"
>>> >>> > priority="1"/>
>>> >>> >           <failoverdomainnode name="neo-02" priority="2"/>
>>> >>> > </failoverdomain>    </failoverdomains>    <resources>        <lvm
>>> >>> > name="lvm" vg_name="shared_vg" lv_name="ha-lv"/>        <fs
>>> name="FS"
>>> >>> > device="/dev/shared_vg/ha-lv" force_fsck="0" force_unmount="1"
>>> >>> > fsid="64050"
>>> >>> > fstype="ext3" mountpoint="/mnt" options="" self_fence="0"/>
>>> >>> > </resources>
>>> >>> >    <service autostart="1" domain="FD" name="serv"
>>> recovery="relocate">
>>> >>> >        <lvm ref="lvm"/>        <fs ref="FS"/>    </service> </rm>
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > Regards
>>> >>> >
>>> >>> > Shalom.
>>> >>> >
>>> >>> >
>>> >>> >
>>> >>> > --
>>> >>> > Linux-cluster mailing list
>>> >>> > Linux-cluster at redhat.com
>>> >>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>>> >>> >
>>> >>>
>>> >>> --
>>> >>> Linux-cluster mailing list
>>> >>> Linux-cluster at redhat.com
>>> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>> >
>>> >
>>> > --
>>> > Linux-cluster mailing list
>>> > Linux-cluster at redhat.com
>>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>>> >
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110204/ded79353/attachment.htm>

From rhayden.public at gmail.com  Fri Feb  4 18:57:45 2011
From: rhayden.public at gmail.com (Robert Hayden)
Date: Fri, 4 Feb 2011 12:57:45 -0600
Subject: [Linux-cluster] IPv6 Setup with RHCS
Message-ID: <AANLkTi=cxCaYZR=LvdQd_dOw5ea3HPRnxNSpJocEc2t4@mail.gmail.com>

I have searched for a concrete example of RHCS in a pure IPv6 environment,
but I have only found references that IPv6 is supported.

Does anyone have experience with setting up RHCS with IPv6 that they would
be willing to share?  Any good, technical papers out there?  In particular,
I would like to stay in the RHEL 5.x release, but would consider RHEL 6
options as well.  I am wanting to protect a simple, custom application that
requires a floating IPv6 IP address resource.

Thanks
Robert
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110204/bdfbe667/attachment.htm>

From shea.benn at gmail.com  Fri Feb  4 19:07:03 2011
From: shea.benn at gmail.com (Shea Bennett)
Date: Fri, 4 Feb 2011 14:07:03 -0500
Subject: [Linux-cluster] IPv6 Setup with RHCS
In-Reply-To: <AANLkTi=cxCaYZR=LvdQd_dOw5ea3HPRnxNSpJocEc2t4@mail.gmail.com>
References: <AANLkTi=cxCaYZR=LvdQd_dOw5ea3HPRnxNSpJocEc2t4@mail.gmail.com>
Message-ID: <AANLkTimQSuFTcmbJO7EC5WTx5W=Jo7d6Gahqm+qf54=u@mail.gmail.com>

Robert,

I am searching for the same thing and haven't found anything definitive on
http://docs.redhat.com. I have a support call to RedHat setup for Monday. I
will update with what I find out.

Shea

On Fri, Feb 4, 2011 at 13:57, Robert Hayden <rhayden.public at gmail.com>wrote:

> I have searched for a concrete example of RHCS in a pure IPv6 environment,
> but I have only found references that IPv6 is supported.
>
> Does anyone have experience with setting up RHCS with IPv6 that they would
> be willing to share?  Any good, technical papers out there?  In particular,
> I would like to stay in the RHEL 5.x release, but would consider RHEL 6
> options as well.  I am wanting to protect a simple, custom application that
> requires a floating IPv6 IP address resource.
>
> Thanks
> Robert
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



-- 

*P** Please consider the environment before printing this e-mail*

This e-mail message and all documents that accompany it may contain
privileged or confidential information, and are intended only for the use of
the individual or entity to which addressed. Any unauthorized disclosure or
distribution of this e-mail message is prohibited. If you have received this
e-mail message in error, please notify me immediately. Thank you.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110204/94cc94c7/attachment.htm>

From sklemer at gmail.com  Fri Feb  4 22:07:36 2011
From: sklemer at gmail.com (=?UTF-8?B?16nXnNeV150g16fXnNee16g=?=)
Date: Sat, 5 Feb 2011 00:07:36 +0200
Subject: [Linux-cluster] falied to implement HA-LVM with clvmd rhcs5.6
In-Reply-To: <AANLkTi=fWQu_LzdaDOOZGePJghKdkvfHK7FkH9PJD+Sz@mail.gmail.com>
References: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>
	<AANLkTineaTjMex9oVdNysG3=RBsPg21J3PENoCSmm0aV@mail.gmail.com>
	<AANLkTi=AXnPoC5kpoH=Yq8W3tc71onrcY=E9aHQCU91g@mail.gmail.com>
	<AANLkTik0F+tUWNtW65tDvpmEeLRErdo=eCEnRwmBwM9K@mail.gmail.com>
	<AANLkTimHa+MhiiE0e2+ZzbMnxWyQvwpv=s6geXBsWcVQ@mail.gmail.com>
	<AANLkTinw7NWLncpo7M7hg_u_fMn420E0tQTBuUNNiThs@mail.gmail.com>
	<AANLkTikyJuRV30c0872Aoy5ebAsf7mygZAb3xn68px-m@mail.gmail.com>
	<AANLkTi=fWQu_LzdaDOOZGePJghKdkvfHK7FkH9PJD+Sz@mail.gmail.com>
Message-ID: <AANLkTikB7drAprGsccUCoh3HeJjPLwj4mYbe_3deJHr=@mail.gmail.com>

Hello Dominic.

I will be in lab on monday and collect all steps & logs files.

btw - Is it  redhat recommendation to preferre  using  HA-LVM with the
locking_type=1 method ??


Regards

Shalom

On Fri, Feb 4, 2011 at 5:23 PM, dOminic <share2dom at gmail.com> wrote:

> Hi Shalom,
>
> Are you still facing problem implementing HA-LVM with locking_type = 3
> setting ?. If yes, it would be great if you could provide the following
> details .
> So that others can also check
>
> * steps you are following along with complete output
> * status in "clustat" after making changes in cluster.conf
> * attach cluster.conf and /var/log/messages.
>
> dominic
>
> On Fri, Feb 4, 2011 at 8:34 PM, ???? ???? <sklemer at gmail.com> wrote:
>
>> Thanks.
>>
>> This is the old methos and its work great but its hard to maintain such
>> cluster.
>>
>> Shalom.
>>
>>
>> On Fri, Feb 4, 2011 at 4:32 PM, Dominic Geevarghese <share2dom at gmail.com>wrote:
>>
>>>
>>> Hi,
>>>
>>> I am not sure about the error you are getting but it would be great if
>>> you could try the preferred method
>>>
>>> locking_type = 1
>>> volume_list [ "your-root-vg-name" , "@hostname" ]
>>>
>>> rebuild initrd
>>>
>>> add the <lvm> and <fs> resources in cluster.conf , start the cman,
>>> rgmanager .
>>>
>>>
>>> Thanks,
>>>
>>> On Thu, Feb 3, 2011 at 8:02 PM, Corey Kovacs <corey.kovacs at gmail.com>wrote:
>>>
>>>> Excellent,
>>>>
>>>>
>>>> Thanks
>>>>
>>>> -C
>>>>
>>>> On Thu, Feb 3, 2011 at 10:38 AM, ???? ???? <sklemer at gmail.com> wrote:
>>>> >
>>>> >
>>>> > On Thu, Feb 3, 2011 at 12:35 PM, ???? ???? <sklemer at gmail.com> wrote:
>>>> >>
>>>> >>
>>>> >>
>>>> >> https://access.redhat.com/kb/docs/DOC-3068
>>>> >>
>>>> >> On Thu, Feb 3, 2011 at 11:13 AM, Corey Kovacs <
>>>> corey.kovacs at gmail.com>
>>>> >> wrote:
>>>> >>>
>>>> >>> Is using ha-lvm with clvmd a new capability? It's always been my
>>>> >>> understanding that the lvm locking type for using ha-lvm had to be
>>>> set
>>>> >>> to '1'.
>>>> >>>
>>>> >>> I'd much rather be using clvmd if it is the way to go. Can you point
>>>> >>> me to the docs you are seeing these instructions in please?
>>>> >>>
>>>> >>> As for why your config isn't working, clvmd requires that it's
>>>> >>> resources are indeed tagged as cluster volumes, so you might try
>>>> doing
>>>> >>> that and see how it goes.
>>>> >>>
>>>> >>> -C
>>>> >>>
>>>> >>> On Thu, Feb 3, 2011 at 7:26 AM, ???? ???? <sklemer at gmail.com>
>>>> wrote:
>>>> >>> > Hello.
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > I followed redhat instruction trying install HA-LVM with clvmd. (
>>>> rhcs
>>>> >>> > 5.6 -
>>>> >>> > rgmanager 2.0.52-9 )
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > I can't make it work.
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > lvm.conf- locking_type=3
>>>> >>> >
>>>> >>> > clvmd work
>>>> >>> >
>>>> >>> > Its failed saying HA-LVM is not configured correctly.
>>>> >>> >
>>>> >>> > The manual said that we should run "lvchange -a n lvxx" edit the
>>>> >>> > cluster.conf & start the service.
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > But From lvm.conf :
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > case $1 in
>>>> >>> >
>>>> >>> > start)
>>>> >>> >
>>>> >>> >         if ! [[ $(vgs -o attr --noheadings $OCF_RESKEY_vg_name) =~
>>>> >>> > .....c
>>>> >>> > ]]; then
>>>> >>> >
>>>> >>> >                 ha_lvm_proper_setup_check || exit 1
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > If the vg is not taged as cluster than the ha_lvm is looking for
>>>> >>> > volume_list
>>>> >>> > in lvm.conf.
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > I am confused- Does the VG should taged as cluster ??  ( BTW - the
>>>> old
>>>> >>> > fashion HA-LVM is worked with no problems )
>>>> >>> >
>>>> >>> > redhat instructions :
>>>> >>> >
>>>> >>> > To set up HA LVM Failover (using the preferred CLVM variant),
>>>> perform
>>>> >>> > the
>>>> >>> > following steps:
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > 1. Ensure that the parameter locking_type in the global section
>>>> >>> > of /etc/lvm/lvm.conf is set to the value '3', that all the
>>>> necessary
>>>> >>> > LVM
>>>> >>> > cluster packages are installed, and the necessary daemons are
>>>> started
>>>> >>> > (like
>>>> >>> > 'clvmd' and the cluster mirror log daemon - if necessary).
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > 2. Create the logical volume and filesystem using standard LVM2
>>>> and
>>>> >>> > file
>>>> >>> > system commands. For example:
>>>> >>> >
>>>> >>> > # pvcreate /dev/sd[cde]1
>>>> >>> >
>>>> >>> >  # vgcreate <volume group name> /dev/sd[cde]1
>>>> >>> >
>>>> >>> >  # lvcreate -L 10G -n <logical volume name> <volume group name>
>>>> >>> >
>>>> >>> >  # mkfs.ext3 /dev/<volume group name>/<logical volume name>
>>>> >>> >
>>>> >>> >  # lvchange -an <volume group name>/<logical volume name>
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > 3. Edit /etc/cluster/cluster.conf to include the newly created
>>>> logical
>>>> >>> > volume as a resource in one of your services. Alternatively,
>>>> >>> > configuration
>>>> >>> > tools such as Conga or system-config-cluster may be used to create
>>>> >>> > these
>>>> >>> > entries.  Below is a sample resource manager section
>>>> >>> > from /etc/cluster/cluster.conf:
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > <rm>      <failoverdomains>        <failoverdomain name="FD"
>>>> >>> > ordered="1"
>>>> >>> > restricted="0">           <failoverdomainnode name="neo-01"
>>>> >>> > priority="1"/>
>>>> >>> >           <failoverdomainnode name="neo-02" priority="2"/>
>>>> >>> > </failoverdomain>    </failoverdomains>    <resources>        <lvm
>>>> >>> > name="lvm" vg_name="shared_vg" lv_name="ha-lv"/>        <fs
>>>> name="FS"
>>>> >>> > device="/dev/shared_vg/ha-lv" force_fsck="0" force_unmount="1"
>>>> >>> > fsid="64050"
>>>> >>> > fstype="ext3" mountpoint="/mnt" options="" self_fence="0"/>
>>>> >>> > </resources>
>>>> >>> >    <service autostart="1" domain="FD" name="serv"
>>>> recovery="relocate">
>>>> >>> >        <lvm ref="lvm"/>        <fs ref="FS"/>    </service> </rm>
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > Regards
>>>> >>> >
>>>> >>> > Shalom.
>>>> >>> >
>>>> >>> >
>>>> >>> >
>>>> >>> > --
>>>> >>> > Linux-cluster mailing list
>>>> >>> > Linux-cluster at redhat.com
>>>> >>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>>>> >>> >
>>>> >>>
>>>> >>> --
>>>> >>> Linux-cluster mailing list
>>>> >>> Linux-cluster at redhat.com
>>>> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>> >
>>>> >
>>>> > --
>>>> > Linux-cluster mailing list
>>>> > Linux-cluster at redhat.com
>>>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>>>> >
>>>>
>>>> --
>>>> Linux-cluster mailing list
>>>> Linux-cluster at redhat.com
>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>
>>>
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110205/b8938cbf/attachment.htm>

From fdinitto at redhat.com  Sat Feb  5 06:55:55 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Sat, 05 Feb 2011 07:55:55 +0100
Subject: [Linux-cluster] IPv6 Setup with RHCS
In-Reply-To: <AANLkTi=cxCaYZR=LvdQd_dOw5ea3HPRnxNSpJocEc2t4@mail.gmail.com>
References: <AANLkTi=cxCaYZR=LvdQd_dOw5ea3HPRnxNSpJocEc2t4@mail.gmail.com>
Message-ID: <4D4CF47B.5060103@redhat.com>

Hi Robert

On 02/04/2011 07:57 PM, Robert Hayden wrote:
> I have searched for a concrete example of RHCS in a pure IPv6
> environment, but I have only found references that IPv6 is supported.
> 
> Does anyone have experience with setting up RHCS with IPv6 that they
> would be willing to share?

Yes, I use IPv6 for testing RHCS before each release.

The real question is: do you want to use IPv6 for cluster
heartbeat/backend and/or do you want RHCS to drive for example IPv6
virtual IPs?

>  Any good, technical papers out there?  In
> particular, I would like to stay in the RHEL 5.x release, but would
> consider RHEL 6 options as well.  I am wanting to protect a simple,
> custom application that requires a floating IPv6 IP address resource.

VIP should work just fine in both, but I only tested RHEL6 deeply with
IPv6 (both backend and VIP).

Keep in mind that any application you want to use (that being custom
made or any other resource) must be IPv6 aware. RHCS will only manage
the VIP for you and will not make your application IPv6 compliant (a
common thing people ask, while it sounds obvious to many, it is source
of confusion to many more).

In terms of configuration, it is really no different than an IPv4 VIP.

Simply replace the ipv4 address (or add one.. you get the idea) with an
IPv6 address.

Same requirements apply too.

The IPv4 VIP requires one node interface to have an IPv4 on the same
network. This is no different for IPv6. That means, before you start
setting up IPv6 in RHCS, make sure that IPv6 is configured and working
on the host system otherwise you will spend endless time debugging.

Fabio



From sklemer at gmail.com  Sat Feb  5 07:37:37 2011
From: sklemer at gmail.com (=?UTF-8?B?16nXnNeV150g16fXnNee16g=?=)
Date: Sat, 5 Feb 2011 09:37:37 +0200
Subject: [Linux-cluster] falied to implement HA-LVM with clvmd rhcs5.6
In-Reply-To: <AANLkTikB7drAprGsccUCoh3HeJjPLwj4mYbe_3deJHr=@mail.gmail.com>
References: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>
	<AANLkTineaTjMex9oVdNysG3=RBsPg21J3PENoCSmm0aV@mail.gmail.com>
	<AANLkTi=AXnPoC5kpoH=Yq8W3tc71onrcY=E9aHQCU91g@mail.gmail.com>
	<AANLkTik0F+tUWNtW65tDvpmEeLRErdo=eCEnRwmBwM9K@mail.gmail.com>
	<AANLkTimHa+MhiiE0e2+ZzbMnxWyQvwpv=s6geXBsWcVQ@mail.gmail.com>
	<AANLkTinw7NWLncpo7M7hg_u_fMn420E0tQTBuUNNiThs@mail.gmail.com>
	<AANLkTikyJuRV30c0872Aoy5ebAsf7mygZAb3xn68px-m@mail.gmail.com>
	<AANLkTi=fWQu_LzdaDOOZGePJghKdkvfHK7FkH9PJD+Sz@mail.gmail.com>
	<AANLkTikB7drAprGsccUCoh3HeJjPLwj4mYbe_3deJHr=@mail.gmail.com>
Message-ID: <AANLkTincL6Y=fxZ1NAfr1jhQjvWwW8K_n_JbaFbYOtL6@mail.gmail.com>

Hi.

 After reading the manual again ,I think i know  what was my setup problem.

The VGs should be taged as cluster , ( this action will path the lvm.sh
check ) & the LVs should be deactivated.

The cluster will activate the LVs as exclusive. (i will check it on monday
).

Shalom.


On Sat, Feb 5, 2011 at 12:07 AM, ???? ???? <sklemer at gmail.com> wrote:

> Hello Dominic.
>
> I will be in lab on monday and collect all steps & logs files.
>
> btw - Is it  redhat recommendation to preferre  using  HA-LVM with the
> locking_type=1 method ??
>
>
> Regards
>
> Shalom
>
>
> On Fri, Feb 4, 2011 at 5:23 PM, dOminic <share2dom at gmail.com> wrote:
>
>> Hi Shalom,
>>
>> Are you still facing problem implementing HA-LVM with locking_type = 3
>> setting ?. If yes, it would be great if you could provide the following
>> details .
>> So that others can also check
>>
>> * steps you are following along with complete output
>> * status in "clustat" after making changes in cluster.conf
>> * attach cluster.conf and /var/log/messages.
>>
>> dominic
>>
>> On Fri, Feb 4, 2011 at 8:34 PM, ???? ???? <sklemer at gmail.com> wrote:
>>
>>> Thanks.
>>>
>>> This is the old methos and its work great but its hard to maintain such
>>> cluster.
>>>
>>> Shalom.
>>>
>>>
>>> On Fri, Feb 4, 2011 at 4:32 PM, Dominic Geevarghese <share2dom at gmail.com
>>> > wrote:
>>>
>>>>
>>>> Hi,
>>>>
>>>> I am not sure about the error you are getting but it would be great if
>>>> you could try the preferred method
>>>>
>>>> locking_type = 1
>>>> volume_list [ "your-root-vg-name" , "@hostname" ]
>>>>
>>>> rebuild initrd
>>>>
>>>> add the <lvm> and <fs> resources in cluster.conf , start the cman,
>>>> rgmanager .
>>>>
>>>>
>>>> Thanks,
>>>>
>>>> On Thu, Feb 3, 2011 at 8:02 PM, Corey Kovacs <corey.kovacs at gmail.com>wrote:
>>>>
>>>>> Excellent,
>>>>>
>>>>>
>>>>> Thanks
>>>>>
>>>>> -C
>>>>>
>>>>> On Thu, Feb 3, 2011 at 10:38 AM, ???? ???? <sklemer at gmail.com> wrote:
>>>>> >
>>>>> >
>>>>> > On Thu, Feb 3, 2011 at 12:35 PM, ???? ???? <sklemer at gmail.com>
>>>>> wrote:
>>>>> >>
>>>>> >>
>>>>> >>
>>>>> >> https://access.redhat.com/kb/docs/DOC-3068
>>>>> >>
>>>>> >> On Thu, Feb 3, 2011 at 11:13 AM, Corey Kovacs <
>>>>> corey.kovacs at gmail.com>
>>>>> >> wrote:
>>>>> >>>
>>>>> >>> Is using ha-lvm with clvmd a new capability? It's always been my
>>>>> >>> understanding that the lvm locking type for using ha-lvm had to be
>>>>> set
>>>>> >>> to '1'.
>>>>> >>>
>>>>> >>> I'd much rather be using clvmd if it is the way to go. Can you
>>>>> point
>>>>> >>> me to the docs you are seeing these instructions in please?
>>>>> >>>
>>>>> >>> As for why your config isn't working, clvmd requires that it's
>>>>> >>> resources are indeed tagged as cluster volumes, so you might try
>>>>> doing
>>>>> >>> that and see how it goes.
>>>>> >>>
>>>>> >>> -C
>>>>> >>>
>>>>> >>> On Thu, Feb 3, 2011 at 7:26 AM, ???? ???? <sklemer at gmail.com>
>>>>> wrote:
>>>>> >>> > Hello.
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > I followed redhat instruction trying install HA-LVM with clvmd. (
>>>>> rhcs
>>>>> >>> > 5.6 -
>>>>> >>> > rgmanager 2.0.52-9 )
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > I can't make it work.
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > lvm.conf- locking_type=3
>>>>> >>> >
>>>>> >>> > clvmd work
>>>>> >>> >
>>>>> >>> > Its failed saying HA-LVM is not configured correctly.
>>>>> >>> >
>>>>> >>> > The manual said that we should run "lvchange -a n lvxx" edit the
>>>>> >>> > cluster.conf & start the service.
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > But From lvm.conf :
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > case $1 in
>>>>> >>> >
>>>>> >>> > start)
>>>>> >>> >
>>>>> >>> >         if ! [[ $(vgs -o attr --noheadings $OCF_RESKEY_vg_name)
>>>>> =~
>>>>> >>> > .....c
>>>>> >>> > ]]; then
>>>>> >>> >
>>>>> >>> >                 ha_lvm_proper_setup_check || exit 1
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > If the vg is not taged as cluster than the ha_lvm is looking for
>>>>> >>> > volume_list
>>>>> >>> > in lvm.conf.
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > I am confused- Does the VG should taged as cluster ??  ( BTW -
>>>>> the old
>>>>> >>> > fashion HA-LVM is worked with no problems )
>>>>> >>> >
>>>>> >>> > redhat instructions :
>>>>> >>> >
>>>>> >>> > To set up HA LVM Failover (using the preferred CLVM variant),
>>>>> perform
>>>>> >>> > the
>>>>> >>> > following steps:
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > 1. Ensure that the parameter locking_type in the global section
>>>>> >>> > of /etc/lvm/lvm.conf is set to the value '3', that all the
>>>>> necessary
>>>>> >>> > LVM
>>>>> >>> > cluster packages are installed, and the necessary daemons are
>>>>> started
>>>>> >>> > (like
>>>>> >>> > 'clvmd' and the cluster mirror log daemon - if necessary).
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > 2. Create the logical volume and filesystem using standard LVM2
>>>>> and
>>>>> >>> > file
>>>>> >>> > system commands. For example:
>>>>> >>> >
>>>>> >>> > # pvcreate /dev/sd[cde]1
>>>>> >>> >
>>>>> >>> >  # vgcreate <volume group name> /dev/sd[cde]1
>>>>> >>> >
>>>>> >>> >  # lvcreate -L 10G -n <logical volume name> <volume group name>
>>>>> >>> >
>>>>> >>> >  # mkfs.ext3 /dev/<volume group name>/<logical volume name>
>>>>> >>> >
>>>>> >>> >  # lvchange -an <volume group name>/<logical volume name>
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > 3. Edit /etc/cluster/cluster.conf to include the newly created
>>>>> logical
>>>>> >>> > volume as a resource in one of your services. Alternatively,
>>>>> >>> > configuration
>>>>> >>> > tools such as Conga or system-config-cluster may be used to
>>>>> create
>>>>> >>> > these
>>>>> >>> > entries.  Below is a sample resource manager section
>>>>> >>> > from /etc/cluster/cluster.conf:
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > <rm>      <failoverdomains>        <failoverdomain name="FD"
>>>>> >>> > ordered="1"
>>>>> >>> > restricted="0">           <failoverdomainnode name="neo-01"
>>>>> >>> > priority="1"/>
>>>>> >>> >           <failoverdomainnode name="neo-02" priority="2"/>
>>>>> >>> > </failoverdomain>    </failoverdomains>    <resources>
>>>>> <lvm
>>>>> >>> > name="lvm" vg_name="shared_vg" lv_name="ha-lv"/>        <fs
>>>>> name="FS"
>>>>> >>> > device="/dev/shared_vg/ha-lv" force_fsck="0" force_unmount="1"
>>>>> >>> > fsid="64050"
>>>>> >>> > fstype="ext3" mountpoint="/mnt" options="" self_fence="0"/>
>>>>> >>> > </resources>
>>>>> >>> >    <service autostart="1" domain="FD" name="serv"
>>>>> recovery="relocate">
>>>>> >>> >        <lvm ref="lvm"/>        <fs ref="FS"/>    </service> </rm>
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > Regards
>>>>> >>> >
>>>>> >>> > Shalom.
>>>>> >>> >
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > --
>>>>> >>> > Linux-cluster mailing list
>>>>> >>> > Linux-cluster at redhat.com
>>>>> >>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>> >>> >
>>>>> >>>
>>>>> >>> --
>>>>> >>> Linux-cluster mailing list
>>>>> >>> Linux-cluster at redhat.com
>>>>> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>> >
>>>>> >
>>>>> > --
>>>>> > Linux-cluster mailing list
>>>>> > Linux-cluster at redhat.com
>>>>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>> >
>>>>>
>>>>> --
>>>>> Linux-cluster mailing list
>>>>> Linux-cluster at redhat.com
>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>
>>>>
>>>>
>>>> --
>>>> Linux-cluster mailing list
>>>> Linux-cluster at redhat.com
>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>
>>>
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110205/98a61499/attachment.htm>

From share2dom at gmail.com  Sat Feb  5 10:24:24 2011
From: share2dom at gmail.com (dOminic)
Date: Sat, 5 Feb 2011 15:54:24 +0530
Subject: [Linux-cluster] falied to implement HA-LVM with clvmd rhcs5.6
In-Reply-To: <AANLkTincL6Y=fxZ1NAfr1jhQjvWwW8K_n_JbaFbYOtL6@mail.gmail.com>
References: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>
	<AANLkTineaTjMex9oVdNysG3=RBsPg21J3PENoCSmm0aV@mail.gmail.com>
	<AANLkTi=AXnPoC5kpoH=Yq8W3tc71onrcY=E9aHQCU91g@mail.gmail.com>
	<AANLkTik0F+tUWNtW65tDvpmEeLRErdo=eCEnRwmBwM9K@mail.gmail.com>
	<AANLkTimHa+MhiiE0e2+ZzbMnxWyQvwpv=s6geXBsWcVQ@mail.gmail.com>
	<AANLkTinw7NWLncpo7M7hg_u_fMn420E0tQTBuUNNiThs@mail.gmail.com>
	<AANLkTikyJuRV30c0872Aoy5ebAsf7mygZAb3xn68px-m@mail.gmail.com>
	<AANLkTi=fWQu_LzdaDOOZGePJghKdkvfHK7FkH9PJD+Sz@mail.gmail.com>
	<AANLkTikB7drAprGsccUCoh3HeJjPLwj4mYbe_3deJHr=@mail.gmail.com>
	<AANLkTincL6Y=fxZ1NAfr1jhQjvWwW8K_n_JbaFbYOtL6@mail.gmail.com>
Message-ID: <AANLkTimUss1YgG9MGfhKBMHmKnWXO_rR516NkyqEWf-D@mail.gmail.com>

Hi,

If you are you using traditional HA-LVM setup with locking_type = 1 , then
you need to setup tagging / initrd rebuild
Anything went wrong with tagging.... then it won't prevent admin to activate
vg, remove the VG which is activated/used in another node.

# vgchange -ay domvg
  1 logical volume(s) in volume group "domvg" now active
# lvremove /dev/domvg/domlv
Do you really want to remove active logical volume domlv? [y/n]: y
  Logical volume "domlv" successfully removed

If you are using clvmd variant with locking_type = 3 and cluster is
*active/passive* then clvmd won't let you to remove the VG which is
activate/used in another node.

# lvremove /dev/domvg/domlv
Do you really want to remove active clustered logical volume domlv? [y/n]: y
  Error locking on node node1.domtest.com: LV domvg/domlv in use: not
deactivating
  Unable to deactivate logical volume "domlv"

- dominic

On Sat, Feb 5, 2011 at 1:07 PM, ???? ???? <sklemer at gmail.com> wrote:

> Hi.
>
>  After reading the manual again ,I think i know  what was my setup problem.
>
> The VGs should be taged as cluster , ( this action will path the lvm.sh
> check ) & the LVs should be deactivated.
>
> The cluster will activate the LVs as exclusive. (i will check it on monday
> ).
>
> Shalom.
>
>
> On Sat, Feb 5, 2011 at 12:07 AM, ???? ???? <sklemer at gmail.com> wrote:
>
>> Hello Dominic.
>>
>> I will be in lab on monday and collect all steps & logs files.
>>
>> btw - Is it  redhat recommendation to preferre  using  HA-LVM with the
>> locking_type=1 method ??
>>
>>
>> Regards
>>
>> Shalom
>>
>>
>> On Fri, Feb 4, 2011 at 5:23 PM, dOminic <share2dom at gmail.com> wrote:
>>
>>> Hi Shalom,
>>>
>>> Are you still facing problem implementing HA-LVM with locking_type = 3
>>> setting ?. If yes, it would be great if you could provide the following
>>> details .
>>> So that others can also check
>>>
>>> * steps you are following along with complete output
>>> * status in "clustat" after making changes in cluster.conf
>>> * attach cluster.conf and /var/log/messages.
>>>
>>> dominic
>>>
>>> On Fri, Feb 4, 2011 at 8:34 PM, ???? ???? <sklemer at gmail.com> wrote:
>>>
>>>> Thanks.
>>>>
>>>> This is the old methos and its work great but its hard to maintain such
>>>> cluster.
>>>>
>>>> Shalom.
>>>>
>>>>
>>>> On Fri, Feb 4, 2011 at 4:32 PM, Dominic Geevarghese <
>>>> share2dom at gmail.com> wrote:
>>>>
>>>>>
>>>>> Hi,
>>>>>
>>>>> I am not sure about the error you are getting but it would be great if
>>>>> you could try the preferred method
>>>>>
>>>>> locking_type = 1
>>>>> volume_list [ "your-root-vg-name" , "@hostname" ]
>>>>>
>>>>> rebuild initrd
>>>>>
>>>>> add the <lvm> and <fs> resources in cluster.conf , start the cman,
>>>>> rgmanager .
>>>>>
>>>>>
>>>>> Thanks,
>>>>>
>>>>> On Thu, Feb 3, 2011 at 8:02 PM, Corey Kovacs <corey.kovacs at gmail.com>wrote:
>>>>>
>>>>>> Excellent,
>>>>>>
>>>>>>
>>>>>> Thanks
>>>>>>
>>>>>> -C
>>>>>>
>>>>>> On Thu, Feb 3, 2011 at 10:38 AM, ???? ???? <sklemer at gmail.com> wrote:
>>>>>> >
>>>>>> >
>>>>>> > On Thu, Feb 3, 2011 at 12:35 PM, ???? ???? <sklemer at gmail.com>
>>>>>> wrote:
>>>>>> >>
>>>>>> >>
>>>>>> >>
>>>>>> >> https://access.redhat.com/kb/docs/DOC-3068
>>>>>> >>
>>>>>> >> On Thu, Feb 3, 2011 at 11:13 AM, Corey Kovacs <
>>>>>> corey.kovacs at gmail.com>
>>>>>> >> wrote:
>>>>>> >>>
>>>>>> >>> Is using ha-lvm with clvmd a new capability? It's always been my
>>>>>> >>> understanding that the lvm locking type for using ha-lvm had to be
>>>>>> set
>>>>>> >>> to '1'.
>>>>>> >>>
>>>>>> >>> I'd much rather be using clvmd if it is the way to go. Can you
>>>>>> point
>>>>>> >>> me to the docs you are seeing these instructions in please?
>>>>>> >>>
>>>>>> >>> As for why your config isn't working, clvmd requires that it's
>>>>>> >>> resources are indeed tagged as cluster volumes, so you might try
>>>>>> doing
>>>>>> >>> that and see how it goes.
>>>>>> >>>
>>>>>> >>> -C
>>>>>> >>>
>>>>>> >>> On Thu, Feb 3, 2011 at 7:26 AM, ???? ???? <sklemer at gmail.com>
>>>>>> wrote:
>>>>>> >>> > Hello.
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > I followed redhat instruction trying install HA-LVM with clvmd.
>>>>>> ( rhcs
>>>>>> >>> > 5.6 -
>>>>>> >>> > rgmanager 2.0.52-9 )
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > I can't make it work.
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > lvm.conf- locking_type=3
>>>>>> >>> >
>>>>>> >>> > clvmd work
>>>>>> >>> >
>>>>>> >>> > Its failed saying HA-LVM is not configured correctly.
>>>>>> >>> >
>>>>>> >>> > The manual said that we should run "lvchange -a n lvxx" edit the
>>>>>> >>> > cluster.conf & start the service.
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > But From lvm.conf :
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > case $1 in
>>>>>> >>> >
>>>>>> >>> > start)
>>>>>> >>> >
>>>>>> >>> >         if ! [[ $(vgs -o attr --noheadings $OCF_RESKEY_vg_name)
>>>>>> =~
>>>>>> >>> > .....c
>>>>>> >>> > ]]; then
>>>>>> >>> >
>>>>>> >>> >                 ha_lvm_proper_setup_check || exit 1
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > If the vg is not taged as cluster than the ha_lvm is looking for
>>>>>> >>> > volume_list
>>>>>> >>> > in lvm.conf.
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > I am confused- Does the VG should taged as cluster ??  ( BTW -
>>>>>> the old
>>>>>> >>> > fashion HA-LVM is worked with no problems )
>>>>>> >>> >
>>>>>> >>> > redhat instructions :
>>>>>> >>> >
>>>>>> >>> > To set up HA LVM Failover (using the preferred CLVM variant),
>>>>>> perform
>>>>>> >>> > the
>>>>>> >>> > following steps:
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > 1. Ensure that the parameter locking_type in the global section
>>>>>> >>> > of /etc/lvm/lvm.conf is set to the value '3', that all the
>>>>>> necessary
>>>>>> >>> > LVM
>>>>>> >>> > cluster packages are installed, and the necessary daemons are
>>>>>> started
>>>>>> >>> > (like
>>>>>> >>> > 'clvmd' and the cluster mirror log daemon - if necessary).
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > 2. Create the logical volume and filesystem using standard LVM2
>>>>>> and
>>>>>> >>> > file
>>>>>> >>> > system commands. For example:
>>>>>> >>> >
>>>>>> >>> > # pvcreate /dev/sd[cde]1
>>>>>> >>> >
>>>>>> >>> >  # vgcreate <volume group name> /dev/sd[cde]1
>>>>>> >>> >
>>>>>> >>> >  # lvcreate -L 10G -n <logical volume name> <volume group name>
>>>>>> >>> >
>>>>>> >>> >  # mkfs.ext3 /dev/<volume group name>/<logical volume name>
>>>>>> >>> >
>>>>>> >>> >  # lvchange -an <volume group name>/<logical volume name>
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > 3. Edit /etc/cluster/cluster.conf to include the newly created
>>>>>> logical
>>>>>> >>> > volume as a resource in one of your services. Alternatively,
>>>>>> >>> > configuration
>>>>>> >>> > tools such as Conga or system-config-cluster may be used to
>>>>>> create
>>>>>> >>> > these
>>>>>> >>> > entries.  Below is a sample resource manager section
>>>>>> >>> > from /etc/cluster/cluster.conf:
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > <rm>      <failoverdomains>        <failoverdomain name="FD"
>>>>>> >>> > ordered="1"
>>>>>> >>> > restricted="0">           <failoverdomainnode name="neo-01"
>>>>>> >>> > priority="1"/>
>>>>>> >>> >           <failoverdomainnode name="neo-02" priority="2"/>
>>>>>> >>> > </failoverdomain>    </failoverdomains>    <resources>
>>>>>> <lvm
>>>>>> >>> > name="lvm" vg_name="shared_vg" lv_name="ha-lv"/>        <fs
>>>>>> name="FS"
>>>>>> >>> > device="/dev/shared_vg/ha-lv" force_fsck="0" force_unmount="1"
>>>>>> >>> > fsid="64050"
>>>>>> >>> > fstype="ext3" mountpoint="/mnt" options="" self_fence="0"/>
>>>>>> >>> > </resources>
>>>>>> >>> >    <service autostart="1" domain="FD" name="serv"
>>>>>> recovery="relocate">
>>>>>> >>> >        <lvm ref="lvm"/>        <fs ref="FS"/>    </service>
>>>>>> </rm>
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > Regards
>>>>>> >>> >
>>>>>> >>> > Shalom.
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> >
>>>>>> >>> > --
>>>>>> >>> > Linux-cluster mailing list
>>>>>> >>> > Linux-cluster at redhat.com
>>>>>> >>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>> >>> >
>>>>>> >>>
>>>>>> >>> --
>>>>>> >>> Linux-cluster mailing list
>>>>>> >>> Linux-cluster at redhat.com
>>>>>> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>> >
>>>>>> >
>>>>>> > --
>>>>>> > Linux-cluster mailing list
>>>>>> > Linux-cluster at redhat.com
>>>>>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>> >
>>>>>>
>>>>>> --
>>>>>> Linux-cluster mailing list
>>>>>> Linux-cluster at redhat.com
>>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Linux-cluster mailing list
>>>>> Linux-cluster at redhat.com
>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>
>>>>
>>>>
>>>> --
>>>> Linux-cluster mailing list
>>>> Linux-cluster at redhat.com
>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>
>>>
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>
>>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110205/7133d25a/attachment.htm>

From fdinitto at redhat.com  Tue Feb  8 07:44:22 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Tue, 08 Feb 2011 08:44:22 +0100
Subject: [Linux-cluster] fence-agents 3.1.1 stable release
Message-ID: <4D50F456.30009@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Welcome to the second fence-agents standalone release.

This release contains a few bug fixes and a brand new agent for Eaton
ePDU devices (courtesy of Arnaud Quette).

The new source tarball can be downloaded here:

https://fedorahosted.org/releases/f/e/fence-agents/fence-agents-3.1.1.tar.xz

To report bugs or issues:

   https://bugzilla.redhat.com/

Would you like to meet the cluster team or members of its community?

   Join us on IRC (irc.freenode.net #linux-cluster) and share your
   experience  with other sysadministrators or power users.

Thanks/congratulations to all people that contributed to achieve this
great milestone.

Happy clustering,
Fabio

Under the hood (from 3.1.0):

Fabio M. Di Nitto (3):
      Fix build for distributions that don't use bash as default shell
      build: fix make dist target
      Update COPYRIGHT file

Marek 'marx' Grac (3):
      fence_ipmilan: Add "diag" option to support "ipmitool chassis
power diag"
      fence_ipmilan: Fix manual page to describe usage with HP iLO 3
      fence_eaton_snmp: New fence agent for Eaton devices

Ryan O'Hara (6):
      fence_scsi: identify dm-multipath devices correctly
      fence_scsi: fix regular expression for grep
      fence_scsi: always do sg_turs before registration
      fence_scsi: always do sg_turs for dm-mp devices
      fence_scsi: verify that on/off actions succeed
      fence_scsi: properly log errors for all commands

 configure.ac                                |    1 +
 doc/COPYRIGHT                               |   19 ++--
 fence/agents/Makefile.am                    |    1 +
 fence/agents/alom/Makefile.am               |    2 +-
 fence/agents/apc/Makefile.am                |    2 +-
 fence/agents/apc_snmp/Makefile.am           |    2 +-
 fence/agents/bladecenter/Makefile.am        |    2 +-
 fence/agents/cisco_mds/Makefile.am          |    2 +-
 fence/agents/cisco_ucs/Makefile.am          |    2 +-
 fence/agents/drac5/Makefile.am              |    2 +-
 fence/agents/eaton_snmp/Makefile.am         |   16 +++
 fence/agents/eaton_snmp/README              |   20 +++
 fence/agents/eaton_snmp/fence_eaton_snmp.py |  177
+++++++++++++++++++++++++
 fence/agents/eps/Makefile.am                |    2 +-
 fence/agents/ibmblade/Makefile.am           |    2 +-
 fence/agents/ifmib/Makefile.am              |    2 +-
 fence/agents/ilo/Makefile.am                |    2 +-
 fence/agents/ilo_mp/Makefile.am             |    2 +-
 fence/agents/intelmodular/Makefile.am       |    2 +-
 fence/agents/ipmilan/Makefile.am            |    2 +-
 fence/agents/ipmilan/ipmilan.c              |   36 +++++-
 fence/agents/ldom/Makefile.am               |    2 +-
 fence/agents/lpar/Makefile.am               |    2 +-
 fence/agents/node_assassin/Makefile.am      |    2 +-
 fence/agents/rhevm/Makefile.am              |    2 +-
 fence/agents/rsa/Makefile.am                |    2 +-
 fence/agents/sanbox2/Makefile.am            |    2 +-
 fence/agents/scsi/fence_scsi.pl             |  189
+++++++++++++++++++++++----
 fence/agents/virsh/Makefile.am              |    2 +-
 fence/agents/vmware/Makefile.am             |    2 +-
 fence/agents/wti/Makefile.am                |    2 +-
 make/fencebuild.mk                          |    2 +-
 make/release.mk                             |    1 -
 33 files changed, 445 insertions(+), 63 deletions(-)
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iQIcBAEBCAAGBQJNUPRVAAoJEFA6oBJjVJ+OvR8P/RhSI+A8HaF8817LlxMFP/5v
bmP/tr3TrLpUC+gnnauTizrGuBjVogmUz9aO8VWS2wFcpf8NZpwzPrps8v2HAIZr
dEdB8l2yhQsis5cPuIWV8YiOPrp1S/+ewQxadFfmNQUuS+OrwSR4qA8pxAlw/mxW
4OuXhJzLTsK4RxV/rD3K8q1vrEiN3MgAW/ql1sDL94U5Rgs8RTL+FhXMqEqmBXl6
D/ZMnSD5KCYXNOw9r4wblxDkTdm1zP0s6oTM/6VZimYS1UxvuBZJaaxLcnixj+k8
MTCaVawCJtK6PcJXyf3+iHT9OuaFPvQCnn20sNerHuMJWd5jEoyY4lrDMvas73/F
ryJwHMwc/JpiXvbbNuMyS+oYyMFLqW1HSqR3SigiNtgMcoFPRYo1/UdbsTFvHxQe
p9V9W6mTggODLukEex5ShWFkyTS5IoZMniACey4bXdpvU/DJ797l0tqsJIouRv8Z
oRuOPCpX2BP7YAPj34fq82CgmUrPHklDevC6/qyjw8dp+PyRpLKXVyCJeotvYvrF
I4KzW4kjbgsXzRYdGPcIC27HbQ9lF0St21zinZQZzaZMLxFw9D4Re8/50dxILT41
AZs/nhVEZxz+lHKOjx5nW1bLXS8+oYUrhCxt2zNoBs6Evnz/5avsYlCBgOn7g37L
FmiqOW4U2iOdZDme10SN
=jrmg
-----END PGP SIGNATURE-----



From rossnick-lists at cybercat.ca  Wed Feb  9 15:26:17 2011
From: rossnick-lists at cybercat.ca (Nicolas Ross)
Date: Wed, 9 Feb 2011 10:26:17 -0500
Subject: [Linux-cluster] Heartbeat ?
Message-ID: <D561F9133D8844C690819A0B72534633@versa>

Hi !

I curently have a router (in fact 2) in CentOS 5.5 that uses the redhat's 
heartbeat package to manage high-availibility ressource (an ip each of 2 
interfaces, and that's about it).

This package was removed from RHEL 6. Was it replace by pacemaker ? What's 
the equivalent in RHEL 6 ?

Regards, 



From gordan at bobich.net  Wed Feb  9 16:14:13 2011
From: gordan at bobich.net (Gordan Bobic)
Date: Wed, 09 Feb 2011 16:14:13 +0000
Subject: [Linux-cluster] Heartbeat ?
In-Reply-To: <D561F9133D8844C690819A0B72534633@versa>
References: <D561F9133D8844C690819A0B72534633@versa>
Message-ID: <4D52BD55.1060603@bobich.net>

Nicolas Ross wrote:

> I curently have a router (in fact 2) in CentOS 5.5 that uses the 
> redhat's heartbeat package to manage high-availibility ressource (an ip 
> each of 2 interfaces, and that's about it).
> 
> This package was removed from RHEL 6. Was it replace by pacemaker ? 
> What's the equivalent in RHEL 6 ?

I'm using heartbeat from clusterlabs.org yum repository. The F13 
packages work just fine on RHEL6.

Have a look here:

http://www.clusterlabs.org/wiki/Install

Gordan



From veliogluh at itu.edu.tr  Wed Feb  9 16:38:46 2011
From: veliogluh at itu.edu.tr (Hakan VELIOGLU)
Date: Wed, 09 Feb 2011 18:38:46 +0200
Subject: [Linux-cluster] Piranha ipv6 support
In-Reply-To: <4D52BD55.1060603@bobich.net>
References: <D561F9133D8844C690819A0B72534633@versa>
	<4D52BD55.1060603@bobich.net>
Message-ID: <20110209183846.5866264gswpd0z6e@webmail.itu.edu.tr>

Hi,

Does piranha package (LVS tool) has ipv6 load balance support in RHEL  
6? ?f yes how can I set ipv6 addresses in /etc/sysconfig/ha/lvs.cf  
config file?

Thanks...

Hakan VELIOGLU
RHCE LPIC-1






From sklemer at gmail.com  Thu Feb 10 07:08:50 2011
From: sklemer at gmail.com (=?UTF-8?B?16nXnNeV150g16fXnNee16g=?=)
Date: Thu, 10 Feb 2011 09:08:50 +0200
Subject: [Linux-cluster] falied to implement HA-LVM with clvmd rhcs5.6
In-Reply-To: <AANLkTimUss1YgG9MGfhKBMHmKnWXO_rR516NkyqEWf-D@mail.gmail.com>
References: <AANLkTimBQ6_2PDr=pwQO=Hnb5Xsp9nxM4BXzyXsawag5@mail.gmail.com>
	<AANLkTineaTjMex9oVdNysG3=RBsPg21J3PENoCSmm0aV@mail.gmail.com>
	<AANLkTi=AXnPoC5kpoH=Yq8W3tc71onrcY=E9aHQCU91g@mail.gmail.com>
	<AANLkTik0F+tUWNtW65tDvpmEeLRErdo=eCEnRwmBwM9K@mail.gmail.com>
	<AANLkTimHa+MhiiE0e2+ZzbMnxWyQvwpv=s6geXBsWcVQ@mail.gmail.com>
	<AANLkTinw7NWLncpo7M7hg_u_fMn420E0tQTBuUNNiThs@mail.gmail.com>
	<AANLkTikyJuRV30c0872Aoy5ebAsf7mygZAb3xn68px-m@mail.gmail.com>
	<AANLkTi=fWQu_LzdaDOOZGePJghKdkvfHK7FkH9PJD+Sz@mail.gmail.com>
	<AANLkTikB7drAprGsccUCoh3HeJjPLwj4mYbe_3deJHr=@mail.gmail.com>
	<AANLkTincL6Y=fxZ1NAfr1jhQjvWwW8K_n_JbaFbYOtL6@mail.gmail.com>
	<AANLkTimUss1YgG9MGfhKBMHmKnWXO_rR516NkyqEWf-D@mail.gmail.com>
Message-ID: <AANLkTinfTJSTVR77RsG74vUdvzFECkMgi=PkEU8sB1-h@mail.gmail.com>

Hello.

The HA-LVM + clvmd is working!!!

The Vg should be tagged as cluster.



The major problem I noticed when I checked the cluster is that on the
passive system I am able to run vgchange -c n vgxx ; vgchange -a y vgxx ;
mount the LV out of the cluster - While this lv is mounted on the active
member.


Shalom.




On Sat, Feb 5, 2011 at 12:24 PM, dOminic <share2dom at gmail.com> wrote:

> Hi,
>
> If you are you using traditional HA-LVM setup with locking_type = 1 , then
> you need to setup tagging / initrd rebuild
> Anything went wrong with tagging.... then it won't prevent admin to
> activate vg, remove the VG which is activated/used in another node.
>
> # vgchange -ay domvg
>   1 logical volume(s) in volume group "domvg" now active
> # lvremove /dev/domvg/domlv
> Do you really want to remove active logical volume domlv? [y/n]: y
>   Logical volume "domlv" successfully removed
>
> If you are using clvmd variant with locking_type = 3 and cluster is
> *active/passive* then clvmd won't let you to remove the VG which is
> activate/used in another node.
>
> # lvremove /dev/domvg/domlv
> Do you really want to remove active clustered logical volume domlv? [y/n]:
> y
>   Error locking on node node1.domtest.com: LV domvg/domlv in use: not
> deactivating
>   Unable to deactivate logical volume "domlv"
>
> - dominic
>
> On Sat, Feb 5, 2011 at 1:07 PM, ???? ???? <sklemer at gmail.com> wrote:
>
>> Hi.
>>
>>  After reading the manual again ,I think i know  what was my setup
>> problem.
>>
>> The VGs should be taged as cluster , ( this action will path the lvm.sh
>> check ) & the LVs should be deactivated.
>>
>> The cluster will activate the LVs as exclusive. (i will check it on
>> monday ).
>>
>> Shalom.
>>
>>
>> On Sat, Feb 5, 2011 at 12:07 AM, ???? ???? <sklemer at gmail.com> wrote:
>>
>>> Hello Dominic.
>>>
>>> I will be in lab on monday and collect all steps & logs files.
>>>
>>> btw - Is it  redhat recommendation to preferre  using  HA-LVM with the
>>> locking_type=1 method ??
>>>
>>>
>>> Regards
>>>
>>> Shalom
>>>
>>>
>>> On Fri, Feb 4, 2011 at 5:23 PM, dOminic <share2dom at gmail.com> wrote:
>>>
>>>> Hi Shalom,
>>>>
>>>> Are you still facing problem implementing HA-LVM with locking_type = 3
>>>> setting ?. If yes, it would be great if you could provide the following
>>>> details .
>>>> So that others can also check
>>>>
>>>> * steps you are following along with complete output
>>>> * status in "clustat" after making changes in cluster.conf
>>>> * attach cluster.conf and /var/log/messages.
>>>>
>>>> dominic
>>>>
>>>> On Fri, Feb 4, 2011 at 8:34 PM, ???? ???? <sklemer at gmail.com> wrote:
>>>>
>>>>> Thanks.
>>>>>
>>>>> This is the old methos and its work great but its hard to maintain such
>>>>> cluster.
>>>>>
>>>>> Shalom.
>>>>>
>>>>>
>>>>> On Fri, Feb 4, 2011 at 4:32 PM, Dominic Geevarghese <
>>>>> share2dom at gmail.com> wrote:
>>>>>
>>>>>>
>>>>>> Hi,
>>>>>>
>>>>>> I am not sure about the error you are getting but it would be great if
>>>>>> you could try the preferred method
>>>>>>
>>>>>> locking_type = 1
>>>>>> volume_list [ "your-root-vg-name" , "@hostname" ]
>>>>>>
>>>>>> rebuild initrd
>>>>>>
>>>>>> add the <lvm> and <fs> resources in cluster.conf , start the cman,
>>>>>> rgmanager .
>>>>>>
>>>>>>
>>>>>> Thanks,
>>>>>>
>>>>>> On Thu, Feb 3, 2011 at 8:02 PM, Corey Kovacs <corey.kovacs at gmail.com>wrote:
>>>>>>
>>>>>>> Excellent,
>>>>>>>
>>>>>>>
>>>>>>> Thanks
>>>>>>>
>>>>>>> -C
>>>>>>>
>>>>>>> On Thu, Feb 3, 2011 at 10:38 AM, ???? ???? <sklemer at gmail.com>
>>>>>>> wrote:
>>>>>>> >
>>>>>>> >
>>>>>>> > On Thu, Feb 3, 2011 at 12:35 PM, ???? ???? <sklemer at gmail.com>
>>>>>>> wrote:
>>>>>>> >>
>>>>>>> >>
>>>>>>> >>
>>>>>>> >> https://access.redhat.com/kb/docs/DOC-3068
>>>>>>> >>
>>>>>>> >> On Thu, Feb 3, 2011 at 11:13 AM, Corey Kovacs <
>>>>>>> corey.kovacs at gmail.com>
>>>>>>> >> wrote:
>>>>>>> >>>
>>>>>>> >>> Is using ha-lvm with clvmd a new capability? It's always been my
>>>>>>> >>> understanding that the lvm locking type for using ha-lvm had to
>>>>>>> be set
>>>>>>> >>> to '1'.
>>>>>>> >>>
>>>>>>> >>> I'd much rather be using clvmd if it is the way to go. Can you
>>>>>>> point
>>>>>>> >>> me to the docs you are seeing these instructions in please?
>>>>>>> >>>
>>>>>>> >>> As for why your config isn't working, clvmd requires that it's
>>>>>>> >>> resources are indeed tagged as cluster volumes, so you might try
>>>>>>> doing
>>>>>>> >>> that and see how it goes.
>>>>>>> >>>
>>>>>>> >>> -C
>>>>>>> >>>
>>>>>>> >>> On Thu, Feb 3, 2011 at 7:26 AM, ???? ???? <sklemer at gmail.com>
>>>>>>> wrote:
>>>>>>> >>> > Hello.
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > I followed redhat instruction trying install HA-LVM with clvmd.
>>>>>>> ( rhcs
>>>>>>> >>> > 5.6 -
>>>>>>> >>> > rgmanager 2.0.52-9 )
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > I can't make it work.
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > lvm.conf- locking_type=3
>>>>>>> >>> >
>>>>>>> >>> > clvmd work
>>>>>>> >>> >
>>>>>>> >>> > Its failed saying HA-LVM is not configured correctly.
>>>>>>> >>> >
>>>>>>> >>> > The manual said that we should run "lvchange -a n lvxx" edit
>>>>>>> the
>>>>>>> >>> > cluster.conf & start the service.
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > But From lvm.conf :
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > case $1 in
>>>>>>> >>> >
>>>>>>> >>> > start)
>>>>>>> >>> >
>>>>>>> >>> >         if ! [[ $(vgs -o attr --noheadings $OCF_RESKEY_vg_name)
>>>>>>> =~
>>>>>>> >>> > .....c
>>>>>>> >>> > ]]; then
>>>>>>> >>> >
>>>>>>> >>> >                 ha_lvm_proper_setup_check || exit 1
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > If the vg is not taged as cluster than the ha_lvm is looking
>>>>>>> for
>>>>>>> >>> > volume_list
>>>>>>> >>> > in lvm.conf.
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > I am confused- Does the VG should taged as cluster ??  ( BTW -
>>>>>>> the old
>>>>>>> >>> > fashion HA-LVM is worked with no problems )
>>>>>>> >>> >
>>>>>>> >>> > redhat instructions :
>>>>>>> >>> >
>>>>>>> >>> > To set up HA LVM Failover (using the preferred CLVM variant),
>>>>>>> perform
>>>>>>> >>> > the
>>>>>>> >>> > following steps:
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > 1. Ensure that the parameter locking_type in the global section
>>>>>>> >>> > of /etc/lvm/lvm.conf is set to the value '3', that all the
>>>>>>> necessary
>>>>>>> >>> > LVM
>>>>>>> >>> > cluster packages are installed, and the necessary daemons are
>>>>>>> started
>>>>>>> >>> > (like
>>>>>>> >>> > 'clvmd' and the cluster mirror log daemon - if necessary).
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > 2. Create the logical volume and filesystem using standard LVM2
>>>>>>> and
>>>>>>> >>> > file
>>>>>>> >>> > system commands. For example:
>>>>>>> >>> >
>>>>>>> >>> > # pvcreate /dev/sd[cde]1
>>>>>>> >>> >
>>>>>>> >>> >  # vgcreate <volume group name> /dev/sd[cde]1
>>>>>>> >>> >
>>>>>>> >>> >  # lvcreate -L 10G -n <logical volume name> <volume group name>
>>>>>>> >>> >
>>>>>>> >>> >  # mkfs.ext3 /dev/<volume group name>/<logical volume name>
>>>>>>> >>> >
>>>>>>> >>> >  # lvchange -an <volume group name>/<logical volume name>
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > 3. Edit /etc/cluster/cluster.conf to include the newly created
>>>>>>> logical
>>>>>>> >>> > volume as a resource in one of your services. Alternatively,
>>>>>>> >>> > configuration
>>>>>>> >>> > tools such as Conga or system-config-cluster may be used to
>>>>>>> create
>>>>>>> >>> > these
>>>>>>> >>> > entries.  Below is a sample resource manager section
>>>>>>> >>> > from /etc/cluster/cluster.conf:
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > <rm>      <failoverdomains>        <failoverdomain name="FD"
>>>>>>> >>> > ordered="1"
>>>>>>> >>> > restricted="0">           <failoverdomainnode name="neo-01"
>>>>>>> >>> > priority="1"/>
>>>>>>> >>> >           <failoverdomainnode name="neo-02" priority="2"/>
>>>>>>> >>> > </failoverdomain>    </failoverdomains>    <resources>
>>>>>>> <lvm
>>>>>>> >>> > name="lvm" vg_name="shared_vg" lv_name="ha-lv"/>        <fs
>>>>>>> name="FS"
>>>>>>> >>> > device="/dev/shared_vg/ha-lv" force_fsck="0" force_unmount="1"
>>>>>>> >>> > fsid="64050"
>>>>>>> >>> > fstype="ext3" mountpoint="/mnt" options="" self_fence="0"/>
>>>>>>> >>> > </resources>
>>>>>>> >>> >    <service autostart="1" domain="FD" name="serv"
>>>>>>> recovery="relocate">
>>>>>>> >>> >        <lvm ref="lvm"/>        <fs ref="FS"/>    </service>
>>>>>>> </rm>
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > Regards
>>>>>>> >>> >
>>>>>>> >>> > Shalom.
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> >
>>>>>>> >>> > --
>>>>>>> >>> > Linux-cluster mailing list
>>>>>>> >>> > Linux-cluster at redhat.com
>>>>>>> >>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>>> >>> >
>>>>>>> >>>
>>>>>>> >>> --
>>>>>>> >>> Linux-cluster mailing list
>>>>>>> >>> Linux-cluster at redhat.com
>>>>>>> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>>> >
>>>>>>> >
>>>>>>> > --
>>>>>>> > Linux-cluster mailing list
>>>>>>> > Linux-cluster at redhat.com
>>>>>>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>>> >
>>>>>>>
>>>>>>> --
>>>>>>> Linux-cluster mailing list
>>>>>>> Linux-cluster at redhat.com
>>>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>>> Linux-cluster mailing list
>>>>>> Linux-cluster at redhat.com
>>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Linux-cluster mailing list
>>>>> Linux-cluster at redhat.com
>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>>
>>>>
>>>>
>>>> --
>>>> Linux-cluster mailing list
>>>> Linux-cluster at redhat.com
>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>>
>>>
>>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110210/8198f41d/attachment.htm>

From parvez.h.shaikh at gmail.com  Fri Feb 11 07:21:27 2011
From: parvez.h.shaikh at gmail.com (Parvez Shaikh)
Date: Fri, 11 Feb 2011 12:51:27 +0530
Subject: [Linux-cluster] Tuning red hat cluster
Message-ID: <AANLkTikd3LpD+qFPg9cDM4uw89HuUxHmgHFSqvVe0Avs@mail.gmail.com>

Hi,

As per my understanding rgmanager invokes 'status' on resource groups
periodically to determine if these resources are up or down.

I observed that this period is of around 30 seconds. Is it possible to tune
or adjust this period for individual services or resource groups?

Thanks
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110211/656b8494/attachment.htm>

From zachar at awst.at  Fri Feb 11 10:34:45 2011
From: zachar at awst.at (zachar at awst.at)
Date: Fri, 11 Feb 2011 11:34:45 +0100 (CET)
Subject: [Linux-cluster] =?utf-8?q?Tuning_red_hat_cluster?=
Message-ID: <mtranet.20110211113445.309700584@telekom.at>

Hi,

http://sources.redhat.com/cluster/wiki/FAQ/RGManager#rgm_interval

Regards,
Balazs

Parvez Shaikh schrieb:
> Hi,
> 
> As per my understanding rgmanager invokes 'status' on resource groups
> periodically to determine if these resources are up or down.
> 
> I observed that this period is of around 30 seconds. Is it possible to 
> tune
> or adjust this period for individual services or resource groups?
> 
> Thanks




From kitgerrits at gmail.com  Sat Feb 12 10:51:09 2011
From: kitgerrits at gmail.com (Kit Gerrits)
Date: Sat, 12 Feb 2011 11:51:09 +0100
Subject: [Linux-cluster] A better understanding of multicast issues
In-Reply-To: <4D3D9CA1.7040707@alteeve.com>
Message-ID: <4d566626.857a0e0a.6cd4.0d41@mx.google.com>


Digimer,

Did you ever get a reply from anyone?

If what you say is true, failure of one of our HSRP(HA) switches/routers
might break the cluster.
(if they don't share multicast menberships)

I would guess that  multicast groups originate in the cluster, not the
switch.
In that case, if the switch has been rebooted, the cluster needs to
re-create the multicast groups on the switch.

I would guess that the cluster itself needs to check if the switch is
properly handling multicast.
(subscribe to its own group and check if the packets are being handles
correctly)

This should provide an insight into clustering/multicast:
http://www.cisco.com/en/US/products/hw/switches/ps708/products_tech_note0918
6a008059a9df.shtml


Regards,

Kit


-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Digimer
Sent: maandag 24 januari 2011 16:37
To: linux clustering
Subject: [Linux-cluster] A better understanding of multicast issues

Hi all,

  It seems to me that a very good number of clustering problems end up being
multicast and smart switch related. I know that IGMP snooping and STP are
often the cause, and PIM can help solve it. Despite understanding this,
though, I can't quite understand exactly *why* IGMP snooping and STP break
things.

  Reading up on them leads me to think that they should cleanly create and
handle multicast groups, but this obviously isn't the case. When a switch
restarts, shouldn't it send a request to clients asking to resubscribe to
multicast groups? When corosync starts, I expect it would also send
multicast joins.

  Sorry if the question is a little vague or odd. I'm trying to get my head
around the troubles when, on the surface, the docs seem to make the process
of creating/managing multicast quite simple and straight forward.

Thanks!

--
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From corey.kovacs at gmail.com  Sat Feb 12 15:36:20 2011
From: corey.kovacs at gmail.com (Corey Kovacs)
Date: Sat, 12 Feb 2011 15:36:20 +0000
Subject: [Linux-cluster] A better understanding of multicast issues
In-Reply-To: <4d566626.857a0e0a.6cd4.0d41@mx.google.com>
References: <4D3D9CA1.7040707@alteeve.com>
	<4d566626.857a0e0a.6cd4.0d41@mx.google.com>
Message-ID: <AANLkTikMdhKMgtLmGnidOxA-eCxo5=digFCL7WKkVUP3@mail.gmail.com>

When a multicast group is "joined" the switch/router will periodically
(three mins i think) send out a query to the members to see if the
connection is still needed. If a member does not reply to this query,
then the connection is dropped for that port. If a switch is rebooted,
then it's up to the member to re-establish the connection I believe,
not the switch. Snooping is not generally a problem unless it's broken
in the switch/router firmware. If it is, then you might need an
upgrade. We use Multicast for all sorts of things and have indeed run
into some problems on devices like Flex-10 cards for HP c7000 blade
chassis that didn't do igmp snooping correctly, but we have gotten
fixes for these issues from various vendors.

If you have a planned outage for a switch, you can have your network
people relocate the querier for a particular multicast group to
another switch accessible to say a bonded pair or something. Things
get really odd if you are on two separate switches that aren't
stacked.

Generally speaking, multicast isn't hard, you just have to think backwards.

-C

On Sat, Feb 12, 2011 at 10:51 AM, Kit Gerrits <kitgerrits at gmail.com> wrote:
>
> Digimer,
>
> Did you ever get a reply from anyone?
>
> If what you say is true, failure of one of our HSRP(HA) switches/routers
> might break the cluster.
> (if they don't share multicast menberships)
>
> I would guess that ?multicast groups originate in the cluster, not the
> switch.
> In that case, if the switch has been rebooted, the cluster needs to
> re-create the multicast groups on the switch.
>
> I would guess that the cluster itself needs to check if the switch is
> properly handling multicast.
> (subscribe to its own group and check if the packets are being handles
> correctly)
>
> This should provide an insight into clustering/multicast:
> http://www.cisco.com/en/US/products/hw/switches/ps708/products_tech_note0918
> 6a008059a9df.shtml
>
>
> Regards,
>
> Kit
>
>
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Digimer
> Sent: maandag 24 januari 2011 16:37
> To: linux clustering
> Subject: [Linux-cluster] A better understanding of multicast issues
>
> Hi all,
>
> ?It seems to me that a very good number of clustering problems end up being
> multicast and smart switch related. I know that IGMP snooping and STP are
> often the cause, and PIM can help solve it. Despite understanding this,
> though, I can't quite understand exactly *why* IGMP snooping and STP break
> things.
>
> ?Reading up on them leads me to think that they should cleanly create and
> handle multicast groups, but this obviously isn't the case. When a switch
> restarts, shouldn't it send a request to clients asking to resubscribe to
> multicast groups? When corosync starts, I expect it would also send
> multicast joins.
>
> ?Sorry if the question is a little vague or odd. I'm trying to get my head
> around the troubles when, on the surface, the docs seem to make the process
> of creating/managing multicast quite simple and straight forward.
>
> Thanks!
>
> --
> Digimer
> E-Mail: digimer at alteeve.com
> AN!Whitepapers: http://alteeve.com
> Node Assassin: ?http://nodeassassin.org
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From linux at alteeve.com  Sat Feb 12 16:21:44 2011
From: linux at alteeve.com (Digimer)
Date: Sat, 12 Feb 2011 11:21:44 -0500
Subject: [Linux-cluster] A better understanding of multicast issues
In-Reply-To: <4d566626.857a0e0a.6cd4.0d41@mx.google.com>
References: <4d566626.857a0e0a.6cd4.0d41@mx.google.com>
Message-ID: <4D56B398.70102@alteeve.com>

On 02/12/2011 05:51 AM, Kit Gerrits wrote:
> 
> Digimer,
> 
> Did you ever get a reply from anyone?
> 
> If what you say is true, failure of one of our HSRP(HA) switches/routers
> might break the cluster.
> (if they don't share multicast menberships)
> 
> I would guess that  multicast groups originate in the cluster, not the
> switch.
> In that case, if the switch has been rebooted, the cluster needs to
> re-create the multicast groups on the switch.
> 
> I would guess that the cluster itself needs to check if the switch is
> properly handling multicast.
> (subscribe to its own group and check if the packets are being handles
> correctly)
> 
> This should provide an insight into clustering/multicast:
> http://www.cisco.com/en/US/products/hw/switches/ps708/products_tech_note0918
> 6a008059a9df.shtml
> 
> 
> Regards,
> 
> Kit

Hi Kit,

  I did not, and thank you for replying.

  So the frequent multicast breakdowns, given that it's fairly rare for
switches to reset, is probably in the periodic checks done by the
switches. I wonder then if corosync, for whatever reasons, doesn't or
isn't able to answer the requests (quickly enough). Perhaps the process
takes too much time? Corosync will, by default, decare a ring dead after
~3s.

  More to think about, and I appreciate that link. Thanks. :)

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From linux at alteeve.com  Sat Feb 12 16:23:48 2011
From: linux at alteeve.com (Digimer)
Date: Sat, 12 Feb 2011 11:23:48 -0500
Subject: [Linux-cluster] A better understanding of multicast issues
In-Reply-To: <AANLkTikMdhKMgtLmGnidOxA-eCxo5=digFCL7WKkVUP3@mail.gmail.com>
References: <4D3D9CA1.7040707@alteeve.com>	<4d566626.857a0e0a.6cd4.0d41@mx.google.com>
	<AANLkTikMdhKMgtLmGnidOxA-eCxo5=digFCL7WKkVUP3@mail.gmail.com>
Message-ID: <4D56B414.1050706@alteeve.com>

On 02/12/2011 10:36 AM, Corey Kovacs wrote:
> When a multicast group is "joined" the switch/router will periodically
> (three mins i think) send out a query to the members to see if the
> connection is still needed. If a member does not reply to this query,
> then the connection is dropped for that port. If a switch is rebooted,
> then it's up to the member to re-establish the connection I believe,
> not the switch. Snooping is not generally a problem unless it's broken
> in the switch/router firmware. If it is, then you might need an
> upgrade. We use Multicast for all sorts of things and have indeed run
> into some problems on devices like Flex-10 cards for HP c7000 blade
> chassis that didn't do igmp snooping correctly, but we have gotten
> fixes for these issues from various vendors.
> 
> If you have a planned outage for a switch, you can have your network
> people relocate the querier for a particular multicast group to
> another switch accessible to say a bonded pair or something. Things
> get really odd if you are on two separate switches that aren't
> stacked.
> 
> Generally speaking, multicast isn't hard, you just have to think backwards.
> 
> -C

Thanks for the reply, Corey.

  When I replied to Kit, I think I addressed issues you both brought up
(yay head colds!). So let me extend my thanks to you here, this has
given my more to consider. I'll have to fire up tcpdump on the the
cluster and wait and see.

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From sachinbhugra at hotmail.com  Sun Feb 13 09:14:43 2011
From: sachinbhugra at hotmail.com (Sachin Bhugra)
Date: Sun, 13 Feb 2011 14:44:43 +0530
Subject: [Linux-cluster] Cluster node hangs
Message-ID: <SNT112-W544548584AD789563BB1ECDAD10@phx.gbl>


Hi ,



 I have setup a two node cluster in lab, with Vmware Server, and hence 
used manual fencing. It includes a iSCSI GFS2 partition and it service 
Apache in Active/Passive mode.



Cluster works and I am able to relocate service between nodes with no 
issues. However, the problem comes when I shutdown the node, for 
testing, which is presently holding the service. When the node becomes 
unavailable, service gets relocated and GFS partition gets mounted on 
the other node, however it is not accessible. If I try to do a "ls/du" 
on GFS partition, the command hangs. On the other hand the node which 
was shutdown gets stuck at "unmounting file system". 



I tried using fence_manual -n nodename and then fence_ack_manual -n nodename, however it still remains the same.



Can someone please help me is what I am doing wrong?



Thanks, 		 	   		  
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110213/86d1c478/attachment.htm>

From ekuric at redhat.com  Sun Feb 13 09:41:55 2011
From: ekuric at redhat.com (Elvir Kuric)
Date: Sun, 13 Feb 2011 10:41:55 +0100
Subject: [Linux-cluster] Cluster node hangs
In-Reply-To: <SNT112-W544548584AD789563BB1ECDAD10@phx.gbl>
References: <SNT112-W544548584AD789563BB1ECDAD10@phx.gbl>
Message-ID: <4D57A763.8030700@redhat.com>

On 02/13/2011 10:14 AM, Sachin Bhugra wrote:
> Hi ,
>
> I have setup a two node cluster in lab, with Vmware Server, and hence 
> used manual fencing. It includes a iSCSI GFS2 partition and it service 
> Apache in Active/Passive mode.
>
> Cluster works and I am able to relocate service between nodes with no 
> issues. However, the problem comes when I shutdown the node, for 
> testing, which is presently holding the service. When the node becomes 
> unavailable, service gets relocated and GFS partition gets mounted on 
> the other node, however it is not accessible. If I try to do a "ls/du" 
> on GFS partition, the command hangs. On the other hand the node which 
> was shutdown gets stuck at "unmounting file system".
>
> I tried using fence_manual -n nodename and then fence_ack_manual -n 
> nodename, however it still remains the same.
>
> Can someone please help me is what I am doing wrong?
>
> Thanks,
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
It would be good to see  /etc/fstab configuration used on cluster nodes. 
If /gfs partition is mounted manually it will not be unmounted correctly 
in case you restart node ( and not executing umount prior restart ), and 
will hang during shutdown/reboot process.

More at:  
http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/html-single/Global_File_System_2/index.html 


Regards,

Elvir


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110213/53eecada/attachment.htm>

From ekuric at redhat.com  Sun Feb 13 09:52:51 2011
From: ekuric at redhat.com (Elvir Kuric)
Date: Sun, 13 Feb 2011 10:52:51 +0100
Subject: [Linux-cluster] Cluster node hangs
In-Reply-To: <4D57A763.8030700@redhat.com>
References: <SNT112-W544548584AD789563BB1ECDAD10@phx.gbl>
	<4D57A763.8030700@redhat.com>
Message-ID: <4D57A9F3.90408@redhat.com>

On 02/13/2011 10:41 AM, Elvir Kuric wrote:
> On 02/13/2011 10:14 AM, Sachin Bhugra wrote:
>> Hi ,
>>
>> I have setup a two node cluster in lab, with Vmware Server, and hence 
>> used manual fencing. It includes a iSCSI GFS2 partition and it 
>> service Apache in Active/Passive mode.
>>
>> Cluster works and I am able to relocate service between nodes with no 
>> issues. However, the problem comes when I shutdown the node, for 
>> testing, which is presently holding the service. When the node 
>> becomes unavailable, service gets relocated and GFS partition gets 
>> mounted on the other node, however it is not accessible. If I try to 
>> do a "ls/du" on GFS partition, the command hangs. On the other hand 
>> the node which was shutdown gets stuck at "unmounting file system".
>>
>> I tried using fence_manual -n nodename and then fence_ack_manual -n 
>> nodename, however it still remains the same.
>>
>> Can someone please help me is what I am doing wrong?
>>
>> Thanks,
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> It would be good to see  /etc/fstab configuration used on cluster 
> nodes. If /gfs partition is mounted manually it will not be unmounted 
> correctly in case you restart node ( and not executing umount prior 
> restart ), and will hang during shutdown/reboot process.
>
> More at: 
> http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/html-single/Global_File_System_2/index.html

Edit: above link, section 3.4Special Considerations when Mounting GFS2 
File Systems
>
>
> Regards,
>
> Elvir
>

>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110213/5443b0fb/attachment.htm>

From sachinbhugra at hotmail.com  Sun Feb 13 10:19:01 2011
From: sachinbhugra at hotmail.com (Sachin Bhugra)
Date: Sun, 13 Feb 2011 15:49:01 +0530
Subject: [Linux-cluster] Cluster node hangs
In-Reply-To: <4D57A9F3.90408@redhat.com>
References: <SNT112-W544548584AD789563BB1ECDAD10@phx.gbl>,
	<4D57A763.8030700@redhat.com>, <4D57A9F3.90408@redhat.com>
Message-ID: <SNT112-W456C3B1F45B101BABEDFDFDAD10@phx.gbl>


Thank for the reply and link. However, GFS2 is not listed in fstab, it is only handled by cluster config.

Date: Sun, 13 Feb 2011 10:52:51 +0100
From: ekuric at redhat.com
To: linux-cluster at redhat.com
Subject: Re: [Linux-cluster] Cluster node hangs



  


    
    Message body
  
  
    On 02/13/2011 10:41 AM, Elvir Kuric wrote:
    
      
      
      On 02/13/2011 10:14 AM, Sachin Bhugra wrote:
      
         Hi ,

        

        I have setup a two node cluster in lab, with Vmware Server, and
        hence used manual fencing. It includes a iSCSI GFS2 partition
        and it service Apache in Active/Passive mode.

        

        Cluster works and I am able to relocate service between nodes
        with no issues. However, the problem comes when I shutdown the
        node, for testing, which is presently holding the service. When
        the node becomes unavailable, service gets relocated and GFS
        partition gets mounted on the other node, however it is not
        accessible. If I try to do a "ls/du" on GFS partition, the
        command hangs. On the other hand the node which was shutdown
        gets stuck at "unmounting file system". 

        

        I tried using fence_manual -n nodename and then fence_ack_manual
        -n nodename, however it still remains the same.

        

        Can someone please help me is what I am doing wrong?

        

        Thanks,
        
--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
      
      It would be good to see  /etc/fstab configuration used on
        cluster nodes. If /gfs partition is mounted manually it will not
        be unmounted correctly in case you restart node ( and not
        executing umount prior restart ), and will hang during
        shutdown/reboot process.

        

        More at:  http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/html-single/Global_File_System_2/index.html
    

    Edit: above link, section 3.4 Special Considerations
      when Mounting GFS2 File Systems
    
        

        

        Regards, 

        

        Elvir 

        

      
    

    
        

      
      
--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster
    
    

  


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster 		 	   		  
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110213/a8b6f2e0/attachment.htm>

From shariq.siddiqui at yahoo.com  Sun Feb 13 11:16:28 2011
From: shariq.siddiqui at yahoo.com (Shariq Siddiqui)
Date: Sun, 13 Feb 2011 03:16:28 -0800 (PST)
Subject: [Linux-cluster] want to use GFS2
In-Reply-To: <SNT112-W544548584AD789563BB1ECDAD10@phx.gbl>
References: <SNT112-W544548584AD789563BB1ECDAD10@phx.gbl>
Message-ID: <428984.82494.qm@web39802.mail.mud.yahoo.com>

Dear All,
I want to use GFS2 filesystem in shared storage.

and trying to mount it in two node,

what initially I need to do to fullfill this task.
I don't need whole cluster suit i need minimal configuration.

Please HELP

 
Best Regards,

Shariq Siddiqui
Advanced Operations Technology
PO.Box : 25904 - Riyadh 11476
Riyadh Saudi Arabia
Tel : +966 1 291 0605 -
Fax:+966 1 291 3328


      
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110213/c6f54338/attachment.htm>

From share2dom at gmail.com  Sun Feb 13 14:32:55 2011
From: share2dom at gmail.com (dOminic)
Date: Sun, 13 Feb 2011 20:02:55 +0530
Subject: [Linux-cluster] Cluster node hangs
In-Reply-To: <SNT112-W456C3B1F45B101BABEDFDFDAD10@phx.gbl>
References: <SNT112-W544548584AD789563BB1ECDAD10@phx.gbl>
	<4D57A763.8030700@redhat.com> <4D57A9F3.90408@redhat.com>
	<SNT112-W456C3B1F45B101BABEDFDFDAD10@phx.gbl>
Message-ID: <AANLkTimZ9E4xm8qS_bb7-NcxY7X+u0Eg1nYaCXK0iuKj@mail.gmail.com>

Hi,

Whats the msg you are getting in logs ?. It would be great if you could
attach log mesgs along with cluster.conf

-dominic


On Sun, Feb 13, 2011 at 3:49 PM, Sachin Bhugra <sachinbhugra at hotmail.com>wrote:

>  Thank for the reply and link. However, GFS2 is not listed in fstab, it is
> only handled by cluster config.
>
> ------------------------------
> Date: Sun, 13 Feb 2011 10:52:51 +0100
> From: ekuric at redhat.com
> To: linux-cluster at redhat.com
> Subject: Re: [Linux-cluster] Cluster node hangs
>
>
> On 02/13/2011 10:41 AM, Elvir Kuric wrote:
>
> On 02/13/2011 10:14 AM, Sachin Bhugra wrote:
>
> Hi ,
>
> I have setup a two node cluster in lab, with Vmware Server, and hence used
> manual fencing. It includes a iSCSI GFS2 partition and it service Apache in
> Active/Passive mode.
>
> Cluster works and I am able to relocate service between nodes with no
> issues. However, the problem comes when I shutdown the node, for testing,
> which is presently holding the service. When the node becomes unavailable,
> service gets relocated and GFS partition gets mounted on the other node,
> however it is not accessible. If I try to do a "ls/du" on GFS partition, the
> command hangs. On the other hand the node which was shutdown gets stuck at
> "unmounting file system".
>
> I tried using fence_manual -n nodename and then fence_ack_manual -n
> nodename, however it still remains the same.
>
> Can someone please help me is what I am doing wrong?
>
> Thanks,
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>  It would be good to see  /etc/fstab configuration used on cluster nodes.
> If /gfs partition is mounted manually it will not be unmounted correctly in
> case you restart node ( and not executing umount prior restart ), and will
> hang during shutdown/reboot process.
>
> More at:
> http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/html-single/Global_File_System_2/index.html
>
>
> Edit: above link, section 3.4 Special Considerations when Mounting GFS2
> File Systems
>
>
>
> Regards,
>
> Elvir
>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>
> -- Linux-cluster mailing list Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110213/7453f7cf/attachment.htm>

From share2dom at gmail.com  Sun Feb 13 14:39:15 2011
From: share2dom at gmail.com (dOminic)
Date: Sun, 13 Feb 2011 20:09:15 +0530
Subject: [Linux-cluster] want to use GFS2
In-Reply-To: <428984.82494.qm@web39802.mail.mud.yahoo.com>
References: <SNT112-W544548584AD789563BB1ECDAD10@phx.gbl>
	<428984.82494.qm@web39802.mail.mud.yahoo.com>
Message-ID: <AANLkTi=dX8E19iQ4T1YZNAhsec-sUpXCcw5AOf-Yp+Av@mail.gmail.com>

Hi,

You need to setup a simple cluster and a proper fencing mechanism . No need
to configure any services since you want to use GFS2 on both the nodes.
Start the cluster, mount the gfs2 by /etc/fstab entry.

Note: You can't use GFS2 without a Cluster setup.

Dominic

On Sun, Feb 13, 2011 at 4:46 PM, Shariq Siddiqui
<shariq.siddiqui at yahoo.com>wrote:

> Dear All,
> I want to use GFS2 filesystem in shared storage.
>
> and trying to mount it in two node,
>
> what initially I need to do to fullfill this task.
> I don't need whole cluster suit i need minimal configuration.
>
> Please HELP
>
>
> Best Regards,
>
> Shariq Siddiqui
>
> Advanced Operations Technology
>
> PO.Box : 25904 - Riyadh 11476
>
> Riyadh Saudi Arabia
>
> Tel : +966 1 291 0605 -
>
> Fax:+966 1 291 3328
> [image: View Shariq Siddiqui's profile on LinkedIn]<http://www.linkedin.com/in/shariqsiddiqui>
>
>
>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110213/68c4406e/attachment.htm>

From rpeterso at redhat.com  Mon Feb 14 13:37:21 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Mon, 14 Feb 2011 08:37:21 -0500 (EST)
Subject: [Linux-cluster] want to use GFS2
In-Reply-To: <AANLkTi=dX8E19iQ4T1YZNAhsec-sUpXCcw5AOf-Yp+Av@mail.gmail.com>
Message-ID: <674877142.9531.1297690641329.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Hi,
| 
| You need to setup a simple cluster and a proper fencing mechanism . No
| need
| to configure any services since you want to use GFS2 on both the
| nodes.
| Start the cluster, mount the gfs2 by /etc/fstab entry.
| 
| Note: You can't use GFS2 without a Cluster setup.
| 
| Dominic

Hi,

Well, technically you can use GFS2 without a cluster setup.
I believe Red Hat doesn't support it, and the storage can't
be mounted by more than a single computer (with "lock_nolock"
locking protocol), but it can be done.

Regards,

Bob Peterson
Red Hat File Systems



From randy.brown at noaa.gov  Mon Feb 14 13:53:21 2011
From: randy.brown at noaa.gov (Randy Brown)
Date: Mon, 14 Feb 2011 08:53:21 -0500
Subject: [Linux-cluster] Problem with machines fencing one another in 2 Node
	NFS cluster
Message-ID: <4D5933D1.10009@noaa.gov>

Hello,

I am running a 2 node cluster being used as a NAS head for a Lefthand 
Networks iSCSI SAN to provide NFS mounts out to my network.  Things have 
been OK for a while, but I recently lost one of the nodes as a result of 
a patching problem.  In an effort to recreate the failed node, I imaged 
the working node and installed that image on the failed node.  I set 
it's hostname and IP settings correctly and the machine booted and 
joined the cluster just fine.  Or at least it appeared so.  Things ran 
OK for the last few weeks, but I recently started seeing a behavior 
where the nodes start fencing each other.  I'm wondering if there is 
something as a result of cloning the nodes that could be the problem.  
Possibly something that should be different but isn't because of the 
cloning?

I am running CentOS 5.5 with the following package versions:

Kernel - 2.6.18-194.11.3.el5 #1 SMP
cman-2.0.115-34.el5_5.4
lvm2-cluster-2.02.56-7.el5_5.4
gfs2-utils-0.1.62-20.el5
kmod-gfs-0.1.34-12.el5.centos
rgmanager-2.0.52-6.el5.centos.8

I have a Qlogic qla4062 HBA in the node running: QLogic iSCSI HBA Driver 
(f8b83000) v5.01.03.04

I will gladly provide more information as needed.

Thank you,
Randy

-------------- next part --------------
A non-text attachment was scrubbed...
Name: randy_brown.vcf
Type: text/x-vcard
Size: 313 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110214/cec4c89d/attachment.vcf>

From rhurst at bidmc.harvard.edu  Mon Feb 14 13:54:05 2011
From: rhurst at bidmc.harvard.edu (rhurst at bidmc.harvard.edu)
Date: Mon, 14 Feb 2011 08:54:05 -0500
Subject: [Linux-cluster] want to use GFS2
In-Reply-To: <674877142.9531.1297690641329.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <AANLkTi=dX8E19iQ4T1YZNAhsec-sUpXCcw5AOf-Yp+Av@mail.gmail.com>
	<674877142.9531.1297690641329.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <50168EC934B8D64AA8D8DD37F840F3DE05670D38F2@EVS2CCR.its.caregroup.org>

Just two cents, I believe Red Hat does support GFS2 on single server using lock_nolock, because we do SAN "snaps" of actively clustered GFS2 volumes (simple flatfiles, no databases) and present the snap luns to a media agent server to backup the data oob.  RHN was okay with that configuration and we have been running it this way on GFS and GFS2 for 5-years without issue.


-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Bob Peterson
Sent: Monday, February 14, 2011 8:37 AM
To: linux clustering
Subject: Re: [Linux-cluster] want to use GFS2

----- Original Message -----
| Hi,
| 
| You need to setup a simple cluster and a proper fencing mechanism . No 
| need to configure any services since you want to use GFS2 on both the 
| nodes.
| Start the cluster, mount the gfs2 by /etc/fstab entry.
| 
| Note: You can't use GFS2 without a Cluster setup.
| 
| Dominic

Hi,

Well, technically you can use GFS2 without a cluster setup.
I believe Red Hat doesn't support it, and the storage can't be mounted by more than a single computer (with "lock_nolock"
locking protocol), but it can be done.

Regards,

Bob Peterson
Red Hat File Systems

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From linux at alteeve.com  Mon Feb 14 14:03:36 2011
From: linux at alteeve.com (Digimer)
Date: Mon, 14 Feb 2011 09:03:36 -0500
Subject: [Linux-cluster] Problem with machines fencing one another in 2
 Node	NFS cluster
In-Reply-To: <4D5933D1.10009@noaa.gov>
References: <4D5933D1.10009@noaa.gov>
Message-ID: <4D593638.30802@alteeve.com>

On 02/14/2011 08:53 AM, Randy Brown wrote:
> Hello,
> 
> I am running a 2 node cluster being used as a NAS head for a Lefthand
> Networks iSCSI SAN to provide NFS mounts out to my network.  Things have
> been OK for a while, but I recently lost one of the nodes as a result of
> a patching problem.  In an effort to recreate the failed node, I imaged
> the working node and installed that image on the failed node.  I set
> it's hostname and IP settings correctly and the machine booted and
> joined the cluster just fine.  Or at least it appeared so.  Things ran
> OK for the last few weeks, but I recently started seeing a behavior
> where the nodes start fencing each other.  I'm wondering if there is
> something as a result of cloning the nodes that could be the problem. 
> Possibly something that should be different but isn't because of the
> cloning?
> 
> I am running CentOS 5.5 with the following package versions:
> 
> Kernel - 2.6.18-194.11.3.el5 #1 SMP
> cman-2.0.115-34.el5_5.4
> lvm2-cluster-2.02.56-7.el5_5.4
> gfs2-utils-0.1.62-20.el5
> kmod-gfs-0.1.34-12.el5.centos
> rgmanager-2.0.52-6.el5.centos.8
> 
> I have a Qlogic qla4062 HBA in the node running: QLogic iSCSI HBA Driver
> (f8b83000) v5.01.03.04
> 
> I will gladly provide more information as needed.
> 
> Thank you,
> Randy

Silly question, but are the NICs mapped to their MAC addresses? If so,
did you update the MAC addresses after cloning the server to reflect the
actual MAC addresses? Assuming so, do you have managed switches? If so,
can you test by swapping out a simple, unmanaged switch?

This sounds like a multicast issue at some level. Fencing happens once
the totem ring is declared failed. Do you see anything interesting in
the log files prior to the fence? Can you run tcpdump to see what is
happening on the interface(s) prior to the fence?

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From shariq.siddiqui at yahoo.com  Mon Feb 14 18:45:26 2011
From: shariq.siddiqui at yahoo.com (Shariq Siddiqui)
Date: Mon, 14 Feb 2011 10:45:26 -0800 (PST)
Subject: [Linux-cluster] want to use GFS2
In-Reply-To: <50168EC934B8D64AA8D8DD37F840F3DE05670D38F2@EVS2CCR.its.caregroup.org>
References: <AANLkTi=dX8E19iQ4T1YZNAhsec-sUpXCcw5AOf-Yp+Av@mail.gmail.com>
	<674877142.9531.1297690641329.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<50168EC934B8D64AA8D8DD37F840F3DE05670D38F2@EVS2CCR.its.caregroup.org>
Message-ID: <71316.4030.qm@web39807.mail.mud.yahoo.com>

Thanks All for your reply,

I treid it with lock_nolock and its working fine with me, But only in one server 
at a time
But I want to take a benafit of GFS2 to use this storage with two servers as a 
central storage.
So?each one can write easily on it.
My point is this can we use it without making cluster UP?

or is there any other filesystem through which I can fullfill my requirment??
?
Best Regards,

Shariq Siddiqui
?
?




________________________________
From: "rhurst at bidmc.harvard.edu" <rhurst at bidmc.harvard.edu>
To: linux-cluster at redhat.com
Sent: Mon, February 14, 2011 4:54:05 PM
Subject: Re: [Linux-cluster] want to use GFS2

Just two cents, I believe Red Hat does support GFS2 on single server using 
lock_nolock, because we do SAN "snaps" of actively clustered GFS2 volumes 
(simple flatfiles, no databases) and present the snap luns to a media agent 
server to backup the data oob.? RHN was okay with that configuration and we have 
been running it this way on GFS and GFS2 for 5-years without issue.


-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] 
On Behalf Of Bob Peterson
Sent: Monday, February 14, 2011 8:37 AM
To: linux clustering
Subject: Re: [Linux-cluster] want to use GFS2

----- Original Message -----
| Hi,
| 
| You need to setup a simple cluster and a proper fencing mechanism . No 
| need to configure any services since you want to use GFS2 on both the 
| nodes.
| Start the cluster, mount the gfs2 by /etc/fstab entry.
| 
| Note: You can't use GFS2 without a Cluster setup.
| 
| Dominic

Hi,

Well, technically you can use GFS2 without a cluster setup.
I believe Red Hat doesn't support it, and the storage can't be mounted by more 
than a single computer (with "lock_nolock"
locking protocol), but it can be done.

Regards,

Bob Peterson
Red Hat File Systems

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



      
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110214/b921c469/attachment.htm>

From rpeterso at redhat.com  Mon Feb 14 19:01:10 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Mon, 14 Feb 2011 14:01:10 -0500 (EST)
Subject: [Linux-cluster] want to use GFS2
In-Reply-To: <71316.4030.qm@web39807.mail.mud.yahoo.com>
Message-ID: <1886322021.17117.1297710070432.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Thanks All for your reply,
| 
| I treid it with lock_nolock and its working fine with me, But only in
| one server
| at a time
| But I want to take a benafit of GFS2 to use this storage with two
| servers as a
| central storage.
| So each one can write easily on it.
| My point is this can we use it without making cluster UP?
| 
| or is there any other filesystem through which I can fullfill my
| requirment?
| 
| Best Regards,
| 
| Shariq Siddiqui

Hi Shariq,

If you want to share the storage with gfs2, you need to set up
a simple cluster.

Regards,

Bob Peterson
Red Hat File Systems



From vincent.blondel at ing.be  Mon Feb 14 21:02:40 2011
From: vincent.blondel at ing.be (vincent.blondel at ing.be)
Date: Mon, 14 Feb 2011 22:02:40 +0100
Subject: [Linux-cluster] Two nodes DRBD - Fail-Over Actif/Passif Cluster.
Message-ID: <294881FE3F4013418806F0CE6E73A7B6052F302466@ing.com>


Hello all,

I just installed last week two servers, each of them with Redhat Linux Enterprise 6.0 on it for hosting in a near future Blue Coat Reporter. Installation is ok but now I am trying to configure these both servers in cluster. 

First of all, I never configured any cluster with Linux ...

Servers are both HP DL380R06 with disk cabinet directly attached on it. (twice exactly same hardware specs).

What I would like to get is simply getting an Actif/Passif clustering mode with bidirectional disk space synchronization. This means, both servers are running. Only, the first one is running Reporter. During this time, disk spaces are continuously synchronized. When first one is down, second one becomes actif and when first one is running again, it synchronizes the disks and becomes primary again.

server 1 is reporter1.lab.intranet with ip 10.30.30.90
server 2 is reporter2.lab.intranet with ip 10.30.30.91

the load balanced ip should be 10.30.30.92 ..

After some days of research on the net, I came to the conclusion that I could be happy with a solution including, DRBD/GFS2 with Redhat Cluster Suite.

I am first trying to get a complete picture running on two vmware fusion (Linux Redhat Enterprise Linux 6) on my macosx before configuring my real servers.

So, after some hours of research on the net, I found some articles and links that seem to describe what I wanna get ...

http://gcharriere.com/blog/?p=73
http://www.linuxtopia.org/online_books/rhel6/rhel_6_cluster_admin/rhel_6_cluster_ch-config-cli-CA.html
http://www.drbd.org/users-guide/users-guide.html

and the DRBD packages for RHEL6 that I did not find anywhere ..

http://elrepo.org/linux/elrepo/el6/i386/RPMS/

I just only configured till now the first part, meaning cluster services but the first issue occur ..

below the cluster.conf file ...


<?xml version="1.0"?>
<cluster name="cluster" config_version="6">
  <!-- post_join_delay: number of seconds the daemon will wait before
                        fencing any victims after a node joins the domain
       post_fail_delay: number of seconds the daemon will wait before
            	        fencing any victims after a domain member fails
       clean_start    : prevent any startup fencing the daemon might do.
		        It indicates that the daemon should assume all nodes
		        are in a clean state to start. -->
  <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
  <clusternodes>
    <clusternode name="reporter1.lab.intranet" votes="1" nodeid="1">
      <fence>
        <!-- Handle fencing manually -->
        <method name="human">
          <device name="human" nodename="reporter1.lab.intranet"/>
        </method>
      </fence>
    </clusternode>
    <clusternode name="reporter2.lab.intranet" votes="1" nodeid="2"> 
      <fence>
        <!-- Handle fencing manually -->
        <method name="human">
          <device name="human" nodename="reporter2.lab.intranet"/>
        </method>
      </fence>
    </clusternode>
  </clusternodes>
  <!-- cman two nodes specification -->
  <cman expected_votes="1" two_node="1"/>
  <fencedevices>
    <!-- Define manual fencing -->
    <fencedevice name="human" agent="fence_manual"/>
  </fencedevices>
  <rm>
     <failoverdomains>
        <failoverdomain name="example_pri" nofailback="0" ordered="1" restricted="0">
           <failoverdomainnode name="reporter1.lab.intranet" priority="1"/>
           <failoverdomainnode name="reporter2.lab.intranet" priority="2"/>
        </failoverdomain>
     </failoverdomains>
     <resources>
           <ip address="10.30.30.92" monitor_link="on" sleeptime="10"/>
           <apache config_file="conf/httpd.conf" name="example_server" server_root="/etc/httpd" shutdown_wait="0"/>
      </resources>
      <service autostart="1" domain="example_pri" exclusive="0" name="example_apache" recovery="relocate">
                <ip ref="10.30.30.92"/>
                <apache ref="example_server"/>
      </service>
  </rm>
</cluster>

and this is the result I get on both servers ...

[root at reporter1 ~]# clustat
Cluster Status for cluster @ Mon Feb 14 22:22:53 2011
Member Status: Quorate

 Member Name                                      ID   Status
 ------ ----                                      ---- ------
 reporter1.lab.intranet                               1 Online, Local, rgmanager
 reporter2.lab.intranet                               2 Online, rgmanager

 Service Name                            Owner (Last)                            State         
 ------- ----                            ----- ------                            -----         
 service:example_apache                  (none)                                  stopped       

as you can see, everything is stopped or in other words nothing runs .. so my question are :

did I forget something in my conf file ?
did I make something wrong in my conf file ?
do I have to configure manually load balanced ip 10.30.30.92 as an alias ip on both sides or is it done automatically by redhat cluster ?
I just made a simple try with apache but I do not find anywhere reference to the start/stop script for apache in the examples, is that normal ??
do you have some best practice regarding this picture ??

many thks to help me because I certainly have a bad understanding on some points.

Regards
Vincent
-----------------------------------------------------------------
ATTENTION:
This e-mail is intended for the exclusive use of the
recipient(s). This e-mail and its attachments, if any, contain
confidential information and/or information protected by
intellectual property rights or other rights. This e-mail does
not constitute any commitment for ING Belgium except when
expressly otherwise agreed in a written agreement between the
intended recipient and ING Belgium.

If you receive this message by mistake, please, notify the sender
with the "reply" option and delete immediately this e-mail from
your system, and destroy all copies of it. You may not, directly
or indirectly, use this e-mail or any part of it if you are not
the intended recipient.

Messages and attachments are scanned for all viruses known. If
this message contains password-protected attachments, the files
have NOT been scanned for viruses by the ING mail domain. Always
scan attachments before opening them.
-----------------------------------------------------------------
ING Belgium SA/NV - Bank/Lender - Avenue Marnix 24, B-1000
Brussels, Belgium - Brussels RPM/RPR - VAT BE 0403.200.393 -
BIC (SWIFT) : BBRUBEBB - Account: 310-9156027-89 (IBAN BE45 3109
1560 2789).
An insurance broker, registered with the Banking, Finance and
Insurance Commission under the code number 12381A.

ING Belgique SA - Banque/Preteur, Avenue Marnix 24, B-1000
Bruxelles - RPM Bruxelles - TVA BE 0403 200 393 - BIC (SWIFT) :
BBRUBEBB - Compte: 310-9156027-89 (IBAN: BE45 3109 1560 2789).
Courtier d'assurances inscrit a la CBFA sous le numero 12381A.

ING Belgie NV - Bank/Kredietgever - Marnixlaan 24, B-1000 Brussel
- RPR Brussel - BTW BE 0403.200.393 - BIC (SWIFT) : BBRUBEBB -
Rekening: 310-9156027-89 (IBAN: BE45 3109 1560 2789).
Verzekeringsmakelaar ingeschreven bij de CBFA onder het nr.
12381A.
-----------------------------------------------------------------




From niks at logik-internet.rs  Mon Feb 14 23:39:21 2011
From: niks at logik-internet.rs (Nikola Savic)
Date: Tue, 15 Feb 2011 00:39:21 +0100
Subject: [Linux-cluster] Cluster with shared storage on low budget
Message-ID: <4D59BD29.30906@logik-internet.rs>


  Hello,

  I need to setup cluster of 3 servers without separate storage device
(SAN). Servers should join their local hard drives to create shared
storage space. Every server in cluster has public (100Mbps) and private
(1Gbps) NIC. Private 1Gbit network will be used for exchange of data
(files) on shared storage. Additional request is that data on shared
storage is highly redundant (complete mirroring is required).

  I was wondering if following setup is possible and if anyone has any
experience or comments on it:
- Servers will export local disks of same size as iSCSI targets
- Each server will access other's two servers disk over iSCSI initiator
- CLVM will be used to set Volume Group on all 3 disks. In theory this
VG will work on all servers, because they'll have access to all disks
(either directly or over iSCSI).
- CLVM will be used to create Logical Volume with mirroring option set
to 3 (-m 3). Since there are 3 disks (physical devices) forming VG, each
server will have redundant copy of same data.
- Created Logical Volume will have GFS2 on it, so that it can be shared
by cluster.
- Web server will store web application files (scripts and photos) on
created GFS.

  If it works, this setup should provide shared storage for cluster,
built from already available local hard drives in servers forming
cluster. By using LVM mirroring, each server will have the same copy of
data, which should make cluster resistant to failure of any server.

  I was wondering, is LVM smart enough to optimize reading and use local
drive for read operations?

  Looking forward to your comments.

  Best Regards,
  Nikola
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110215/a7cf9ef2/attachment.htm>

From work at fajar.net  Mon Feb 14 23:45:21 2011
From: work at fajar.net (Fajar A. Nugraha)
Date: Tue, 15 Feb 2011 06:45:21 +0700
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D59BD29.30906@logik-internet.rs>
References: <4D59BD29.30906@logik-internet.rs>
Message-ID: <AANLkTintdyMD-yHcWoYLcrz9i8mZNMJDzdbN0kx7+NR-@mail.gmail.com>

On Tue, Feb 15, 2011 at 6:39 AM, Nikola Savic <niks at logik-internet.rs> wrote:
>
> ? Hello,
>
> ? I need to setup cluster of 3 servers without separate storage device
> (SAN). Servers should join their local hard drives to create shared storage
> space. Every server in cluster has public (100Mbps) and private (1Gbps) NIC.
> Private 1Gbit network will be used for exchange of data (files) on shared
> storage. Additional request is that data on shared storage is highly
> redundant (complete mirroring is required).
>
> ? I was wondering if following setup is possible and if anyone has any
> experience or comments on it:
> - Servers will export local disks of same size as iSCSI targets
> - Each server will access other's two servers disk over iSCSI initiator
> - CLVM will be used to set Volume Group on all 3 disks. In theory this VG
> will work on all servers, because they'll have access to all disks (either
> directly or over iSCSI).
> - CLVM will be used to create Logical Volume with mirroring option set to 3
> (-m 3). Since there are 3 disks (physical devices) forming VG, each server
> will have redundant copy of same data.
> - Created Logical Volume will have GFS2 on it, so that it can be shared by
> cluster.
> - Web server will store web application files (scripts and photos) on
> created GFS.
>
> ? If it works, this setup should provide shared storage for cluster, built
> from already available local hard drives in servers forming cluster. By
> using LVM mirroring, each server will have the same copy of data, which
> should make cluster resistant to failure of any server.
>
> ? I was wondering, is LVM smart enough to optimize reading and use local
> drive for read operations?

Can LVM mirror handle one server outage? Can it automatically pick the
difference when it's back on?
Looks like you'd better stick with drbd.

-- 
Fajar



From linux at alteeve.com  Mon Feb 14 23:56:42 2011
From: linux at alteeve.com (Digimer)
Date: Mon, 14 Feb 2011 18:56:42 -0500
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D59BD29.30906@logik-internet.rs>
References: <4D59BD29.30906@logik-internet.rs>
Message-ID: <4D59C13A.9030807@alteeve.com>

On 02/14/2011 06:39 PM, Nikola Savic wrote:
> 
>   Hello,
> 
>   I need to setup cluster of 3 servers without separate storage device
> (SAN). Servers should join their local hard drives to create shared
> storage space. Every server in cluster has public (100Mbps) and private
> (1Gbps) NIC. Private 1Gbit network will be used for exchange of data
> (files) on shared storage. Additional request is that data on shared
> storage is highly redundant (complete mirroring is required).
> 
>   I was wondering if following setup is possible and if anyone has any
> experience or comments on it:
> - Servers will export local disks of same size as iSCSI targets
> - Each server will access other's two servers disk over iSCSI initiator
> - CLVM will be used to set Volume Group on all 3 disks. In theory this
> VG will work on all servers, because they'll have access to all disks
> (either directly or over iSCSI).
> - CLVM will be used to create Logical Volume with mirroring option set
> to 3 (-m 3). Since there are 3 disks (physical devices) forming VG, each
> server will have redundant copy of same data.
> - Created Logical Volume will have GFS2 on it, so that it can be shared
> by cluster.
> - Web server will store web application files (scripts and photos) on
> created GFS.
> 
>   If it works, this setup should provide shared storage for cluster,
> built from already available local hard drives in servers forming
> cluster. By using LVM mirroring, each server will have the same copy of
> data, which should make cluster resistant to failure of any server.
> 
>   I was wondering, is LVM smart enough to optimize reading and use local
> drive for read operations?
> 
>   Looking forward to your comments.
> 
>   Best Regards,
>   Nikola

I'd recommend looking at created a three-way DRBD resource. Use this
resource as your cLVM PV/VG/LVs. On these LVs you can use GFS2 for the
actual shared file system where your data can reside.

A few notes:

- You *MUST* have fencing setup. This is not an option, and manual
fencing will not suffice. If your server have IPMI (or vendor
equivalents like DRAC, iLO, etc) then you are fine. If not, you will
need an external device like an addressable PDU (see APC or Tripplite).

- LVM optimization is not something I can comment on.

- We've not discussed complexity. Clustering is not inherently hard, but
there is a lot to know and a lot can go wrong. Give yourself ample time
to work through problems and test failure scenarios. Do not expect to
launch in a month. It will likely takes a few months at minimum to be
ready for production.

- Join the #linux-cluster IRC channel on freenode.net. There are good
people there who can help you out as you learn.

- Be patient and have fun. :)

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From linux at alteeve.com  Mon Feb 14 23:57:38 2011
From: linux at alteeve.com (Digimer)
Date: Mon, 14 Feb 2011 18:57:38 -0500
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <AANLkTintdyMD-yHcWoYLcrz9i8mZNMJDzdbN0kx7+NR-@mail.gmail.com>
References: <4D59BD29.30906@logik-internet.rs>
	<AANLkTintdyMD-yHcWoYLcrz9i8mZNMJDzdbN0kx7+NR-@mail.gmail.com>
Message-ID: <4D59C172.8070703@alteeve.com>

On 02/14/2011 06:45 PM, Fajar A. Nugraha wrote:
> On Tue, Feb 15, 2011 at 6:39 AM, Nikola Savic <niks at logik-internet.rs> wrote:
>>
>>   Hello,
>>
>>   I need to setup cluster of 3 servers without separate storage device
>> (SAN). Servers should join their local hard drives to create shared storage
>> space. Every server in cluster has public (100Mbps) and private (1Gbps) NIC.
>> Private 1Gbit network will be used for exchange of data (files) on shared
>> storage. Additional request is that data on shared storage is highly
>> redundant (complete mirroring is required).
>>
>>   I was wondering if following setup is possible and if anyone has any
>> experience or comments on it:
>> - Servers will export local disks of same size as iSCSI targets
>> - Each server will access other's two servers disk over iSCSI initiator
>> - CLVM will be used to set Volume Group on all 3 disks. In theory this VG
>> will work on all servers, because they'll have access to all disks (either
>> directly or over iSCSI).
>> - CLVM will be used to create Logical Volume with mirroring option set to 3
>> (-m 3). Since there are 3 disks (physical devices) forming VG, each server
>> will have redundant copy of same data.
>> - Created Logical Volume will have GFS2 on it, so that it can be shared by
>> cluster.
>> - Web server will store web application files (scripts and photos) on
>> created GFS.
>>
>>   If it works, this setup should provide shared storage for cluster, built
>> from already available local hard drives in servers forming cluster. By
>> using LVM mirroring, each server will have the same copy of data, which
>> should make cluster resistant to failure of any server.
>>
>>   I was wondering, is LVM smart enough to optimize reading and use local
>> drive for read operations?
> 
> Can LVM mirror handle one server outage? Can it automatically pick the
> difference when it's back on?
> Looks like you'd better stick with drbd.

You can use DRBD as the PV for clustered LVM. :)

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From niks at logik-internet.rs  Tue Feb 15 01:49:01 2011
From: niks at logik-internet.rs (Nikola Savic)
Date: Tue, 15 Feb 2011 02:49:01 +0100
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D59C13A.9030807@alteeve.com>
References: <4D59BD29.30906@logik-internet.rs> <4D59C13A.9030807@alteeve.com>
Message-ID: <4D59DB8D.5080606@logik-internet.rs>

Digimer wrote:
> On 02/14/2011 06:39 PM, Nikola Savic wrote:
>   
>>   Hello,
>>
>>   I need to setup cluster of 3 servers without separate storage device
>> (SAN). Servers should join their local hard drives to create shared
>> storage space. Every server in cluster has public (100Mbps) and private
>> (1Gbps) NIC. Private 1Gbit network will be used for exchange of data
>> (files) on shared storage. Additional request is that data on shared
>> storage is highly redundant (complete mirroring is required).
>>
>>   I was wondering if following setup is possible and if anyone has any
>> experience or comments on it:
>> - Servers will export local disks of same size as iSCSI targets
>> - Each server will access other's two servers disk over iSCSI initiator
>> - CLVM will be used to set Volume Group on all 3 disks. In theory this
>> VG will work on all servers, because they'll have access to all disks
>> (either directly or over iSCSI).
>> - CLVM will be used to create Logical Volume with mirroring option set
>> to 3 (-m 3). Since there are 3 disks (physical devices) forming VG, each
>> server will have redundant copy of same data.
>> - Created Logical Volume will have GFS2 on it, so that it can be shared
>> by cluster.
>> - Web server will store web application files (scripts and photos) on
>> created GFS.
>>
>>   If it works, this setup should provide shared storage for cluster,
>> built from already available local hard drives in servers forming
>> cluster. By using LVM mirroring, each server will have the same copy of
>> data, which should make cluster resistant to failure of any server.
>>
>>   I was wondering, is LVM smart enough to optimize reading and use local
>> drive for read operations?
>>
>>   Looking forward to your comments.
>>
>>   Best Regards,
>>   Nikola
>>     
>
> I'd recommend looking at created a three-way DRBD resource. Use this
> resource as your cLVM PV/VG/LVs. On these LVs you can use GFS2 for the
> actual shared file system where your data can reside.
>   

  Thank you for prompt reply!

  Is there howto I can look into for this kind of setup? I assume that,
when DRBD is used, only one of three mirrored devices is available for
writing. That would require that one of servers exports writable DRBD
using iSCSI or GNBD, so other cluster servers could access it. Am I right?

  What is main reason you would suggest DRBD and not solution based on
LVM mirroring? Is it
- Performance
- Reliability
- Better failover

  Best Regards,
  Nikola
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110215/e64ebbf9/attachment.htm>

From linux at alteeve.com  Tue Feb 15 01:51:00 2011
From: linux at alteeve.com (Digimer)
Date: Mon, 14 Feb 2011 20:51:00 -0500
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D59DB8D.5080606@logik-internet.rs>
References: <4D59BD29.30906@logik-internet.rs> <4D59C13A.9030807@alteeve.com>
	<4D59DB8D.5080606@logik-internet.rs>
Message-ID: <4D59DC04.6060305@alteeve.com>

On 02/14/2011 08:49 PM, Nikola Savic wrote:
> Digimer wrote:
>> On 02/14/2011 06:39 PM, Nikola Savic wrote:
>>   
>>>   Hello,
>>>
>>>   I need to setup cluster of 3 servers without separate storage device
>>> (SAN). Servers should join their local hard drives to create shared
>>> storage space. Every server in cluster has public (100Mbps) and private
>>> (1Gbps) NIC. Private 1Gbit network will be used for exchange of data
>>> (files) on shared storage. Additional request is that data on shared
>>> storage is highly redundant (complete mirroring is required).
>>>
>>>   I was wondering if following setup is possible and if anyone has any
>>> experience or comments on it:
>>> - Servers will export local disks of same size as iSCSI targets
>>> - Each server will access other's two servers disk over iSCSI initiator
>>> - CLVM will be used to set Volume Group on all 3 disks. In theory this
>>> VG will work on all servers, because they'll have access to all disks
>>> (either directly or over iSCSI).
>>> - CLVM will be used to create Logical Volume with mirroring option set
>>> to 3 (-m 3). Since there are 3 disks (physical devices) forming VG, each
>>> server will have redundant copy of same data.
>>> - Created Logical Volume will have GFS2 on it, so that it can be shared
>>> by cluster.
>>> - Web server will store web application files (scripts and photos) on
>>> created GFS.
>>>
>>>   If it works, this setup should provide shared storage for cluster,
>>> built from already available local hard drives in servers forming
>>> cluster. By using LVM mirroring, each server will have the same copy of
>>> data, which should make cluster resistant to failure of any server.
>>>
>>>   I was wondering, is LVM smart enough to optimize reading and use local
>>> drive for read operations?
>>>
>>>   Looking forward to your comments.
>>>
>>>   Best Regards,
>>>   Nikola
>>>     
>>
>> I'd recommend looking at created a three-way DRBD resource. Use this
>> resource as your cLVM PV/VG/LVs. On these LVs you can use GFS2 for the
>> actual shared file system where your data can reside.
>>   
> 
>   Thank you for prompt reply!
> 
>   Is there howto I can look into for this kind of setup? I assume that,
> when DRBD is used, only one of three mirrored devices is available for
> writing. That would require that one of servers exports writable DRBD
> using iSCSI or GNBD, so other cluster servers could access it. Am I right?
> 
>   What is main reason you would suggest DRBD and not solution based on
> LVM mirroring? Is it
> - Performance
> - Reliability
> - Better failover
> 
>   Best Regards,
>   Nikola

I have an in-progress tutorial, which I would recommend as a guide only.
If you are interested, I will send you the link off-list.

As for your question; No, you can read/write to the shared storage at
the same time without the need for iSCSI. DRBD can run in
"Primary/Primary[/Primary]" mode. Then you layer onto this clustered LVM
followed by GFS2. Once up, all three nodes can access and edit the same
storage space at the same time.

So you're taking advantage of all three technologies. As for mirrored
LVM, I've not tried it yet as DRBD->cLVM->GFS2 has worked quite well for me.

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From linux at alteeve.com  Tue Feb 15 02:37:39 2011
From: linux at alteeve.com (Digimer)
Date: Mon, 14 Feb 2011 21:37:39 -0500
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D59E3A1.4070508@logik-internet.rs>
References: <4D59BD29.30906@logik-internet.rs> <4D59C13A.9030807@alteeve.com>
	<4D59DB8D.5080606@logik-internet.rs> <4D59DC04.6060305@alteeve.com>
	<4D59E3A1.4070508@logik-internet.rs>
Message-ID: <4D59E6F3.3050002@alteeve.com>

On 02/14/2011 09:23 PM, Nikola Savic wrote:
>> I have an in-progress tutorial, which I would recommend as a guide only.
>> If you are interested, I will send you the link off-list.
>>
>> As for your question; No, you can read/write to the shared storage at
>> the same time without the need for iSCSI. DRBD can run in
>> "Primary/Primary[/Primary]" mode. Then you layer onto this clustered LVM
>> followed by GFS2. Once up, all three nodes can access and edit the same
>> storage space at the same time.
>>
>> So you're taking advantage of all three technologies. As for mirrored
>> LVM, I've not tried it yet as DRBD->cLVM->GFS2 has worked quite well for me.
> 
>   I just read about Primary/Primary configuration in DRBD's User Guide,
> but would love to get link to tutorial you mentioned, especially if it
> covers fancing :) When one of servers is restarted and there is delay in
> data being written to DRBD, what happens when sever is back up? Is
> booting stopped by DRBD until synchronization is done, or does it try to
> do it in background? If it's done in background, how does
> Primary/Primary mode work?
> 
>   Thanks,
>   Nikola

Once the cluster manager (corosync in Cluster3, openais in Cluster2)
stops getting messages from a node (be it hung or dead), it starts a
counter. Once the counter exceeds a set threshold, the node is declared
dead and a fence is called against that node. This should, when working
properly, reliably prevent the node from trying to access the shared
storage (ie: stop it from trying to complete a write operation).

Once, and *only* if the fence was successful, the cluster will reform.
Once the cluster configuration is in place, recovery of the file system
can begin (ie: the journal can be replayed). Finally, normal operation
can continue, albeit with one less node. This is also where the resource
manager (rgmanager or pacemaker) start shuffling around any resources
that were lost when the node went down.

Traditionally, fencing involves rebooting the lost node, in the hopes
that it will come back in a healthier state. Assuming it does come up
healthy, a couple main steps must occur.

First, it will rejoin the other DRBD members. These members will have a
"dirty block" list in memory which will allow them to quickly bring the
recovered server back into sync. During this time, you can bring that
node online (ie: set it primary and start accessing it via GFS2).
However, note that it can not be the sole primary device until it is
fully sync'ed.

Second, the cluster reforms to restore the recovered node. Once the
member has successfully joined, the resource manager (again, rgmanager
or pacemaker) will begin reorganizing the clustered resources as per
your configuration.

An important note:

If the fence call fails (either because of a fault in the fence device
or due to misconfiguration), the cluster will hang and *all* access to
the shared storage will stop.

*This is by design!*

The reason is that, should the cluster falsely assume the node was dead,
begin recovering the journal and then the hung node recovered and tried
to complete the write, the shared filesystem would be corrupted. That
is; "It is better a hung cluster than a corrupt cluster."

This is why fencing is so critical. :)

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From niks at logik-internet.rs  Tue Feb 15 02:53:33 2011
From: niks at logik-internet.rs (Nikola Savic)
Date: Tue, 15 Feb 2011 03:53:33 +0100
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D59E6F3.3050002@alteeve.com>
References: <4D59BD29.30906@logik-internet.rs> <4D59C13A.9030807@alteeve.com>
	<4D59DB8D.5080606@logik-internet.rs> <4D59DC04.6060305@alteeve.com>
	<4D59E3A1.4070508@logik-internet.rs> <4D59E6F3.3050002@alteeve.com>
Message-ID: <4D59EAAD.8030301@logik-internet.rs>

Digimer wrote:
> First, it will rejoin the other DRBD members. These members will have a
> "dirty block" list in memory which will allow them to quickly bring the
> recovered server back into sync. During this time, you can bring that
> node online (ie: set it primary and start accessing it via GFS2).
> However, note that it can not be the sole primary device until it is
> fully sync'ed.
>   

  If I understand you well, even before sync is completely done DRBD
will take care of reading and writing of dirty blocks on problematic
node that got back online? Let's say that node was down for longer time
and that synchronization can take few minutes, maybe more. If all
services start working before sync is complete, it can happen that web
applications tries to write into or read from dirty block(s). Will DRBD
take care of that? If not, is there way to suspend startup of services
(web server and similar) until sync is done?

  Thanks for detailed replies!

  Regards,
  Nikola



From stefan at lsd.co.za  Tue Feb 15 06:14:58 2011
From: stefan at lsd.co.za (Stefan Lesicnik)
Date: Tue, 15 Feb 2011 08:14:58 +0200 (SAST)
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D59DC04.6060305@alteeve.com>
Message-ID: <1170617551.7574.1297750498041.JavaMail.root@zcs-jhb-lsd>



I have an in-progress tutorial, which I would recommend as a guide only.
If you are interested, I will send you the link off-list.

As for your question; No, you can read/write to the shared storage at
the same time without the need for iSCSI. DRBD can run in
"Primary/Primary[/Primary]" mode. Then you layer onto this clustered LVM
followed by GFS2. Once up, all three nodes can access and edit the same
storage space at the same time.

So you're taking advantage of all three technologies. As for mirrored
LVM, I've not tried it yet as DRBD->cLVM->GFS2 has worked quite well for me.

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org


Hi,

Sorry my mail client isn't indenting above reply (zimbra - shrug). I have used DRBD to mirror 2 SAN's. I tested the active / active with ocfs2 and must say the performance knock was really terrible. It may have been application specific (many little files), but i think the general consensus is use active / passive with some cluster failover if you can.

But please do test active / active and let us know if its better!  (I also know there is a new version of drbd that is meant to improve dual primary mode)

Stefan



--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From gordan at bobich.net  Tue Feb 15 09:57:18 2011
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 15 Feb 2011 09:57:18 +0000
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D59EAAD.8030301@logik-internet.rs>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>
	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>
	<4D59E6F3.3050002@alteeve.com> <4D59EAAD.8030301@logik-internet.rs>
Message-ID: <4D5A4DFE.8050101@bobich.net>

Nikola Savic wrote:
> Digimer wrote:
>> First, it will rejoin the other DRBD members. These members will have a
>> "dirty block" list in memory which will allow them to quickly bring the
>> recovered server back into sync. During this time, you can bring that
>> node online (ie: set it primary and start accessing it via GFS2).
>> However, note that it can not be the sole primary device until it is
>> fully sync'ed.
>>   
> 
>   If I understand you well, even before sync is completely done DRBD
> will take care of reading and writing of dirty blocks on problematic
> node that got back online? Let's say that node was down for longer time
> and that synchronization can take few minutes, maybe more. If all
> services start working before sync is complete, it can happen that web
> applications tries to write into or read from dirty block(s). Will DRBD
> take care of that? If not, is there way to suspend startup of services
> (web server and similar) until sync is done?

DRBD and GFS will take care of that for you. DRBD directs reads to nodes 
that are up to date until everything is in sync.

Make sure that in drbd.conf you put in a stonith parameter pointing at 
your fencing agent with suitable parameters, and set the timeout to 
slightly less than what you have it set in cluster.conf. That will 
ensure that you are protected from the race condition where DRBD might 
drop out but the node starts heartbeating between then and when the 
fencing timeout occurs.

Oh, and if you are going to use DRBD there is no reason to use LVM.

Gordan



From work at fajar.net  Tue Feb 15 10:08:45 2011
From: work at fajar.net (Fajar A. Nugraha)
Date: Tue, 15 Feb 2011 17:08:45 +0700
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D5A4DFE.8050101@bobich.net>
References: <4D59BD29.30906@logik-internet.rs> <4D59C13A.9030807@alteeve.com>
	<4D59DB8D.5080606@logik-internet.rs> <4D59DC04.6060305@alteeve.com>
	<4D59E3A1.4070508@logik-internet.rs> <4D59E6F3.3050002@alteeve.com>
	<4D59EAAD.8030301@logik-internet.rs> <4D5A4DFE.8050101@bobich.net>
Message-ID: <AANLkTikj3iaK1HLhh7JXs-1ZuVhwscM5v1ZO7g8ZpUci@mail.gmail.com>

On Tue, Feb 15, 2011 at 4:57 PM, Gordan Bobic <gordan at bobich.net> wrote:
> Nikola Savic wrote:
>> ?If I understand you well, even before sync is completely done DRBD
>> will take care of reading and writing of dirty blocks on problematic
>> node that got back online? Let's say that node was down for longer time
>> and that synchronization can take few minutes, maybe more. If all
>> services start working before sync is complete, it can happen that web
>> applications tries to write into or read from dirty block(s). Will DRBD
>> take care of that? If not, is there way to suspend startup of services
>> (web server and similar) until sync is done?
>
> DRBD and GFS will take care of that for you. DRBD directs reads to nodes
> that are up to date until everything is in sync.

Really? Can you point to a documentation that said so?
IIRC the block device /dev/drbd* on a node will not be accessible for
read/write until it's synced.

>
> Make sure that in drbd.conf you put in a stonith parameter pointing at your
> fencing agent with suitable parameters, and set the timeout to slightly less
> than what you have it set in cluster.conf. That will ensure that you are
> protected from the race condition where DRBD might drop out but the node
> starts heartbeating between then and when the fencing timeout occurs.
>
> Oh, and if you are going to use DRBD there is no reason to use LVM.

There are two ways to use DRBD with LVM in a cluster:
(1) Use drbd on partition/disk, and use CLVM on top of that
(2) create local LVM, and use drbd on top of the LVs

Personally I prefer (2), since this setup allows LVM snapshots, and
faster to resync if I want to reinitialize a drbd device on one of the
nodes (like when a split brain occurred, which was often on my
fencingless-test-setup a while back).

-- 
Fajar



From gordan at bobich.net  Tue Feb 15 10:23:36 2011
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 15 Feb 2011 10:23:36 +0000
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <AANLkTikj3iaK1HLhh7JXs-1ZuVhwscM5v1ZO7g8ZpUci@mail.gmail.com>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>
	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>
	<4D59E6F3.3050002@alteeve.com>	<4D59EAAD.8030301@logik-internet.rs>
	<4D5A4DFE.8050101@bobich.net>
	<AANLkTikj3iaK1HLhh7JXs-1ZuVhwscM5v1ZO7g8ZpUci@mail.gmail.com>
Message-ID: <4D5A5428.20400@bobich.net>

Fajar A. Nugraha wrote:
> On Tue, Feb 15, 2011 at 4:57 PM, Gordan Bobic <gordan at bobich.net> wrote:
>> Nikola Savic wrote:
>>>  If I understand you well, even before sync is completely done DRBD
>>> will take care of reading and writing of dirty blocks on problematic
>>> node that got back online? Let's say that node was down for longer time
>>> and that synchronization can take few minutes, maybe more. If all
>>> services start working before sync is complete, it can happen that web
>>> applications tries to write into or read from dirty block(s). Will DRBD
>>> take care of that? If not, is there way to suspend startup of services
>>> (web server and similar) until sync is done?
>> DRBD and GFS will take care of that for you. DRBD directs reads to nodes
>> that are up to date until everything is in sync.
> 
> Really? Can you point to a documentation that said so?
> IIRC the block device /dev/drbd* on a node will not be accessible for
> read/write until it's synced.

If you are running in primary/primary mode, the block device will most 
definitely be available in rw mode as soon as drbd has connected to the 
cluster and established where to get the most up to date copy from.

I haven't looked through the documentation recently so don't have a link 
handy but I have several clusters with this setup deployed, so I'm 
reasonably confident I know what I'm talking about. :)

>> Make sure that in drbd.conf you put in a stonith parameter pointing at your
>> fencing agent with suitable parameters, and set the timeout to slightly less
>> than what you have it set in cluster.conf. That will ensure that you are
>> protected from the race condition where DRBD might drop out but the node
>> starts heartbeating between then and when the fencing timeout occurs.
>>
>> Oh, and if you are going to use DRBD there is no reason to use LVM.
> 
> There are two ways to use DRBD with LVM in a cluster:
> (1) Use drbd on partition/disk, and use CLVM on top of that
> (2) create local LVM, and use drbd on top of the LVs
> 
> Personally I prefer (2), since this setup allows LVM snapshots, and
> faster to resync if I want to reinitialize a drbd device on one of the
> nodes (like when a split brain occurred, which was often on my
> fencingless-test-setup a while back).

I don't see what the purpose of (1) is. I can sort of see where you are 
coming from with snapshots in (2), but what you are describing doesn't 
sound like something you would ever want to use in production.

Just because you _can_ use LVM doesn't mean that you _should_ use it. 
Another bad thing about LVM if you are using it on top of RAID or an SSD 
is that its headers will throw the FS completely out of alignment if you 
don't pre-compensate for it.

Gordan



From niks at logik-internet.rs  Tue Feb 15 11:49:38 2011
From: niks at logik-internet.rs (Nikola Savic)
Date: Tue, 15 Feb 2011 12:49:38 +0100
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D5A4DFE.8050101@bobich.net>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>	<4D59E6F3.3050002@alteeve.com>
	<4D59EAAD.8030301@logik-internet.rs> <4D5A4DFE.8050101@bobich.net>
Message-ID: <4D5A6852.5000103@logik-internet.rs>

Gordan Bobic wrote:
> DRBD and GFS will take care of that for you. DRBD directs reads to
> nodes that are up to date until everything is in sync.
>
> Make sure that in drbd.conf you put in a stonith parameter pointing at
> your fencing agent with suitable parameters, and set the timeout to
> slightly less than what you have it set in cluster.conf. That will
> ensure that you are protected from the race condition where DRBD might
> drop out but the node starts heartbeating between then and when the
> fencing timeout occurs.
>
> Oh, and if you are going to use DRBD there is no reason to use LVM.

This is interesting approach. I understand that DRBD with GFS2 doesn't
require LVM between, but it does bring some inflexibility:

    * For each logical volume, one has to setup separate DRBD
    * Cluster wide logical volume resizing not easy
    * No snapshot - this is very important to me for MySQL backups.

What is main reason for you not to use LVM on top of DRBD? Is it just
that you didn't require benefits it brings? Or, it makes more problems
by your opinion?

Best Regards,
Nikola
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110215/6fb020ad/attachment.htm>

From gordan at bobich.net  Tue Feb 15 12:04:44 2011
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 15 Feb 2011 12:04:44 +0000
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D5A6852.5000103@logik-internet.rs>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>	<4D59E6F3.3050002@alteeve.com>	<4D59EAAD.8030301@logik-internet.rs>
	<4D5A4DFE.8050101@bobich.net> <4D5A6852.5000103@logik-internet.rs>
Message-ID: <4D5A6BDC.8080708@bobich.net>

Nikola Savic wrote:
> Gordan Bobic wrote:
>> DRBD and GFS will take care of that for you. DRBD directs reads to 
>> nodes that are up to date until everything is in sync.
>>
>> Make sure that in drbd.conf you put in a stonith parameter pointing at 
>> your fencing agent with suitable parameters, and set the timeout to 
>> slightly less than what you have it set in cluster.conf. That will 
>> ensure that you are protected from the race condition where DRBD might 
>> drop out but the node starts heartbeating between then and when the 
>> fencing timeout occurs.
>>
>> Oh, and if you are going to use DRBD there is no reason to use LVM.
> 
>   This is interesting approach. I understand that DRBD with GFS2 doesn't 
> require LVM between, but it does bring some inflexibility:
> 
>     * For each logical volume, one has to setup separate DRBD

Can you elaborate what you are referring to? Partitions? There is 
technically nothing stopping you from partitioning a DRBD device. Also 
depending on what you are doing, you may find that having one DRBD 
device per disk is preferable in terms of performance and reliability to 
having a mirrored pool (effectively RAID01). Pool of mirrors (RAID10) is 
more resilient.

>     * Cluster wide logical volume resizing not easy

Are you really going to spoon-feed the space expansions that much, thus 
causing unnecessary fragmentation? If you size your storage sensibly, 
you won't need to upgrade it for a few years, and when the time comes to 
upgrade it you may well need to replace the servers while you're at it. 
Volume resizing is, IMO, over-rated and unnecessary in most cases, 
except where data growth is quite mind-boggling (in which case you won't 
be using MySQL anyway).

>     * No snapshot - this is very important to me for MySQL backups.

Last I checked CLVM couldn't do snapshots, but that may have changed 
recently. Snapshots also aren't even remotely ideal for MySQL backups. 
You really need a replicated server to take reliable backups from.

>   What is main reason for you not to use LVM on top of DRBD? Is it just 
> that you didn't require benefits it brings? Or, it makes more problems 
> by your opinion?

Traditionally, CLVM didn't provide any tangible benefits (no snapshots), 
and I never found myself in a situation where dynamically growing a 
volume with randomly assembled storage was required. If you are JBOD-ing 
a bunch of cheap SATA disks, you might as well size the storage 
correctly to begin with and not have to bother with LVM. I'm assuming 
this is what you are doing since you are doing it on the cheap 
(SAN-less). If you are using a SAN, the SAN will provide functionality 
to grow the exported block device and you can just grow the fs onto 
that, without needing LVM.

So apart from snapshots (non-clustered) or a setup like what was 
suggested earlier, to have DRBD on top of local LVM to gain 
local-consistency snapshot capability in a cluster (not sure I'd trust 
that with my data, but it may be good for non-production environments), 
I don't really see the advantage. Snapshots also only give you 
crash-level consistency, which I never felt was good enough for 
applications like databases. A replicated slave that you can shut down 
is generally a more reliable solution for backups.

Gordan



From thomas at sjolshagen.net  Tue Feb 15 12:12:25 2011
From: thomas at sjolshagen.net (Thomas Sjolshagen)
Date: Tue, 15 Feb 2011 07:12:25 -0500
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D5A6852.5000103@logik-internet.rs>
References: "\"<4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>"
	<4D59E3A1.4070508@logik-internet.rs>" <4D59E6F3.3050002@alteeve.com>
	<4D59EAAD.8030301@logik-internet.rs> <4D5A4DFE.8050101@bobich.net>
	<4D5A6852.5000103@logik-internet.rs>
Message-ID: <6b22c9afe6618e0ab302dae12c4b74aa@sjolshagen.net>

  

On Tue, 15 Feb 2011 12:49:38 +0100, Nikola Savic wrote: 

> This is
interesting approach. I understand that DRBD with GFS2 doesn't require
LVM between, but it does bring some inflexibility:
> 
> * For each
logical volume, one has to setup separate DRBD
> * Cluster wide logical
volume resizing not easy
> * No snapshot - this is very important to me
for MySQL backups.
> 
> What is main reason for you not to use LVM on
top of DRBD? Is it just that you didn't require benefits it brings? Or,
it makes more problems by your opinion?

Just so you realize; If you
intend to use clvm (i.e. lvme in a cluster where you expect to be able
to write to the volume from more than one node at/around the same time
w/o a full-on failover), you will _not_ have snapshot support. And no,
this isn't "not supported" as in "nobody to call if you encounter a
problem", it's "not supported" as in "the tools will not let you create
a snapshot of the LV". 

However, you ought to be able to configure one
of the DRBD mirror members as part of a split/mount read-only/merge
based equivalent and thus get a similar result, I think. 

// Thomas 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110215/34060d72/attachment.htm>

From niks at logik-internet.rs  Tue Feb 15 12:19:41 2011
From: niks at logik-internet.rs (Nikola Savic)
Date: Tue, 15 Feb 2011 13:19:41 +0100
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D59E6F3.3050002@alteeve.com>
References: <4D59BD29.30906@logik-internet.rs> <4D59C13A.9030807@alteeve.com>
	<4D59DB8D.5080606@logik-internet.rs> <4D59DC04.6060305@alteeve.com>
	<4D59E3A1.4070508@logik-internet.rs> <4D59E6F3.3050002@alteeve.com>
Message-ID: <4D5A6F5D.1050301@logik-internet.rs>

Digimer wrote:
> Once, and *only* if the fence was successful, the cluster will reform.
> Once the cluster configuration is in place, recovery of the file system
> can begin (ie: the journal can be replayed). Finally, normal operation
> can continue, albeit with one less node. This is also where the resource
> manager (rgmanager or pacemaker) start shuffling around any resources
> that were lost when the node went down.
>   

  From guide you sent me, I understood that fencing to work well servers
should have IPMI available on motherboards.

  My client is going to purchase servers at Hetzner from their EQ-Line.
I asked their support if IPMI is available. Since my other client
already has server with 'em, I tried to install ipmi related packages
(like you specified in guide). IPMI service doesn't start, so I assume
it's not available or not turned on in BIOS.

  How would cluster work if no IPMI or similar technology is available
for fencing? In case one of nodes dies and no fencing is available,
cluster will hang until administrator does manual fancing?

  Best Regards,
  Nikola



From gordan at bobich.net  Tue Feb 15 12:20:04 2011
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 15 Feb 2011 12:20:04 +0000
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <6b22c9afe6618e0ab302dae12c4b74aa@sjolshagen.net>
References: "\"<4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>"	<4D59E3A1.4070508@logik-internet.rs>"
	<4D59E6F3.3050002@alteeve.com>	<4D59EAAD.8030301@logik-internet.rs>
	<4D5A4DFE.8050101@bobich.net>	<4D5A6852.5000103@logik-internet.rs>
	<6b22c9afe6618e0ab302dae12c4b74aa@sjolshagen.net>
Message-ID: <4D5A6F74.5030500@bobich.net>

Thomas Sjolshagen wrote:
> On Tue, 15 Feb 2011 12:49:38 +0100, Nikola Savic wrote:
> 
>>    This is interesting approach. I understand that DRBD with GFS2 
>> doesn't require LVM between, but it does bring some inflexibility:
>>
>>     * For each logical volume, one has to setup separate DRBD
>>     * Cluster wide logical volume resizing not easy
>>     * No snapshot - this is very important to me for MySQL backups.
>>
>>   What is main reason for you not to use LVM on top of DRBD? Is it 
>> just that you didn't require benefits it brings? Or, it makes more 
>> problems by your opinion?
> 
> Just so you realize; If you intend to use clvm (i.e. lvme in a cluster 
> where you expect to be able to write to the volume from more than one 
> node at/around the same time w/o a full-on failover), you will _not_ 
> have snapshot support. And no, this isn't "not supported" as in "nobody 
> to call if you encounter a problem", it's "not supported" as in "the 
> tools will not let you create a snapshot of the LV".
> 
> However, you ought to be able to configure one of the DRBD mirror 
> members as part of a split/mount read-only/merge based equivalent and 
> thus get a similar result, I think.

Indeed, that is right - you can drop a server out of the cluster, stop 
drbd replication and mount it read-only (lock_nolock) and use that as a 
"snapshot". The added benefit is that it won't cause massive cluster 
slow-down through lock-bouncing during the backup.

Gordan



From gordan at bobich.net  Tue Feb 15 12:30:06 2011
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 15 Feb 2011 12:30:06 +0000
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D5A6F5D.1050301@logik-internet.rs>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>
	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>
	<4D59E6F3.3050002@alteeve.com> <4D5A6F5D.1050301@logik-internet.rs>
Message-ID: <4D5A71CE.4030007@bobich.net>

Nikola Savic wrote:
> Digimer wrote:
>> Once, and *only* if the fence was successful, the cluster will reform.
>> Once the cluster configuration is in place, recovery of the file system
>> can begin (ie: the journal can be replayed). Finally, normal operation
>> can continue, albeit with one less node. This is also where the resource
>> manager (rgmanager or pacemaker) start shuffling around any resources
>> that were lost when the node went down.
>>   
> 
>   From guide you sent me, I understood that fencing to work well servers
> should have IPMI available on motherboards.
> 
>   My client is going to purchase servers at Hetzner from their EQ-Line.
> I asked their support if IPMI is available. Since my other client
> already has server with 'em, I tried to install ipmi related packages
> (like you specified in guide). IPMI service doesn't start, so I assume
> it's not available or not turned on in BIOS.

That doesn't mean much. The IPMI service isn't what you use for fencing 
in this context. It's for diagnostics (e.g. advanced sensor readings, 
fan speeds, temperatures, voltages, etc.). Think of it as lm_sensors on 
steroids. For fencing you need to connect to the machine externally over 
the network via IPMI, and this will run at firmware level (i.e. you need 
to be able to power the machine on and off without an OS running).

>   How would cluster work if no IPMI or similar technology is available
> for fencing? In case one of nodes dies and no fencing is available,
> cluster will hang until administrator does manual fancing?

Yes, that's about the size of it.

There are add-in cards you can use to add fencing functionality even if 
you don't have this built into the server, e.g. Raritan eRIC G4 and 
similar. I wrote a fencing agent for those, you should be able to find 
it in the redhat bugzilla. They can be found for about ?175 or so. That 
may or may not compare favourably to what you can get with the servers 
from the vendor.

Alternatively, you can use network controllable power bars for fencing, 
they may work out cheaper (you need one eRIC card per server, and 
assuming your servers have dual PSUs, you'd only need two power bars).

Something else just occurs to me - you mentioned MySQL. You do realize 
that the performance of it will be attrocious on a shared cluster file 
system (ANY shared cluster file system), right? Unless you only intend 
to run mysqld on a single node at a time (in which case there's no point 
in putting it on a cluster file system).

Gordan



From niks at logik-internet.rs  Tue Feb 15 13:05:35 2011
From: niks at logik-internet.rs (Nikola Savic)
Date: Tue, 15 Feb 2011 14:05:35 +0100
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D5A6BDC.8080708@bobich.net>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>	<4D59E6F3.3050002@alteeve.com>	<4D59EAAD.8030301@logik-internet.rs>	<4D5A4DFE.8050101@bobich.net>
	<4D5A6852.5000103@logik-internet.rs> <4D5A6BDC.8080708@bobich.net>
Message-ID: <4D5A7A1F.60804@logik-internet.rs>

Gordan Bobic wrote:
>> What is main reason for you not to use LVM on top of DRBD? Is it just
>> that you didn't require benefits it brings? Or, it makes more
>> problems by your opinion?
>
> Traditionally, CLVM didn't provide any tangible benefits (no
> snapshots), and I never found myself in a situation where dynamically
> growing a volume with randomly assembled storage was required. If you
> are JBOD-ing a bunch of cheap SATA disks, you might as well size the
> storage correctly to begin with and not have to bother with LVM. I'm
> assuming this is what you are doing since you are doing it on the
> cheap (SAN-less). If you are using a SAN, the SAN will provide
> functionality to grow the exported block device and you can just grow
> the fs onto that, without needing LVM.
>
> So apart from snapshots (non-clustered) or a setup like what was
> suggested earlier, to have DRBD on top of local LVM to gain
> local-consistency snapshot capability in a cluster (not sure I'd trust
> that with my data, but it may be good for non-production
> environments), I don't really see the advantage. Snapshots also only
> give you crash-level consistency, which I never felt was good enough
> for applications like databases. A replicated slave that you can shut
> down is generally a more reliable solution for backups.

Thank you for detailed response!

I generally like idea of removing unneeded levels of technology.

In case DRBD+GFS2 is used for shared storage, do I need cluster suite?
Can GFS2 in this setup without cluster setup?

Thanks,
Nikola



From niks at logik-internet.rs  Tue Feb 15 13:14:42 2011
From: niks at logik-internet.rs (Nikola Savic)
Date: Tue, 15 Feb 2011 14:14:42 +0100
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D5A71CE.4030007@bobich.net>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>	<4D59E6F3.3050002@alteeve.com>
	<4D5A6F5D.1050301@logik-internet.rs> <4D5A71CE.4030007@bobich.net>
Message-ID: <4D5A7C42.3080003@logik-internet.rs>

Gordan Bobic wrote:
> Something else just occurs to me - you mentioned MySQL. You do realize
> that the performance of it will be attrocious on a shared cluster file
> system (ANY shared cluster file system), right? Unless you only intend
> to run mysqld on a single node at a time (in which case there's no
> point in putting it on a cluster file system). 

  MySQL Master and Slave(s) will run on single node. No two MySQL
instances will run on same set of data. Shared storage for MySQL data
should enable easier movement of MySQL instance between nodes. Eg. when
MySQL master needs to be moved from one node to other, I assume it would
be easier with DRBD, because I would "only" need to stop MySQL on one
node and start it on other configured to use same set of data.
Additionally, floating IP address assigned to MySQL master would need to
be re-assigned to new node. Slaves would also need to be restarted to
connect to new master. Even without floating IP used only my MySQL
Master, slaves and web application can easily be reconfigured to use new
IP. Do you see problem in this kind of setup?

  Thanks,
  Nikola



From gordan at bobich.net  Tue Feb 15 13:12:58 2011
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 15 Feb 2011 13:12:58 +0000
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D5A7A1F.60804@logik-internet.rs>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>	<4D59E6F3.3050002@alteeve.com>	<4D59EAAD.8030301@logik-internet.rs>	<4D5A4DFE.8050101@bobich.net>	<4D5A6852.5000103@logik-internet.rs>
	<4D5A6BDC.8080708@bobich.net> <4D5A7A1F.60804@logik-internet.rs>
Message-ID: <4D5A7BDA.30104@bobich.net>

Nikola Savic wrote:
> Gordan Bobic wrote:
>>> What is main reason for you not to use LVM on top of DRBD? Is it just
>>> that you didn't require benefits it brings? Or, it makes more
>>> problems by your opinion?
>> Traditionally, CLVM didn't provide any tangible benefits (no
>> snapshots), and I never found myself in a situation where dynamically
>> growing a volume with randomly assembled storage was required. If you
>> are JBOD-ing a bunch of cheap SATA disks, you might as well size the
>> storage correctly to begin with and not have to bother with LVM. I'm
>> assuming this is what you are doing since you are doing it on the
>> cheap (SAN-less). If you are using a SAN, the SAN will provide
>> functionality to grow the exported block device and you can just grow
>> the fs onto that, without needing LVM.
>>
>> So apart from snapshots (non-clustered) or a setup like what was
>> suggested earlier, to have DRBD on top of local LVM to gain
>> local-consistency snapshot capability in a cluster (not sure I'd trust
>> that with my data, but it may be good for non-production
>> environments), I don't really see the advantage. Snapshots also only
>> give you crash-level consistency, which I never felt was good enough
>> for applications like databases. A replicated slave that you can shut
>> down is generally a more reliable solution for backups.
> 
> Thank you for detailed response!
> 
> I generally like idea of removing unneeded levels of technology.
> 
> In case DRBD+GFS2 is used for shared storage, do I need cluster suite?
> Can GFS2 in this setup without cluster setup?

No, it cannot. GFS2's locking is dependant on the cman service being up 
and quorate, so yes, you still need the cluster suite being up and 
running, since that is what handles fencing. You could replace 
DRBD+GFS+RHCS with, say, DRBD+OCFS2+Heartbeat, but that wouldn't gain 
you anything either way - you'd still need fencing configured and working.

Note that DRBD should also have fencing (stonith) configured, and on a 
lower time-out than the rest of the cluster layer to eliminate 
possibility of split-braining.

Gordan



From gordan at bobich.net  Tue Feb 15 13:31:42 2011
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 15 Feb 2011 13:31:42 +0000
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D5A7C42.3080003@logik-internet.rs>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>	<4D59E6F3.3050002@alteeve.com>	<4D5A6F5D.1050301@logik-internet.rs>
	<4D5A71CE.4030007@bobich.net> <4D5A7C42.3080003@logik-internet.rs>
Message-ID: <4D5A803E.5090106@bobich.net>

Nikola Savic wrote:
> Gordan Bobic wrote:
>> Something else just occurs to me - you mentioned MySQL. You do realize
>> that the performance of it will be attrocious on a shared cluster file
>> system (ANY shared cluster file system), right? Unless you only intend
>> to run mysqld on a single node at a time (in which case there's no
>> point in putting it on a cluster file system). 
> 
>   MySQL Master and Slave(s) will run on single node. No two MySQL
> instances will run on same set of data. Shared storage for MySQL data
> should enable easier movement of MySQL instance between nodes. Eg. when
> MySQL master needs to be moved from one node to other, I assume it would
> be easier with DRBD, because I would "only" need to stop MySQL on one
> node and start it on other configured to use same set of data.

There is a better way to do that. Run DRBD in active-passive mode, and 
grab the fail-over scripts from heartbeat. Then set up a dependency in 
cluster.conf that will handle a combined service of DRBD disk (handling 
active/passive switch), file system (mounting the fs once the DRBD 
becomes active locally, and mysql. You define them as dependant on each 
other in cluster.conf by suitable nesting.

> Additionally, floating IP address assigned to MySQL master would need to
> be re-assigned to new node.

You can make that IP a part of the dependency stack mentioned above.

> Slaves would also need to be restarted to
> connect to new master. Even without floating IP used only my MySQL
> Master, slaves and web application can easily be reconfigured to use new
> IP. Do you see problem in this kind of setup?

If the IP fails over and the FS is consistent you don't need to change 
any configs - MySQL slaves will re-try connecting until they succeed. 
Just make sure your bin-logs are on the same mount as the rest of MySQL, 
since they have to fail over with the rest of the DB.

Gordan



From jeff.sturm at eprize.com  Tue Feb 15 15:55:43 2011
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Tue, 15 Feb 2011 10:55:43 -0500
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D5A6BDC.8080708@bobich.net>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>	<4D59E6F3.3050002@alteeve.com>	<4D59EAAD.8030301@logik-internet.rs><4D5A4DFE.8050101@bobich.net>
	<4D5A6852.5000103@logik-internet.rs> <4D5A6BDC.8080708@bobich.net>
Message-ID: <64D0546C5EBBD147B75DE133D798665F0855C0F4@hugo.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com]
> On Behalf Of Gordan Bobic
> Sent: Tuesday, February 15, 2011 7:05 AM
> 
> Volume resizing is, IMO, over-rated and unnecessary in most cases,
except where data
> growth is quite mind-boggling (in which case you won't be using MySQL
anyway).

We actually resize volumes often.  Some of our storage volumes have 30
LUNs or more.  We have so many because we've virtualized most of our
infrastructure, and some of the hosts are single-purpose hosts.

We don't want to allocate too more storage in advance, simply because
it's easier to grow than to shrink.  Stop the host, grow the volume,
e2fsck/resize2fs, start up and go.  Much nicer than increasing disk
capacity on physical hosts.

CLVM works well for this, but that's about all it's good for IMHO.  I
prefer to use the SAN's native volume management over CLVM when
available.

Haven't tried DRBD yet but I'm really tempted... it sounds like it has
come a long way since its modest beginnings.

-Jeff





From gordan at bobich.net  Tue Feb 15 16:17:03 2011
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 15 Feb 2011 16:17:03 +0000
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F0855C0F4@hugo.eprize.local>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>	<4D59E6F3.3050002@alteeve.com>	<4D59EAAD.8030301@logik-internet.rs><4D5A4DFE.8050101@bobich.net>	<4D5A6852.5000103@logik-internet.rs>
	<4D5A6BDC.8080708@bobich.net>
	<64D0546C5EBBD147B75DE133D798665F0855C0F4@hugo.eprize.local>
Message-ID: <4D5AA6FF.8080608@bobich.net>

Jeff Sturm wrote:
>> -----Original Message-----
>> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com]
>> On Behalf Of Gordan Bobic
>> Sent: Tuesday, February 15, 2011 7:05 AM
>>
>> Volume resizing is, IMO, over-rated and unnecessary in most cases,
> except where data
>> growth is quite mind-boggling (in which case you won't be using MySQL
> anyway).
> 
> We actually resize volumes often.  Some of our storage volumes have 30
> LUNs or more.  We have so many because we've virtualized most of our
> infrastructure, and some of the hosts are single-purpose hosts.
> 
> We don't want to allocate too more storage in advance, simply because
> it's easier to grow than to shrink.  Stop the host, grow the volume,
> e2fsck/resize2fs, start up and go.  Much nicer than increasing disk
> capacity on physical hosts.

Seems labour and downtime intensive to me. Maybe I'm just used to 
environments where that is an unacceptable tradeoff vs. ?40/TB for storage.

Not to mention that it makes you totally reliant on SAN level 
redundancy, which I also generally deem unacceptable except on very high 
end SANs that have mirroring features.

Additionally, considering you can self-build a multi-TB iSCSI SAN for a 
few hundred ?/$/? which will have volume growing ability (use sparse 
files for iSCSI volumes and write a byte to a greater offset), I cannot 
really see any justification whatsoever for using LVM with SAN based 
storage.

> Haven't tried DRBD yet but I'm really tempted... it sounds like it has
> come a long way since its modest beginnings.

Not sure how far back you are talking about but I have been using it in 
production in both active-active and active-passive configurations since 
at least 2007 with no problems. From the usage point of view, the 
changes have been negligible.

Gordan



From rpeterso at redhat.com  Tue Feb 15 16:24:26 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Tue, 15 Feb 2011 11:24:26 -0500 (EST)
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F0855C0F4@hugo.eprize.local>
Message-ID: <263367529.33108.1297787066881.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| We don't want to allocate too more storage in advance, simply because
| it's easier to grow than to shrink. Stop the host, grow the volume,
| e2fsck/resize2fs, start up and go. Much nicer than increasing disk
| capacity on physical hosts.

These might be good for ext3/4, but with gfs and gfs2 you can lvresize
and gfs2_grow while the lv is mounted.  In fact, we expect it.
Just make sure the vg has the clustered bit set (vgchange -cy) first.

Regards,

Bob Peterson
Red Hat File Systems



From ajb2 at mssl.ucl.ac.uk  Tue Feb 15 17:59:08 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Tue, 15 Feb 2011 17:59:08 +0000
Subject: [Linux-cluster] optimising DLM speed?
Message-ID: <4D5ABEEC.30609@mssl.ucl.ac.uk>


After lots of headbanging, I'm slowly realising that limits on GFS2 lock 
rates and totem message passing appears to be the main inhibitor of 
cluster performance.

Even on disks which are only mounted on one node (using lock_dlm), the 
ping_pong rate is - quite frankly - appalling, at about 5000 
locks/second, falling off to single digits when 3 nodes are active on 
the same directory.

totem's defaults are pretty low:

(from man openais.conf)

max messages/second = 17
window_size = 50
encryption = on
encryption/decryption threads = 1
netmtu = 1500

I suspect tuning these would have a marked effect on performance

gfs_controld and dlm_controld aren't even appearing in the CPU usage 
tables (24Gb dual 5560CPUs)

We have 2 GFS clusters, 2 nodes (imap) and 3 nodes (fileserving)

The imap system has around 2.5-3 million small files in the Maildir imap 
tree, whilst the fileserver cluster has ~90 1Tb filesystems of 1-4 
million files apiece (fileserver total is around 150 million files)

When things get busy or when users get silly and drop 10,000 files in a 
directory, performance across the entire cluster goes downhill badly - 
not just in the affected disk or directory.

Even worse: backups - it takes 20-28 hours to run a 0 file incremental 
backup of a 2.1million file system (ext4 takes about 8 minutes for the 
same file set!)


All heartbeat/lock traffic is handled across a dedicated Gb switch with 
each cluster in its own vlan to ensure no external cruft gets in to 
cause problems.

I'm seeing heartbeat/lock lan traffic peak out at about 120kb/s and 
4000pps per node at the moment. Clearly the switch isn't the problem - 
and using hardware acclerated igb devices I'm pretty sure the 
networking's fine too.

SAN side, there are 4 8Gb Qlogic cards facing the fabric and right now 
the whole mess talks to a Nexsan atabeast (which is slow, but seldom 
gets its commmand queue maxed out.)

Has anyone played much with the totem message timings? if so what 
results have you had?

As a comparison, the same hardware using EXT4 on a standalone system can 
trivially max out multiple 1Gb/s interfaces while transferring 1-2Mb/s 
files and gives lock rates of 1.8-2.5 million locks/second even with 
multiple ping_pong processes running.





From swhiteho at redhat.com  Tue Feb 15 18:20:20 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Tue, 15 Feb 2011 18:20:20 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D5ABEEC.30609@mssl.ucl.ac.uk>
References: <4D5ABEEC.30609@mssl.ucl.ac.uk>
Message-ID: <1297794020.2711.21.camel@dolmen>

Hi,

On Tue, 2011-02-15 at 17:59 +0000, Alan Brown wrote:
> After lots of headbanging, I'm slowly realising that limits on GFS2 lock 
> rates and totem message passing appears to be the main inhibitor of 
> cluster performance.
> 
> Even on disks which are only mounted on one node (using lock_dlm), the 
> ping_pong rate is - quite frankly - appalling, at about 5000 
> locks/second, falling off to single digits when 3 nodes are active on 
> the same directory.
> 
Let me try and explain what is going on here.... the posix (fcntl) locks
which you are using, do not go through the dlm, or at least not the main
part of the dlm.

The lock requests are sent to either gfs_controld or dlm_controld,
depending upon the version of RHCS where the requests are processed in
userspace via corosync/openais.

> totem's defaults are pretty low:
> 
> (from man openais.conf)
> 
> max messages/second = 17
> window_size = 50
> encryption = on
> encryption/decryption threads = 1
> netmtu = 1500
> 
> I suspect tuning these would have a marked effect on performance
> 
> gfs_controld and dlm_controld aren't even appearing in the CPU usage 
> tables (24Gb dual 5560CPUs)
> 
Only one of gfs_controld/dlm_controld will have any part in dealing with
the locks that you are concerned with, depending on the version.

> We have 2 GFS clusters, 2 nodes (imap) and 3 nodes (fileserving)
> 
> The imap system has around 2.5-3 million small files in the Maildir imap 
> tree, whilst the fileserver cluster has ~90 1Tb filesystems of 1-4 
> million files apiece (fileserver total is around 150 million files)
> 
> When things get busy or when users get silly and drop 10,000 files in a 
> directory, performance across the entire cluster goes downhill badly - 
> not just in the affected disk or directory.
> 
> Even worse: backups - it takes 20-28 hours to run a 0 file incremental 
> backup of a 2.1million file system (ext4 takes about 8 minutes for the 
> same file set!)
> 
The issues you've reported here don't sound to me as if they are related
to the rate of posix locks which can be granted. These sound to me a lot
more like issues relating to the I/O pattern on the filesystem.

How is the data spread out across directories and across nodes? Do you
try to keep users local to a single node for the imap servers? Is the
backup just doing a single pass scan over the whole fileystem?

> 
> All heartbeat/lock traffic is handled across a dedicated Gb switch with 
> each cluster in its own vlan to ensure no external cruft gets in to 
> cause problems.
> 
> I'm seeing heartbeat/lock lan traffic peak out at about 120kb/s and 
> 4000pps per node at the moment. Clearly the switch isn't the problem - 
> and using hardware acclerated igb devices I'm pretty sure the 
> networking's fine too.
> 
During the actual workload, or just during the ping pong test?

Steve.

> SAN side, there are 4 8Gb Qlogic cards facing the fabric and right now 
> the whole mess talks to a Nexsan atabeast (which is slow, but seldom 
> gets its commmand queue maxed out.)
> 
> Has anyone played much with the totem message timings? if so what 
> results have you had?
> 
> As a comparison, the same hardware using EXT4 on a standalone system can 
> trivially max out multiple 1Gb/s interfaces while transferring 1-2Mb/s 
> files and gives lock rates of 1.8-2.5 million locks/second even with 
> multiple ping_pong processes running.
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




From yvette at dbtgroup.com  Tue Feb 15 18:19:53 2011
From: yvette at dbtgroup.com (yvette hirth)
Date: Tue, 15 Feb 2011 18:19:53 +0000
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <6b22c9afe6618e0ab302dae12c4b74aa@sjolshagen.net>
References: "\"<4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>"	<4D59E3A1.4070508@logik-internet.rs>"
	<4D59E6F3.3050002@alteeve.com>	<4D59EAAD.8030301@logik-internet.rs>
	<4D5A4DFE.8050101@bobich.net>	<4D5A6852.5000103@logik-internet.rs>
	<6b22c9afe6618e0ab302dae12c4b74aa@sjolshagen.net>
Message-ID: <4D5AC3C9.8000401@dbtgroup.com>

Thomas Sjolshagen wrote:

> Just so you realize; If you intend to use clvm (i.e. lvme in a cluster 
> where you expect to be able to write to the volume from more than one 
> node at/around the same time w/o a full-on failover), you will _not_ 
> have snapshot support. And no, this isn't "not supported" as in "nobody 
> to call if you encounter a problem", it's "not supported" as in "the 
> tools will not let you create a snapshot of the LV".

i've been listening to this discussion with much interest, as we would 
like to improve the currency of our backup files.

right now we have an ensemble of GFS2 LV's ("pri") as our primary data 
store, and a "matching ensemble" of XFS LV's ("bak") as our backup data 
store.  an hourly cron job rsync's all LV's in the ensemble from pri => 
bak.  it's incredibly reliable, but this reduces our mean backup 
currency by 1/2 hour.  one upside is that i've got snapshots that are 
only 1/2 hour old, and are daily backed up to tape.

the conversation seems to indicate that we can change the bak LV's from 
XFS to GFS2 and have drbd auto-sync the pri LV changes made to the bak 
LV's - yes?  this would reduce our backup currency from a mean of 1/2 
hour to theoretically, "atomic" (more likely "mere seconds").  i assUme 
we have to change from XFS to GFS2, as drbd doesn't appear to do file 
system conversions...

if our assumptions are correct, are there any guides / manuals / doc on 
how to do this?  it's most tempting to try, since if it doesn't work, 
the hourly cron rsync's could be simply reinstated.

many thanks in advance to any and all who can advise...

yvette



From niks at logik-internet.rs  Tue Feb 15 20:09:26 2011
From: niks at logik-internet.rs (Nikola Savic)
Date: Tue, 15 Feb 2011 21:09:26 +0100
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F0855C0F4@hugo.eprize.local>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>	<4D59E6F3.3050002@alteeve.com>	<4D59EAAD.8030301@logik-internet.rs><4D5A4DFE.8050101@bobich.net>	<4D5A6852.5000103@logik-internet.rs>
	<4D5A6BDC.8080708@bobich.net>
	<64D0546C5EBBD147B75DE133D798665F0855C0F4@hugo.eprize.local>
Message-ID: <4D5ADD76.2050600@logik-internet.rs>

Jeff Sturm wrote:
> We actually resize volumes often.  Some of our storage volumes have 30
> LUNs or more.  We have so many because we've virtualized most of our
> infrastructure, and some of the hosts are single-purpose hosts.
>   

  Can you please provide more information on how storage is organized?

  Are you using SAN or local hard disks in nodes? Is there mirroring of
data and how is it implemented in your system?

  Thanks,
  Nikola



From grimme at atix.de  Tue Feb 15 20:07:31 2011
From: grimme at atix.de (Marc Grimme)
Date: Tue, 15 Feb 2011 21:07:31 +0100 (CET)
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <1297794020.2711.21.camel@dolmen>
Message-ID: <13588217.76.1297800448396.JavaMail.marc@mobilix-20>

Hi Steve,
I think lately I observed a very similar behavior with RHEL5 and gfs2.
It was a gfs2 filesystem that had about 2Mio files with sum of 2GB in a directory. When I did a du -shx . in this directory it took about 5 Minutes (noatime mountoption given). Independently on how much nodes took part in the cluster (in the end I only tested with one node). This was only for the first time running all later executed du commands were much faster.
When I mounted the exact same filesystem with lockproto=lock_nolock it took about 10-20 seconds to proceed with the same command.

Next I started to analyze this with oprofile and observed the following result:

opreport --long-file-names:
CPU: AMD64 family10, speed 2900.11 MHz (estimated)
Counted CPU_CLK_UNHALTED events (Cycles outside of halt state) with a unit mask of 0x00 (No unit mask) count 100000
samples  %        symbol name
200569   46.7639  search_rsb_list
118905   27.7234  create_lkb
32499     7.5773  search_bucket
4125      0.9618  find_lkb
3641      0.8489  process_send_sockets
3420      0.7974  dlm_scan_rsbs
3184      0.7424  _request_lock
3012      0.7023  find_rsb
2735      0.6377  receive_from_sock
2610      0.6085  _receive_message
2543      0.5929  dlm_allocate_rsb
2299      0.5360  dlm_hash2nodeid
2228      0.5195  _create_message
2180      0.5083  dlm_astd
2163      0.5043  dlm_find_lockspace_global
2109      0.4917  dlm_find_lockspace_local
2074      0.4836  dlm_lowcomms_get_buffer
2060      0.4803  dlm_lock
1982      0.4621  put_rsb
..

opreport --image /gfs2
CPU: AMD64 family10, speed 2900.11 MHz (estimated)
Counted CPU_CLK_UNHALTED events (Cycles outside of halt state) with a unit mask of 0x00 (No unit mask) count 100000
samples  %        symbol name
9310     15.5600  search_bucket
6268     10.4758  do_promote
2704      4.5192  gfs2_glock_put
2289      3.8256  gfs2_glock_hold
2286      3.8206  gfs2_glock_schedule_for_reclaim
2204      3.6836  gfs2_glock_nq
2204      3.6836  run_queue
2001      3.3443  gfs2_holder_wake
..

opreport --image /dlm
CPU: AMD64 family10, speed 2900.11 MHz (estimated)
Counted CPU_CLK_UNHALTED events (Cycles outside of halt state) with a unit mask of 0x00 (No unit mask) count 100000
samples  %        symbol name
200569   46.7639  search_rsb_list
118905   27.7234  create_lkb
32499     7.5773  search_bucket
4125      0.9618  find_lkb
3641      0.8489  process_send_sockets
3420      0.7974  dlm_scan_rsbs
3184      0.7424  _request_lock
3012      0.7023  find_rsb
2735      0.6377  receive_from_sock
2610      0.6085  _receive_message
2543      0.5929  dlm_allocate_rsb
2299      0.5360  dlm_hash2nodeid
2228      0.5195  _create_message
..

This very much reminded me on a similar test we've done years ago with gfs (see http://www.open-sharedroot.org/Members/marc/blog/blog-on-dlm/red-hat-dlm-__find_lock_by_id/profile-data-with-diffrent-table-sizes).

Does this not show that during the du command 46% of the time the kernel stays in the dlm:search_rsb_list function while looking out for locks. It still looks like the hashtable for the lock in dlm is much too small and searching inside the hashmap is not constant anymore?

I would be really interesting how long the described backup takes when the gfs2 filesystem is mounted exclusively on one node without locking.
For me it looks like you're facing a similar problem with gfs2 that has been worked around with gfs by introducing the glock_purge functionality that leads to a much smaller glock->dlm->hashtable and makes backups and the like much faster.

I hope this helps.

Thanks and regards
Marc.

----- Original Message -----
From: "Steven Whitehouse" <swhiteho at redhat.com>
To: "linux clustering" <linux-cluster at redhat.com>
Sent: Dienstag, 15. Februar 2011 19:20:20
Subject: Re: [Linux-cluster] optimising DLM speed?

Hi,

On Tue, 2011-02-15 at 17:59 +0000, Alan Brown wrote:
> After lots of headbanging, I'm slowly realising that limits on GFS2 lock 
> rates and totem message passing appears to be the main inhibitor of 
> cluster performance.
> 
> Even on disks which are only mounted on one node (using lock_dlm), the 
> ping_pong rate is - quite frankly - appalling, at about 5000 
> locks/second, falling off to single digits when 3 nodes are active on 
> the same directory.
> 
Let me try and explain what is going on here.... the posix (fcntl) locks
which you are using, do not go through the dlm, or at least not the main
part of the dlm.

The lock requests are sent to either gfs_controld or dlm_controld,
depending upon the version of RHCS where the requests are processed in
userspace via corosync/openais.

> totem's defaults are pretty low:
> 
> (from man openais.conf)
> 
> max messages/second = 17
> window_size = 50
> encryption = on
> encryption/decryption threads = 1
> netmtu = 1500
> 
> I suspect tuning these would have a marked effect on performance
> 
> gfs_controld and dlm_controld aren't even appearing in the CPU usage 
> tables (24Gb dual 5560CPUs)
> 
Only one of gfs_controld/dlm_controld will have any part in dealing with
the locks that you are concerned with, depending on the version.

> We have 2 GFS clusters, 2 nodes (imap) and 3 nodes (fileserving)
> 
> The imap system has around 2.5-3 million small files in the Maildir imap 
> tree, whilst the fileserver cluster has ~90 1Tb filesystems of 1-4 
> million files apiece (fileserver total is around 150 million files)
> 
> When things get busy or when users get silly and drop 10,000 files in a 
> directory, performance across the entire cluster goes downhill badly - 
> not just in the affected disk or directory.
> 
> Even worse: backups - it takes 20-28 hours to run a 0 file incremental 
> backup of a 2.1million file system (ext4 takes about 8 minutes for the 
> same file set!)
> 
The issues you've reported here don't sound to me as if they are related
to the rate of posix locks which can be granted. These sound to me a lot
more like issues relating to the I/O pattern on the filesystem.

How is the data spread out across directories and across nodes? Do you
try to keep users local to a single node for the imap servers? Is the
backup just doing a single pass scan over the whole fileystem?

> 
> All heartbeat/lock traffic is handled across a dedicated Gb switch with 
> each cluster in its own vlan to ensure no external cruft gets in to 
> cause problems.
> 
> I'm seeing heartbeat/lock lan traffic peak out at about 120kb/s and 
> 4000pps per node at the moment. Clearly the switch isn't the problem - 
> and using hardware acclerated igb devices I'm pretty sure the 
> networking's fine too.
> 
During the actual workload, or just during the ping pong test?

Steve.

> SAN side, there are 4 8Gb Qlogic cards facing the fabric and right now 
> the whole mess talks to a Nexsan atabeast (which is slow, but seldom 
> gets its commmand queue maxed out.)
> 
> Has anyone played much with the totem message timings? if so what 
> results have you had?
> 
> As a comparison, the same hardware using EXT4 on a standalone system can 
> trivially max out multiple 1Gb/s interfaces while transferring 1-2Mb/s 
> files and gives lock rates of 1.8-2.5 million locks/second even with 
> multiple ping_pong processes running.
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-- 

Marc Grimme

Tel: +49 89 4523538-14
Fax: +49 89 9901766-0
E-Mail: grimme at atix.de

ATIX Informationstechnologie und Consulting AG | Einsteinstrasse 10 |
85716 Unterschleissheim | www.atix.de

Registergericht: Amtsgericht Muenchen, Registernummer: HRB 168930, USt.-Id.:
DE209485962 | Vorstand: Thomas Merz (Vors.), Marc Grimme, Mark Hlawatschek, Jan R. Bergrath |
Vorsitzender des Aufsichtsrats: Dr. Martin Buss



From ajb2 at mssl.ucl.ac.uk  Tue Feb 15 20:24:44 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Tue, 15 Feb 2011 20:24:44 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D5ABEEC.30609@mssl.ucl.ac.uk>
References: <4D5ABEEC.30609@mssl.ucl.ac.uk>
Message-ID: <4D5AE10C.9020106@mssl.ucl.ac.uk>


The setup described is all on RHEL5.6.

Fileserver filesystems are each mounted on one cluster node only 
(scattered  across nodes) and then NFS exported as individual services 
for portability. (That exposed a major race condition with exportfs as 
it's not parallel aware in any way, shape or form)

Imapserver filesystems are mounted on one node and ALL imap activity 
happens on that node (hot standby)

Up to EL5.6 this has been pretty unstable, panicing regularly under load 
and losing filesystem to a FC driver bug I'll describe separately.





From ajb2 at mssl.ucl.ac.uk  Tue Feb 15 20:36:16 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Tue, 15 Feb 2011 20:36:16 +0000
Subject: [Linux-cluster] QLA2xxx tagged queue bug.
Message-ID: <4D5AE3C0.5060006@mssl.ucl.ac.uk>

I'm documenting this in case anyone else gets bitten

(This is supposed to have been fixed since October, but we encountered 
it in the last few days on RHEL5.6 - either it's not fully fixed or the 
patch has fallen out of the production kernel)

We kept getting GFS and GFS2 filesystems mysteriously going dead with 
"input/output" errors over the last 2 years, which has been traced to a 
bug in qla2xxx:

A QUEUE FULL or BUSY from the target results in a generic error being 
passed up to dm-multipath from the qla2xxx driver (instead of the driver 
backing off the queue size and trying again a few milliseconds later.)

When Dm-multipath receives an error, it marks the path to the target 
"bad" and tries another path. If the queue full condition doesn't clear 
quickly there is a cascade of path failures followed by the target being 
marked as BAD when they've all failed.

If "queue_if_no_path" isn't explicitly enabled in /etc/multipath,conf, 
that causes the i/o error symptoms described above.

Even if the target's tagged queue recovers before all paths fail, there 
tends to be a big hiccup in GFS(2) operations.

If multipathing's queue_if_no_path is enabled and the OS has to wait for 
the target to return, there will be an even longer glitch.


Currently the only workaround available is to set the qla2xxx tagged 
queue depth to a very low value via module options.


Qla2xxx's tagged queue depth is PER LUN, while most target tagged queues 
are PER DEVICE (eg: A Nexsan Satabeast presenting 6 luns has 255 
commands in total, not per lun). It's pretty easy to end up with more 
requests coming out of the initiators than the targets can handle 
simultaneously.





From ajb2 at mssl.ucl.ac.uk  Tue Feb 15 20:45:09 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Tue, 15 Feb 2011 20:45:09 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D5AE10C.9020106@mssl.ucl.ac.uk>
References: <4D5ABEEC.30609@mssl.ucl.ac.uk> <4D5AE10C.9020106@mssl.ucl.ac.uk>
Message-ID: <4D5AE5D5.1050002@mssl.ucl.ac.uk>


 > It would be really interesting how long the described backup takes 
when the gfs2 filesystem is mounted exclusively on one node without locking.

The 2 million inode system backs up in about 30 minutes when mounted 
lock_nolock (0 file incremental backup using bacula)

 > For me it looks like you're facing a similar problem with gfs2 that 
has been worked around with gfs by introducing the glock_purge 
functionality that leads to a much smaller glock->dlm->hashtable and 
makes backups and the like much faster.

Quite likely. Backup performance under GFS2 is slightly worse than with 
GFS. "ls -l" in a directory is _significantly_ worse and can take up to 
4 minutes for a directory with 4000 files onboard (Remember: This is 
with the GFS2 filesystem mounted lock_dlm on one node only!)


Compounding matters, we have a network /home - mounted on the 
fileservers and NFSv3 exported to ~150 RHEL5 desktops (Lots of small 
files, LOTS of random access).

KDE, Openoffice, Thunderbird, Mozilla are all pretty lock/cachefile 
happy and hit the network /home export fairly hard, so when there's a 
performance issue the users get pretty noisy.






From vincent.blondel at ing.be  Tue Feb 15 20:50:04 2011
From: vincent.blondel at ing.be (vincent.blondel at ing.be)
Date: Tue, 15 Feb 2011 21:50:04 +0100
Subject: [Linux-cluster] Two nodes DRBD - Fail-Over Actif/Passif Cluster.
In-Reply-To: <294881FE3F4013418806F0CE6E73A7B6052F302466@VPNLCMS92081.europe.intranet>
References: <294881FE3F4013418806F0CE6E73A7B6052F302466@VPNLCMS92081.europe.intranet>
Message-ID: <294881FE3F4013418806F0CE6E73A7B6052F302474@ing.com>

> Hello all,
>
> I just installed last week two servers, each of them with Redhat Linux Enterprise 6.0 on it for hosting in a near future Blue Coat Reporter. Installation is ok but now I am trying to configure these both servers in cluster.
>
> First of all, I never configured any cluster with Linux ...
>
> Servers are both HP DL380R06 with disk cabinet directly attached on it. (twice exactly same hardware specs).
>
> What I would like to get is simply getting an Actif/Passif clustering mode with bidirectional disk space synchronization. This means, both servers are running. Only, the first one is running Reporter. During this time, disk spaces are continuously synchronized. When first one is down, second one becomes actif and when first one is running again, it synchronizes the disks and becomes primary again.
>
> server 1 is reporter1.lab.intranet with ip 10.30.30.90
> server 2 is reporter2.lab.intranet with ip 10.30.30.91
>
> the load balanced ip should be 10.30.30.92 ..
>
> After some days of research on the net, I came to the conclusion that I could be happy with a solution including, DRBD/GFS2 with Redhat Cluster Suite.
>
> I am first trying to get a complete picture running on two vmware fusion (Linux Redhat Enterprise Linux 6) on my macosx before configuring my real servers.
>
> So, after some hours of research on the net, I found some articles and links that seem to describe what I wanna get ...
>
> http://gcharriere.com/blog/?p=73
> http://www.linuxtopia.org/online_books/rhel6/rhel_6_cluster_admin/rhel_6_cluster_ch-config-cli-CA.html
> http://www.drbd.org/users-guide/users-guide.html
>
> and the DRBD packages for RHEL6 that I did not find anywhere ..
>
> http://elrepo.org/linux/elrepo/el6/i386/RPMS/
>
> I just only configured till now the first part, meaning cluster services but the first issue occur ..
>
> below the cluster.conf file ...
>
>
> <?xml version="1.0"?>
> <cluster name="cluster" config_version="6">
>   <!-- post_join_delay: number of seconds the daemon will wait before
>                         fencing any victims after a node joins the domain
>        post_fail_delay: number of seconds the daemon will wait before
>                       fencing any victims after a domain member fails
>        clean_start    : prevent any startup fencing the daemon might do.
>                       It indicates that the daemon should assume all nodes
>                       are in a clean state to start. -->
>   <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
>   <clusternodes>
>     <clusternode name="reporter1.lab.intranet" votes="1" nodeid="1">
>       <fence>
>         <!-- Handle fencing manually -->
>         <method name="human">
>           <device name="human" nodename="reporter1.lab.intranet"/>
>         </method>
>       </fence>
>     </clusternode>
>     <clusternode name="reporter2.lab.intranet" votes="1" nodeid="2">
>       <fence>
>         <!-- Handle fencing manually -->
>         <method name="human">
>           <device name="human" nodename="reporter2.lab.intranet"/>
>         </method>
>       </fence>
>     </clusternode>
>   </clusternodes>
>   <!-- cman two nodes specification -->
>   <cman expected_votes="1" two_node="1"/>
>   <fencedevices>
>     <!-- Define manual fencing -->
>     <fencedevice name="human" agent="fence_manual"/>
>   </fencedevices>
>   <rm>
>      <failoverdomains>
>         <failoverdomain name="example_pri" nofailback="0" ordered="1" restricted="0">
>            <failoverdomainnode name="reporter1.lab.intranet" priority="1"/>
>            <failoverdomainnode name="reporter2.lab.intranet" priority="2"/>
>         </failoverdomain>
>      </failoverdomains>
>      <resources>
>            <ip address="10.30.30.92" monitor_link="on" sleeptime="10"/>
>            <apache config_file="conf/httpd.conf" name="example_server" server_root="/etc/httpd" shutdown_wait="0"/>
>       </resources>
>       <service autostart="1" domain="example_pri" exclusive="0" name="example_apache" recovery="relocate">
>                 <ip ref="10.30.30.92"/>
>                 <apache ref="example_server"/>
>       </service>
>   </rm>
> </cluster>
>
> and this is the result I get on both servers ...
>
> [root at reporter1 ~]# clustat
> Cluster Status for cluster @ Mon Feb 14 22:22:53 2011
> Member Status: Quorate
>
>  Member Name                                      ID   Status
>  ------ ----                                      ---- ------
>  reporter1.lab.intranet                               1 Online, Local, rgmanager
>  reporter2.lab.intranet                               2 Online, rgmanager
>
>  Service Name                            Owner (Last)                            State
>  ------- ----                            ----- ------                            -----
>  service:example_apache                  (none)                                  stopped
>
> as you can see, everything is stopped or in other words nothing runs .. so my question are :
>
> did I forget something in my conf file ?
> did I make something wrong in my conf file ?
> do I have to configure manually load balanced ip 10.30.30.92 as an alias ip on both sides or is it done automatically by redhat cluster ?
> I just made a simple try with apache but I do not find anywhere reference to the start/stop script for apache in the examples, is that normal ??
> do you have some best practice regarding this picture ??
>
> many thks to help me because I certainly have a bad understanding on some points.
>

any idea to solve this problem, .. many thks ??

> Regards
> Vincent
-----------------------------------------------------------------
ATTENTION:
This e-mail is intended for the exclusive use of the
recipient(s). This e-mail and its attachments, if any, contain
confidential information and/or information protected by
intellectual property rights or other rights. This e-mail does
not constitute any commitment for ING Belgium except when
expressly otherwise agreed in a written agreement between the
intended recipient and ING Belgium.

If you receive this message by mistake, please, notify the sender
with the "reply" option and delete immediately this e-mail from
your system, and destroy all copies of it. You may not, directly
or indirectly, use this e-mail or any part of it if you are not
the intended recipient.

Messages and attachments are scanned for all viruses known. If
this message contains password-protected attachments, the files
have NOT been scanned for viruses by the ING mail domain. Always
scan attachments before opening them.
-----------------------------------------------------------------
ING Belgium SA/NV - Bank/Lender - Avenue Marnix 24, B-1000
Brussels, Belgium - Brussels RPM/RPR - VAT BE 0403.200.393 -
BIC (SWIFT) : BBRUBEBB - Account: 310-9156027-89 (IBAN BE45 3109
1560 2789).
An insurance broker, registered with the Banking, Finance and
Insurance Commission under the code number 12381A.

ING Belgique SA - Banque/Preteur, Avenue Marnix 24, B-1000
Bruxelles - RPM Bruxelles - TVA BE 0403 200 393 - BIC (SWIFT) :
BBRUBEBB - Compte: 310-9156027-89 (IBAN: BE45 3109 1560 2789).
Courtier d'assurances inscrit a la CBFA sous le numero 12381A.

ING Belgie NV - Bank/Kredietgever - Marnixlaan 24, B-1000 Brussel
- RPR Brussel - BTW BE 0403.200.393 - BIC (SWIFT) : BBRUBEBB -
Rekening: 310-9156027-89 (IBAN: BE45 3109 1560 2789).
Verzekeringsmakelaar ingeschreven bij de CBFA onder het nr.
12381A.
-----------------------------------------------------------------




From gordan at bobich.net  Tue Feb 15 21:01:21 2011
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 15 Feb 2011 21:01:21 +0000
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D5AC3C9.8000401@dbtgroup.com>
References: "\"<4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>"	<4D59E3A1.4070508@logik-internet.rs>"	<4D59E6F3.3050002@alteeve.com>	<4D59EAAD.8030301@logik-internet.rs>	<4D5A4DFE.8050101@bobich.net>	<4D5A6852.5000103@logik-internet.rs>	<6b22c9afe6618e0ab302dae12c4b74aa@sjolshagen.net>
	<4D5AC3C9.8000401@dbtgroup.com>
Message-ID: <4D5AE9A1.9080604@bobich.net>

On 02/15/2011 06:19 PM, yvette hirth wrote:
> Thomas Sjolshagen wrote:
>
>> Just so you realize; If you intend to use clvm (i.e. lvme in a cluster
>> where you expect to be able to write to the volume from more than one
>> node at/around the same time w/o a full-on failover), you will _not_
>> have snapshot support. And no, this isn't "not supported" as in
>> "nobody to call if you encounter a problem", it's "not supported" as
>> in "the tools will not let you create a snapshot of the LV".
>
> i've been listening to this discussion with much interest, as we would
> like to improve the currency of our backup files.
>
> right now we have an ensemble of GFS2 LV's ("pri") as our primary data
> store, and a "matching ensemble" of XFS LV's ("bak") as our backup data
> store. an hourly cron job rsync's all LV's in the ensemble from pri =>
> bak. it's incredibly reliable, but this reduces our mean backup currency
> by 1/2 hour. one upside is that i've got snapshots that are only 1/2
> hour old, and are daily backed up to tape.
>
> the conversation seems to indicate that we can change the bak LV's from
> XFS to GFS2 and have drbd auto-sync the pri LV changes made to the bak
> LV's - yes? this would reduce our backup currency from a mean of 1/2
> hour to theoretically, "atomic" (more likely "mere seconds"). i assUme
> we have to change from XFS to GFS2, as drbd doesn't appear to do file
> system conversions...
>
> if our assumptions are correct, are there any guides / manuals / doc on
> how to do this? it's most tempting to try, since if it doesn't work, the
> hourly cron rsync's could be simply reinstated.

I'm not sure you realize what this would require. DRBD is a block 
device. You would have to start with a new partition/disk, "format" it 
for DRBD (creates DRBD metadata on the block device), then create GFS on 
top of it and put your files in. It's a backup+restore job to migrate to 
and from it.

If you were to do this, your backup node would have to be a part of your 
DRBD cluster (all nodes need to share the DRBD device, unless you plan 
to only use it on the SAN that all the nodes connect to the volume 
from). You would then drop the backup node out of the cluster completely 
and make sure it cannot reconnect (this is vitally important), mount the 
GFS FS from DRBD ro with lock_nolock, and then back that up. Unless you 
are happy with just a block level mirror, which won't help you if data 
is accidentally deleted. DRBD is network RAID1, and RAID (of any level) 
is not a replacement for backups - but I'm sure you know that.

Gordan



From gordan at bobich.net  Tue Feb 15 21:09:14 2011
From: gordan at bobich.net (Gordan Bobic)
Date: Tue, 15 Feb 2011 21:09:14 +0000
Subject: [Linux-cluster] Two nodes DRBD - Fail-Over Actif/Passif Cluster.
In-Reply-To: <294881FE3F4013418806F0CE6E73A7B6052F302474@ing.com>
References: <294881FE3F4013418806F0CE6E73A7B6052F302466@VPNLCMS92081.europe.intranet>
	<294881FE3F4013418806F0CE6E73A7B6052F302474@ing.com>
Message-ID: <4D5AEB7A.6090207@bobich.net>

On 02/15/2011 08:50 PM, vincent.blondel at ing.be wrote:

>> below the cluster.conf file ...
>>
>>
>> <?xml version="1.0"?>
>> <cluster name="cluster" config_version="6">
>>    <!-- post_join_delay: number of seconds the daemon will wait before
>>                          fencing any victims after a node joins the domain
>>         post_fail_delay: number of seconds the daemon will wait before
>>                        fencing any victims after a domain member fails
>>         clean_start    : prevent any startup fencing the daemon might do.
>>                        It indicates that the daemon should assume all nodes
>>                        are in a clean state to start. -->
>>    <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
>>    <clusternodes>
>>      <clusternode name="reporter1.lab.intranet" votes="1" nodeid="1">
>>        <fence>
>>          <!-- Handle fencing manually -->
>>          <method name="human">
>>            <device name="human" nodename="reporter1.lab.intranet"/>
>>          </method>
>>        </fence>
>>      </clusternode>
>>      <clusternode name="reporter2.lab.intranet" votes="1" nodeid="2">
>>        <fence>
>>          <!-- Handle fencing manually -->
>>          <method name="human">
>>            <device name="human" nodename="reporter2.lab.intranet"/>
>>          </method>
>>        </fence>
>>      </clusternode>
>>    </clusternodes>
>>    <!-- cman two nodes specification -->
>>    <cman expected_votes="1" two_node="1"/>
>>    <fencedevices>
>>      <!-- Define manual fencing -->
>>      <fencedevice name="human" agent="fence_manual"/>
>>    </fencedevices>
>>    <rm>
>>       <failoverdomains>
>>          <failoverdomain name="example_pri" nofailback="0" ordered="1" restricted="0">
>>             <failoverdomainnode name="reporter1.lab.intranet" priority="1"/>
>>             <failoverdomainnode name="reporter2.lab.intranet" priority="2"/>
>>          </failoverdomain>
>>       </failoverdomains>
>>       <resources>
>>             <ip address="10.30.30.92" monitor_link="on" sleeptime="10"/>
>>             <apache config_file="conf/httpd.conf" name="example_server" server_root="/etc/httpd" shutdown_wait="0"/>
>>        </resources>
>>        <service autostart="1" domain="example_pri" exclusive="0" name="example_apache" recovery="relocate">
>>                  <ip ref="10.30.30.92"/>
>>                  <apache ref="example_server"/>
>>        </service>
>>    </rm>
>> </cluster>
>>
>> and this is the result I get on both servers ...
>>
>> [root at reporter1 ~]# clustat
>> Cluster Status for cluster @ Mon Feb 14 22:22:53 2011
>> Member Status: Quorate
>>
>>   Member Name                                      ID   Status
>>   ------ ----                                      ---- ------
>>   reporter1.lab.intranet                               1 Online, Local, rgmanager
>>   reporter2.lab.intranet                               2 Online, rgmanager
>>
>>   Service Name                            Owner (Last)                            State
>>   ------- ----                            ----- ------                            -----
>>   service:example_apache                  (none)                                  stopped
>>
>> as you can see, everything is stopped or in other words nothing runs .. so my question are :

Having a read through /var/log/messages for possible causes would be a 
good start.

>> do I have to configure manually load balanced ip 10.30.30.92 as an alias ip on both sides or is it done automatically by redhat cluster ?

RHCS will automatically assign the IP to an interface that is on the 
same subnet. You most definitely shouldn't create the IP manually on any 
of the nodes.

>> I just made a simple try with apache but I do not find anywhere reference to the start/stop script for apache in the examples, is that normal ??
>> do you have some best practice regarding this picture ??

I'm not familiar with the <apache> tag in cluster.conf, I usually 
configure most things as init script resources.

Gordan



From niks at logik-internet.rs  Tue Feb 15 21:26:19 2011
From: niks at logik-internet.rs (Nikola Savic)
Date: Tue, 15 Feb 2011 22:26:19 +0100
Subject: [Linux-cluster] Organizing 3 servers into cluster
Message-ID: <4D5AEF7B.7030700@logik-internet.rs>


  Hello,

  I need to setup cluster using 3 servers. Thanks to everybody involved
from this mailing list in previous post, we have concluded that
DRBD+GFS2 is the best approach for building shared storage from local
hard drives. It will enable mirroring of data between nodes using DRBD,
and concurrent access to file systems thanks to GFS2.

  Main purpose of this cluster is hosting of single web site (web
application). Main services we'll have are Web server (httpd) and MySQL.
We also use memcached for shared session and caching. Cluster should
provide following benefits:
- High Availability
- High Performance (balancing of web application execution on cluster nodes)
- Traffic balancing

  This means that all 3 servers will execute web application and provide
content to visitors. We didn't plan to use load balancers in front of
web servers, because traffic balancing is important. That is why DNS
round-robin approach was planned, which we already use for two server
architecture used at moment. Web server on each node will be directly
accessed by visitors, spread by use of DNS round-robin. One of servers
will have MySQL Master used for writing, while other two will have MySQL
Slave instances for reading. Each MySQL instance will execute on
separate data on shared storage.

  I was confused by following line in RedHat's Cluster documentation,
related to High Availablity: "An HA service can run on only one cluster
node at a time to maintain data integrity". Does this mean that web
servers can not work in parallel on all cluster nodes? Or, is this
limitaion related to combination of IP address and service (eg. web
server on IP 10.1.1.1)?

  When MySQL is in question, I have even more doubts on how to implement
HA automatically. Failure of node where MySQL Master executes should
result in automatic start of MySQL service on different node. In our
configuration with 3 servers, there is no spare node, so MySQL Master
and one of MySQL Slave instances should run on same server. If two nodes
fail, all three MySQL instances will fail to single node :). If I
understand MySQL docs well, this is not problem, but each instance must
use different port, socket, data folders (which we already have
separated). I didn't notice that MySQL instance can connect to specific IP.

  Does anyone have experience with this kind of setup? I know that one
can run more than one instance of MySQL on single server, executed on
different sets of data and connected to different ports. However, I'm
not sure if it's possible to setup and if it's stable in cluster
environment with HA? if it's possible, what are things I should take
care of?

  Finally, for setup like this would you use available servers (3 of 'em
in my case) as cluster nodes, or would you use 'em as bare metal
machines for virtual servers with specific roles (eg. web server, MySQL
master, Memcached, etc.), that can be easily moved from one base metal
machine to other, in case of failure?

  Best Regards,
  Nikola
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110215/5816f441/attachment.htm>

From ajb2 at mssl.ucl.ac.uk  Tue Feb 15 21:35:37 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Tue, 15 Feb 2011 21:35:37 +0000
Subject: [Linux-cluster] Linux-cluster Digest, Vol 82, Issue 20
In-Reply-To: <mailman.1288.1297804172.10127.linux-cluster@redhat.com>
References: <mailman.1288.1297804172.10127.linux-cluster@redhat.com>
Message-ID: <4D5AF1A9.3000307@mssl.ucl.ac.uk>



>> I'm seeing heartbeat/lock lan traffic peak out at about 120kb/s and
>> 4000pps per node at the moment. Clearly the switch isn't the problem -
>> and using hardware acclerated igb devices I'm pretty sure the
>> networking's fine too.
>>
> During the actual workload, or just during the ping pong test?

During the actual workload.






From list at fajar.net  Tue Feb 15 22:09:36 2011
From: list at fajar.net (Fajar A. Nugraha)
Date: Wed, 16 Feb 2011 05:09:36 +0700
Subject: [Linux-cluster] Organizing 3 servers into cluster
In-Reply-To: <4D5AEF7B.7030700@logik-internet.rs>
References: <4D5AEF7B.7030700@logik-internet.rs>
Message-ID: <AANLkTikaF-vNzd4qkSMrw_OSJEiNzRgxOLCTPDzpzuCT@mail.gmail.com>

On Wed, Feb 16, 2011 at 4:26 AM, Nikola Savic <niks at logik-internet.rs> wrote:
>
> ? Hello,
>
> ? I need to setup cluster using 3 servers. Thanks to everybody involved from
> this mailing list in previous post, we have concluded that DRBD+GFS2 is the
> best approach for building shared storage from local hard drives. It will
> enable mirroring of data between nodes using DRBD, and concurrent access to
> file systems thanks to GFS2.
>
> ? Main purpose of this cluster is hosting of single web site (web
> application). Main services we'll have are Web server (httpd) and MySQL. We
> also use memcached for shared session and caching. Cluster should provide
> following benefits:
> - High Availability
> - High Performance (balancing of web application execution on cluster nodes)
> - Traffic balancing

Before you get your hopes too high, make sure you test it first.
DRBD will have some performance penalty compared to plain local block
device, and GFS (or any other cluster file system) will have some
performance penalty compared to ext3/4. Then there's also the
additional layer of complexity involved (e.g.fencing, cluster service,
etc.). Whether or not the penalty is acceptable depends on your needs.

Depending on your needs, it might be possible that the "best" setup
would be to dedicate one of the nodes as NAS server using nfs4 on top
of ext4 for the other two nodes, and setup two floating IPs with
something like vrrp.

>
> ? This means that all 3 servers will execute web application and provide
> content to visitors. We didn't plan to use load balancers in front of web
> servers, because traffic balancing is important. That is why DNS round-robin
> approach was planned, which we already use for two server architecture used
> at moment. Web server on each node will be directly accessed by visitors,
> spread by use of DNS round-robin. One of servers will have MySQL Master used
> for writing, while other two will have MySQL Slave instances for reading.
> Each MySQL instance will execute on separate data on shared storage.
>
> ? I was confused by following line in RedHat's Cluster documentation,
> related to High Availablity: "An HA service can run on only one cluster node
> at a time to maintain data integrity". Does this mean that web servers can
> not work in parallel on all cluster nodes? Or, is this limitaion related to
> combination of IP address and service (eg. web server on IP 10.1.1.1)?

I believe it's also related to what filesystem and what service you
use. When you use ext3/4 for storage, obviously it can only be mounted
on one node. Similar thing with application, MySQL requires exclusive
access to its data directory while httpd has no problem sharing it's
DocumentRoot with other http instances.

-- 
Fajar



From sachinbhugra at hotmail.com  Tue Feb 15 22:24:34 2011
From: sachinbhugra at hotmail.com (sachin)
Date: Wed, 16 Feb 2011 03:54:34 +0530
Subject: [Linux-cluster] Cluster node hangs
In-Reply-To: <AANLkTimZ9E4xm8qS_bb7-NcxY7X+u0Eg1nYaCXK0iuKj@mail.gmail.com>
References: <SNT112-W544548584AD789563BB1ECDAD10@phx.gbl>	<4D57A763.8030700@redhat.com>
	<4D57A9F3.90408@redhat.com>	<SNT112-W456C3B1F45B101BABEDFDFDAD10@phx.gbl>
	<AANLkTimZ9E4xm8qS_bb7-NcxY7X+u0Eg1nYaCXK0iuKj@mail.gmail.com>
Message-ID: <SNT112-DS164BF14E62B91918518549DAD30@phx.gbl>

Sorry for the delay friends. Actually, logs are scattered in different log
files:

 

1.       For rgmamager logs I have configured /var/log/cluster.log

2.       Other cluster logs are going to messages file. Presently I am
trying to find a way using which I can gather all the logs under one file
other than messages. Seems I can use <logging> feature in cluster.conf,
comments??

 

I am having openldap logging enabled on this server which is also using
local4 facility and logs from cluster and ldap are getting mixed up.

 

 

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of dOminic
Sent: Sunday, February 13, 2011 8:03 PM
To: linux clustering
Subject: Re: [Linux-cluster] Cluster node hangs

 

Hi,

 

Whats the msg you are getting in logs ?. It would be great if you could
attach log mesgs along with cluster.conf 

 

-dominic 

 

On Sun, Feb 13, 2011 at 3:49 PM, Sachin Bhugra <sachinbhugra at hotmail.com>
wrote:

Thank for the reply and link. However, GFS2 is not listed in fstab, it is
only handled by cluster config.

  _____  

Date: Sun, 13 Feb 2011 10:52:51 +0100
From: ekuric at redhat.com
To: linux-cluster at redhat.com
Subject: Re: [Linux-cluster] Cluster node hangs



On 02/13/2011 10:41 AM, Elvir Kuric wrote: 

On 02/13/2011 10:14 AM, Sachin Bhugra wrote: 

Hi ,

I have setup a two node cluster in lab, with Vmware Server, and hence used
manual fencing. It includes a iSCSI GFS2 partition and it service Apache in
Active/Passive mode.

Cluster works and I am able to relocate service between nodes with no
issues. However, the problem comes when I shutdown the node, for testing,
which is presently holding the service. When the node becomes unavailable,
service gets relocated and GFS partition gets mounted on the other node,
however it is not accessible. If I try to do a "ls/du" on GFS partition, the
command hangs. On the other hand the node which was shutdown gets stuck at
"unmounting file system". 

I tried using fence_manual -n nodename and then fence_ack_manual -n
nodename, however it still remains the same.

Can someone please help me is what I am doing wrong?

Thanks, 




--


Linux-cluster mailing list


Linux-cluster at redhat.com


https://www.redhat.com/mailman/listinfo/linux-cluster

It would be good to see  /etc/fstab configuration used on cluster nodes. If
/gfs partition is mounted manually it will not be unmounted correctly in
case you restart node ( and not executing umount prior restart ), and will
hang during shutdown/reboot process.

More at:
http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/html-single/Glo
bal_File_System_2/index.html


Edit: above link, section 3.4 Special Considerations when Mounting GFS2 File
Systems 



Regards, 

Elvir 

 

 




--


Linux-cluster mailing list


Linux-cluster at redhat.com


https://www.redhat.com/mailman/listinfo/linux-cluster

 

-- Linux-cluster mailing list Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster 


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110216/37af12f5/attachment.htm>

From vincent.blondel at ing.be  Wed Feb 16 05:55:35 2011
From: vincent.blondel at ing.be (vincent.blondel at ing.be)
Date: Wed, 16 Feb 2011 06:55:35 +0100
Subject: [Linux-cluster] Two nodes DRBD - Fail-Over Actif/Passif Cluster.
In-Reply-To: <4D5AEB7A.6090207@bobich.net>
References: <294881FE3F4013418806F0CE6E73A7B6052F302466@VPNLCMS92081.europe.intranet>
	<294881FE3F4013418806F0CE6E73A7B6052F302474@ing.com>
	<4D5AEB7A.6090207@bobich.net>
Message-ID: <294881FE3F4013418806F0CE6E73A7B6052F302477@ing.com>

>>> below the cluster.conf file ...
>>>
>>>
>>> <?xml version="1.0"?>
>>> <cluster name="cluster" config_version="6">
>>>    <!-- post_join_delay: number of seconds the daemon will wait before
>>>                          fencing any victims after a node joins the domain
>>>         post_fail_delay: number of seconds the daemon will wait before
>>>                        fencing any victims after a domain member fails
>>>         clean_start    : prevent any startup fencing the daemon might do.
>>>                        It indicates that the daemon should assume all nodes
>>>                        are in a clean state to start. -->
>>>    <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
>>>    <clusternodes>
>>>      <clusternode name="reporter1.lab.intranet" votes="1" nodeid="1">
>>>        <fence>
>>>          <!-- Handle fencing manually -->
>>>          <method name="human">
>>>            <device name="human" nodename="reporter1.lab.intranet"/>
>>>          </method>
>>>        </fence>
>>>      </clusternode>
>>>      <clusternode name="reporter2.lab.intranet" votes="1" nodeid="2">
>>>        <fence>
>>>          <!-- Handle fencing manually -->
>>>          <method name="human">
>>>            <device name="human" nodename="reporter2.lab.intranet"/>
>>>          </method>
>>>        </fence>
>>>      </clusternode>
>>>    </clusternodes>
>>>    <!-- cman two nodes specification -->
>>>    <cman expected_votes="1" two_node="1"/>
>>>    <fencedevices>
>>>      <!-- Define manual fencing -->
>>>      <fencedevice name="human" agent="fence_manual"/>
>>>    </fencedevices>
>>>    <rm>
>>>       <failoverdomains>
>>>          <failoverdomain name="example_pri" nofailback="0" ordered="1" restricted="0">
>>>             <failoverdomainnode name="reporter1.lab.intranet" priority="1"/>
>>>             <failoverdomainnode name="reporter2.lab.intranet" priority="2"/>
>>>          </failoverdomain>
>>>       </failoverdomains>
>>>       <resources>
>>>             <ip address="10.30.30.92" monitor_link="on" sleeptime="10"/>
>>>             <apache config_file="conf/httpd.conf" name="example_server" server_root="/etc/httpd" shutdown_wait="0"/>
>>>        </resources>
>>>        <service autostart="1" domain="example_pri" exclusive="0" name="example_apache" recovery="relocate">
>>>                  <ip ref="10.30.30.92"/>
>>>                  <apache ref="example_server"/>
>>>        </service>
>>>    </rm>
>>> </cluster>
>>>
>>> and this is the result I get on both servers ...
>>>
>>> [root at reporter1 ~]# clustat
>>> Cluster Status for cluster @ Mon Feb 14 22:22:53 2011
>>> Member Status: Quorate
>>>
>>>   Member Name                                      ID   Status
>>>   ------ ----                                      ---- ------
>>>   reporter1.lab.intranet                               1 Online, Local, rgmanager
>>>   reporter2.lab.intranet                               2 Online, rgmanager
>>>
>>>   Service Name                            Owner (Last)                            State
>>>   ------- ----                            ----- ------                            -----
>>>   service:example_apache                  (none)                                  stopped
>>>
>>> as you can see, everything is stopped or in other words nothing runs .. so my question are :
>
>Having a read through /var/log/messages for possible causes would be a
>good start.
>

this is what I see in the /var/log/messages file ...

Feb 16 07:36:54 reporter1 corosync[1250]:   [MAIN  ] Corosync Cluster Engine ('1.2.3'): started and ready to provide service.
Feb 16 07:36:54 reporter1 corosync[1250]:   [MAIN  ] Corosync built-in features: nss rdma
Feb 16 07:36:54 reporter1 corosync[1250]:   [MAIN  ] Successfully read config from /etc/cluster/cluster.conf
Feb 16 07:36:54 reporter1 corosync[1250]:   [MAIN  ] Successfully parsed cman config
Feb 16 07:36:54 reporter1 corosync[1250]:   [TOTEM ] Initializing transport (UDP/IP).
Feb 16 07:36:54 reporter1 corosync[1250]:   [TOTEM ] Initializing transmit/receive security: libtomcrypt SOBER128/SHA1HMAC (mode 0).
Feb 16 07:36:55 reporter1 corosync[1250]:   [TOTEM ] The network interface [10.30.30.90] is now up.
Feb 16 07:36:55 reporter1 corosync[1250]:   [QUORUM] Using quorum provider quorum_cman
Feb 16 07:36:55 reporter1 corosync[1250]:   [SERV  ] Service engine loaded: corosync cluster quorum service v0.1
Feb 16 07:36:55 reporter1 corosync[1250]:   [CMAN  ] CMAN 3.0.12 (built Aug 17 2010 14:08:49) started
Feb 16 07:36:55 reporter1 corosync[1250]:   [SERV  ] Service engine loaded: corosync CMAN membership service 2.90
Feb 16 07:36:55 reporter1 corosync[1250]:   [SERV  ] Service engine loaded: openais checkpoint service B.01.01
Feb 16 07:36:55 reporter1 corosync[1250]:   [SERV  ] Service engine loaded: corosync extended virtual synchrony service
Feb 16 07:36:55 reporter1 corosync[1250]:   [SERV  ] Service engine loaded: corosync configuration service
Feb 16 07:36:55 reporter1 corosync[1250]:   [SERV  ] Service engine loaded: corosync cluster closed process group service v1.01
Feb 16 07:36:55 reporter1 corosync[1250]:   [SERV  ] Service engine loaded: corosync cluster config database access v1.01
Feb 16 07:36:55 reporter1 corosync[1250]:   [SERV  ] Service engine loaded: corosync profile loading service
Feb 16 07:36:55 reporter1 corosync[1250]:   [QUORUM] Using quorum provider quorum_cman
Feb 16 07:36:55 reporter1 corosync[1250]:   [SERV  ] Service engine loaded: corosync cluster quorum service v0.1
Feb 16 07:36:55 reporter1 corosync[1250]:   [MAIN  ] Compatibility mode set to whitetank.  Using V1 and V2 of the synchronization engine.
Feb 16 07:36:55 reporter1 corosync[1250]:   [TOTEM ] A processor joined or left the membership and a new membership was formed.
Feb 16 07:36:55 reporter1 corosync[1250]:   [CMAN  ] quorum regained, resuming activity
Feb 16 07:36:55 reporter1 corosync[1250]:   [QUORUM] This node is within the primary component and will provide service.
Feb 16 07:36:55 reporter1 corosync[1250]:   [QUORUM] Members[1]: 1
Feb 16 07:36:55 reporter1 corosync[1250]:   [QUORUM] Members[1]: 1
Feb 16 07:36:55 reporter1 corosync[1250]:   [CPG   ] downlist received left_list: 0
Feb 16 07:36:55 reporter1 corosync[1250]:   [CPG   ] chosen downlist from node r(0) ip(10.30.30.90)
Feb 16 07:36:55 reporter1 corosync[1250]:   [MAIN  ] Completed service synchronization, ready to provide service.
Feb 16 07:36:56 reporter1 fenced[1302]: fenced 3.0.12 started
Feb 16 07:36:57 reporter1 dlm_controld[1319]: dlm_controld 3.0.12 started
Feb 16 07:36:57 reporter1 gfs_controld[1374]: gfs_controld 3.0.12 started
Feb 16 07:37:03 reporter1 corosync[1250]:   [TOTEM ] A processor joined or left the membership and a new membership was formed.
Feb 16 07:37:03 reporter1 corosync[1250]:   [QUORUM] Members[2]: 1 2
Feb 16 07:37:03 reporter1 corosync[1250]:   [QUORUM] Members[2]: 1 2
Feb 16 07:37:03 reporter1 corosync[1250]:   [CPG   ] downlist received left_list: 0
Feb 16 07:37:03 reporter1 corosync[1250]:   [CPG   ] downlist received left_list: 0
Feb 16 07:37:03 reporter1 corosync[1250]:   [CPG   ] chosen downlist from node r(0) ip(10.30.30.90)

>>> do I have to configure manually load balanced ip 10.30.30.92 as an alias ip on both sides or is it done automatically by redhat cluster ?
>
>RHCS will automatically assign the IP to an interface that is on the
>same subnet. You most definitely shouldn't create the IP manually on any
>of the nodes.
>
>>> I just made a simple try with apache but I do not find anywhere reference to the start/stop script for apache in the examples, is that normal ??
>>> do you have some best practice regarding this picture ??
>
>I'm not familiar with the <apache> tag in cluster.conf, I usually
>configure most things as init script resources.
>
>Gordan
-----------------------------------------------------------------
ATTENTION:
This e-mail is intended for the exclusive use of the
recipient(s). This e-mail and its attachments, if any, contain
confidential information and/or information protected by
intellectual property rights or other rights. This e-mail does
not constitute any commitment for ING Belgium except when
expressly otherwise agreed in a written agreement between the
intended recipient and ING Belgium.

If you receive this message by mistake, please, notify the sender
with the "reply" option and delete immediately this e-mail from
your system, and destroy all copies of it. You may not, directly
or indirectly, use this e-mail or any part of it if you are not
the intended recipient.

Messages and attachments are scanned for all viruses known. If
this message contains password-protected attachments, the files
have NOT been scanned for viruses by the ING mail domain. Always
scan attachments before opening them.
-----------------------------------------------------------------
ING Belgium SA/NV - Bank/Lender - Avenue Marnix 24, B-1000
Brussels, Belgium - Brussels RPM/RPR - VAT BE 0403.200.393 -
BIC (SWIFT) : BBRUBEBB - Account: 310-9156027-89 (IBAN BE45 3109
1560 2789).
An insurance broker, registered with the Banking, Finance and
Insurance Commission under the code number 12381A.

ING Belgique SA - Banque/Preteur, Avenue Marnix 24, B-1000
Bruxelles - RPM Bruxelles - TVA BE 0403 200 393 - BIC (SWIFT) :
BBRUBEBB - Compte: 310-9156027-89 (IBAN: BE45 3109 1560 2789).
Courtier d'assurances inscrit a la CBFA sous le numero 12381A.

ING Belgie NV - Bank/Kredietgever - Marnixlaan 24, B-1000 Brussel
- RPR Brussel - BTW BE 0403.200.393 - BIC (SWIFT) : BBRUBEBB -
Rekening: 310-9156027-89 (IBAN: BE45 3109 1560 2789).
Verzekeringsmakelaar ingeschreven bij de CBFA onder het nr.
12381A.
-----------------------------------------------------------------




From shariq.siddiqui at yahoo.com  Wed Feb 16 10:04:00 2011
From: shariq.siddiqui at yahoo.com (Shariq Siddiqui)
Date: Wed, 16 Feb 2011 02:04:00 -0800 (PST)
Subject: [Linux-cluster] RAW Devices performance issue
Message-ID: <506336.34162.qm@web39801.mail.mud.yahoo.com>





Dear All,

I am going to install Oracle RAC on two Servers, With shared SAN storage 
(Servers and Storage is IBM)
OS = RHEL 5u5 x64 bit

And we used multipathing mechanism and created multipathing devices.
i.e. /dev/mapper/mpath1.

Then I created raw device /dev/raw/raw1 of this /dev/mapper/mpath1 Block device 
as per pre-reqs for Oracle Cluster.

Every thing looks good, But we faced the performance issue as under...

when we run command :
#dd if=/dev/zero of=/dev/mapper/mpath1 bs=1024 count=1000
the writing rate is approx. 34 MB/s

But If we run command
#dd if=/dev/zero of=/dev/raw/raw1 bs=1024 count=1000
the writing rate is very slow like 253 KB/s

Please advice how to tune the performance.




 
Best Regards,

Shariq Siddiqui


      
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110216/ff6dea47/attachment.htm>

From list at fajar.net  Wed Feb 16 10:34:07 2011
From: list at fajar.net (Fajar A. Nugraha)
Date: Wed, 16 Feb 2011 17:34:07 +0700
Subject: [Linux-cluster] RAW Devices performance issue
In-Reply-To: <506336.34162.qm@web39801.mail.mud.yahoo.com>
References: <506336.34162.qm@web39801.mail.mud.yahoo.com>
Message-ID: <AANLkTin-Ce2qJ5zOhzm9gssCW=oExFNF50D_z6RZUL=Y@mail.gmail.com>

On Wed, Feb 16, 2011 at 5:04 PM, Shariq Siddiqui
<shariq.siddiqui at yahoo.com> wrote:
>
>
> Dear All,
>
> I am going to install Oracle RAC on two Servers, With shared SAN storage (Servers and Storage is IBM)
> OS = RHEL 5u5 x64 bit
>
> And we used multipathing mechanism and created multipathing devices.
> i.e. /dev/mapper/mpath1.
>
> Then I created raw device /dev/raw/raw1 of this /dev/mapper/mpath1 Block device as per pre-reqs for Oracle Cluster.
>
> Every thing looks good, But we faced the performance issue as under...
>
> when we run command :
> #dd if=/dev/zero of=/dev/mapper/mpath1 bs=1024 count=1000
> the writing rate is approx. 34 MB/s
>
> But If we run command
> #dd if=/dev/zero of=/dev/raw/raw1 bs=1024 count=1000
> the writing rate is very slow like 253 KB/s
>
> Please advice how to tune the performance.

Shouldn't you ask Oracle about that?

My GUESS is that in the first one the I/O is buffered, while in the
second /dev/raw/raw* is simply a block device opened with O_DIRECT
(thus bypassing buffer cache). You may want to retry dd with
"oflags=direct" and compare the results.

You might want to look at
http://en.wikipedia.org/wiki/Raw_device
http://download.oracle.com/docs/cd/B19306_01/relnotes.102/b15659/toc.htm#CJAICHEG

Again, better ask Oracle if you want to be sure.

-- 
Fajar



From fajar at fajar.net  Wed Feb 16 10:37:08 2011
From: fajar at fajar.net (Fajar A. Nugraha)
Date: Wed, 16 Feb 2011 17:37:08 +0700
Subject: [Linux-cluster] RAW Devices performance issue
In-Reply-To: <506336.34162.qm@web39801.mail.mud.yahoo.com>
References: <506336.34162.qm@web39801.mail.mud.yahoo.com>
Message-ID: <AANLkTinrRbK+mAPHbcRsRRB2n4zAWD8jK5TSLYL8WptM@mail.gmail.com>

On Wed, Feb 16, 2011 at 5:04 PM, Shariq Siddiqui
<shariq.siddiqui at yahoo.com> wrote:
>
>
> Dear All,
>
> I am going to install Oracle RAC on two Servers, With shared SAN storage (Servers and Storage is IBM)
> OS = RHEL 5u5 x64 bit
>
> And we used multipathing mechanism and created multipathing devices.
> i.e. /dev/mapper/mpath1.
>
> Then I created raw device /dev/raw/raw1 of this /dev/mapper/mpath1 Block device as per pre-reqs for Oracle Cluster.
>
> Every thing looks good, But we faced the performance issue as under...
>
> when we run command :
> #dd if=/dev/zero of=/dev/mapper/mpath1 bs=1024 count=1000
> the writing rate is approx. 34 MB/s
>
> But If we run command
> #dd if=/dev/zero of=/dev/raw/raw1 bs=1024 count=1000
> the writing rate is very slow like 253 KB/s
>
> Please advice how to tune the performance.

Shouldn't you ask Oracle about that?

My GUESS is that in the first one the I/O is buffered, while in the
second /dev/raw/raw* is simply a block device opened with O_DIRECT
(thus bypassing buffer cache). You may want to retry dd with
"oflags=direct" and compare the results.

You might want to look at
http://en.wikipedia.org/wiki/Raw_device
http://download.oracle.com/docs/cd/B19306_01/relnotes.102/b15659/toc.htm#CJAICHEG

Again, better ask Oracle if you want to be sure.

-- 
Fajar



From stefan at lsd.co.za  Wed Feb 16 10:36:37 2011
From: stefan at lsd.co.za (Stefan Lesicnik)
Date: Wed, 16 Feb 2011 12:36:37 +0200 (SAST)
Subject: [Linux-cluster] RAW Devices performance issue
In-Reply-To: <506336.34162.qm@web39801.mail.mud.yahoo.com>
Message-ID: <844193781.8667.1297852597836.JavaMail.root@zcs-jhb-lsd>





----- Original Message -----
> From: "Shariq Siddiqui" <shariq.siddiqui at yahoo.com>
> To: linux4oracle at yahoogroups.com, linux-cluster at redhat.com
> Sent: Wednesday, 16 February, 2011 12:04:00 PM
> Subject: [Linux-cluster] RAW Devices performance issue
> Dear All,
> 
> I am going to install Oracle RAC on two Servers, With shared SAN
> storage (Servers and Storage is IBM)
> OS = RHEL 5u5 x64 bit
> 
> And we used multipathing mechanism and created multipathing devices.
> i.e. /dev/mapper/mpath1.
> 
> Then I created raw device /dev/raw/raw1 of this /dev/mapper/mpath1
> Block device as per pre-reqs for Oracle Cluster.
> 
> Every thing looks good, But we faced the performance issue as under...
> 
> when we run command :
> #dd if=/dev/zero of=/dev/mapper/mpath1 bs=1024 count=1000
> the writing rate is approx. 34 MB/s
> 
> But If we run command
> #dd if=/dev/zero of=/dev/raw/raw1 bs=1024 count=1000
> the writing rate is very slow like 253 KB/s
> 
> Please advice how to tune the performance.

Hi,

I dont know anything about using raw devices, but I do know the write speed through the multipath device for the SAN is slow.

Try fix that performance first - check SAN cache write is enabled, check your raid levels and over how many disks.
I cant say what you should get, but i've seen local non raided disks write much faster

Stefan
  



From swhiteho at redhat.com  Wed Feb 16 11:13:38 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 16 Feb 2011 11:13:38 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <13588217.76.1297800448396.JavaMail.marc@mobilix-20>
References: <13588217.76.1297800448396.JavaMail.marc@mobilix-20>
Message-ID: <1297854818.2464.20.camel@dolmen>

Hi,

On Tue, 2011-02-15 at 21:07 +0100, Marc Grimme wrote:
> Hi Steve,
> I think lately I observed a very similar behavior with RHEL5 and gfs2.
> It was a gfs2 filesystem that had about 2Mio files with sum of 2GB in a directory. When I did a du -shx . in this directory it took about 5 Minutes (noatime mountoption given). Independently on how much nodes took part in the cluster (in the end I only tested with one node). This was only for the first time running all later executed du commands were much faster.
> When I mounted the exact same filesystem with lockproto=lock_nolock it took about 10-20 seconds to proceed with the same command.
> 
> Next I started to analyze this with oprofile and observed the following result:
> 
> opreport --long-file-names:
> CPU: AMD64 family10, speed 2900.11 MHz (estimated)
> Counted CPU_CLK_UNHALTED events (Cycles outside of halt state) with a unit mask of 0x00 (No unit mask) count 100000
> samples  %        symbol name
> 200569   46.7639  search_rsb_list
The resource table size is by default 256 entries in size. Assuming that
you have enough ram that all 4m locks (for 2m files) are in memory at
the same time, that is approx 15625 resources per hash chain, so it
would make sense that this would start to slow things down a bit.

There is a config option to increase the resource table size though, so
perhaps you could try that?

> 118905   27.7234  create_lkb
This reads down a hash chain in the lkb table. That table is larger by
default (1024), which is probably why there is less cpu time burned
here. On the other hand, the hash chain might be read more than once if
there is a collision on the lock ids. Again it is a config option, so it
should be possible to increase the size of the table.

> 32499     7.5773  search_bucket
> 4125      0.9618  find_lkb
> 3641      0.8489  process_send_sockets
> 3420      0.7974  dlm_scan_rsbs
> 3184      0.7424  _request_lock
> 3012      0.7023  find_rsb
> 2735      0.6377  receive_from_sock
> 2610      0.6085  _receive_message
> 2543      0.5929  dlm_allocate_rsb
> 2299      0.5360  dlm_hash2nodeid
> 2228      0.5195  _create_message
> 2180      0.5083  dlm_astd
> 2163      0.5043  dlm_find_lockspace_global
> 2109      0.4917  dlm_find_lockspace_local
> 2074      0.4836  dlm_lowcomms_get_buffer
> 2060      0.4803  dlm_lock
> 1982      0.4621  put_rsb
> ..
> 
> opreport --image /gfs2
> CPU: AMD64 family10, speed 2900.11 MHz (estimated)
> Counted CPU_CLK_UNHALTED events (Cycles outside of halt state) with a unit mask of 0x00 (No unit mask) count 100000
> samples  %        symbol name
> 9310     15.5600  search_bucket
This should get better in RHEL6.1 and above, due to the new design of
glock hash table. The patch is already in upstream. The glock hash table
is much larger than the dlm hash table, though there are still
scalability issues due to the locking and that we cannot currently grow
the hash table.

> 6268     10.4758  do_promote
The result in do_promote is interesting, as I wouldn't have expected
that to show up here really, so I'll look into that when I have a moment
and try to figure out what is going on.

> 2704      4.5192  gfs2_glock_put
> 2289      3.8256  gfs2_glock_hold
> 2286      3.8206  gfs2_glock_schedule_for_reclaim
> 2204      3.6836  gfs2_glock_nq
> 2204      3.6836  run_queue
> 2001      3.3443  gfs2_holder_wake
> ..
> 
> opreport --image /dlm
> CPU: AMD64 family10, speed 2900.11 MHz (estimated)
> Counted CPU_CLK_UNHALTED events (Cycles outside of halt state) with a unit mask of 0x00 (No unit mask) count 100000
> samples  %        symbol name
> 200569   46.7639  search_rsb_list
> 118905   27.7234  create_lkb
> 32499     7.5773  search_bucket
> 4125      0.9618  find_lkb
> 3641      0.8489  process_send_sockets
> 3420      0.7974  dlm_scan_rsbs
> 3184      0.7424  _request_lock
> 3012      0.7023  find_rsb
> 2735      0.6377  receive_from_sock
> 2610      0.6085  _receive_message
> 2543      0.5929  dlm_allocate_rsb
> 2299      0.5360  dlm_hash2nodeid
> 2228      0.5195  _create_message
> ..
> 
> This very much reminded me on a similar test we've done years ago with gfs (see http://www.open-sharedroot.org/Members/marc/blog/blog-on-dlm/red-hat-dlm-__find_lock_by_id/profile-data-with-diffrent-table-sizes).
> 
> Does this not show that during the du command 46% of the time the kernel stays in the dlm:search_rsb_list function while looking out for locks. It still looks like the hashtable for the lock in dlm is much too small and searching inside the hashmap is not constant anymore?
> 
> I would be really interesting how long the described backup takes when the gfs2 filesystem is mounted exclusively on one node without locking.
> For me it looks like you're facing a similar problem with gfs2 that has been worked around with gfs by introducing the glock_purge functionality that leads to a much smaller glock->dlm->hashtable and makes backups and the like much faster.
> 
> I hope this helps.
> 
> Thanks and regards
> Marc.
> 
Many thanks for this information, it is really helpful to get feedback
like this which helps identify issues in the code,

Steve.




From chauhan.anujsingh at gmail.com  Wed Feb 16 11:54:37 2011
From: chauhan.anujsingh at gmail.com (Anuj)
Date: Wed, 16 Feb 2011 17:24:37 +0530
Subject: [Linux-cluster] Linux-cluster Digest, Vol 82, Issue 19
In-Reply-To: <mailman.47.1297789206.24599.linux-cluster@redhat.com>
References: <mailman.47.1297789206.24599.linux-cluster@redhat.com>
Message-ID: <AANLkTikqngwEnC8k+uhTXbE+d6=wZ5pgETea-7LCgyaf@mail.gmail.com>

Hi,
Hello to all !

will you please guide me how can do a practice of clustrign as well as
loadbalancer for testing enviorment can all of you please guide me what are
the basic requirements

i have three centos machine apache,Mysql and postfix is runing on these
machines


-- 
*Regards.*.//
Anuj Singh Chauhan
(Voice): 09013203509*
*


On Tue, Feb 15, 2011 at 10:30 PM, <linux-cluster-request at redhat.com> wrote:

> Send Linux-cluster mailing list submissions to
>        linux-cluster at redhat.com
>
> To subscribe or unsubscribe via the World Wide Web, visit
>        https://www.redhat.com/mailman/listinfo/linux-cluster
> or, via email, send a message with subject or body 'help' to
>        linux-cluster-request at redhat.com
>
> You can reach the person managing the list at
>        linux-cluster-owner at redhat.com
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Linux-cluster digest..."
>
>
> Today's Topics:
>
>   1. Re: Cluster with shared storage on low budget (Gordan Bobic)
>   2. Re: Cluster with shared storage on low budget (Jeff Sturm)
>   3. Re: Cluster with shared storage on low budget (Gordan Bobic)
>   4. Re: Cluster with shared storage on low budget (Bob Peterson)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Tue, 15 Feb 2011 13:31:42 +0000
> From: Gordan Bobic <gordan at bobich.net>
> To: linux clustering <linux-cluster at redhat.com>
> Subject: Re: [Linux-cluster] Cluster with shared storage on low budget
> Message-ID: <4D5A803E.5090106 at bobich.net>
> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>
> Nikola Savic wrote:
> > Gordan Bobic wrote:
> >> Something else just occurs to me - you mentioned MySQL. You do realize
> >> that the performance of it will be attrocious on a shared cluster file
> >> system (ANY shared cluster file system), right? Unless you only intend
> >> to run mysqld on a single node at a time (in which case there's no
> >> point in putting it on a cluster file system).
> >
> >   MySQL Master and Slave(s) will run on single node. No two MySQL
> > instances will run on same set of data. Shared storage for MySQL data
> > should enable easier movement of MySQL instance between nodes. Eg. when
> > MySQL master needs to be moved from one node to other, I assume it would
> > be easier with DRBD, because I would "only" need to stop MySQL on one
> > node and start it on other configured to use same set of data.
>
> There is a better way to do that. Run DRBD in active-passive mode, and
> grab the fail-over scripts from heartbeat. Then set up a dependency in
> cluster.conf that will handle a combined service of DRBD disk (handling
> active/passive switch), file system (mounting the fs once the DRBD
> becomes active locally, and mysql. You define them as dependant on each
> other in cluster.conf by suitable nesting.
>
> > Additionally, floating IP address assigned to MySQL master would need to
> > be re-assigned to new node.
>
> You can make that IP a part of the dependency stack mentioned above.
>
> > Slaves would also need to be restarted to
> > connect to new master. Even without floating IP used only my MySQL
> > Master, slaves and web application can easily be reconfigured to use new
> > IP. Do you see problem in this kind of setup?
>
> If the IP fails over and the FS is consistent you don't need to change
> any configs - MySQL slaves will re-try connecting until they succeed.
> Just make sure your bin-logs are on the same mount as the rest of MySQL,
> since they have to fail over with the rest of the DB.
>
> Gordan
>
>
>
> ------------------------------
>
> Message: 2
> Date: Tue, 15 Feb 2011 10:55:43 -0500
> From: Jeff Sturm <jeff.sturm at eprize.com>
> To: linux clustering <linux-cluster at redhat.com>
> Subject: Re: [Linux-cluster] Cluster with shared storage on low budget
> Message-ID:
>        <64D0546C5EBBD147B75DE133D798665F0855C0F4 at hugo.eprize.local>
> Content-Type: text/plain; charset="us-ascii"
>
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com]
> > On Behalf Of Gordan Bobic
> > Sent: Tuesday, February 15, 2011 7:05 AM
> >
> > Volume resizing is, IMO, over-rated and unnecessary in most cases,
> except where data
> > growth is quite mind-boggling (in which case you won't be using MySQL
> anyway).
>
> We actually resize volumes often.  Some of our storage volumes have 30
> LUNs or more.  We have so many because we've virtualized most of our
> infrastructure, and some of the hosts are single-purpose hosts.
>
> We don't want to allocate too more storage in advance, simply because
> it's easier to grow than to shrink.  Stop the host, grow the volume,
> e2fsck/resize2fs, start up and go.  Much nicer than increasing disk
> capacity on physical hosts.
>
> CLVM works well for this, but that's about all it's good for IMHO.  I
> prefer to use the SAN's native volume management over CLVM when
> available.
>
> Haven't tried DRBD yet but I'm really tempted... it sounds like it has
> come a long way since its modest beginnings.
>
> -Jeff
>
>
>
>
>
> ------------------------------
>
> Message: 3
> Date: Tue, 15 Feb 2011 16:17:03 +0000
> From: Gordan Bobic <gordan at bobich.net>
> To: linux clustering <linux-cluster at redhat.com>
> Subject: Re: [Linux-cluster] Cluster with shared storage on low budget
> Message-ID: <4D5AA6FF.8080608 at bobich.net>
> Content-Type: text/plain; charset=windows-1252; format=flowed
>
> Jeff Sturm wrote:
> >> -----Original Message-----
> >> From: linux-cluster-bounces at redhat.com
> > [mailto:linux-cluster-bounces at redhat.com]
> >> On Behalf Of Gordan Bobic
> >> Sent: Tuesday, February 15, 2011 7:05 AM
> >>
> >> Volume resizing is, IMO, over-rated and unnecessary in most cases,
> > except where data
> >> growth is quite mind-boggling (in which case you won't be using MySQL
> > anyway).
> >
> > We actually resize volumes often.  Some of our storage volumes have 30
> > LUNs or more.  We have so many because we've virtualized most of our
> > infrastructure, and some of the hosts are single-purpose hosts.
> >
> > We don't want to allocate too more storage in advance, simply because
> > it's easier to grow than to shrink.  Stop the host, grow the volume,
> > e2fsck/resize2fs, start up and go.  Much nicer than increasing disk
> > capacity on physical hosts.
>
> Seems labour and downtime intensive to me. Maybe I'm just used to
> environments where that is an unacceptable tradeoff vs. ?40/TB for storage.
>
> Not to mention that it makes you totally reliant on SAN level
> redundancy, which I also generally deem unacceptable except on very high
> end SANs that have mirroring features.
>
> Additionally, considering you can self-build a multi-TB iSCSI SAN for a
> few hundred ?/$/? which will have volume growing ability (use sparse
> files for iSCSI volumes and write a byte to a greater offset), I cannot
> really see any justification whatsoever for using LVM with SAN based
> storage.
>
> > Haven't tried DRBD yet but I'm really tempted... it sounds like it has
> > come a long way since its modest beginnings.
>
> Not sure how far back you are talking about but I have been using it in
> production in both active-active and active-passive configurations since
> at least 2007 with no problems. From the usage point of view, the
> changes have been negligible.
>
> Gordan
>
>
>
> ------------------------------
>
> Message: 4
> Date: Tue, 15 Feb 2011 11:24:26 -0500 (EST)
> From: Bob Peterson <rpeterso at redhat.com>
> To: linux clustering <linux-cluster at redhat.com>
> Subject: Re: [Linux-cluster] Cluster with shared storage on low budget
> Message-ID:
>        <
> 263367529.33108.1297787066881.JavaMail.root at zmail06.collab.prod.int.phx2.redhat.com
> >
>
> Content-Type: text/plain; charset=utf-8
>
> ----- Original Message -----
> | We don't want to allocate too more storage in advance, simply because
> | it's easier to grow than to shrink. Stop the host, grow the volume,
> | e2fsck/resize2fs, start up and go. Much nicer than increasing disk
> | capacity on physical hosts.
>
> These might be good for ext3/4, but with gfs and gfs2 you can lvresize
> and gfs2_grow while the lv is mounted.  In fact, we expect it.
> Just make sure the vg has the clustered bit set (vgchange -cy) first.
>
> Regards,
>
> Bob Peterson
> Red Hat File Systems
>
>
>
> ------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> End of Linux-cluster Digest, Vol 82, Issue 19
> *********************************************
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110216/21fe7c6a/attachment.htm>

From ajb2 at mssl.ucl.ac.uk  Wed Feb 16 12:02:32 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Wed, 16 Feb 2011 12:02:32 +0000
Subject: [Linux-cluster] optimising DLM speed?
Message-ID: <4D5BBCD8.9010808@mssl.ucl.ac.uk>


 > There is a config option to increase the resource table size though, 
so perhaps you could try that?

..details?





From swhiteho at redhat.com  Wed Feb 16 12:57:27 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 16 Feb 2011 12:57:27 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D5BBCD8.9010808@mssl.ucl.ac.uk>
References: <4D5BBCD8.9010808@mssl.ucl.ac.uk>
Message-ID: <1297861047.2522.4.camel@dolmen>

Hi,

On Wed, 2011-02-16 at 12:02 +0000, Alan Brown wrote:
> > There is a config option to increase the resource table size though, 
> so perhaps you could try that?
> 
> ..details?
> 
> 
You can set it via the configfs interface:

echo "4096" > /sys/kernel/config/dlm//cluster/rsbtbl_size

It doesn't change once a lockspace has been created, so the new table
size needs to be set before mounting the filesystem, otherwise it will
not take effect. The size must be a power of two. Likewise the
lkbtbl_size can be set the same way,

Steve.

> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




From ajb2 at mssl.ucl.ac.uk  Wed Feb 16 14:12:30 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Wed, 16 Feb 2011 14:12:30 +0000
Subject: [Linux-cluster] optimising DLM speed?
Message-ID: <4D5BDB4E.3010006@mssl.ucl.ac.uk>

 > You can set it via the configfs interface:

Given 24Gb ram, 100 filesystems, several hundred million of files and 
the usual user habits of trying to put 100k files in a directory:

Is 24Gb enough or should I add more memory? (96Gb is easy, beyond that 
is harder)

What would you consider safe maximums for these settings?

What about the following parameters?

buffer_size
dirtbl_size






From swhiteho at redhat.com  Wed Feb 16 17:33:22 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 16 Feb 2011 17:33:22 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D5BDB4E.3010006@mssl.ucl.ac.uk>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>
Message-ID: <1297877602.2522.69.camel@dolmen>

Hi,

On Wed, 2011-02-16 at 14:12 +0000, Alan Brown wrote:
> > You can set it via the configfs interface:
> 
> Given 24Gb ram, 100 filesystems, several hundred million of files and 
> the usual user habits of trying to put 100k files in a directory:
> 
> Is 24Gb enough or should I add more memory? (96Gb is easy, beyond that 
> is harder)
> 
The more memory you add, the greater the potential for caching large
numbers of inodes, which in turn implies larger numbers of dlm locks.

So you are much more likely to see these issues with large ram sizes. If
you can easily do 96G, then I'd say start with that.

> What would you consider safe maximums for these settings?
> 
That is a more tricky question. There might be some issues if you go
above 2^16 hash buckets due to the way in which dlm organises its hash
buckets. Dave Teigland can give you more info on that.

> What about the following parameters?
> 
> buffer_size
I doubt that this will need adjusting.

> dirtbl_size
That might need adjusting too, although it didn't appear to be
significant on the profile results,

Steve.

> 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




From teigland at redhat.com  Wed Feb 16 17:52:27 2011
From: teigland at redhat.com (David Teigland)
Date: Wed, 16 Feb 2011 12:52:27 -0500
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <13588217.76.1297800448396.JavaMail.marc@mobilix-20>
References: <1297794020.2711.21.camel@dolmen>
	<13588217.76.1297800448396.JavaMail.marc@mobilix-20>
Message-ID: <20110216175227.GB2291@redhat.com>

On Tue, Feb 15, 2011 at 09:07:31PM +0100, Marc Grimme wrote:
> Hi Steve,
> I think lately I observed a very similar behavior with RHEL5 and gfs2.
> It was a gfs2 filesystem that had about 2Mio files with sum of 2GB in a directory. When I did a du -shx . in this directory it took about 5 Minutes (noatime mountoption given). Independently on how much nodes took part in the cluster (in the end I only tested with one node). This was only for the first time running all later executed du commands were much faster.
> When I mounted the exact same filesystem with lockproto=lock_nolock it took about 10-20 seconds to proceed with the same command.
> 
> Next I started to analyze this with oprofile and observed the following result:
> 
> opreport --long-file-names:
> CPU: AMD64 family10, speed 2900.11 MHz (estimated)
> Counted CPU_CLK_UNHALTED events (Cycles outside of halt state) with a unit mask of 0x00 (No unit mask) count 100000
> samples  %        symbol name
> 200569   46.7639  search_rsb_list
> 118905   27.7234  create_lkb

Hi Marc, thanks for sending this again, I remember that you pointed these
out a long time ago, but had forgotten just how bad those searches were.
I really do need to do some optimizing there.

> This very much reminded me on a similar test we've done years ago with
> gfs (see http://www.open-sharedroot.org/Members/marc/blog/blog-on-dlm/red-hat-dlm-__find_lock_by_id/profile-data-with-diffrent-table-sizes).
>
> Does this not show that during the du command 46% of the time the kernel
> stays in the dlm:search_rsb_list function while looking out for locks.
> It still looks like the hashtable for the lock in dlm is much too small
> and searching inside the hashmap is not constant anymore?

We should definately check if the default hash table sizes should be
increased.

Dave



From teigland at redhat.com  Wed Feb 16 17:58:48 2011
From: teigland at redhat.com (David Teigland)
Date: Wed, 16 Feb 2011 12:58:48 -0500
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D5BDB4E.3010006@mssl.ucl.ac.uk>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>
Message-ID: <20110216175848.GC2291@redhat.com>

On Wed, Feb 16, 2011 at 02:12:30PM +0000, Alan Brown wrote:
> > You can set it via the configfs interface:
> 
> Given 24Gb ram, 100 filesystems, several hundred million of files
> and the usual user habits of trying to put 100k files in a
> directory:
> 
> Is 24Gb enough or should I add more memory? (96Gb is easy, beyond
> that is harder)
> 
> What would you consider safe maximums for these settings?
> 
> What about the following parameters?
> 
> buffer_size
> dirtbl_size

Don't change the buffer size, but I'd increase all the hash table sizes to
4096 and see if anything changes.

echo "4096" > /sys/kernel/config/dlm/cluster/rsbtbl_size
echo "4096" > /sys/kernel/config/dlm/cluster/lkbtbl_size
echo "4096" > /sys/kernel/config/dlm/cluster/dirtbl_size

(Before gfs file systems are mounted as Steve mentioned.)

Dave



From ajb2 at mssl.ucl.ac.uk  Wed Feb 16 19:07:10 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Wed, 16 Feb 2011 19:07:10 +0000
Subject: [Linux-cluster] optimising DLM speed?
Message-ID: <4D5C205E.4010708@mssl.ucl.ac.uk>

Steve:

To add some interest (and give you numbers to work with as far as dlm 
config tuning goes), here are a selection of real world lock figures 
from our file cluster (cat $d | wc -l)

/sys/kernel/debug/dlm/WwwHome-gfs2_locks  162299  (webserver exports)
/sys/kernel/debug/dlm/soft2-gfs2_locks  198890  (Mainly IDL software - 
it's hopelessly inefficient, 32Gb partition)
/sys/kernel/debug/dlm/home-gfs2_locks  74649 (users' /home directories, 
150Gb partition)
/sys/kernel/debug/dlm/User1_locks  318337 (thunderbird, mozilla, 
openoffice caches, 200gb partition)
/sys/kernel/debug/dlm/Peace04-gfs2_locks  265955  (solar wind data)
/sys/kernel/debug/dlm/Peace05-gfs2_locks  332267
/sys/kernel/debug/dlm/Peace06-gfs2_locks  283588

At the other end of the spectrum:

/sys/kernel/debug/dlm/xray0-gfs2_locks  24917 (solar observation data)
/sys/kernel/debug/dlm/xray2-gfs2_locks  558
/sys/kernel/debug/dlm/cassini2-gfs2_locks  598 (cassini probe data from 
Saturn)
/sys/kernel/debug/dlm/cassini3-gfs2_locks  80
/sys/kernel/debug/dlm/cassini4-gfs2_locks  246
/sys/kernel/debug/dlm/rgoplates-gfs2_locks 27 (global archive of 100 
years' worth of photographic plates from Greenwich observatory)


Directories may have up to 90k entries in them, although we try very 
hard to encourage users to use nested structures and keep directories 
below 1000 entries for human readability (exceptions tend to be mirrors 
of offsite archives), but the counterpoint to is that it drives the 
number of directories up - which is why I was asking about the 
dirtbl_size entry.

~98% of directories are below 4000 entries.

FSes usually have 400k-2M inodes in use.

Does that help with tuning recommendations?






From swhiteho at redhat.com  Wed Feb 16 19:25:01 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 16 Feb 2011 19:25:01 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D5C205E.4010708@mssl.ucl.ac.uk>
References: <4D5C205E.4010708@mssl.ucl.ac.uk>
Message-ID: <1297884301.2522.74.camel@dolmen>

Hi,

On Wed, 2011-02-16 at 19:07 +0000, Alan Brown wrote:
> Steve:
> 
> To add some interest (and give you numbers to work with as far as dlm 
> config tuning goes), here are a selection of real world lock figures 
> from our file cluster (cat $d | wc -l)
> 
> /sys/kernel/debug/dlm/WwwHome-gfs2_locks  162299  (webserver exports)
> /sys/kernel/debug/dlm/soft2-gfs2_locks  198890  (Mainly IDL software - 
> it's hopelessly inefficient, 32Gb partition)
> /sys/kernel/debug/dlm/home-gfs2_locks  74649 (users' /home directories, 
> 150Gb partition)
> /sys/kernel/debug/dlm/User1_locks  318337 (thunderbird, mozilla, 
> openoffice caches, 200gb partition)
> /sys/kernel/debug/dlm/Peace04-gfs2_locks  265955  (solar wind data)
> /sys/kernel/debug/dlm/Peace05-gfs2_locks  332267
> /sys/kernel/debug/dlm/Peace06-gfs2_locks  283588
> 
A faster way to just grab lock numbers is to grep for gfs2
in /proc/slabinfo as that will show how many are allocated at any one
time.

> At the other end of the spectrum:
> 
> /sys/kernel/debug/dlm/xray0-gfs2_locks  24917 (solar observation data)
> /sys/kernel/debug/dlm/xray2-gfs2_locks  558
> /sys/kernel/debug/dlm/cassini2-gfs2_locks  598 (cassini probe data from 
> Saturn)
> /sys/kernel/debug/dlm/cassini3-gfs2_locks  80
> /sys/kernel/debug/dlm/cassini4-gfs2_locks  246
> /sys/kernel/debug/dlm/rgoplates-gfs2_locks 27 (global archive of 100 
> years' worth of photographic plates from Greenwich observatory)
> 
> 
> Directories may have up to 90k entries in them, although we try very 
> hard to encourage users to use nested structures and keep directories 
> below 1000 entries for human readability (exceptions tend to be mirrors 
> of offsite archives), but the counterpoint to is that it drives the 
> number of directories up - which is why I was asking about the 
> dirtbl_size entry.
> 
The dirtbl refers to the DLM's resource directory and not to the
directories which are in the filesystem. So the dirtbl will scale
according to the number of dlm locks, which in turn scales with the
number of cached inodes.

Directories of the size (number of entries) which you have indicated
should not be causing a problem as lookup should still be quite fast at
that scale.

> ~98% of directories are below 4000 entries.
> 
> FSes usually have 400k-2M inodes in use.
> 
The important thing from the dlm tuning point of view is how many of
those inodes are cached on each node at once, so using the slabinfo
trick above will show that.

> Does that help with tuning recommendations?
> 
It is always useful to have some background information like this, and I
think as a first step trying Dave's suggested DLM table config changes
is a good plan,

Steve.

> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




From ajb2 at mssl.ucl.ac.uk  Wed Feb 16 19:36:09 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Wed, 16 Feb 2011 19:36:09 +0000
Subject: [Linux-cluster] optimising DLM speed?
Message-ID: <4D5C2729.2070809@mssl.ucl.ac.uk>

 > A faster way to just grab lock numbers is to grep for gfs2
in /proc/slabinfo as that will show how many are allocated at any one
time.

True, but it doesn't show mow many are used per fs.

FWIW, here are current stats on each cluster node (it's evening and 
lightly loaded)


gfs2_quotad           47    108    144   27    1 : tunables  120   60 
  8 : slabdata      4      4      0
gfs2_rgrpd          9563   9618    184   21    1 : tunables  120   60 
  8 : slabdata    458    458      0
gfs2_bufdata      318804 318840     96   40    1 : tunables  120   60 
  8 : slabdata   7971   7971      1
gfs2_inode        725605 725605    800    5    1 : tunables   54   27 
  8 : slabdata 145121 145121      0
gfs2_glock        738297 738297    424    9    1 : tunables   54   27 
  8 : slabdata  82033  82033      0

gfs2_quotad           94    189    144   27    1 : tunables  120   60 
  8 : slabdata      7      7      0
gfs2_rgrpd          1658   1680    184   21    1 : tunables  120   60 
  8 : slabdata     80     80      0
gfs2_bufdata      1065806 1067080     96   40    1 : tunables  120   60 
    8 : slabdata  26677  26677      0
gfs2_inode        986986 1024845    800    5    1 : tunables   54   27 
   8 : slabdata 204969 204969      0
gfs2_glock        1105575 1812825    424    9    1 : tunables   54   27 
    8 : slabdata 201425 201425      1

gfs2_quotad           45    108    144   27    1 : tunables  120   60 
  8 : slabdata      4      4      2
gfs2_rgrpd          6515   6573    184   21    1 : tunables  120   60 
  8 : slabdata    313    313      0
gfs2_bufdata      100785 101000     96   40    1 : tunables  120   60 
  8 : slabdata   2525   2525      0
gfs2_inode        2954515 2954515    800    5    1 : tunables   54   27 
    8 : slabdata 590903 590903      0
gfs2_glock        3332311 3639843    424    9    1 : tunables   54   27 
    8 : slabdata 404427 404427      0




From ajb2 at mssl.ucl.ac.uk  Wed Feb 16 19:41:04 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Wed, 16 Feb 2011 19:41:04 +0000
Subject: [Linux-cluster] optimising DLM speed?
Message-ID: <4D5C2850.7090100@mssl.ucl.ac.uk>

 > Directories of the size (number of entries) which you have indicated
should not be causing a problem as lookup should still be quite fast at
that scale.

Perhaps, but even so 4000 file directories usually take over a minute to 
"ls -l" , while 85k file/directories take 5 mins (20-40 mins on a bad 
day) - and this is mounted lock_dlm, single-node-only







From swhiteho at redhat.com  Wed Feb 16 20:19:21 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 16 Feb 2011 20:19:21 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D5C2729.2070809@mssl.ucl.ac.uk>
References: <4D5C2729.2070809@mssl.ucl.ac.uk>
Message-ID: <1297887561.2522.77.camel@dolmen>

Hi,

On Wed, 2011-02-16 at 19:36 +0000, Alan Brown wrote:
> > A faster way to just grab lock numbers is to grep for gfs2
> in /proc/slabinfo as that will show how many are allocated at any one
> time.
> 
> True, but it doesn't show mow many are used per fs.
> 
For the GFS2 glocks, that doesn't matter - all of the glocks are held in
a single hash table no matter how many filesystems there are. The DLM
however has hash tables for each lockspace (per filesystem) so it might
make a difference there.

> FWIW, here are current stats on each cluster node (it's evening and 
> lightly loaded)
> 
> 
> gfs2_quotad           47    108    144   27    1 : tunables  120   60 
>   8 : slabdata      4      4      0
> gfs2_rgrpd          9563   9618    184   21    1 : tunables  120   60 
>   8 : slabdata    458    458      0
> gfs2_bufdata      318804 318840     96   40    1 : tunables  120   60 
>   8 : slabdata   7971   7971      1
> gfs2_inode        725605 725605    800    5    1 : tunables   54   27 
>   8 : slabdata 145121 145121      0
> gfs2_glock        738297 738297    424    9    1 : tunables   54   27 
>   8 : slabdata  82033  82033      0
> 
> gfs2_quotad           94    189    144   27    1 : tunables  120   60 
>   8 : slabdata      7      7      0
> gfs2_rgrpd          1658   1680    184   21    1 : tunables  120   60 
>   8 : slabdata     80     80      0
> gfs2_bufdata      1065806 1067080     96   40    1 : tunables  120   60 
>     8 : slabdata  26677  26677      0
> gfs2_inode        986986 1024845    800    5    1 : tunables   54   27 
>    8 : slabdata 204969 204969      0
> gfs2_glock        1105575 1812825    424    9    1 : tunables   54   27 
>     8 : slabdata 201425 201425      1
> 
> gfs2_quotad           45    108    144   27    1 : tunables  120   60 
>   8 : slabdata      4      4      2
> gfs2_rgrpd          6515   6573    184   21    1 : tunables  120   60 
>   8 : slabdata    313    313      0
> gfs2_bufdata      100785 101000     96   40    1 : tunables  120   60 
>   8 : slabdata   2525   2525      0
> gfs2_inode        2954515 2954515    800    5    1 : tunables   54   27 
>     8 : slabdata 590903 590903      0
> gfs2_glock        3332311 3639843    424    9    1 : tunables   54   27 
>     8 : slabdata 404427 404427      0
> 
Thanks for the info. There is now a bug open (bz #678102) for increasing
the default DLM hash table size,

Steve.

> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




From swhiteho at redhat.com  Wed Feb 16 20:26:20 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Wed, 16 Feb 2011 20:26:20 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D5C2850.7090100@mssl.ucl.ac.uk>
References: <4D5C2850.7090100@mssl.ucl.ac.uk>
Message-ID: <1297887980.2522.83.camel@dolmen>

Hi,

On Wed, 2011-02-16 at 19:41 +0000, Alan Brown wrote:
> > Directories of the size (number of entries) which you have indicated
> should not be causing a problem as lookup should still be quite fast at
> that scale.
> 
> Perhaps, but even so 4000 file directories usually take over a minute to 
> "ls -l" , while 85k file/directories take 5 mins (20-40 mins on a bad 
> day) - and this is mounted lock_dlm, single-node-only
> 
> 
Yes, ls -l will always take longer because it is not just accessing the
directory, but also every inode in the directory. As a result the I/O
pattern will generally be poor.

Also, the order in which GFS2 returns the directory entries is not
efficient if it is used for doing the stat calls associated with the ls
-l. Better performance could be obtained by sorting the inodes to run
stat on into inode number order.

The reason that the ordering is not ideal is that without that we could
not maintain a uniform view of the directory from a readers point of
view while other processes are adding or removing entries. It is a
historical issue that we have inherited from GFS and I've spent some
time trying to come up with a solution in kernel space, but in the end,
a userland solution may be a better way to solve it.

I assume that once the directory has been read in once, that it acesses
will be much faster on subsequent occasions,

Steve.




From jeff.sturm at eprize.com  Wed Feb 16 20:25:56 2011
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Wed, 16 Feb 2011 15:25:56 -0500
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <4D5ADD76.2050600@logik-internet.rs>
References: <4D59BD29.30906@logik-internet.rs>	<4D59C13A.9030807@alteeve.com>	<4D59DB8D.5080606@logik-internet.rs>	<4D59DC04.6060305@alteeve.com>	<4D59E3A1.4070508@logik-internet.rs>	<4D59E6F3.3050002@alteeve.com>	<4D59EAAD.8030301@logik-internet.rs><4D5A4DFE.8050101@bobich.net>	<4D5A6852.5000103@logik-internet.rs><4D5A6BDC.8080708@bobich.net><64D0546C5EBBD147B75DE133D798665F0855C0F4@hugo.eprize.local>
	<4D5ADD76.2050600@logik-internet.rs>
Message-ID: <64D0546C5EBBD147B75DE133D798665F0855C138@hugo.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com]
> On Behalf Of Nikola Savic
> Sent: Tuesday, February 15, 2011 3:09 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] Cluster with shared storage on low budget
> 
> Jeff Sturm wrote:
> > We actually resize volumes often.  Some of our storage volumes have
30
> > LUNs or more.  We have so many because we've virtualized most of our
> > infrastructure, and some of the hosts are single-purpose hosts.
> >
> 
>   Can you please provide more information on how storage is organized?
> 
>   Are you using SAN or local hard disks in nodes? Is there mirroring
of data and how is
> it implemented in your system?

To answer your questions, these nodes are paravirtualized under the Xen
hypervisor.  The physical volumes are kept on a central storage server
(commercial SAN appliance), organized into a clustered volume group by
CLVM.

Out of that volume group we have 30+ logical volumes, each of which are
simple filesystem images, mounted as the root filesystem on one of the
virtual hosts.

-Jeff





From ajb2 at mssl.ucl.ac.uk  Wed Feb 16 20:38:57 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Wed, 16 Feb 2011 20:38:57 +0000
Subject: [Linux-cluster] optimising DLM speed?
Message-ID: <4D5C35E1.7050404@mssl.ucl.ac.uk>

 > For the GFS2 glocks, that doesn't matter - all of the glocks are held 
in a single hash table no matter how many filesystems there are.

Given nearly 4 mlllion glocks currently on one of the boxes in a quiet 
state (and nearly 6 million if everything was on one node), is the 
existing hash table large enough?





From jeff.sturm at eprize.com  Wed Feb 16 21:08:55 2011
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Wed, 16 Feb 2011 16:08:55 -0500
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <263367529.33108.1297787066881.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <64D0546C5EBBD147B75DE133D798665F0855C0F4@hugo.eprize.local>
	<263367529.33108.1297787066881.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <64D0546C5EBBD147B75DE133D798665F0855C139@hugo.eprize.local>

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com]
> On Behalf Of Bob Peterson
> Sent: Tuesday, February 15, 2011 11:24 AM
> To: linux clustering
> Subject: Re: [Linux-cluster] Cluster with shared storage on low budget
> 
> ----- Original Message -----
> | We don't want to allocate too more storage in advance, simply
because
> | it's easier to grow than to shrink. Stop the host, grow the volume,
> | e2fsck/resize2fs, start up and go. Much nicer than increasing disk
> | capacity on physical hosts.
> 
> These might be good for ext3/4, but with gfs and gfs2 you can lvresize
and gfs2_grow
> while the lv is mounted.  In fact, we expect it.
> Just make sure the vg has the clustered bit set (vgchange -cy) first.

You know, that's a good point.  We don't use GFS2 for any non-clustered
fs, right now, but why not?  Are you saying I can do an online gfs2_grow
even with lock_nolock?

-Jeff





From ajb2 at mssl.ucl.ac.uk  Wed Feb 16 21:12:58 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Wed, 16 Feb 2011 21:12:58 +0000
Subject: [Linux-cluster] optimising DLM speed?
Message-ID: <4D5C3DDA.6090906@mssl.ucl.ac.uk>

 > Yes, ls -l will always take longer because it is not just accessing 
the directory, but also every inode in the directory. As a result the 
I/O pattern will generally be poor.

I know and accept that. It's common to most filesystems but the access 
time is particularly pronounced with GFS2 (presumably because of the 
added latencies)

The problem is that users don't see things from the same point of view, 
so there's a constant flow of complaints about "slow servers".

They think that holding down the number of files/directory is an 
unreasonable restriction - and in some cases (NASA/ESA archives) I can't 
even explain the reasons why as the people involved are unreachable.

This is despite quite documentable performance gains from breaking up 
large directories even on non-cluster filesystems - We saw a ls -lR 
speedup of around 700x when moving one directory structure from flat 
(130k files) to nested.

The same poor I/O pattern has a direct bearing on incremental backup 
speeds - backup software has to stat() a file (at minimum - SHA hash 
comparisons are even more overhead) to see if anything's changed, which 
means in large directories a backup may drop down to scan rates of 10 
files/second or lower and seldom exceeds 100 files/second at best.

(Bacula is pretty good about caching and issues a fadvise(notneeded) 
after each file is checked. I just wish other filesystem-crawling 
processes did the same)

 > I assume that once the directory has been read in once, that it 
acesses will be much faster on subsequent occasions,

Correct - but after 5-10 idle minutes the cached information is lost and 
the pattern repeats.

 > It is a historical issue that we have inherited from GFS and I've 
spent some time trying to come up with a solution in kernel space, but 
in the end, a userland solution may be a better way to solve it.

In the case of NFS clients, I'm seriously looking at trying to move to 
RHEL6 and use fscache - this should help reduce load a little but won't 
help for uncached directories.

If you have any suggestions on the [nfs export|client mount] side to try 
and help things I'm open to suggestions.






From rpeterso at redhat.com  Wed Feb 16 21:20:05 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Wed, 16 Feb 2011 16:20:05 -0500 (EST)
Subject: [Linux-cluster] Cluster with shared storage on low budget
In-Reply-To: <64D0546C5EBBD147B75DE133D798665F0855C139@hugo.eprize.local>
Message-ID: <883242944.61521.1297891205419.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| You know, that's a good point. We don't use GFS2 for any non-clustered
| fs, right now, but why not? Are you saying I can do an online
| gfs2_grow
| even with lock_nolock?
| 
| -Jeff

Hi Jeff,

Yes, you should be able to.

Regards,

Bob Peterson
Red Hat File Systems



From fdinitto at redhat.com  Thu Feb 17 09:19:57 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Thu, 17 Feb 2011 10:19:57 +0100
Subject: [Linux-cluster] Announcing "Cluster in a BOX" project
Message-ID: <4D5CE83D.7050400@redhat.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Hi all,

A lot of people find it hard to setup their first cluster or simply
don?t have time to repeat the same setup over and over, either for
development or do some basic testing or to showcase cluster technologies
to other people.

The  "Cluster in a BOX" project (cbox in short), is one script to setup
a KVM based virtual test cluster in a matter of few minutes.

cbox is still in its early development, has several limitations and some
strict requirements.

Plans are to include as many cluster technologies and configuration
examples as possible and remove as many limitations as we possibly can.

If your cluster project or technology is not there yet, or your
distribution is not supported, it?s simply because I do not have the
resources to do all by myself. Do not take it personally, we will get it
there together and I absolutely welcome comments, patches and feedback
at any time.

Support for pacemaker, DRBD and OCFS2 will come shortly.

cbox documentation is here (in temporary lack of a more neutral location):

http://sources.redhat.com/cluster/wiki/cluster_in_a_box

Please read it carefully before running cbox.

Fabio
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iQIcBAEBCAAGBQJNXOg6AAoJEFA6oBJjVJ+Oky4QAIYZptGXaEeCuOIUhyalZrxA
Piuc05L5De6Nsfe7EtVQmut+mylK2uuF8DmErenIsmy/uIlX7xPHx5vzeOxE0nWk
xTvEjVnAL4t36CNf8AfrVXJ7F+4OsRqNrXjukxTJ7lbq72aYqtr8NQDL7sEfCSBp
XQSywRWIdxjqC5JGHNtKN/rSdcD6AlMt9EutvDHJkWZtzLFAZhbxdPkj0sTDyAWp
qje2Bnsz/BfNLcfnxEhGZsZ+ZH1X/A3ps6xT5EIPo3r3l52HTOSCXNLYLBIY4+pK
E4IdzRLwJOQqYujPjYsMeESBww2cgDFSJFHW/AR5YZCE7MkjS/e/BkZxXBkZY5dq
YFR9KTU1GvQi8Hwi32pDYJXcd0bbkKT/Cq73a2/bkwEqwBR2oWisr/lI196SGKaW
PVBNUM2N4DPxv6Rc3tg966ZycAkR/PM8oa1EbojAC9hl7eM5yQADAzs9mof/YISt
1Nay4gdDYEEzH5Mt6pozUZaK6sQcz63C/vrxGsQZAjAXwZz87jmlHOoLBWGhVj6a
Z/FEWo0ofaYBBQKQnboz4V9EKFP3M4yhJ8vDcWUa3qwaDL9X2BVCg/dtuLq3Q51O
szFMibYzw73Xr4K2CS/XlVZQ4Lykfy7gEKxeEIo/7287W2keL/YgXtuVf+17ZUkl
bdhqV0OXF8hq/xMC+OeU
=QTI7
-----END PGP SIGNATURE-----



From swhiteho at redhat.com  Thu Feb 17 10:13:15 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 17 Feb 2011 10:13:15 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D5C35E1.7050404@mssl.ucl.ac.uk>
References: <4D5C35E1.7050404@mssl.ucl.ac.uk>
Message-ID: <1297937595.2552.9.camel@dolmen>

Hi,

On Wed, 2011-02-16 at 20:38 +0000, Alan Brown wrote:
> > For the GFS2 glocks, that doesn't matter - all of the glocks are held 
> in a single hash table no matter how many filesystems there are.
> 
> Given nearly 4 mlllion glocks currently on one of the boxes in a quiet 
> state (and nearly 6 million if everything was on one node), is the 
> existing hash table large enough?
> 
> 
It is a concern. The table cannot be realistically expanded forever, and
expanding it "on the fly" would be very tricky. There are however other
factors which determine the scalability of the hash table, not just the
number of hash heads. By using RCU for the upstream code, we've been
able to reduce locking and improve speed by a significant factor without
needing to increase the number of list heads in the hash table. We did
increase that number though, anyway, since the new system we are using
can put both the hash chain lock and the hash table head into a single
pointer. That means less space for locks and therefore we increased the
number of hash table heads at that time.

However large we grow the table though, it will never really be "enough"
so that probably the next development will be to have trees rather than
chains of glocks under each hash table head, and at least then the chain
lengths will scale with log(N) rather than N. The issue with doing that
is making such a thing work with RCU.

We do go to some lengths to avoid doing hash lookups at all. Once a
glock has been attached to an inode, we don't do any lookups in the hash
table again until the inode has been pushed out of the cache, so it will
only show up on a workload which is constantly scanning new inodes which
are not in cache already. At least until now, the time taken to do the
I/O associated with such operations has been much larger, so that it
didn't really show up as an important performance item.

Obviously if it causes problems, then we'll look into addressing them.
Hopefully that explains a bit more of our reasoning behind the decisions
that have been made. Please let us know if we can be of further help,

Steve.

> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




From linux-cluster at redhat.com  Thu Feb 17 12:33:07 2011
From: linux-cluster at redhat.com (Mailbot for etexusa.com)
Date: Thu, 17 Feb 2011 04:33:07 -0800
Subject: [Linux-cluster] DSN: failed (ADROITHOUSE@REDIFFMAIL.COM)
Message-ID: <mAWtxIeCDcrRueD8002@etexusa.com>


This is a Delivery Status Notification (DSN).

I was unable to deliver your message to
adroithouse at rediffmail.com.

I said 
  (end of message)

And they gave me the error;
  552 suspicious virus code detected in executables attached, message not accepted (#5.3.4)

 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: text/rfc822-headers
Size: 515 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110217/b2dea4fc/attachment.bin>

From ajb2 at mssl.ucl.ac.uk  Thu Feb 17 21:24:41 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Thu, 17 Feb 2011 21:24:41 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <20110216175848.GC2291@redhat.com>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>
	<20110216175848.GC2291@redhat.com>
Message-ID: <4D5D9219.8090002@mssl.ucl.ac.uk>

David Teigland wrote:
>
> Don't change the buffer size, but I'd increase all the hash table sizes to
> 4096 and see if anything changes.
> 
> echo "4096" > /sys/kernel/config/dlm/cluster/rsbtbl_size
> echo "4096" > /sys/kernel/config/dlm/cluster/lkbtbl_size
> echo "4096" > /sys/kernel/config/dlm/cluster/dirtbl_size

Increasing rsbtbl_size to 4096 or higher results in FSes refusing to 
mount and clvm refusing to start - both with "cannot allocate memory"

At 2048, it works, but gfs_controld and dlm_controld exited when I tried 
to mount all FSes on one node as a test.

At 1024 it seems stable.

The other settings seemed to have applied OK. So far, reports are 
positive (but it's quiet at the moment)

I've got a strace of clvmd trying to start with rsbtbl_size set to 4096. 
Should I post it here or would you prefer it mailed direct?





From teigland at redhat.com  Thu Feb 17 21:29:48 2011
From: teigland at redhat.com (David Teigland)
Date: Thu, 17 Feb 2011 16:29:48 -0500
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D5D9219.8090002@mssl.ucl.ac.uk>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>
	<20110216175848.GC2291@redhat.com>
	<4D5D9219.8090002@mssl.ucl.ac.uk>
Message-ID: <20110217212948.GA9582@redhat.com>

On Thu, Feb 17, 2011 at 09:24:41PM +0000, Alan Brown wrote:
> David Teigland wrote:
> >
> >Don't change the buffer size, but I'd increase all the hash table sizes to
> >4096 and see if anything changes.
> >
> >echo "4096" > /sys/kernel/config/dlm/cluster/rsbtbl_size
> >echo "4096" > /sys/kernel/config/dlm/cluster/lkbtbl_size
> >echo "4096" > /sys/kernel/config/dlm/cluster/dirtbl_size
> 
> Increasing rsbtbl_size to 4096 or higher results in FSes refusing to
> mount and clvm refusing to start - both with "cannot allocate
> memory"
> 
> At 2048, it works, but gfs_controld and dlm_controld exited when I
> tried to mount all FSes on one node as a test.
> 
> At 1024 it seems stable.
> 
> The other settings seemed to have applied OK. So far, reports are
> positive (but it's quiet at the moment)
> 
> I've got a strace of clvmd trying to start with rsbtbl_size set to
> 4096. Should I post it here or would you prefer it mailed direct?

Thanks for testing, you can post here.



From ajb2 at mssl.ucl.ac.uk  Fri Feb 18 12:40:41 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Fri, 18 Feb 2011 12:40:41 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <20110217212948.GA9582@redhat.com>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>
	<20110216175848.GC2291@redhat.com>
	<4D5D9219.8090002@mssl.ucl.ac.uk>
	<20110217212948.GA9582@redhat.com>
Message-ID: <4D5E68C9.1000206@mssl.ucl.ac.uk>

David Teigland wrote:

>> I've got a strace of clvmd trying to start with rsbtbl_size set to
>> 4096. Should I post it here or would you prefer it mailed direct?
> 
> Thanks for testing, you can post here.
> 




-------------- next part --------------
A non-text attachment was scrubbed...
Name: ClvmdFailedStartStrace.gz
Type: application/x-gzip
Size: 42737 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110218/6bdca379/attachment.bin>

From jon at whiteheat.org.uk  Fri Feb 18 13:30:28 2011
From: jon at whiteheat.org.uk (Jonathan Gowar)
Date: Fri, 18 Feb 2011 13:30:28 +0000
Subject: [Linux-cluster] WebData_monitor_0 (node=squeeze, call=11, rc=6,
 status=complete): not configured
Message-ID: <4D5E7474.2080500@whiteheat.org.uk>

I've been following the cluster from scratch guide, by Beekhof.  I'm 
using Debian 6, so I don't know how much that might confuse things; I 
appreciate there are a few debian-specifics.

Before adding the drbd pacemaker resource crm status looked fine.  After 
configuring the resource I get the following error from crm_mon:-

WebData_monitor_0 (node=squeeze, call=11, rc=6, status=complete): not 
configured

Here is the crm configuration, and monitor:-

root at squeeze:~# crm configure show
node sleeze
node sneeze
node squeeze
primitive ClusterIP ocf:heartbeat:IPaddr2 \
         params ip="xxx.xxx.xxx.xxx" cidr_netmask="32" \
         op monitor interval="30s"
primitive WebData ocf:linbit:drbd \
         params drbd_resource="wwwdata" \
         op monitor interval="60s"
primitive WebSite ocf:heartbeat:apache \
         params configfile="/etc/apache2/apache2.conf" \
         op monitor interval="1m"
ms WebDataClone WebData \
         meta master-max="1" master-node-max="1" clone-max="2" 
clone-node-max="1" notify="true"
colocation website-with-ip inf: WebSite ClusterIP
order apache-after-ip inf: ClusterIP WebSite
property $id="cib-bootstrap-options" \
         dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \
         cluster-infrastructure="openais" \
         expected-quorum-votes="3" \
         stonith-enabled="false"
rsc_defaults $id="rsc-options" \
         resource-stickiness="100"
root at squeeze:~# crm status
============
Last updated: Fri Feb 18 13:15:53 2011
Stack: openais
Current DC: sneeze - partition with quorum
Version: 1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b
3 Nodes configured, 3 expected votes
3 Resources configured.
============

Online: [ squeeze sneeze sleeze ]

  ClusterIP      (ocf::heartbeat:IPaddr2):       Started sneeze
  WebSite        (ocf::heartbeat:apache):        Started sneeze
  Master/Slave Set: WebDataClone
      Masters: [ squeeze ]
      Slaves: [ sneeze ]

Failed actions:
     WebData_monitor_0 (node=sleeze, call=4, rc=6, status=complete): not 
configured
     WebData_monitor_0 (node=sneeze, call=9, rc=6, status=complete): not 
configured
     WebData_monitor_0 (node=squeeze, call=11, rc=6, status=complete): 
not configured

Does anyone have any ideas as to how I might investigate where the 
problem is.

Kind regards,
Jon



From Jason_Henderson at Mitel.com  Fri Feb 18 17:47:17 2011
From: Jason_Henderson at Mitel.com (Jason_Henderson at Mitel.com)
Date: Fri, 18 Feb 2011 12:47:17 -0500
Subject: [Linux-cluster] HP iLO3 and /sbin/fence_ilo not working
Message-ID: <OFBE0DFE0A.271B4614-ON8525783B.005F48E1-8525783B.0061B6BC@ottlnmta.mitel.com>

According to this knowledge base article at redhat, 
https://access.redhat.com/kb/docs/DOC-39336, the /sbin/fence_ilo script 
should work with iLO3. I am using the version of cman mentioned in the 
article. The iLO3 firmware is at the latest revision 1.16.
The fence_ilo script just returns a connect/login error as follows:

[root at node01 ~]# /sbin/fence_ilo -o status -a '10.39.170.233' -l mitel -p 
ilopassword -v
Unable to connect/login to fencing device

The login credentials are correct as I can connect via ssh:

[root at node01 ~]# ssh mitel at 10.39.170.233
mitel at 10.39.170.233's password:
User:mitel logged-in to ILOUSE103N281.(10.39.170.233)
iLO 3 Standard 1.16 at  Dec 17 2010
Server Name: host is unnamed
Server Power: On

</>hpiLO->

Is their anything else that needs to be done to execute the script 
successfully? The fence_ilo script works on previous iLO versions.


Linked Article:

Issue

    * How do I  set up HP iLO3 as a fence device in a Red Hat Cluster 
Suite (RHCS) cluster?
    * Why does fencing fail on Red Hat Cluster Suite when using HP's iLO3?
 
Environment

    * Red Hat Enterprise Linux (RHEL) 5
    * Red Hat Enterprise Linux 6
    * Red Hat Cluster Suite
    * HP iLO3

Resolution

Support for the iLO3 fence device in RHEL5 has been added with the release 
of cman 2.0.115-34.el5_5.4 through erratum RHEA-2010-0876. No special 
setup is required after installing this erratum to get the iLO3 to work.

HP has also released new firmware to address an issue with fence_ipmi 
which could cause the server to be powered on instead of off. We therefore 
advise you to upgrade to firmware version 1.15 (28 Oct 2010) as provided 
by HP. For installation instructions or problem resolution with this 
firmware version we refer you to the HP website.
 
Root Cause

HP iLO3 was not supported by fence_ilo.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110218/93551510/attachment.htm>

From Jason_Henderson at Mitel.com  Fri Feb 18 18:04:01 2011
From: Jason_Henderson at Mitel.com (Jason_Henderson at Mitel.com)
Date: Fri, 18 Feb 2011 13:04:01 -0500
Subject: [Linux-cluster] HP iLO3 and /sbin/fence_ilo not working
In-Reply-To: <OFBE0DFE0A.271B4614-ON8525783B.005F48E1-8525783B.0061B6BC@ottlnmta.mitel.com>
Message-ID: <OF26F0711F.881C93D9-ON8525783B.0062E2C7-8525783B.00633EE0@ottlnmta.mitel.com>

I think I mis-understood the article in combination with a reply from HP 
tech support on the issue. Looks like the fence_ipmilan agent is what 
changed to support fencing with iLO3, not fence_ilo.

linux-cluster-bounces at redhat.com wrote on 02/18/2011 12:47:17 PM:

> 
> According to this knowledge base article at redhat, https://access.
> redhat.com/kb/docs/DOC-39336, the /sbin/fence_ilo script should work
> with iLO3. I am using the version of cman mentioned in the article. 
> The iLO3 firmware is at the latest revision 1.16. 
> The fence_ilo script just returns a connect/login error as follows: 
> 
> [root at node01 ~]# /sbin/fence_ilo -o status -a '10.39.170.233' -l 
> mitel -p ilopassword -v 
> Unable to connect/login to fencing device 
> 
> The login credentials are correct as I can connect via ssh: 
> 
> [root at node01 ~]# ssh mitel at 10.39.170.233 
> mitel at 10.39.170.233's password: 
> User:mitel logged-in to ILOUSE103N281.(10.39.170.233) 
> iLO 3 Standard 1.16 at  Dec 17 2010 
> Server Name: host is unnamed 
> Server Power: On 
> 
> </>hpiLO-> 
> 
> Is their anything else that needs to be done to execute the script 
> successfully? The fence_ilo script works on previous iLO versions. 
> 
> 
> Linked Article: 
> 
> Issue 
> 
>     * How do I  set up HP iLO3 as a fence device in a Red Hat 
> Cluster Suite (RHCS) cluster? 
>     * Why does fencing fail on Red Hat Cluster Suite when using HP's 
iLO3? 
> 
> Environment 
> 
>     * Red Hat Enterprise Linux (RHEL) 5 
>     * Red Hat Enterprise Linux 6 
>     * Red Hat Cluster Suite 
>     * HP iLO3 
> 
> Resolution 
> 
> Support for the iLO3 fence device in RHEL5 has been added with the 
> release of cman 2.0.115-34.el5_5.4 through erratum RHEA-2010-0876. 
> No special setup is required after installing this erratum to get 
> the iLO3 to work. 
> 
> HP has also released new firmware to address an issue with 
> fence_ipmi which could cause the server to be powered on instead of 
> off. We therefore advise you to upgrade to firmware version 1.15 (28
> Oct 2010) as provided by HP. For installation instructions or 
> problem resolution with this firmware version we refer you to the HP 
website.
> 
> Root Cause 
> 
> HP iLO3 was not supported by fence_ilo.--
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From jon at whiteheat.org.uk  Fri Feb 18 20:52:11 2011
From: jon at whiteheat.org.uk (Jonathan Gowar)
Date: Fri, 18 Feb 2011 20:52:11 +0000
Subject: [Linux-cluster] WebData_monitor_0 (node=squeeze, call=11, rc=6,
 status=complete): not configured
In-Reply-To: <4D5E7474.2080500@whiteheat.org.uk>
References: <4D5E7474.2080500@whiteheat.org.uk>
Message-ID: <4D5EDBFB.7010400@whiteheat.org.uk>

On 18/02/11 13:30, Jonathan Gowar wrote:
> I've been following the cluster from scratch guide, by Beekhof. I'm
> using Debian 6, so I don't know how much that might confuse things; I
> appreciate there are a few debian-specifics.
>
> Before adding the drbd pacemaker resource crm status looked fine. After
> configuring the resource I get the following error from crm_mon:-
>
> WebData_monitor_0 (node=squeeze, call=11, rc=6, status=complete): not
> configured
>
> Here is the crm configuration, and monitor:-
>
> root at squeeze:~# crm configure show
> node sleeze
> node sneeze
> node squeeze
> primitive ClusterIP ocf:heartbeat:IPaddr2 \
> params ip="xxx.xxx.xxx.xxx" cidr_netmask="32" \
> op monitor interval="30s"
> primitive WebData ocf:linbit:drbd \
> params drbd_resource="wwwdata" \
> op monitor interval="60s"
> primitive WebSite ocf:heartbeat:apache \
> params configfile="/etc/apache2/apache2.conf" \
> op monitor interval="1m"
> ms WebDataClone WebData \
> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1"
> notify="true"
> colocation website-with-ip inf: WebSite ClusterIP
> order apache-after-ip inf: ClusterIP WebSite
> property $id="cib-bootstrap-options" \
> dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \
> cluster-infrastructure="openais" \
> expected-quorum-votes="3" \
> stonith-enabled="false"
> rsc_defaults $id="rsc-options" \
> resource-stickiness="100"
> root at squeeze:~# crm status
> ============
> Last updated: Fri Feb 18 13:15:53 2011
> Stack: openais
> Current DC: sneeze - partition with quorum
> Version: 1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b
> 3 Nodes configured, 3 expected votes
> 3 Resources configured.
> ============
>
> Online: [ squeeze sneeze sleeze ]
>
> ClusterIP (ocf::heartbeat:IPaddr2): Started sneeze
> WebSite (ocf::heartbeat:apache): Started sneeze
> Master/Slave Set: WebDataClone
> Masters: [ squeeze ]
> Slaves: [ sneeze ]
>
> Failed actions:
> WebData_monitor_0 (node=sleeze, call=4, rc=6, status=complete): not
> configured
> WebData_monitor_0 (node=sneeze, call=9, rc=6, status=complete): not
> configured
> WebData_monitor_0 (node=squeeze, call=11, rc=6, status=complete): not
> configured
>
> Does anyone have any ideas as to how I might investigate where the
> problem is.
>
> Kind regards,
> Jon

Hi,

   Found out how to debug failing resources:-

http://www.clusterlabs.org/wiki/Debugging_Resource_Failures

I managed to clear 1 problem, fuser was not installed; that means psmisc 
for Debian users.


root at squeeze:~# crm configure show
node sleeze
node sneeze
node squeeze
primitive ClusterIP ocf:heartbeat:IPaddr2 \
         params ip="80.87.131.245" cidr_netmask="32" \
         op monitor interval="30s"
primitive WebData ocf:linbit:drbd \
         params drbd_resource="wwwdata" \
         op monitor interval="60s"
primitive WebFS ocf:heartbeat:Filesystem \
         params device="/dev/drbd/by-res/wwwdata" 
directory="/var/www/drbd" fstype="ext4" \
         meta is-managed="true"
primitive WebSite ocf:heartbeat:apache \
         params configfile="/etc/apache2/apache2.conf" \
         op monitor interval="1m"
ms WebDataClone WebData \
         meta master-max="1" master-node-max="1" clone-max="2" 
clone-node-max="1" notify="true" is-managed="false"
location cli-prefer-WebSite WebSite \
         rule $id="cli-prefer-rule-WebSite" inf: #uname eq sleeze
colocation WebSite-with-WebFS inf: WebSite WebFS
colocation fs_on_drbd inf: WebFS WebDataClone:Master
colocation website-with-ip inf: WebSite ClusterIP
order WebFS-after-WebData inf: WebDataClone:promote WebFS:start
order WebSite-after-WebFS inf: WebFS WebSite
order apache-after-ip inf: ClusterIP WebSite
property $id="cib-bootstrap-options" \
         dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \
         cluster-infrastructure="openais" \
         expected-quorum-votes="3" \
         stonith-enabled="false" \
         last-lrm-refresh="1298043091"
rsc_defaults $id="rsc-options" \
         resource-stickiness="100"


Here are a couple of bad looking lines from the debug output:-


/usr/lib/ocf/resource.d/linbit/drbd: 1: [[: not found
/usr/lib/ocf/resource.d/linbit/drbd: 1: 0x080307: not found
/usr/lib/ocf/resource.d/linbit/drbd: 1: Bad substitution


n.b. See full debug report at http://pastebin.com/pjKxBu8K

OCF Return Code: 2
OCF Alias: OCF_ERR_ARGS
Description: "The resource's configuration is not valid on this machine. 
Eg. Refers to a location/tool not found on the node."
Recovery Type: hard

Let me know if there's anything else I need to post.

Kind regards,
Jon



From jon at whiteheat.org.uk  Fri Feb 18 21:51:55 2011
From: jon at whiteheat.org.uk (Jonathan Gowar)
Date: Fri, 18 Feb 2011 21:51:55 +0000
Subject: [Linux-cluster] WebData_monitor_0 (node=squeeze, call=11, rc=6,
 status=complete): not configured
In-Reply-To: <4D5EDBFB.7010400@whiteheat.org.uk>
References: <4D5E7474.2080500@whiteheat.org.uk>
	<4D5EDBFB.7010400@whiteheat.org.uk>
Message-ID: <4D5EE9FB.9040407@whiteheat.org.uk>

On 18/02/11 20:52, Jonathan Gowar wrote:
> On 18/02/11 13:30, Jonathan Gowar wrote:
>> I've been following the cluster from scratch guide, by Beekhof. I'm
>> using Debian 6, so I don't know how much that might confuse things; I
>> appreciate there are a few debian-specifics.
>>
>> Before adding the drbd pacemaker resource crm status looked fine. After
>> configuring the resource I get the following error from crm_mon:-
>>
>> WebData_monitor_0 (node=squeeze, call=11, rc=6, status=complete): not
>> configured
>>
>> Here is the crm configuration, and monitor:-
>>
>> root at squeeze:~# crm configure show
>> node sleeze
>> node sneeze
>> node squeeze
>> primitive ClusterIP ocf:heartbeat:IPaddr2 \
>> params ip="xxx.xxx.xxx.xxx" cidr_netmask="32" \
>> op monitor interval="30s"
>> primitive WebData ocf:linbit:drbd \
>> params drbd_resource="wwwdata" \
>> op monitor interval="60s"
>> primitive WebSite ocf:heartbeat:apache \
>> params configfile="/etc/apache2/apache2.conf" \
>> op monitor interval="1m"
>> ms WebDataClone WebData \
>> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1"
>> notify="true"
>> colocation website-with-ip inf: WebSite ClusterIP
>> order apache-after-ip inf: ClusterIP WebSite
>> property $id="cib-bootstrap-options" \
>> dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \
>> cluster-infrastructure="openais" \
>> expected-quorum-votes="3" \
>> stonith-enabled="false"
>> rsc_defaults $id="rsc-options" \
>> resource-stickiness="100"
>> root at squeeze:~# crm status
>> ============
>> Last updated: Fri Feb 18 13:15:53 2011
>> Stack: openais
>> Current DC: sneeze - partition with quorum
>> Version: 1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b
>> 3 Nodes configured, 3 expected votes
>> 3 Resources configured.
>> ============
>>
>> Online: [ squeeze sneeze sleeze ]
>>
>> ClusterIP (ocf::heartbeat:IPaddr2): Started sneeze
>> WebSite (ocf::heartbeat:apache): Started sneeze
>> Master/Slave Set: WebDataClone
>> Masters: [ squeeze ]
>> Slaves: [ sneeze ]
>>
>> Failed actions:
>> WebData_monitor_0 (node=sleeze, call=4, rc=6, status=complete): not
>> configured
>> WebData_monitor_0 (node=sneeze, call=9, rc=6, status=complete): not
>> configured
>> WebData_monitor_0 (node=squeeze, call=11, rc=6, status=complete): not
>> configured
>>
>> Does anyone have any ideas as to how I might investigate where the
>> problem is.
>>
>> Kind regards,
>> Jon
>
> Hi,
>
> Found out how to debug failing resources:-
>
> http://www.clusterlabs.org/wiki/Debugging_Resource_Failures
>
> I managed to clear 1 problem, fuser was not installed; that means psmisc
> for Debian users.
>
>
> root at squeeze:~# crm configure show
> node sleeze
> node sneeze
> node squeeze
> primitive ClusterIP ocf:heartbeat:IPaddr2 \
> params ip="xxx.xxx.xxx.xxx" cidr_netmask="32" \
> op monitor interval="30s"
> primitive WebData ocf:linbit:drbd \
> params drbd_resource="wwwdata" \
> op monitor interval="60s"
> primitive WebFS ocf:heartbeat:Filesystem \
> params device="/dev/drbd/by-res/wwwdata" directory="/var/www/drbd"
> fstype="ext4" \
> meta is-managed="true"
> primitive WebSite ocf:heartbeat:apache \
> params configfile="/etc/apache2/apache2.conf" \
> op monitor interval="1m"
> ms WebDataClone WebData \
> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1"
> notify="true" is-managed="false"
> location cli-prefer-WebSite WebSite \
> rule $id="cli-prefer-rule-WebSite" inf: #uname eq sleeze
> colocation WebSite-with-WebFS inf: WebSite WebFS
> colocation fs_on_drbd inf: WebFS WebDataClone:Master
> colocation website-with-ip inf: WebSite ClusterIP
> order WebFS-after-WebData inf: WebDataClone:promote WebFS:start
> order WebSite-after-WebFS inf: WebFS WebSite
> order apache-after-ip inf: ClusterIP WebSite
> property $id="cib-bootstrap-options" \
> dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \
> cluster-infrastructure="openais" \
> expected-quorum-votes="3" \
> stonith-enabled="false" \
> last-lrm-refresh="1298043091"
> rsc_defaults $id="rsc-options" \
> resource-stickiness="100"
>
>
> Here are a couple of bad looking lines from the debug output:-
>
>
> /usr/lib/ocf/resource.d/linbit/drbd: 1: [[: not found
> /usr/lib/ocf/resource.d/linbit/drbd: 1: 0x080307: not found
> /usr/lib/ocf/resource.d/linbit/drbd: 1: Bad substitution
>
>
> n.b. See full debug report at http://pastebin.com/pjKxBu8K
>
> OCF Return Code: 2
> OCF Alias: OCF_ERR_ARGS
> Description: "The resource's configuration is not valid on this machine.
> Eg. Refers to a location/tool not found on the node."
> Recovery Type: hard
>
> Let me know if there's anything else I need to post.
>
> Kind regards,
> Jon

Hi,

   This appeared to be a problem running 3 nodes.  Stopping corosync on 
one of the nodes levitated the problem.

Is it possible to have a 3 node cluster, 3 running apache, 2 running 
DRBD?  If so, can someone point me in the direction of how to.

Kind regards,
Jon



From jakov.sosic at srce.hr  Sat Feb 19 00:36:02 2011
From: jakov.sosic at srce.hr (Jakov Sosic)
Date: Sat, 19 Feb 2011 01:36:02 +0100
Subject: [Linux-cluster] MySQL MASTER->SLAVE agent?
Message-ID: <4D5F1072.1050202@srce.hr>

Hi.

Would it be possible to use the MySQL replication as a way of achieving HA?

When MASTER is down to take appropriate actions and declare SLAVE the
NEW master, and take on IP address? Has anyone tested this kind of
setup? Obviously, failback should be impossible and request manual action.

This would pose as a good solution for environments without shared
storage and without DRBD.


-- 
Jakov Sosic
www.srce.hr



From crosa at redhat.com  Sat Feb 19 01:05:33 2011
From: crosa at redhat.com (Cleber Rosa)
Date: Fri, 18 Feb 2011 23:05:33 -0200
Subject: [Linux-cluster] MySQL MASTER->SLAVE agent?
In-Reply-To: <4D5F1072.1050202@srce.hr>
References: <4D5F1072.1050202@srce.hr>
Message-ID: <4D5F175D.9070903@redhat.com>

On 02/18/2011 10:36 PM, Jakov Sosic wrote:
> Hi.
>
> Would it be possible to use the MySQL replication as a way of achieving HA?
>
> When MASTER is down to take appropriate actions and declare SLAVE the
> NEW master, and take on IP address? Has anyone tested this kind of
> setup? Obviously, failback should be impossible and request manual action.
>
> This would pose as a good solution for environments without shared
> storage and without DRBD.
>
>
Jakov,

Actually MySQL allows for MASTER<->MASTER replication. I've successfully 
deployed that on some 10 sites or so top of RHCS using nothing but a 
floating IP address for MySQL (plus other resources for other services).

Of course, you'd better keep an eye on MySQL's replication health when 
doing that.

CR.



From dan.candea at quah.ro  Sat Feb 19 10:22:05 2011
From: dan.candea at quah.ro (Dan Candea)
Date: Sat, 19 Feb 2011 12:22:05 +0200
Subject: [Linux-cluster] MySQL MASTER->SLAVE agent?
In-Reply-To: <4D5F175D.9070903@redhat.com>
References: <4D5F1072.1050202@srce.hr> <4D5F175D.9070903@redhat.com>
Message-ID: <4D5F99CD.2060709@quah.ro>

On 19.02.2011 03:05, Cleber Rosa wrote:
> On 02/18/2011 10:36 PM, Jakov Sosic wrote:
>> Hi.
>>
>> Would it be possible to use the MySQL replication as a way of 
>> achieving HA?
>>
>> When MASTER is down to take appropriate actions and declare SLAVE the
>> NEW master, and take on IP address? Has anyone tested this kind of
>> setup? Obviously, failback should be impossible and request manual 
>> action.
>>
>> This would pose as a good solution for environments without shared
>> storage and without DRBD.
>>
>>
> Jakov,
>
> Actually MySQL allows for MASTER<->MASTER replication. I've 
> successfully deployed that on some 10 sites or so top of RHCS using 
> nothing but a floating IP address for MySQL (plus other resources for 
> other services).
>
> Of course, you'd better keep an eye on MySQL's replication health when 
> doing that.
>
> CR.
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
you could try with ndb tables,  it's the mysql cluster engine

-- 
Dan C?ndea
Does God Play Dice?



From crosa at redhat.com  Sat Feb 19 21:27:01 2011
From: crosa at redhat.com (Cleber Rosa)
Date: Sat, 19 Feb 2011 19:27:01 -0200
Subject: [Linux-cluster] MySQL MASTER->SLAVE agent?
In-Reply-To: <4D5F99CD.2060709@quah.ro>
References: <4D5F1072.1050202@srce.hr> <4D5F175D.9070903@redhat.com>
	<4D5F99CD.2060709@quah.ro>
Message-ID: <4D6035A5.4020703@redhat.com>

On 02/19/2011 08:22 AM, Dan Candea wrote:
> On 19.02.2011 03:05, Cleber Rosa wrote:
>> On 02/18/2011 10:36 PM, Jakov Sosic wrote:
>>> Hi.
>>>
>>> Would it be possible to use the MySQL replication as a way of 
>>> achieving HA?
>>>
>>> When MASTER is down to take appropriate actions and declare SLAVE the
>>> NEW master, and take on IP address? Has anyone tested this kind of
>>> setup? Obviously, failback should be impossible and request manual 
>>> action.
>>>
>>> This would pose as a good solution for environments without shared
>>> storage and without DRBD.
>>>
>>>
>> Jakov,
>>
>> Actually MySQL allows for MASTER<->MASTER replication. I've 
>> successfully deployed that on some 10 sites or so top of RHCS using 
>> nothing but a floating IP address for MySQL (plus other resources for 
>> other services).
>>
>> Of course, you'd better keep an eye on MySQL's replication health 
>> when doing that.
>>
>> CR.
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> you could try with ndb tables,  it's the mysql cluster engine
>

AFAIK this still requires a "data storage" node (which is centralized), 
but I may be totally outdated on the subject.



From pieter.baele at gmail.com  Mon Feb 21 07:54:22 2011
From: pieter.baele at gmail.com (Pieter Baele)
Date: Mon, 21 Feb 2011 08:54:22 +0100
Subject: [Linux-cluster] CLVM mirror using Pacemaker (RHEL6)
Message-ID: <AANLkTim8zZEMNRU5fY4GEpJ4WcBJTEEGxX4iDgDPb7ku@mail.gmail.com>

Hi,

I added a DLM resource, but when I try to add clvm in crm, I get the
following error:
crm(live)configure# primitive clvm ocf:lvm2:clvmd params
daemon_timeout="30" op monitor interval="60" timeout="60"
ERROR: ocf:lvm2:clvmd: could not parse meta-data:
ERROR: ocf:lvm2:clvmd: no such resource agent

How can I set up clvm (mirroring) using Pacemaker DLM integration?


Met vriendelijke groeten,
Pieter Baele
www.pieterb.be



From linux-cluster at redhat.com  Mon Feb 21 07:57:29 2011
From: linux-cluster at redhat.com (Mailbot for etexusa.com)
Date: Sun, 20 Feb 2011 23:57:29 -0800
Subject: [Linux-cluster] DSN: failed (Message could not be delivered)
Message-ID: <mAWtxIeCDcrRueZ2l02@etexusa.com>


This is a Delivery Status Notification (DSN).

I was unable to deliver your message to
guhantex at eth.net.

I said 
  RCPT TO:<guhantex at eth.net>

And they gave me the error;
  550 5.1.1 unknown or illegal alias: guhantex at eth.net

 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: text/rfc822-headers
Size: 499 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110220/e3a5a136/attachment.bin>

From andrew at beekhof.net  Mon Feb 21 09:52:33 2011
From: andrew at beekhof.net (Andrew Beekhof)
Date: Mon, 21 Feb 2011 10:52:33 +0100
Subject: [Linux-cluster] WebData_monitor_0 (node=squeeze, call=11, rc=6,
 status=complete): not configured
In-Reply-To: <4D5EE9FB.9040407@whiteheat.org.uk>
References: <4D5E7474.2080500@whiteheat.org.uk>
	<4D5EDBFB.7010400@whiteheat.org.uk>
	<4D5EE9FB.9040407@whiteheat.org.uk>
Message-ID: <AANLkTimm+L3pkLE7Mef34R8Xj=AMqUVEjcFfKbWhE1D+@mail.gmail.com>

On Fri, Feb 18, 2011 at 10:51 PM, Jonathan Gowar <jon at whiteheat.org.uk> wrote:
> On 18/02/11 20:52, Jonathan Gowar wrote:
>>
>> On 18/02/11 13:30, Jonathan Gowar wrote:
>>>
>>> I've been following the cluster from scratch guide, by Beekhof. I'm
>>> using Debian 6, so I don't know how much that might confuse things; I
>>> appreciate there are a few debian-specifics.
>>>
>>> Before adding the drbd pacemaker resource crm status looked fine. After
>>> configuring the resource I get the following error from crm_mon:-
>>>
>>> WebData_monitor_0 (node=squeeze, call=11, rc=6, status=complete): not
>>> configured
>>>
>>> Here is the crm configuration, and monitor:-
>>>
>>> root at squeeze:~# crm configure show
>>> node sleeze
>>> node sneeze
>>> node squeeze
>>> primitive ClusterIP ocf:heartbeat:IPaddr2 \
>>> params ip="xxx.xxx.xxx.xxx" cidr_netmask="32" \
>>> op monitor interval="30s"
>>> primitive WebData ocf:linbit:drbd \
>>> params drbd_resource="wwwdata" \
>>> op monitor interval="60s"
>>> primitive WebSite ocf:heartbeat:apache \
>>> params configfile="/etc/apache2/apache2.conf" \
>>> op monitor interval="1m"
>>> ms WebDataClone WebData \
>>> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1"
>>> notify="true"
>>> colocation website-with-ip inf: WebSite ClusterIP
>>> order apache-after-ip inf: ClusterIP WebSite
>>> property $id="cib-bootstrap-options" \
>>> dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \
>>> cluster-infrastructure="openais" \
>>> expected-quorum-votes="3" \
>>> stonith-enabled="false"
>>> rsc_defaults $id="rsc-options" \
>>> resource-stickiness="100"
>>> root at squeeze:~# crm status
>>> ============
>>> Last updated: Fri Feb 18 13:15:53 2011
>>> Stack: openais
>>> Current DC: sneeze - partition with quorum
>>> Version: 1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b
>>> 3 Nodes configured, 3 expected votes
>>> 3 Resources configured.
>>> ============
>>>
>>> Online: [ squeeze sneeze sleeze ]
>>>
>>> ClusterIP (ocf::heartbeat:IPaddr2): Started sneeze
>>> WebSite (ocf::heartbeat:apache): Started sneeze
>>> Master/Slave Set: WebDataClone
>>> Masters: [ squeeze ]
>>> Slaves: [ sneeze ]
>>>
>>> Failed actions:
>>> WebData_monitor_0 (node=sleeze, call=4, rc=6, status=complete): not
>>> configured
>>> WebData_monitor_0 (node=sneeze, call=9, rc=6, status=complete): not
>>> configured
>>> WebData_monitor_0 (node=squeeze, call=11, rc=6, status=complete): not
>>> configured
>>>
>>> Does anyone have any ideas as to how I might investigate where the
>>> problem is.
>>>
>>> Kind regards,
>>> Jon
>>
>> Hi,
>>
>> Found out how to debug failing resources:-
>>
>> http://www.clusterlabs.org/wiki/Debugging_Resource_Failures
>>
>> I managed to clear 1 problem, fuser was not installed; that means psmisc
>> for Debian users.
>>
>>
>> root at squeeze:~# crm configure show
>> node sleeze
>> node sneeze
>> node squeeze
>> primitive ClusterIP ocf:heartbeat:IPaddr2 \
>> params ip="xxx.xxx.xxx.xxx" cidr_netmask="32" \
>> op monitor interval="30s"
>> primitive WebData ocf:linbit:drbd \
>> params drbd_resource="wwwdata" \
>> op monitor interval="60s"
>> primitive WebFS ocf:heartbeat:Filesystem \
>> params device="/dev/drbd/by-res/wwwdata" directory="/var/www/drbd"
>> fstype="ext4" \
>> meta is-managed="true"
>> primitive WebSite ocf:heartbeat:apache \
>> params configfile="/etc/apache2/apache2.conf" \
>> op monitor interval="1m"
>> ms WebDataClone WebData \
>> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1"
>> notify="true" is-managed="false"
>> location cli-prefer-WebSite WebSite \
>> rule $id="cli-prefer-rule-WebSite" inf: #uname eq sleeze
>> colocation WebSite-with-WebFS inf: WebSite WebFS
>> colocation fs_on_drbd inf: WebFS WebDataClone:Master
>> colocation website-with-ip inf: WebSite ClusterIP
>> order WebFS-after-WebData inf: WebDataClone:promote WebFS:start
>> order WebSite-after-WebFS inf: WebFS WebSite
>> order apache-after-ip inf: ClusterIP WebSite
>> property $id="cib-bootstrap-options" \
>> dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \
>> cluster-infrastructure="openais" \
>> expected-quorum-votes="3" \
>> stonith-enabled="false" \
>> last-lrm-refresh="1298043091"
>> rsc_defaults $id="rsc-options" \
>> resource-stickiness="100"
>>
>>
>> Here are a couple of bad looking lines from the debug output:-
>>
>>
>> /usr/lib/ocf/resource.d/linbit/drbd: 1: [[: not found
>> /usr/lib/ocf/resource.d/linbit/drbd: 1: 0x080307: not found
>> /usr/lib/ocf/resource.d/linbit/drbd: 1: Bad substitution
>>
>>
>> n.b. See full debug report at http://pastebin.com/pjKxBu8K
>>
>> OCF Return Code: 2
>> OCF Alias: OCF_ERR_ARGS
>> Description: "The resource's configuration is not valid on this machine.
>> Eg. Refers to a location/tool not found on the node."
>> Recovery Type: hard
>>
>> Let me know if there's anything else I need to post.
>>
>> Kind regards,
>> Jon
>
> Hi,
>
> ?This appeared to be a problem running 3 nodes. ?Stopping corosync on one of
> the nodes levitated the problem.
>
> Is it possible to have a 3 node cluster, 3 running apache, 2 running DRBD?

Should be possible

> ?If so, can someone point me in the direction of how to.

Depends on what errors are being thrown



From andrew at beekhof.net  Mon Feb 21 09:58:37 2011
From: andrew at beekhof.net (Andrew Beekhof)
Date: Mon, 21 Feb 2011 10:58:37 +0100
Subject: [Linux-cluster] MySQL MASTER->SLAVE agent?
In-Reply-To: <4D5F1072.1050202@srce.hr>
References: <4D5F1072.1050202@srce.hr>
Message-ID: <AANLkTi=-0DCHj_L7RJzNNaxHH8Ov-GzeKDnPobz5WC9N@mail.gmail.com>

On Sat, Feb 19, 2011 at 1:36 AM, Jakov Sosic <jakov.sosic at srce.hr> wrote:
> Hi.
>
> Would it be possible to use the MySQL replication as a way of achieving HA?
>
> When MASTER is down to take appropriate actions and declare SLAVE the
> NEW master, and take on IP address?

That would be tricky for rgmanager since it doesn't understand the
concept of multi-state resources.
I know people have done it with Pacemaker (also available in RHEL6) though.

> Has anyone tested this kind of
> setup? Obviously, failback should be impossible and request manual action.
>
> This would pose as a good solution for environments without shared
> storage and without DRBD.
>
>
> --
> Jakov Sosic
> www.srce.hr
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From thiagoh at digirati.com.br  Mon Feb 21 15:19:41 2011
From: thiagoh at digirati.com.br (Thiago Henrique)
Date: Mon, 21 Feb 2011 12:19:41 -0300
Subject: [Linux-cluster] Segfault in GFS2
Message-ID: <1298301581.21845.36.camel@thiagohenrique06>

Hello,

I'm making a simple test with GFS2: I run simultaneously on both nodes,
a script that make write operations in the filesystem. It causes GFS2 to
dump a stack trace and fault.

I have a cluster configured with two nodes like this:
  Ubuntu 10.04.1 LTS 
  Kernel 2.6.35-23-generic 
  drbd8-source-2:8.3.7-1ubuntu2.1
  drbd8-utils-2:8.3.8.1-0ubuntu1
  cman-3.0.2-2ubuntu3.1 
  libcman3-3.0.2-2ubuntu3.1 
  gfs2-tools-3.0.2-2ubuntu3.1


Is this known? What other kind of information could be useful to help
find this issue?

Thanks,
--
Thiago Henrique

STACK TRACE:
################################################################################
/var/log/kern.log:
Feb 20 06:29:39 wcluster1 kernel: [142560.304056] INFO: task
gfs2_quotad:1813 blocked for more than 120 seconds.
Feb 20 06:29:39 wcluster1 kernel: [142560.304075] "echo 0
> /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Feb 20 06:29:39 wcluster1 kernel: [142560.304089] gfs2_quotad   D
f4887e0c     0  1813      2 0x00000000
Feb 20 06:29:39 wcluster1 kernel: [142560.304098]  f4887e1c 00000046
00000002 f4887e0c f5778744 c05d99e0 c08c3700 c08c3700
Feb 20 06:29:39 wcluster1 kernel: [142560.304114]  e70ea676 00008184
c08c3700 c08c3700 e70c4587 00008184 00000000 c08c3700
Feb 20 06:29:39 wcluster1 kernel: [142560.304123]  c08c3700 f545bf70
00000001 f4887e50 00000000 f4887e58 f4887e24 f85ab73d
Feb 20 06:29:39 wcluster1 kernel: [142560.304133] Call Trace:
Feb 20 06:29:39 wcluster1 kernel: [142560.304174]  [<f85ab73d>]
gfs2_glock_holder_wait+0xd/0x20 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304192]  [<c05c81fd>]
__wait_on_bit+0x4d/0x70
Feb 20 06:29:39 wcluster1 kernel: [142560.304203]  [<f85ab730>] ?
gfs2_glock_holder_wait+0x0/0x20 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304214]  [<f85ab730>] ?
gfs2_glock_holder_wait+0x0/0x20 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304220]  [<c05c82cb>]
out_of_line_wait_on_bit+0xab/0xc0
Feb 20 06:29:39 wcluster1 kernel: [142560.304231]  [<c0165f20>] ?
wake_bit_function+0x0/0x50
Feb 20 06:29:39 wcluster1 kernel: [142560.304242]  [<f85ac7f2>]
gfs2_glock_wait+0x32/0x40 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304254]  [<f85adcfe>]
gfs2_glock_nq+0x29e/0x350 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304266]  [<c012cec8>] ?
default_spin_lock_flags+0x8/0x10
Feb 20 06:29:39 wcluster1 kernel: [142560.304272]  [<c05c977f>] ?
_raw_spin_lock_irqsave+0x2f/0x50
Feb 20 06:29:39 wcluster1 kernel: [142560.304296]  [<f85c5efc>]
gfs2_statfs_sync+0x4c/0x1b0 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304304]  [<c0159319>] ?
del_timer_sync+0x19/0x20
Feb 20 06:29:39 wcluster1 kernel: [142560.304319]  [<f85c5ef4>] ?
gfs2_statfs_sync+0x44/0x1b0 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304323]  [<c0158a90>] ?
process_timeout+0x0/0x10
Feb 20 06:29:39 wcluster1 kernel: [142560.304337]  [<f85bdfce>]
quotad_check_timeo+0x3e/0xa0 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304343]  [<c016603f>] ?
finish_wait+0x4f/0x70
Feb 20 06:29:39 wcluster1 kernel: [142560.304356]  [<f85be23a>]
gfs2_quotad+0x20a/0x250 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304362]  [<c05c75aa>] ?
schedule+0x37a/0x7a0
Feb 20 06:29:39 wcluster1 kernel: [142560.304367]  [<c0165ed0>] ?
autoremove_wake_function+0x0/0x50
Feb 20 06:29:39 wcluster1 kernel: [142560.304380]  [<f85be030>] ?
gfs2_quotad+0x0/0x250 [gfs2]
Feb 20 06:29:39 wcluster1 kernel: [142560.304386]  [<c0165aa4>] kthread
+0x74/0x80
Feb 20 06:29:39 wcluster1 kernel: [142560.304390]  [<c0165a30>] ?
kthread+0x0/0x80
Feb 20 06:29:39 wcluster1 kernel: [142560.304397]  [<c010363e>]
kernel_thread_helper+0x6/0x10
################################################################################



From ooolinux at 163.com  Tue Feb 22 02:55:00 2011
From: ooolinux at 163.com (yue)
Date: Tue, 22 Feb 2011 10:55:00 +0800 (CST)
Subject: [Linux-cluster] hi,question about gfs2
Message-ID: <30cb8ebe.1293c.12e4b4a8e15.Coremail.ooolinux@163.com>

1.if i can deploy gfs2 on fedora12.    if it is ok to build from source code ?
2.the max node  gfs2 can manger?  i use san, if i have 100 machines,if gfs2 can work over those nodes?
 
thanks
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110222/2b670f99/attachment.htm>

From ooolinux at 163.com  Tue Feb 22 03:04:39 2011
From: ooolinux at 163.com (yue)
Date: Tue, 22 Feb 2011 11:04:39 +0800 (CST)
Subject: [Linux-cluster] hi,question about gfs2
Message-ID: <6d94d69.12d04.12e4b5362f4.Coremail.ooolinux@163.com>

1.if i can deploy gfs2 on fedora12.    if it is ok to build from source code ?
2.the max node  gfs2 can manger?  i use san, if i have 100 machines,if gfs2 can work over those nodes?
 
thanks


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110222/df3eda06/attachment.htm>

From andrew at beekhof.net  Tue Feb 22 12:39:33 2011
From: andrew at beekhof.net (Andrew Beekhof)
Date: Tue, 22 Feb 2011 13:39:33 +0100
Subject: [Linux-cluster] hi,question about gfs2
In-Reply-To: <30cb8ebe.1293c.12e4b4a8e15.Coremail.ooolinux@163.com>
References: <30cb8ebe.1293c.12e4b4a8e15.Coremail.ooolinux@163.com>
Message-ID: <AANLkTin+Nbvp3P+zfZMegXXBhi3dXV-GrRfTW9UZuauw@mail.gmail.com>

2011/2/22 yue <ooolinux at 163.com>:
> 1.if i can deploy gfs2 on fedora12.

Why would you do that?  Isn't F-12 unsupported in less than 2 months?

> if it is ok to build from source code
> ?
> 2.the max node? gfs2 can manger?? i use san, if i have 100 machines,if gfs2
> can work over those nodes?
>
> thanks
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From rpeterso at redhat.com  Tue Feb 22 13:48:42 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Tue, 22 Feb 2011 08:48:42 -0500 (EST)
Subject: [Linux-cluster] hi,question about gfs2
In-Reply-To: <6d94d69.12d04.12e4b5362f4.Coremail.ooolinux@163.com>
Message-ID: <205773825.125610.1298382522204.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| 1.if i can deploy gfs2 on fedora12. if it is ok to build from source
| code ?

Yes you can.  However, as Andrew said, it's probably a mistake.
You're better off using Fedora 14 where the code base is newer
and it will be supported longer.

You can build it from source, but find a source that's compatible
with your kernel may be a challenge.  GFS2 has advanced to match
ongoing kernel development.

You can build the Fedora 12 kernel from source RPMS, but you're
likely going to encounter bugs that have already been fixed by
later revisions.

If you go with Fedora 14, it may be easier to compile the latest
source from the GFS2 kernel git repo.

| 2.the max node gfs2 can manger? i use san, if i have 100 machines,if
| gfs2 can work over those nodes?
| 
| thanks

GFS2 does not care how many nodes are in your cluster.
The only thing that cares is the rest of the cluster infrastructure.
However, we don't recommend that many nodes for various reasons.
For one thing, your network may be clogged with lots of traffic,
which may interfere with proper cluster communications.

Regards,

Bob Peterson
Red Hat File Systems



From ooolinux at 163.com  Wed Feb 23 01:59:09 2011
From: ooolinux at 163.com (yue)
Date: Wed, 23 Feb 2011 09:59:09 +0800 (CST)
Subject: [Linux-cluster] hi,question about gfs2
In-Reply-To: <AANLkTin+Nbvp3P+zfZMegXXBhi3dXV-GrRfTW9UZuauw@mail.gmail.com>
References: <AANLkTin+Nbvp3P+zfZMegXXBhi3dXV-GrRfTW9UZuauw@mail.gmail.com>
	<30cb8ebe.1293c.12e4b4a8e15.Coremail.ooolinux@163.com>
Message-ID: <16cbd14.1841.12e503dc631.Coremail.ooolinux@163.com>

1.hi, i do not know fc12 unsupported in less than 2 months
now my environment is fc12

2.if there are rpm  ,rpm is good.
3.i want gfs2 to host xen image-disk.   100G--200G image file.
and need xen live migration  amoung gluster.
i do not know gfs2's performance.
thanks
----------------
At 2011-02-22 20:39:33?"Andrew Beekhof" <andrew at beekhof.net> wrote:

>2011/2/22 yue <ooolinux at 163.com>:
>> 1.if i can deploy gfs2 on fedora12.
>
>Why would you do that?  Isn't F-12 unsupported in less than 2 months?
>
>> if it is ok to build from source code
>> ?
>> 2.the max node  gfs2 can manger?  i use san, if i have 100 machines,if gfs2
>> can work over those nodes?
>>
>> thanks
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110223/2dce0547/attachment.htm>

From ad+lists at uni-x.org  Wed Feb 23 08:48:45 2011
From: ad+lists at uni-x.org (Alexander Dalloz)
Date: Wed, 23 Feb 2011 09:48:45 +0100
Subject: [Linux-cluster] hi,question about gfs2
In-Reply-To: <AANLkTin+Nbvp3P+zfZMegXXBhi3dXV-GrRfTW9UZuauw@mail.gmail.com>
References: <30cb8ebe.1293c.12e4b4a8e15.Coremail.ooolinux@163.com>
	<AANLkTin+Nbvp3P+zfZMegXXBhi3dXV-GrRfTW9UZuauw@mail.gmail.com>
Message-ID: <4D64C9ED.6020604@uni-x.org>

Am 22.02.2011 13:39, schrieb Andrew Beekhof:
> 2011/2/22 yue <ooolinux at 163.com>:
>> 1.if i can deploy gfs2 on fedora12.
> 
> Why would you do that?  Isn't F-12 unsupported in less than 2 months?

F 12 *is* unsupported since nearly 3 months! It went EOL on 2010-12-02

https://fedoraproject.org/wiki/Fedora_Release_Life_Cycle#Maintenance_Schedule

Alexander



From jeff.sturm at eprize.com  Wed Feb 23 13:42:23 2011
From: jeff.sturm at eprize.com (Jeff Sturm)
Date: Wed, 23 Feb 2011 08:42:23 -0500
Subject: [Linux-cluster] hi,question about gfs2
In-Reply-To: <16cbd14.1841.12e503dc631.Coremail.ooolinux@163.com>
References: <AANLkTin+Nbvp3P+zfZMegXXBhi3dXV-GrRfTW9UZuauw@mail.gmail.com><30cb8ebe.1293c.12e4b4a8e15.Coremail.ooolinux@163.com>
	<16cbd14.1841.12e503dc631.Coremail.ooolinux@163.com>
Message-ID: <64D0546C5EBBD147B75DE133D798665F0855C1EA@hugo.eprize.local>

GFS2 isn't the only way to get live Xen migration.  If it's simpler, you can implement CLVM on shared storage, and use logical volumes to contain disk images.  Your cluster infrastructure will ensure consistency of volume group metadata and still provide for domain failover.

 

-Jeff

 

From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of yue
Sent: Tuesday, February 22, 2011 8:59 PM
To: Andrew Beekhof
Cc: cluster-devel; linux clustering
Subject: Re: [Linux-cluster] hi,question about gfs2

 

1.hi, i do not know fc12 unsupported in less than 2 months

now my environment is fc12

2.if there are rpm  ,rpm is good.
3.i want gfs2 to host xen image-disk.   100G--200G image file.
and need xen live migration  amoung gluster.
i do not know gfs2's performance.
thanks
----------------
At 2011-02-22 20:39:33?"Andrew Beekhof" <andrew at beekhof.net> wrote:
 
>2011/2/22 yue <ooolinux at 163.com>:
>> 1.if i can deploy gfs2 on fedora12.
> 
>Why would you do that?  Isn't F-12 unsupported in less than 2 months?
> 
>> if it is ok to build from source code
>> ?
>> 2.the max node  gfs2 can manger?  i use san, if i have 100 machines,if gfs2
>> can work over those nodes?
>> 
>> thanks
>> 
>> 
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>> 

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110223/c3f9b523/attachment.htm>

From zachar at awst.at  Wed Feb 23 14:45:02 2011
From: zachar at awst.at (zachar at awst.at)
Date: Wed, 23 Feb 2011 15:45:02 +0100 (CET)
Subject: [Linux-cluster] =?utf-8?q?hi=2Cquestion_about_gfs2?=
Message-ID: <mtranet.20110223154502.1505152292@telekom.at>

Is this solution supported by RedHat?

If I am correct, it isn't:
https://access.redhat.com/kb/docs/DOC-17651

With HA-LVM you will loose the live-migration...

And if I am correct, as the redhat cluster suite's resource script do not activate the vgs (or lvs)  exclusively (that can be an option when you are using clustered LVM), nothing would prevent the second node to start the vm that is already running on the master node (which would probably kill your filesystems in your vm) in a split brain situation (except a "reliable" fencing).

If I am correct, the only supported method is CLVMD+GFS(2) if you want live migration for a VM. Is this true?

Regards,
Balazs


Jeff Sturm schrieb:
> GFS2 isn't the only way to get live Xen migration.  If it's simpler, you 
> can implement CLVM on shared storage, and use logical volumes to contain 
> disk images.  Your cluster infrastructure will ensure consistency of 
> volume group metadata and still provide for domain failover.
> 
>  
> 
> -Jeff
> 
>  
> 
> From: linux-cluster-bounces at redhat.com [mailto:
> linux-cluster-bounces at redhat.com] On Behalf Of yue
> Sent: Tuesday, February 22, 2011 8:59 PM
> To: Andrew Beekhof
> Cc: cluster-devel; linux clustering
> Subject: Re: [Linux-cluster] hi,question about gfs2
> 
>  
> 
> 1.hi, i do not know fc12 unsupported in less than 2 months
> 
> now my environment is fc12
> 
> 2.if there are rpm  ,rpm is good.
> 3.i want gfs2 to host xen image-disk.   100G--200G image file.
> and need xen live migration  amoung gluster.
> i do not know gfs2's performance.
> thanks
> ----------------
> At 2011-02-22 20:39:33?"Andrew Beekhof" <andrew at beekhof.net> wrote:
>  
> >2011/2/22 yue <ooolinux at 163.com>:
> >> 1.if i can deploy gfs2 on fedora12.
> > 
> >Why would you do that?  Isn't F-12 unsupported in less than 2 months?
> > 
> >> if it is ok to build from source code
> >> ?
> >> 2.the max node  gfs2 can manger?  i use san, if i have 100 machines,
> if gfs2
> >> can work over those nodes?
> >> 
> >> thanks
> >> 
> >> 
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >> 
> 
>  
> 




From rpeterso at redhat.com  Wed Feb 23 18:08:07 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Wed, 23 Feb 2011 13:08:07 -0500 (EST)
Subject: [Linux-cluster] New gfs2-utils-3.1.1 release
In-Reply-To: <2038341380.151565.1298484449813.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <1628649254.151580.1298484487360.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

Hi,

I just wanted to let everyone know that I just did a new build of gfs2-utils.

The new source tarball can be downloaded here:

https://fedorahosted.org/releases/g/f/gfs2-utils/gfs2-utils-3.1.1.tar.gz

To report bugs or issues:

   https://bugzilla.redhat.com/

Regards,

Bob Peterson
Red Hat File Systems

Changes since 3.1.0:

Bob Peterson (14):
      gfs2_edit savemeta doesn't save all leaf blocks for large dirs
      fsck.gfs2: segfault in pass1b
      gfs2_edit: fix segfault in set_bitmap when block is in rgrp
      gfs2_edit: fix careless compiler warning
      gfs2_edit: Fix error message on blockalloc when outside bitmap
      gfs2_edit: add -d option for printing journal details
      gfs2_edit: has problems printing gfs1 journals
      gfs2_edit: print large block numbers better
      gfs2_edit: handle corrupt file systems better
      fsck.gfs2: can't repair rgrps resulting from gfs_grow->gfs2_convert
      fsck.gfs2: reports master/root dinodes as unused and fixes the bitmap
      gfs2-utils: minor corrections to README.build
      GFS2: mkfs.gfs2 segfaults with 18.55TB and -b512
      mkfs.gfs2 should support discard request generation

Ben Marzinski (1):
      gfs2_grow: fix growing on full filesystems

Dave Teigland (1):
      gfs_controld: remove oom_adj

Steve Whitehouse (5):
      libgfs2: Move gfs2_query into gfs2_convert
      libgfs2: Remove unused function get_sysfs_uinit()
      libgfs2: Remove calls to gettext from libgfs2
      strings: Clean up strings
      tune: Clean up and make closer to tune2fs



From Ning.Bao at statcan.gc.ca  Wed Feb 23 18:49:58 2011
From: Ning.Bao at statcan.gc.ca (Ning.Bao at statcan.gc.ca)
Date: Wed, 23 Feb 2011 13:49:58 -0500
Subject: [Linux-cluster]  question about Fabric fencing in GFS2
Message-ID: <DD4D0CFE33FD2F4F8657DCE624FE657E088AAE@STCEM17.statcan.ca>

Hi 

I am very new to GFS2. When I am reading GFS2 docs, I have noticed
fabric fencing method. If I understand correctly, fabric fencing
requires cluster node would be able to login into the SAN switch to
disable ports. Does such kind of access put admin password of the SAN
switch  in cluster.conf in clear text? If yes, it could be a bad idea
for storage admins, If not, does storage admins need create a special
account which can only disable the ports for the particular host?

Thanks
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110223/3c7cdec8/attachment.htm>

From scooter at cgl.ucsf.edu  Wed Feb 23 20:17:00 2011
From: scooter at cgl.ucsf.edu (Scooter Morris)
Date: Wed, 23 Feb 2011 12:17:00 -0800
Subject: [Linux-cluster] multiple gfs2_tool shrinks cause hang?
Message-ID: <4D656B3C.20603@cgl.ucsf.edu>

Hi all,
     I recently had a hang on our cluster that I unwittingly caused and 
wondered if anyone else has seen anything similar.  We were noticing a 
definitely slow-down in one filesystem and doing some investigation, I 
noticed that one of the nodes had a large number of locks gfs2_glock in 
/proc/slabinfo was very large.  I decided to try doing a gfs2_tool 
shrink on the filesystem that was going to slow.  I noticed some 
reduction in the number of locks, but not a lot, so I did it again.  
Everything dropped into D wait on that filesystem, as did several of the 
kernel threads.  Has anyone else seen this behavior?  Is this a known bug?

-- scooter



From sachinbhugra at hotmail.com  Wed Feb 23 20:27:08 2011
From: sachinbhugra at hotmail.com (sachin)
Date: Thu, 24 Feb 2011 01:57:08 +0530
Subject: [Linux-cluster] Cluster node hangs
In-Reply-To: <AANLkTimZ9E4xm8qS_bb7-NcxY7X+u0Eg1nYaCXK0iuKj@mail.gmail.com>
References: <SNT112-W544548584AD789563BB1ECDAD10@phx.gbl>	<4D57A763.8030700@redhat.com>
	<4D57A9F3.90408@redhat.com>	<SNT112-W456C3B1F45B101BABEDFDFDAD10@phx.gbl>
	<AANLkTimZ9E4xm8qS_bb7-NcxY7X+u0Eg1nYaCXK0iuKj@mail.gmail.com>
Message-ID: <SNT112-DS15927A4805EBF6388A7ED6DADB0@phx.gbl>

Hi Dominic,

 

Below is my cluster.conf:

===================================

<?xml version="1.0"?>

<cluster alias="rhel5_cluster" config_version="21" name="rhel5_cluster">

        <fence_daemon post_fail_delay="0" post_join_delay="3"/>

        <clusternodes>

                <clusternode name="rhel5cln1.home.com" nodeid="1" votes="1">

                        <fence>

                                <method name="1">

                                        <device name="manual_fence"
nodename="rhel5cln1.home.com"/>

                                </method>

                        </fence>

                </clusternode>

                <clusternode name="rhel5cln2.home.com" nodeid="2" votes="1">

                        <fence>

                                <method name="1">

                                        <device name="manual_fence"
nodename="rhel5cln2.home.com"/>

                                </method>

                        </fence>

                </clusternode>

        </clusternodes>

        <cman expected_votes="1" two_node="1"/>

        <fencedevices>

                <fencedevice agent="fence_manual" name="manual_fence"/>

        </fencedevices>

        <rm log_level="7" log_facility="local3">

                <failoverdomains/>

                <resources>

                        <script file="/usr/local/httpd2.2.16/bin/apachectl"
name="Apache_Script"/>

                        <ip address="192.168.30.137" monitor_link="1"/>

                        <clusterfs device="/dev/sdc" force_unmount="0"
fsid="22440" fstype="gfs2" mountpoint="/usr/local/httpd2.2.16/htdocs/"
name="gfs2share" options=""/>

                </resources>

                <service autostart="1" name="Apache_Service"
recovery="restart">

                        <ip ref="192.168.30.137"/>

                        <script ref="Apache_Script"/>

                </service>

                <service autostart="1" name="gfs2share" recovery="relocate">

                        <clusterfs ref="gfs2share"/>

                </service>

        </rm>

<logging to_syslog="yes" to_logfile="yes" syslog_facility="local3">

<logging_daemon name="corosync" logfile="/var/log/cluster.log"/>

</logging>

</cluster>

=================================

 

One thing which I noticed is when I move the service on other node, it
generates the following logs:

 

Feb 20 21:50:48 rhel5cln1 clurgmgrd[13764]: <notice> Stopping service
service:gfs2share

Feb 20 21:50:48 rhel5cln1 clurgmgrd: [13764]: <debug> Not umounting /dev/sdc
(clustered file system)

Feb 20 21:50:48 rhel5cln1 clurgmgrd[13764]: <notice> Service
service:gfs2share is stopped

 

Cluster is configured in such that only one node should be mounting the GFS2
FS. When I start the cluster only one node mounts GFS2, however when service
is moved GFS2 gets mounted on both the node but it is still accessible. It
hangs when the owner node goes down and services move to other node
automatically.

 

 

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of dOminic
Sent: Sunday, February 13, 2011 8:03 PM
To: linux clustering
Subject: Re: [Linux-cluster] Cluster node hangs

 

Hi,

 

Whats the msg you are getting in logs ?. It would be great if you could
attach log mesgs along with cluster.conf 

 

-dominic 

 

On Sun, Feb 13, 2011 at 3:49 PM, Sachin Bhugra <sachinbhugra at hotmail.com>
wrote:

Thank for the reply and link. However, GFS2 is not listed in fstab, it is
only handled by cluster config.

  _____  

Date: Sun, 13 Feb 2011 10:52:51 +0100
From: ekuric at redhat.com
To: linux-cluster at redhat.com
Subject: Re: [Linux-cluster] Cluster node hangs



On 02/13/2011 10:41 AM, Elvir Kuric wrote: 

On 02/13/2011 10:14 AM, Sachin Bhugra wrote: 

Hi ,

I have setup a two node cluster in lab, with Vmware Server, and hence used
manual fencing. It includes a iSCSI GFS2 partition and it service Apache in
Active/Passive mode.

Cluster works and I am able to relocate service between nodes with no
issues. However, the problem comes when I shutdown the node, for testing,
which is presently holding the service. When the node becomes unavailable,
service gets relocated and GFS partition gets mounted on the other node,
however it is not accessible. If I try to do a "ls/du" on GFS partition, the
command hangs. On the other hand the node which was shutdown gets stuck at
"unmounting file system". 

I tried using fence_manual -n nodename and then fence_ack_manual -n
nodename, however it still remains the same.

Can someone please help me is what I am doing wrong?

Thanks, 




--


Linux-cluster mailing list


Linux-cluster at redhat.com


https://www.redhat.com/mailman/listinfo/linux-cluster

It would be good to see  /etc/fstab configuration used on cluster nodes. If
/gfs partition is mounted manually it will not be unmounted correctly in
case you restart node ( and not executing umount prior restart ), and will
hang during shutdown/reboot process.

More at:
http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/html-single/Glo
bal_File_System_2/index.html


Edit: above link, section 3.4 Special Considerations when Mounting GFS2 File
Systems 



Regards, 

Elvir 

 

 




--


Linux-cluster mailing list


Linux-cluster at redhat.com


https://www.redhat.com/mailman/listinfo/linux-cluster

 

-- Linux-cluster mailing list Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster 


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110224/3dd27c34/attachment.htm>

From rpeterso at redhat.com  Wed Feb 23 20:54:03 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Wed, 23 Feb 2011 15:54:03 -0500 (EST)
Subject: [Linux-cluster] multiple gfs2_tool shrinks cause hang?
In-Reply-To: <4D656B3C.20603@cgl.ucsf.edu>
Message-ID: <1529888616.155259.1298494443324.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Hi all,
| I recently had a hang on our cluster that I unwittingly caused and
| wondered if anyone else has seen anything similar. We were noticing a
| definitely slow-down in one filesystem and doing some investigation, I
| noticed that one of the nodes had a large number of locks gfs2_glock
| in
| /proc/slabinfo was very large. I decided to try doing a gfs2_tool
| shrink on the filesystem that was going to slow. I noticed some
| reduction in the number of locks, but not a lot, so I did it again.
| Everything dropped into D wait on that filesystem, as did several of
| the
| kernel threads. Has anyone else seen this behavior? Is this a known
| bug?
| 
| -- scooter

Hi Scooter,

I don't know of any problems with gfs2_tool shrink.
Since GFS2 is self-tuning, people don't use gfs2_tool shrink.
As a matter of fact, I'm not even sure if it's tested.

Feel free to open a bugzilla record (or a Red Hat support case if
applicable) for us to investigate the gfs2_tool shrink issue.

If the original problem occurs again, please consider compiling and
running the glock analyzer tool on my people page:

http://people.redhat.com/rpeterso/Experimental/RHEL5.x/gfs2/gfs2_hangalyzer.c

Instructions are contained in comments at the top.  Then post to a
bugzilla record: (1) the program's output, (2) the files it downloaded into
/tmp/ and (3) the output from sysrq-t showing the call traces of all
processes from syslog.

Regards,

Bob Peterson
Red Hat File Systems



From Colin.Simpson at iongeo.com  Wed Feb 23 21:01:34 2011
From: Colin.Simpson at iongeo.com (Colin Simpson)
Date: Wed, 23 Feb 2011 21:01:34 +0000
Subject: [Linux-cluster] RHEL6 CLVM doesn't activate the LV on other node?
Message-ID: <1298494894.5839.8.camel@cowie.iouk.ioroot.tld>

Hi,

I'm not sure if I've missed something in the discussion here or the
manual.

I have just been trying clvm on RHEL 6 for the first time.

On the first node if I run:
node1# lvcreate --size 1G --name lv00test cluvg00

node1# lvscan
  ACTIVE            '/dev/cluvg00/lv00test' [1.00 GiB] inherit

However on the second node I get:
node2# lvscan 
  inactive          '/dev/cluvg00/lv00test' [1.00 GiB] inherit

And (no surprise) I don't have /dev/cluvg00/lv00test. 

I can bring it to active on the second node with vgchange -ay , but that
doesn't seem right (and is that persistent)?

All I have changed is the locking in lvm.conf to 3 as ever. 

I'm pretty sure I didn't do anything else on RHEL 5, and it just
appeared ACTIVE with no fuss on both nodes.

Have I missed something?

Thanks

Colin



This email and any files transmitted with it are confidential and are intended solely for the use of the individual or entity to whom they are addressed.  If you are not the original recipient or the person responsible for delivering the email to the intended recipient, be advised that you have received this email in error, and that any use, dissemination, forwarding, printing, or copying of this email is strictly prohibited. If you received this email in error, please immediately notify the sender and delete the original.





From cfeist at redhat.com  Wed Feb 23 21:10:43 2011
From: cfeist at redhat.com (Chris Feist)
Date: Wed, 23 Feb 2011 15:10:43 -0600
Subject: [Linux-cluster] Announcing new command line cluster configuration
	tool
Message-ID: <4D6577D3.9090508@redhat.com>

We have finished updating fedora packages to include our new command line 
cluster configuration tool, ccs.

It's included in the ccs package in F14/F15 (yum install ccs) and can be used to 
setup clusters from scratch or modify an existing cluster and provides far more 
functionality than the old ccs_tool command.

You can also use ccs to synchronize your cluster.conf across all nodes in the 
cluster as well as start and stop specific nodes or the entire cluster.

More information can be found in the ccs man page.

Thanks,
Chris



From ajb2 at mssl.ucl.ac.uk  Wed Feb 23 21:48:03 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Wed, 23 Feb 2011 21:48:03 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D5E68C9.1000206@mssl.ucl.ac.uk>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>	<20110216175848.GC2291@redhat.com>	<4D5D9219.8090002@mssl.ucl.ac.uk>	<20110217212948.GA9582@redhat.com>
	<4D5E68C9.1000206@mssl.ucl.ac.uk>
Message-ID: <4D658093.5010807@mssl.ucl.ac.uk>


After running several days with the larger table sizes I don't think 
it's made any difference to individual thread performance or overall 
throughput.

Likewise, the following changes have had no effect on access time for 
large directories (but they have improved caching and improved high load 
overall performance):

Increasing dentry and inode caches to the maximum size allowed by the 
kernel (about 128million entries. This is limited as a percentage of 
memory to about 10%)

This helped caching under load, but until I added the following change:

(sysctl)
vm.max_reclaims_in_progress=1
vm.zone_reclaim_mode=0

The cached dentry data would evaporate after a while.

(Switching to reeclaim_mode=0 is recommended for fileservers to enhance 
dentry/inode caching)


At the end of all that, the effect is only minor and the biggest bugbear 
- access to directories with more than ~150 files onboard is unusably 
slow - hasn't been addressed.

The change which had the largest effect on this problem - switching to 
lock_nolock - isn't practical in a production cluster environment (and 
defeats the purpose of using GFS2 anyway)

Iostat's showing that under heavy i/o load (1000-3000 requests/second 
but only 2-3Mb/s actual data), the kernel on one machine can sit on 
read/write equests for up to 3000ms before passing them to the storage 
devices - which usually respond within 2-5ms. It's sitting at 300ms most 
of the time and the machine concerned only has 5 FSes mounted.

The other 2 machines in the cluster not facing this kind of 
treatement(100-300 requests/second) have 30 mounts each, can easily read 
at 10-20Mb/s and have read delays of 2-10ms (mostly 3-4).

Users report that these 2 machines are _fast_ when not accessing 
directories with large numbers of files onboard...










From sdake at redhat.com  Wed Feb 23 22:27:04 2011
From: sdake at redhat.com (Steven Dake)
Date: Wed, 23 Feb 2011 15:27:04 -0700
Subject: [Linux-cluster] Announcing new command line cluster
 configuration tool
In-Reply-To: <4D6577D3.9090508@redhat.com>
References: <4D6577D3.9090508@redhat.com>
Message-ID: <4D6589B8.7080003@redhat.com>

On 02/23/2011 02:10 PM, Chris Feist wrote:
> We have finished updating fedora packages to include our new command
> line cluster configuration tool, ccs.
> 
> It's included in the ccs package in F14/F15 (yum install ccs) and can be
> used to setup clusters from scratch or modify an existing cluster and
> provides far more functionality than the old ccs_tool command.
> 
> You can also use ccs to synchronize your cluster.conf across all nodes
> in the cluster as well as start and stop specific nodes or the entire
> cluster.
> 
> More information can be found in the ccs man page.
> 
> Thanks,
> Chris
> 

nice work!

> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From fdinitto at redhat.com  Wed Feb 23 22:53:05 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Wed, 23 Feb 2011 23:53:05 +0100
Subject: [Linux-cluster] [Cluster-devel] Announcing new command line
 cluster configuration tool
In-Reply-To: <4D6577D3.9090508@redhat.com>
References: <4D6577D3.9090508@redhat.com>
Message-ID: <4D658FD1.4040803@redhat.com>

On 02/23/2011 10:10 PM, Chris Feist wrote:
> We have finished updating fedora packages to include our new command
> line cluster configuration tool, ccs.
> 
> It's included in the ccs package in F14/F15 (yum install ccs)

It's also in Fedora 13 FYI.

Fabio



From jbrassow at redhat.com  Wed Feb 23 23:23:49 2011
From: jbrassow at redhat.com (Jonathan Brassow)
Date: Wed, 23 Feb 2011 17:23:49 -0600
Subject: [Linux-cluster] RHEL6 CLVM doesn't activate the LV on other
	node?
In-Reply-To: <1298494894.5839.8.camel@cowie.iouk.ioroot.tld>
References: <1298494894.5839.8.camel@cowie.iouk.ioroot.tld>
Message-ID: <ECA5D53E-189A-48EA-89D1-E9E145A4844B@redhat.com>


On Feb 23, 2011, at 3:01 PM, Colin Simpson wrote:

>
> All I have changed is the locking in lvm.conf to 3 as ever.

On all nodes?

Also, 'vgs' reports that the volume group is "clustered"?

  brassow



From thomas at sjolshagen.net  Wed Feb 23 23:38:38 2011
From: thomas at sjolshagen.net (Thomas Sjolshagen)
Date: Wed, 23 Feb 2011 18:38:38 -0500
Subject: [Linux-cluster] RHEL6 CLVM doesn't activate the LV on other
	node?
In-Reply-To: <1298494894.5839.8.camel@cowie.iouk.ioroot.tld>
References: <1298494894.5839.8.camel@cowie.iouk.ioroot.tld>
Message-ID: <F1065A16-83EB-49A3-849A-6C3F60FA6DC1@sjolshagen.net>

You seem to have forgotten to create the group as "clustered". Off the top of my head that would mean you need to vgchange the vg with the -cy option (but my recollection of the option could be fuzzy/dead wrong, so check the man page!)



On Feb 23, 2011, at 16:01, "Colin Simpson" <Colin.Simpson at iongeo.com> wrote:

> Hi,
> 
> I'm not sure if I've missed something in the discussion here or the
> manual.
> 
> I have just been trying clvm on RHEL 6 for the first time.
> 
> On the first node if I run:
> node1# lvcreate --size 1G --name lv00test cluvg00
> 
> node1# lvscan
>  ACTIVE            '/dev/cluvg00/lv00test' [1.00 GiB] inherit
> 
> However on the second node I get:
> node2# lvscan 
>  inactive          '/dev/cluvg00/lv00test' [1.00 GiB] inherit
> 
> And (no surprise) I don't have /dev/cluvg00/lv00test. 
> 
> I can bring it to active on the second node with vgchange -ay , but that
> doesn't seem right (and is that persistent)?
> 
> All I have changed is the locking in lvm.conf to 3 as ever. 
> 
> I'm pretty sure I didn't do anything else on RHEL 5, and it just
> appeared ACTIVE with no fuss on both nodes.
> 
> Have I missed something?
> 
> Thanks
> 
> Colin
> 
> 
> 
> This email and any files transmitted with it are confidential and are intended solely for the use of the individual or entity to whom they are addressed.  If you are not the original recipient or the person responsible for delivering the email to the intended recipient, be advised that you have received this email in error, and that any use, dissemination, forwarding, printing, or copying of this email is strictly prohibited. If you received this email in error, please immediately notify the sender and delete the original.
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From Colin.Simpson at iongeo.com  Thu Feb 24 00:48:08 2011
From: Colin.Simpson at iongeo.com (Colin Simpson)
Date: Thu, 24 Feb 2011 00:48:08 +0000
Subject: [Linux-cluster] RHEL6 CLVM doesn't activate the LV on othernode?
In-Reply-To: <F1065A16-83EB-49A3-849A-6C3F60FA6DC1@sjolshagen.net>
References: <1298494894.5839.8.camel@cowie.iouk.ioroot.tld>
	<F1065A16-83EB-49A3-849A-6C3F60FA6DC1@sjolshagen.net>
Message-ID: <1298508488.4051.5.camel@ascog.iouk.ioroot.tld>

It is indeed the issue that the vg isn't clustered.

# vgs
  VG      #PV #LV #SN Attr   VSize   VFree  
  cluvg00   1   1   0 wz--n- 954.94g 953.94g

I wasn't sure why it managed to get created this was when vgcreate
defaults to clustering. I've just found that one of my nodes has 
locking_type = 3

I'll retest with the correction.

Thanks

Colin

On Wed, 2011-02-23 at 23:38 +0000, Thomas Sjolshagen wrote:
> You seem to have forgotten to create the group as "clustered". Off the
> top of my head that would mean you need to vgchange the vg with the
> -cy option (but my recollection of the option could be fuzzy/dead
> wrong, so check the man page!)
> 
> 
> 
> On Feb 23, 2011, at 16:01, "Colin Simpson" <Colin.Simpson at iongeo.com>
> wrote:
> 
> > Hi,
> >
> > I'm not sure if I've missed something in the discussion here or the
> > manual.
> >
> > I have just been trying clvm on RHEL 6 for the first time.
> >
> > On the first node if I run:
> > node1# lvcreate --size 1G --name lv00test cluvg00
> >
> > node1# lvscan
> >  ACTIVE            '/dev/cluvg00/lv00test' [1.00 GiB] inherit
> >
> > However on the second node I get:
> > node2# lvscan
> >  inactive          '/dev/cluvg00/lv00test' [1.00 GiB] inherit
> >
> > And (no surprise) I don't have /dev/cluvg00/lv00test.
> >
> > I can bring it to active on the second node with vgchange -ay , but
> that
> > doesn't seem right (and is that persistent)?
> >
> > All I have changed is the locking in lvm.conf to 3 as ever.
> >
> > I'm pretty sure I didn't do anything else on RHEL 5, and it just
> > appeared ACTIVE with no fuss on both nodes.
> >
> > Have I missed something?
> >
> > Thanks
> >
> > Colin
> >
> >
> >
> > This email and any files transmitted with it are confidential and
> are intended solely for the use of the individual or entity to whom
> they are addressed.  If you are not the original recipient or the
> person responsible for delivering the email to the intended recipient,
> be advised that you have received this email in error, and that any
> use, dissemination, forwarding, printing, or copying of this email is
> strictly prohibited. If you received this email in error, please
> immediately notify the sender and delete the original.
> >
> >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 




From ooolinux at 163.com  Thu Feb 24 06:38:55 2011
From: ooolinux at 163.com (yue)
Date: Thu, 24 Feb 2011 14:38:55 +0800 (CST)
Subject: [Linux-cluster] if gfs2 or clvm based on rhcs?
Message-ID: <3c66dfc8.1ba37.12e56644755.Coremail.ooolinux@163.com>

i want to  deploy iscsi+clvm+gfs2
 
i do not know if i have to install rhcs?
but i use fc12  ,not rhel.
 
what /etc/cluster/cluster.conf  do ?   how to configure the conf file?
 
thanks
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110224/ea8d48e6/attachment.htm>

From fdinitto at redhat.com  Thu Feb 24 07:29:08 2011
From: fdinitto at redhat.com (Fabio M. Di Nitto)
Date: Thu, 24 Feb 2011 08:29:08 +0100
Subject: [Linux-cluster] [Cluster-devel] if gfs2 or clvm based on rhcs?
In-Reply-To: <3c66dfc8.1ba37.12e56644755.Coremail.ooolinux@163.com>
References: <3c66dfc8.1ba37.12e56644755.Coremail.ooolinux@163.com>
Message-ID: <4D6608C4.6010506@redhat.com>

Please stop posting to all mailing lists at the same time.

User questions should be directed to linux-cluster mailing list only.

Fabio

On 02/24/2011 07:38 AM, yue wrote:
> i want to  deploy iscsi+clvm+gfs2
>  
> i do not know if i have to install rhcs?
> but i use fc12  ,not rhel.
>  
> what /etc/cluster/cluster.conf  do ?   how to configure the conf file?
>  
> thanks
> 
> 



From pieter.baele at gmail.com  Thu Feb 24 08:00:11 2011
From: pieter.baele at gmail.com (Pieter Baele)
Date: Thu, 24 Feb 2011 09:00:11 +0100
Subject: [Linux-cluster] if gfs2 or clvm based on rhcs?
In-Reply-To: <3c66dfc8.1ba37.12e56644755.Coremail.ooolinux@163.com>
References: <3c66dfc8.1ba37.12e56644755.Coremail.ooolinux@163.com>
Message-ID: <AANLkTin8KATGvJxbfgLUBanRjU-pUBWLvokkeafRzkwr@mail.gmail.com>

Start reading the documentation...
most of it applies to FC as well.

http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/index.html

--> start with Cluster Suite Overview, so at least you get a good
understanding about the components involved.


2011/2/24 yue <ooolinux at 163.com>:
> i want to? deploy iscsi+clvm+gfs2
>
> i do not know if i have to install rhcs?
> but i use fc12? ,not rhel.
>
> what /etc/cluster/cluster.conf? do ??? how to configure the conf file?
>
> thanks
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>



From xsanch at gmail.com  Thu Feb 24 08:41:22 2011
From: xsanch at gmail.com (jorge sanchez)
Date: Thu, 24 Feb 2011 02:41:22 -0600
Subject: [Linux-cluster]  qdiskd node evicted
Message-ID: <AANLkTin0PPy1Dk=dCzEgy8gCRq7eyaNNkac0MtKLTOcA@mail.gmail.com>

Hi,

can someone explain what is the meaning of  two lines ?


cluster:Feb 20 21:36:49 mcnvt11 qdiskd[5667]: <notice> Writing
eviction notice for node 1
cluster:Feb 20 21:36:50 mcnvt11 qdiskd[5667]: <notice> Node 1 evicted

Versin:
Red Hat Enterprise Linux AS release 4 (Nahant Update 8)

Thanks,

Jorge



From martijn.storck at gmail.com  Thu Feb 24 09:34:01 2011
From: martijn.storck at gmail.com (Martijn Storck)
Date: Thu, 24 Feb 2011 10:34:01 +0100
Subject: [Linux-cluster] High DLM CPU usage - low GFS/iSCSI performance
Message-ID: <AANLkTikDO534-5do_ziv4HikdFX3X7fjT2YCtcTPfztH@mail.gmail.com>

Hello everyone,

We currently have the following RHCS cluster in operation:

- 3 nodes, Xeon CPU, 12 GB hardware etc.
- 100mbit network between the cluster nodes
- Dell MD3200i iSCSI SAN, with 4 Gbit links (dm-multipath) to each server
(through two switches), 5 15k RPM spindles
- 1 GFS1 file system on the above mentioned SAN

2 of the nodes share a single GFS file system, which is used for hosting
virtual machine containers (for web serving, mail and light database work).
We've noticed that performance is suboptimal so we've started to
investigate. The load is not high (we previously ran the same containers on
a single, much cheaper server using local 7200rpm disks and ext3 fs without
issues), but there is a lot of small block I/O.

When I run iptraf (only monitoring the iSCSI traffic) and top side by side
on a single server I often see dlm_send using 100% CPU. During this time I/O
to our gfs filesystem seems to be blocked and container performance goes
down the drain.

My question is: what causes dlm_send to use 100% CPU and is this wat causes
the low GFS performance? Based on what the servers are doing I'm not
expecting any deadlocks (they're mostly accessing separate parts of the
filesystem), so I'm suspecting some other kind of limitation here. Could it
be the 100Mbit network?

I've looked into the waiters queue using the debug fs and it varies between
0 and 60 entries which doesn't seem to bad to me. The locks table has some
30.000 locks. All DLM and GFS settings are defaults. Any hints on where to
look are appreciated!

Regards,

Martijn Storck
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110224/23fcb304/attachment.htm>

From swhiteho at redhat.com  Thu Feb 24 10:02:41 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 24 Feb 2011 10:02:41 +0000
Subject: [Linux-cluster] High DLM CPU usage - low GFS/iSCSI performance
In-Reply-To: <AANLkTikDO534-5do_ziv4HikdFX3X7fjT2YCtcTPfztH@mail.gmail.com>
References: <AANLkTikDO534-5do_ziv4HikdFX3X7fjT2YCtcTPfztH@mail.gmail.com>
Message-ID: <1298541761.2566.2.camel@dolmen>

Hi,

On Thu, 2011-02-24 at 10:34 +0100, Martijn Storck wrote:
> Hello everyone,
> 
> 
> We currently have the following RHCS cluster in operation:
> 
> 
> - 3 nodes, Xeon CPU, 12 GB hardware etc.
> - 100mbit network between the cluster nodes
> - Dell MD3200i iSCSI SAN, with 4 Gbit links (dm-multipath) to each
> server (through two switches), 5 15k RPM spindles
> - 1 GFS1 file system on the above mentioned SAN
> 
> 
> 2 of the nodes share a single GFS file system, which is used for
> hosting virtual machine containers (for web serving, mail and light
> database work). We've noticed that performance is suboptimal so we've
> started to investigate. The load is not high (we previously ran the
> same containers on a single, much cheaper server using local 7200rpm
> disks and ext3 fs without issues), but there is a lot of small block
> I/O.
> 
> 
> When I run iptraf (only monitoring the iSCSI traffic) and top side by
> side on a single server I often see dlm_send using 100% CPU. During
> this time I/O to our gfs filesystem seems to be blocked and container
> performance goes down the drain.
> 
Can you take a netstat -t while the cpu usage is at 100%, that will tell
us whether there is queued data at that point in time.

> 
> My question is: what causes dlm_send to use 100% CPU and is this wat
> causes the low GFS performance? Based on what the servers are doing
> I'm not expecting any deadlocks (they're mostly accessing separate
> parts of the filesystem), so I'm suspecting some other kind of
> limitation here. Could it be the 100Mbit network?
> 
Well, that depends on how much traffic there is... have you measured the
traffic when the problem is occurring?

> 
> I've looked into the waiters queue using the debug fs and it varies
> between 0 and 60 entries which doesn't seem to bad to me. The locks
> table has some 30.000 locks. All DLM and GFS settings are defaults.
> Any hints on where to look are appreciated!
> 
It does sounds like a performance issue, and it shouldn't be too hard to
get to the bottom of what is going on,

Steve.

> 
> Regards,
> 
> 
> Martijn Storck
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




From swhiteho at redhat.com  Thu Feb 24 12:29:48 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 24 Feb 2011 12:29:48 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D658093.5010807@mssl.ucl.ac.uk>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>
	<20110216175848.GC2291@redhat.com>	<4D5D9219.8090002@mssl.ucl.ac.uk>
	<20110217212948.GA9582@redhat.com> <4D5E68C9.1000206@mssl.ucl.ac.uk>
	<4D658093.5010807@mssl.ucl.ac.uk>
Message-ID: <1298550588.2566.16.camel@dolmen>

Hi,

On Wed, 2011-02-23 at 21:48 +0000, Alan Brown wrote:
> After running several days with the larger table sizes I don't think 
> it's made any difference to individual thread performance or overall 
> throughput.
> 
> Likewise, the following changes have had no effect on access time for 
> large directories (but they have improved caching and improved high load 
> overall performance):
> 
> Increasing dentry and inode caches to the maximum size allowed by the 
> kernel (about 128million entries. This is limited as a percentage of 
> memory to about 10%)
> 
> This helped caching under load, but until I added the following change:
> 
> (sysctl)
> vm.max_reclaims_in_progress=1
> vm.zone_reclaim_mode=0
> 
> The cached dentry data would evaporate after a while.
> 
> (Switching to reeclaim_mode=0 is recommended for fileservers to enhance 
> dentry/inode caching)
> 
> 
> At the end of all that, the effect is only minor and the biggest bugbear 
> - access to directories with more than ~150 files onboard is unusably 
> slow - hasn't been addressed.
> 
That doesn't sound like it is related to a DLM issue. 150 entries is not
a lot. What do you mean be "access" in this case? Just looking up a
single file in the directory, or create/delete files or an ls -l
(implying stats to each file) or what exactly?

> The change which had the largest effect on this problem - switching to 
> lock_nolock - isn't practical in a production cluster environment (and 
> defeats the purpose of using GFS2 anyway)
> 
> Iostat's showing that under heavy i/o load (1000-3000 requests/second 
> but only 2-3Mb/s actual data), the kernel on one machine can sit on 
> read/write equests for up to 3000ms before passing them to the storage 
> devices - which usually respond within 2-5ms. It's sitting at 300ms most 
> of the time and the machine concerned only has 5 FSes mounted.
> 
> The other 2 machines in the cluster not facing this kind of 
> treatement(100-300 requests/second) have 30 mounts each, can easily read 
> at 10-20Mb/s and have read delays of 2-10ms (mostly 3-4).
> 
> Users report that these 2 machines are _fast_ when not accessing 
> directories with large numbers of files onboard...
> 
> 
Again, figuring out the exact workload should help us get to the bottom
of what is going on here. How are you measuring the delays reported
above? Is the syscall service time, for example?

Steve.

> 
> 
> 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




From ajb2 at mssl.ucl.ac.uk  Thu Feb 24 12:56:10 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Thu, 24 Feb 2011 12:56:10 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <1298550588.2566.16.camel@dolmen>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>	<20110216175848.GC2291@redhat.com>	<4D5D9219.8090002@mssl.ucl.ac.uk>	<20110217212948.GA9582@redhat.com>
	<4D5E68C9.1000206@mssl.ucl.ac.uk>	<4D658093.5010807@mssl.ucl.ac.uk>
	<1298550588.2566.16.camel@dolmen>
Message-ID: <4D66556A.6000509@mssl.ucl.ac.uk>

Steven Whitehouse wrote:

> That doesn't sound like it is related to a DLM issue. 150 entries is not
> a lot.

It isn't, but when the machine's being hammered by requests in other 
filesystems, things can get very slow, very quickly.

> What do you mean be "access" in this case? Just looking up a
> single file in the directory, or create/delete files or an ls -l
> (implying stats to each file) or what exactly?

ls -l and creation/deletion.


>>
> Again, figuring out the exact workload should help us get to the bottom
> of what is going on here. How are you measuring the delays reported
> above? Is the syscall service time, for example?

iostats.

This is a typcal example. Note the huge difference between await (total 
time in ms) and svctm (scsi command response time in ms)


avg-cpu:  %user   %nice %system %iowait  %steal   %idle
            0.62    0.00    2.12   27.03    0.00   70.22

Device:         rrqm/s   wrqm/s   r/s   w/s    rkB/s    wkB/s avgrq-sz 
avgqu-sz   await  svctm  %util
dm-6              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00
dm-7              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00
dm-8              0.00     0.00 16.00  1.00     8.00     0.50     1.00 
    0.20   11.47  11.47  19.50
dm-9              0.00     0.00 45.00 22.00   180.00    88.00     8.00 
   13.45  351.07   6.19  41.50
dm-10             0.00     0.00 61.00 30.00   244.00   120.00     8.00 
   22.19  307.56  10.19  92.70
dm-11             0.00     0.00 53.00  5.00   212.00    20.00     8.00 
  128.12 4332.69  17.26 100.10
dm-12             0.00     0.00 54.00  0.00   216.00     0.00     8.00 
  100.28 2951.09  18.54 100.10
dm-13             0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00
dm-14             0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00

For comparison here's a FC-attached box running Ext4 (on a slower set of 
arrays)


Device:         rrqm/s   wrqm/s   r/s   w/s    rkB/s    wkB/s avgrq-sz 
avgqu-sz   await  svctm  %util
dm-2              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00
dm-3              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00
dm-4              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00
dm-5              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00
dm-6              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00
dm-7              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00
dm-8              0.00     0.00 247.00  0.00   988.00     0.00     8.00 
    19.17   92.43   3.28  81.10
dm-9              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00
dm-10             0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00
dm-11             0.00     0.00 51.00 14.00 21548.00    56.00   664.74 
    1.93   29.91   4.14  26.90
dm-12             0.00     0.00 54.00 424.00 25800.00  1696.00   115.05 
     6.07   12.69   0.60  28.50
dm-13             0.00     0.00 55.00  0.00 23928.00     0.00   870.11 
    2.50   45.49   5.38  29.60
dm-14             0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00
dm-15             0.00     0.00 247.00  0.00   988.00     0.00     8.00 
    19.17   92.44   3.29  81.20
dm-16             0.00     0.00  0.00  0.00     0.00     0.00     0.00 
    0.00    0.00   0.00   0.00






From grimme at atix.de  Thu Feb 24 13:24:04 2011
From: grimme at atix.de (Marc Grimme)
Date: Thu, 24 Feb 2011 14:24:04 +0100 (CET)
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <972752730.1728.1298553674384.JavaMail.root@axgroupware01-1.gallien.atix>
Message-ID: <402467968.1730.1298553844674.JavaMail.root@axgroupware01-1.gallien.atix>

This is what I also observed.

The change of the different hashtable sizes didn't change the performance.

In my opinion another question is if the hash function is really good or if it does not create bucketlists of same size but few with long list holding all locks. Then a bigger size of the dlm hash tables is irrelevant.

Is there any way to examine how the buckets are being utilized?

Thanks
Marc.

----- "Alan Brown" <ajb2 at mssl.ucl.ac.uk> schrieb:

> After running several days with the larger table sizes I don't think 
> it's made any difference to individual thread performance or overall 
> throughput.
> 
> Likewise, the following changes have had no effect on access time for
> 
> large directories (but they have improved caching and improved high
> load 
> overall performance):
> 
> Increasing dentry and inode caches to the maximum size allowed by the
> 
> kernel (about 128million entries. This is limited as a percentage of 
> memory to about 10%)
> 
> This helped caching under load, but until I added the following
> change:
> 
> (sysctl)
> vm.max_reclaims_in_progress=1
> vm.zone_reclaim_mode=0
> 
> The cached dentry data would evaporate after a while.
> 
> (Switching to reeclaim_mode=0 is recommended for fileservers to
> enhance 
> dentry/inode caching)
> 
> 
> At the end of all that, the effect is only minor and the biggest
> bugbear 
> - access to directories with more than ~150 files onboard is unusably
> 
> slow - hasn't been addressed.
> 
> The change which had the largest effect on this problem - switching to
> 
> lock_nolock - isn't practical in a production cluster environment (and
> 
> defeats the purpose of using GFS2 anyway)
> 
> Iostat's showing that under heavy i/o load (1000-3000 requests/second
> 
> but only 2-3Mb/s actual data), the kernel on one machine can sit on 
> read/write equests for up to 3000ms before passing them to the storage
> 
> devices - which usually respond within 2-5ms. It's sitting at 300ms
> most 
> of the time and the machine concerned only has 5 FSes mounted.
> 
> The other 2 machines in the cluster not facing this kind of 
> treatement(100-300 requests/second) have 30 mounts each, can easily
> read 
> at 10-20Mb/s and have read delays of 2-10ms (mostly 3-4).
> 
> Users report that these 2 machines are _fast_ when not accessing 
> directories with large numbers of files onboard...
> 
> 
> 
> 
> 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 

Marc Grimme

Tel: +49 89 4523538-14
Fax: +49 89 9901766-0
E-Mail: grimme at atix.de

ATIX Informationstechnologie und Consulting AG | Einsteinstrasse 10 |
85716 Unterschleissheim | www.atix.de

Registergericht: Amtsgericht Muenchen, Registernummer: HRB 168930, USt.-Id.:
DE209485962 | Vorstand: Thomas Merz (Vors.), Marc Grimme, Mark Hlawatschek, Jan R. Bergrath |
Vorsitzender des Aufsichtsrats: Dr. Martin Buss



From randy.brown at noaa.gov  Thu Feb 24 13:34:04 2011
From: randy.brown at noaa.gov (Randy Brown)
Date: Thu, 24 Feb 2011 08:34:04 -0500
Subject: [Linux-cluster] Problem with machines fencing one another in 2
 Node	NFS cluster
In-Reply-To: <4D593638.30802@alteeve.com>
References: <4D5933D1.10009@noaa.gov> <4D593638.30802@alteeve.com>
Message-ID: <4D665E4C.10107@noaa.gov>

Thanks for the response.  Sorry for the delay.  I had an issue that, 
unexpectedly, took me away from the office.  I am just getting back to 
this now.

Yes, the MAC addresses were all updated after the cloning.  According to 
my notes, here are sections of the log files at the time of a fence from 
each cluster node.

Feb 10 15:17:48 nfs2-cluster clurgmgrd[4280]:<notice>  Resource Group Manager Starting
Feb 10 15:18:17 nfs2-cluster rgmanager: [7580]:<notice>  Shutting down Cluster Service Manager...
Feb 10 15:18:17 nfs2-cluster clurgmgrd[4280]:<notice>  Shutting down
Feb 10 15:18:17 nfs2-cluster clurgmgrd[4280]:<notice>  Shutting down
Feb 10 15:18:17 nfs2-cluster clurgmgrd[4280]:<notice>  Shutdown complete, exiting
Feb 10 15:18:17 nfs2-cluster rgmanager: [7580]:<notice>  Cluster Service Manager is stopped.
Feb 10 15:18:23 nfs2-cluster ccsd[2989]: Stopping ccsd, SIGTERM received.
Feb 10 15:18:23 nfs2-cluster NAMC
Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading all openais components
Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais component: openais_confdb v0 (19/10)
Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais component: openais_cpg v0 (18/8)
Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais component: openais_cfg v0 (17/7)
Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais component: openais_msg v0 (16/6)
Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais component: openais_lck v0 (15/5)
Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais component: openais_evt v0 (14/4)
Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais component: openais_ckpt v0 (13/3)
Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais component: openais_amf v0 (12/2)
Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais component: openais_clm v0 (11/1)
Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais component: openais_evs v0 (10/0)
Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais component: openais_cman v0 (9/9)
Feb 10 15:18:23 nfs2-cluster gfs_controld[3077]: cluster is down, exiting
Feb 10 15:18:23 nfs2-cluster dlm_controld[3071]: cluster is down, exiting
Feb 10 15:18:23 nfs2-cluster fenced[3065]: cluster is down, exiting
Feb 10 15:18:23 nfs2-cluster kernel: dlm: closing connection to node 2
Feb 10 15:18:23 nfs2-cluster kernel: dlm: closing connection to node 1


Feb 10 15:17:34 nfs1-cluster ntpd[3765]: synchronized to LOCAL(0), stratum 10
Feb 10 15:18:17 nfs1-cluster clurgmgrd[4323]:<notice>  Member 2 shutting down
Feb 10 15:18:33 nfs1-cluster openais[3046]: [TOTEM] The token was lost in the OPERATIONAL state.
Feb 10 15:18:33 nfs1-cluster openais[3046]: [TOTEM] Receive multicast socket recv buffer size (320000 bytes).
Feb 10 15:18:33 nfs1-cluster openais[3046]: [TOTEM] Transmit multicast socket send buffer size (262142 bytes).
Feb 10 15:18:33 nfs1-cluster openais[3046]: [TOTEM] entering GATHER state from 2.
Feb 10 15:18:34 nfs1-cluster ntpd[3765]: synchronized to 132.236.56.250, stratum 2
Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] entering GATHER state from 0.
Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] Creating commit token because I am the rep.
Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] Saving state aru 230 high seq received 230
Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] Storing new sequence id for ring 1f80
Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] entering COMMIT state.
Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] entering RECOVERY state.
Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] position [0] member 140.90.91.240:
Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] previous ring seq 8060 rep 140.90.91.240
Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] aru 230 high delivered 230 received flag 1
Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] Did not need to originate any messages in recovery.
Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] Sending initial ORF token
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] CLM CONFIGURATION CHANGE
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] New Configuration:
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ]     r(0) ip(140.90.91.240)
Feb 10 15:18:35 nfs1-cluster kernel: dlm: closing connection to node 2
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] Members Left:
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ]     r(0) ip(140.90.91.242)
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] Members Joined:
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] CLM CONFIGURATION CHANGE
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] New Configuration:
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ]     r(0) ip(140.90.91.240)
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] Members Left:
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] Members Joined:
Feb 10 15:18:35 nfs1-cluster openais[3046]: [SYNC ] This node is within the primary component and will provide service.
Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] entering OPERATIONAL state.
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] got nodejoin message 140.90.91.240
Feb 10 15:18:35 nfs1-cluster openais[3046]: [CPG  ] got joinlist message from node 1


I was seeing a number of these messages but they stopped after upgrading openais

nfs2-cluster openais[3012]: [TOTEM] Retransmit List: 1df3

Yes, these are in managed switches.  I will try to run the tcpdump asap.  Unfortunately, that means I have to have it crash again to get what I need and my users are already annoyed by the downtime we've had.  I know this isn't the best solution for our needs, but given the lack of funding, this seemed like a good idea at the time.

Thanks for the help!

Randy





On 02/14/2011 09:03 AM, Digimer wrote:
> On 02/14/2011 08:53 AM, Randy Brown wrote:
>> Hello,
>>
>> I am running a 2 node cluster being used as a NAS head for a Lefthand
>> Networks iSCSI SAN to provide NFS mounts out to my network.  Things have
>> been OK for a while, but I recently lost one of the nodes as a result of
>> a patching problem.  In an effort to recreate the failed node, I imaged
>> the working node and installed that image on the failed node.  I set
>> it's hostname and IP settings correctly and the machine booted and
>> joined the cluster just fine.  Or at least it appeared so.  Things ran
>> OK for the last few weeks, but I recently started seeing a behavior
>> where the nodes start fencing each other.  I'm wondering if there is
>> something as a result of cloning the nodes that could be the problem.
>> Possibly something that should be different but isn't because of the
>> cloning?
>>
>> I am running CentOS 5.5 with the following package versions:
>>
>> Kernel - 2.6.18-194.11.3.el5 #1 SMP
>> cman-2.0.115-34.el5_5.4
>> lvm2-cluster-2.02.56-7.el5_5.4
>> gfs2-utils-0.1.62-20.el5
>> kmod-gfs-0.1.34-12.el5.centos
>> rgmanager-2.0.52-6.el5.centos.8
>>
>> I have a Qlogic qla4062 HBA in the node running: QLogic iSCSI HBA Driver
>> (f8b83000) v5.01.03.04
>>
>> I will gladly provide more information as needed.
>>
>> Thank you,
>> Randy
> Silly question, but are the NICs mapped to their MAC addresses? If so,
> did you update the MAC addresses after cloning the server to reflect the
> actual MAC addresses? Assuming so, do you have managed switches? If so,
> can you test by swapping out a simple, unmanaged switch?
>
> This sounds like a multicast issue at some level. Fencing happens once
> the totem ring is declared failed. Do you see anything interesting in
> the log files prior to the fence? Can you run tcpdump to see what is
> happening on the interface(s) prior to the fence?
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: randy_brown.vcf
Type: text/x-vcard
Size: 313 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110224/5063820c/attachment.vcf>

From ajb2 at mssl.ucl.ac.uk  Thu Feb 24 15:00:42 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Thu, 24 Feb 2011 15:00:42 +0000
Subject: [Linux-cluster] bug: nfsclient.sh
Message-ID: <4D66729A.9010201@mssl.ucl.ac.uk>


If multiple NFS services are defined, a race condition exists with 
parallel invokations of /usr/share/cluster/nfsclient.sh

exportfs in add/remove mode reads the existing exports in from kernel 
(or/etab/xtab), applies the command and then writes a _complete_ 
exportlist back to the kernel, not just an incremental change.

Unfortunately exportfs takes no account of other running copies of 
itself and kernel exports or /var/lib/[e|x]tab can change while it's 
open without it noticing. Similarly it may write back to the same files 
without checking for locks set by other copies of itself (there aren't 
any to check for in any case)

This can result in differing copies of the exports being written back to 
the kerne. As an example, 5 exports are added at once using 5 
simultaneous exportfs commands - there's a good chance only 3 of the 5 
will make it into the kernel, with the others having been written by one 
process and then overwritten by another.

This primarily manifests at startup/failover and typically results in 
10-20% of filesystem export commands failing and the assciated service 
automatically restarting. (We have 84 separate NFS services. That's a 
LOT of exportfs parallelisation, giving a higher collision rate for 
reading/writing the e/xtab files.

The fix is easy - flock calls around every exportfs invokation to ensure 
only one copy is ever running at once.

See the attached modified version of nfsclient.sh




-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: nfsclient.sh-WITH-FLOCK
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110224/29c951f0/attachment.ksh>

From linux at alteeve.com  Thu Feb 24 15:13:28 2011
From: linux at alteeve.com (Digimer)
Date: Thu, 24 Feb 2011 10:13:28 -0500
Subject: [Linux-cluster] Problem with machines fencing one another in 2
 Node	NFS cluster
In-Reply-To: <4D665E4C.10107@noaa.gov>
References: <4D5933D1.10009@noaa.gov> <4D593638.30802@alteeve.com>
	<4D665E4C.10107@noaa.gov>
Message-ID: <4D667598.9080806@alteeve.com>

On 02/24/2011 08:34 AM, Randy Brown wrote:
> Thanks for the response.  Sorry for the delay.  I had an issue that,
> unexpectedly, took me away from the office.  I am just getting back to
> this now.
> 
> Yes, the MAC addresses were all updated after the cloning.  According to
> my notes, here are sections of the log files at the time of a fence from
> each cluster node.
> 
> Feb 10 15:17:48 nfs2-cluster clurgmgrd[4280]:<notice>  Resource Group
> Manager Starting
> Feb 10 15:18:17 nfs2-cluster rgmanager: [7580]:<notice>  Shutting down
> Cluster Service Manager...
> Feb 10 15:18:17 nfs2-cluster clurgmgrd[4280]:<notice>  Shutting down
> Feb 10 15:18:17 nfs2-cluster clurgmgrd[4280]:<notice>  Shutting down
> Feb 10 15:18:17 nfs2-cluster clurgmgrd[4280]:<notice>  Shutdown
> complete, exiting
> Feb 10 15:18:17 nfs2-cluster rgmanager: [7580]:<notice>  Cluster Service
> Manager is stopped.
> Feb 10 15:18:23 nfs2-cluster ccsd[2989]: Stopping ccsd, SIGTERM received.
> Feb 10 15:18:23 nfs2-cluster NAMC
> Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading all
> openais components
> Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais
> component: openais_confdb v0 (19/10)
> Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais
> component: openais_cpg v0 (18/8)
> Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais
> component: openais_cfg v0 (17/7)
> Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais
> component: openais_msg v0 (16/6)
> Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais
> component: openais_lck v0 (15/5)
> Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais
> component: openais_evt v0 (14/4)
> Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais
> component: openais_ckpt v0 (13/3)
> Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais
> component: openais_amf v0 (12/2)
> Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais
> component: openais_clm v0 (11/1)
> Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais
> component: openais_evs v0 (10/0)
> Feb 10 15:18:23 nfs2-cluster openais[3045]: [SERV ] Unloading openais
> component: openais_cman v0 (9/9)
> Feb 10 15:18:23 nfs2-cluster gfs_controld[3077]: cluster is down, exiting
> Feb 10 15:18:23 nfs2-cluster dlm_controld[3071]: cluster is down, exiting
> Feb 10 15:18:23 nfs2-cluster fenced[3065]: cluster is down, exiting
> Feb 10 15:18:23 nfs2-cluster kernel: dlm: closing connection to node 2
> Feb 10 15:18:23 nfs2-cluster kernel: dlm: closing connection to node 1
> 
> 
> Feb 10 15:17:34 nfs1-cluster ntpd[3765]: synchronized to LOCAL(0),
> stratum 10
> Feb 10 15:18:17 nfs1-cluster clurgmgrd[4323]:<notice>  Member 2 shutting
> down
> Feb 10 15:18:33 nfs1-cluster openais[3046]: [TOTEM] The token was lost
> in the OPERATIONAL state.
> Feb 10 15:18:33 nfs1-cluster openais[3046]: [TOTEM] Receive multicast
> socket recv buffer size (320000 bytes).
> Feb 10 15:18:33 nfs1-cluster openais[3046]: [TOTEM] Transmit multicast
> socket send buffer size (262142 bytes).
> Feb 10 15:18:33 nfs1-cluster openais[3046]: [TOTEM] entering GATHER
> state from 2.
> Feb 10 15:18:34 nfs1-cluster ntpd[3765]: synchronized to 132.236.56.250,
> stratum 2
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] entering GATHER
> state from 0.
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] Creating commit
> token because I am the rep.
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] Saving state aru 230
> high seq received 230
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] Storing new sequence
> id for ring 1f80
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] entering COMMIT state.
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] entering RECOVERY
> state.
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] position [0] member
> 140.90.91.240:
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] previous ring seq
> 8060 rep 140.90.91.240
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] aru 230 high
> delivered 230 received flag 1
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] Did not need to
> originate any messages in recovery.
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] Sending initial ORF
> token
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] CLM CONFIGURATION
> CHANGE
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] New Configuration:
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ]     r(0)
> ip(140.90.91.240)
> Feb 10 15:18:35 nfs1-cluster kernel: dlm: closing connection to node 2
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] Members Left:
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ]     r(0)
> ip(140.90.91.242)
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] Members Joined:
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] CLM CONFIGURATION
> CHANGE
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] New Configuration:
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ]     r(0)
> ip(140.90.91.240)
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] Members Left:
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] Members Joined:
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [SYNC ] This node is within
> the primary component and will provide service.
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [TOTEM] entering OPERATIONAL
> state.
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CLM  ] got nodejoin message
> 140.90.91.240
> Feb 10 15:18:35 nfs1-cluster openais[3046]: [CPG  ] got joinlist message
> from node 1
> 
> 
> I was seeing a number of these messages but they stopped after upgrading
> openais
> 
> nfs2-cluster openais[3012]: [TOTEM] Retransmit List: 1df3
> 
> Yes, these are in managed switches.  I will try to run the tcpdump
> asap.  Unfortunately, that means I have to have it crash again to get
> what I need and my users are already annoyed by the downtime we've had. 
> I know this isn't the best solution for our needs, but given the lack of
> funding, this seemed like a good idea at the time.
> 
> Thanks for the help!
> 
> Randy

The logs you posted seem incomplete. For example, there is no messages
about fenced. Can I assume that nfs2 gets fenced first, comes back up
and then fences nfs1? Was nfs2 the failed and restored node?

In either case, it looks like nfs2 withdraws from the cluster by (trying
to?) shut down the cluster software. Normally, this leaving the cluster
is ordered and will not trigger a fence as the leaving is announced. If
a fence happens, it's because the node simply vanished from the point of
view of the remaining node(s).

This (which is odd in it's own right):

Feb 10 15:17:48 nfs2-cluster clurgmgrd[4280]:<notice>  Resource Group
Manager Starting
Feb 10 15:18:17 nfs2-cluster rgmanager: [7580]:<notice>  Shutting down
Cluster Service Manager...
Feb 10 15:18:17 nfs2-cluster clurgmgrd[4280]:<notice>  Shutting down

Shouldn't lead to this:

Feb 10 15:18:17 nfs1-cluster clurgmgrd[4323]:<notice>  Member 2 shutting
down
Feb 10 15:18:33 nfs1-cluster openais[3046]: [TOTEM] The token was lost
in the OPERATIONAL state.

The 'nfs1' node sees the member leaving, but still freaks out when totem
token stop arriving. I am less convinced now that this is a multicast
issue (but it doesn't hurt to keep watching it).

Can you post the full logs from both nodes well before until well after
the fencing? Can you also post your full cluster.conf and openais.conf
files (only obfuscating the password, leave everything else)? It might
be most effective to post these on http://pastebin.com for brevity.

Something is quite odd here... I'm almost thinking that the internal
node IDs aren't unique or something, but I am not entirely familiar with
the internals (either how IDs are created or how they are stored)... I'm
curious to sort this out now. :)

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From ra at ra.is  Thu Feb 24 15:44:39 2011
From: ra at ra.is (Richard Allen)
Date: Thu, 24 Feb 2011 10:44:39 -0500
Subject: [Linux-cluster] RHEL6 HA addon
Message-ID: <4D667CE7.1050501@ra.is>

Hi all

I notice in the Release Notes for RHEL6 that many changes have been made 
to the Cluster Suite (HA Addon) but I am unable to find any mention of 
how the new suite does heartbeat.
In previous versions the Cluster could only do heartbeats (node 
intercommunication) on one network link and for redundancy the only 
option was to use bonded network devices.
There was a way to add a second heartbeat using altnode directives in 
the XML config file but that always felt a bit hackish and was only 
limited to only one altnode, giving two heartbeat paths.

So I would like to ask how RHEL6 does this.  If I have nodes with 4 10Gb 
NIC's, one connected to an admin network, another to a Database network 
and one to the Application network and the last one connected directly 
to the other node with a crossover cable, can the cluster now use all 
possible paths to communicate to the other nodes or will one of those 
paths become a single point of failure in the cluster?

I'm used to using Clusters like HP's ServiceGuard where I can easily 
define which links to use as heartbeat.  It can even use a serial 
connection (in a two node cluster) as a additional heartbeat and I have 
always felt this is quite a big limitation in Red Hat's cluster suite up 
to RHEL6 atleast.

Thanks in advance
Richard.



From ashley at host365.com  Thu Feb 24 16:00:08 2011
From: ashley at host365.com (ashley at host365.com)
Date: 24 Feb 2011 16:00:08 +0000
Subject: [Linux-cluster] =?utf-8?q?RHEL6_HA_addon?=
Message-ID: <20110224160008.2186.qmail@psa101.host365.com>

Hi

I am away until 28/02/11. If you require support, please email support at host365.com or call +44 (0)207 610 9911 in my absence.

Regards
Ashley




From szhargrave at ybs.co.uk  Thu Feb 24 16:13:52 2011
From: szhargrave at ybs.co.uk (Simon Hargrave)
Date: Thu, 24 Feb 2011 16:13:52 +0000
Subject: [Linux-cluster] lvm2-cluster not syncing correctly?
Message-ID: <20110224161352225.00000004632@H04405>

Please read the warning at the end of this email
________________________________________________

Hi
 
I'm creating an (at present) 2 node cluster to share GFS2 filesystems.  Everything seems to be working reasonably OK apart from one issue which may or may not be a bug.
 
I have a cluster volume group called vgGWPOCSHARED mounted on 2 nodes.  I can "vgdisplay -v" and "vgdisplay -v vgGWPOCSHARED" to my hearts content on either node with no problem.
 
I now create a logical volume on the first node, e.g. lvcreate -n test -L 1G vgGWPOCSHARED.  The output of "vgdisplay -v" and "vgdisplay -v vgGWPOCSHARED" on the local node show the new logical volume as expected.
 
However, if I now perform the same 2 commands on the second node, something strange happens.
 
"vgdisplay -v" shows the state of vgGWPOCSHARED /before/ the new logical volume was created.  It also give the following in stderr: -
 
    Archiving volume group "vgGWPOCSHARED" metadata (seqno 21).
    Archiving volume group "vgGWPOCSHARED" metadata (seqno 20).
    Creating volume group backup "/etc/lvm/backup/vgGWPOCSHARED" (seqno 20).

However if I then do a "vgdisplay -v vgGWPOCSHARED", I see the new logical volume, but also the stderr output: -
 
    Archiving volume group "vgGWPOCSHARED" metadata (seqno 20).
    Archiving volume group "vgGWPOCSHARED" metadata (seqno 21).
    Creating volume group backup "/etc/lvm/backup/vgGWPOCSHARED" (seqno 21).

Alternating between the 2 commands repeats the output.  It's as if clvmd isn't "fully" updating something and that something's cached, but I'm not sure what.
 
If I unmount my gfs2 filesystems and restart clvmd,
 
service gfs2 stop
service clvmd stop
service clvmd start
service gfs2 start
 
Then everything returns to normal and the outputs match.
 
Is this something anyone has seen/can reproduce/has any idea about?
 
Versions are: -
 
lvm2-cluster-2.02.74-3.el5
lvm2-2.02.74-5.el5

Server is RHEL 5 x86_64 updated from the channel on 21st Feb this year.
 
Thanks
 
 
Simon
 
- 
Simon Hargrave szhargrave at ybs.co.uk <blocked::blocked::mailto:szhargrave at ybs.co.uk>  
Technical Services Team Leader x2831
Yorkshire Building Society 01274 472831
http://wwwtech/sysint/tsgcore.asp
 

________________________________________________
This email and any attachments are confidential and may contain privileged information.
If you are not the person for whom they are intended please return the email and then delete all material from any computer. You must not use the email or attachments for any purpose, nor disclose its contents to anyone other than the intended recipient.
Any statements made by an individual in this email do not necessarily reflect the views of the Yorkshire Building Society Group.
________________________________________________

Yorkshire Building Society, which is authorised and regulated by the Financial Services Authority, chooses to introduce its customers to Legal & General for the purposes of advising on and arranging life assurance and investment products bearing Legal & General?s name.

We are entered in the FSA Register and our FSA registration number is 106085 http://www.fsa.gov.uk/register

Head Office: Yorkshire Building Society, Yorkshire House, Yorkshire Drive, Bradford, BD5 8LJ
Tel: 0845 1 200 100

Visit Our Website
http://www.ybs.co.uk

All communications with us may be monitored/recorded to improve the quality of our service and for your protection and security.



________________________________________________________________________
This e-mail has been scanned for all viruses by Star. The
service is powered by MessageLabs. For more information on a proactive
anti-virus service working around the clock, around the globe, visit:
http://www.star.net.uk
________________________________________________________________________
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110224/4e225c6e/attachment.htm>

From linux at alteeve.com  Thu Feb 24 16:24:30 2011
From: linux at alteeve.com (Digimer)
Date: Thu, 24 Feb 2011 11:24:30 -0500
Subject: [Linux-cluster] RHEL6 HA addon
In-Reply-To: <4D667CE7.1050501@ra.is>
References: <4D667CE7.1050501@ra.is>
Message-ID: <4D66863E.3070304@alteeve.com>

On 02/24/2011 10:44 AM, Richard Allen wrote:
> Hi all
> 
> I notice in the Release Notes for RHEL6 that many changes have been made
> to the Cluster Suite (HA Addon) but I am unable to find any mention of
> how the new suite does heartbeat.
> In previous versions the Cluster could only do heartbeats (node
> intercommunication) on one network link and for redundancy the only
> option was to use bonded network devices.
> There was a way to add a second heartbeat using altnode directives in
> the XML config file but that always felt a bit hackish and was only
> limited to only one altnode, giving two heartbeat paths.
> 
> So I would like to ask how RHEL6 does this.  If I have nodes with 4 10Gb
> NIC's, one connected to an admin network, another to a Database network
> and one to the Application network and the last one connected directly
> to the other node with a crossover cable, can the cluster now use all
> possible paths to communicate to the other nodes or will one of those
> paths become a single point of failure in the cluster?
> 
> I'm used to using Clusters like HP's ServiceGuard where I can easily
> define which links to use as heartbeat.  It can even use a serial
> connection (in a two node cluster) as a additional heartbeat and I have
> always felt this is quite a big limitation in Red Hat's cluster suite up
> to RHEL6 atleast.
> 
> Thanks in advance
> Richard.

Hi Richard,

  Can I assume that you are talking about High Availability in general,
as opposed to Heartbeat specifically? If not, the rest won't be too
relevant.

  As you know, the 'altnode' parameter is how you assign a second link.
This is still the case (as is bonding to get more links, but that
requires common subnets which you don't have).

  Corosync is used as the cluster communication layer (as opposed to
openais from RHEL 5.x). It supports one or two interfaces for "totem"
communication. If the main fails, the second link will be used
automatically. However, when the main is restored, totem must be
manually moved back to the original link.

  So in short; as it was in 5, so it is in 6. That said, the 'altname'
is perfectly valid way of removing that SPF. :)

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From rpeterso at redhat.com  Thu Feb 24 16:49:33 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Thu, 24 Feb 2011 11:49:33 -0500 (EST)
Subject: [Linux-cluster] lvm2-cluster not syncing correctly?
In-Reply-To: <20110224161352225.00000004632@H04405>
Message-ID: <1151718125.172523.1298566173100.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Please read the warning at the end of this email
| ________________________________________________
| 
| Hi
| 
| I'm creating an (at present) 2 node cluster to share GFS2 filesystems.
| Everything seems to be working reasonably OK apart from one issue
| which may or may not be a bug.
| 
| I have a cluster volume group called vgGWPOCSHARED mounted on 2 nodes.
| I can "vgdisplay -v" and "vgdisplay -v vgGWPOCSHARED" to my hearts
| content on either node with no problem.
| 
| I now create a logical volume on the first node, e.g. lvcreate -n test
| -L 1G vgGWPOCSHARED. The output of "vgdisplay -v" and "vgdisplay -v
| vgGWPOCSHARED" on the local node show the new logical volume as
| expected.
| 
| However, if I now perform the same 2 commands on the second node,
| something strange happens.

Hi,

Are you sure the clustered bit is set on the VG?
http://sources.redhat.com/cluster/wiki/FAQ/CLVM#clvmd_clustered

Bob Peterson
Red Hat File Systems



From szhargrave at ybs.co.uk  Thu Feb 24 17:38:14 2011
From: szhargrave at ybs.co.uk (Simon Hargrave)
Date: Thu, 24 Feb 2011 17:38:14 +0000
Subject: [Linux-cluster] lvm2-cluster not syncing correctly?
In-Reply-To: <1151718125.172523.1298566173100.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <20110224161352225.00000004632@H04405> 
	<1151718125.172523.1298566173100.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <20110224173814596.00000004632@H04405>

Please read the warning at the end of this email
________________________________________________

> Hi,

> Are you sure the clustered bit is set on the VG?
> http://sources.redhat.com/cluster/wiki/FAQ/CLVM#clvmd_clustered

> Bob Peterson
> Red Hat File Systems

Yes, the volume group was created with vgcreate -cy, and the output of vgs shows the "c" flag: -

  VG            #PV #LV #SN Attr   VSize   VFree
  vg00            1  13   0 wz--n-  49.88G 27.16G
  vgGWPOCSHARED   1   8   0 wz--nc 199.98G 29.98G

You can see below an example using vgs and lvs to show the issue: -

Initial state: -

[root at ybsxlx89 ~]# vgs ; lvs
  VG            #PV #LV #SN Attr   VSize   VFree
  vg00            1  13   0 wz--n-  49.88G 27.16G
  vgGWPOCSHARED   1  10   0 wz--nc 199.98G 27.98G
  LV              VG            Attr   LSize   Origin Snap%  Move Log Copy%  Convert
  esmlv           vg00          -wi-ao 480.00M
  lvol1           vg00          -wi-ao   1.00G
  lvol2           vg00          -wi-ao   4.00G
  lvol3           vg00          -wi-ao   3.91G
  lvol4           vg00          -wi-ao   1.00G
  lvol5           vg00          -wi-ao   1.00G
  lvol6           vg00          -wi-ao 256.00M
  netbackuplv     vg00          -wi-ao 512.00M
  oraclelv        vg00          -wi-ao   5.00G
  tivolilv        vg00          -wi-ao  64.00M
  u001lv          vg00          -wi-ao   5.00G
  u003lv          vg00          -wi-ao 512.00M
  ybslv           vg00          -wi-ao  32.00M
  aserver         vgGWPOCSHARED -wi-a-  10.00G
  fmw1            vgGWPOCSHARED -wi-a-  50.00G
  fmw2            vgGWPOCSHARED -wi-a-  50.00G
  gwpoc_cluster   vgGWPOCSHARED -wi-a-  20.00G
  gwpoc_instance1 vgGWPOCSHARED -wi-a-  10.00G
  gwpoc_instance2 vgGWPOCSHARED -wi-a-  10.00G
  mserver1        vgGWPOCSHARED -wi-a-  10.00G
  mserver2        vgGWPOCSHARED -wi-a-  10.00G
  test            vgGWPOCSHARED -wi-a-   1.00G
  test2           vgGWPOCSHARED -wi-a-   1.00G


I next run "lvcreate -n test3 -L 1G vgGWPOCSHARED" on the other node, and then run the vgs and lvs command on this node again: -


[root at ybsxlx89 ~]# vgs ; lvs
  VG            #PV #LV #SN Attr   VSize   VFree
  vg00            1  13   0 wz--n-  49.88G 27.16G
  vgGWPOCSHARED   1  10   0 wz--nc 199.98G 27.98G
  LV              VG            Attr   LSize   Origin Snap%  Move Log Copy%  Convert
  esmlv           vg00          -wi-ao 480.00M
  lvol1           vg00          -wi-ao   1.00G
  lvol2           vg00          -wi-ao   4.00G
  lvol3           vg00          -wi-ao   3.91G
  lvol4           vg00          -wi-ao   1.00G
  lvol5           vg00          -wi-ao   1.00G
  lvol6           vg00          -wi-ao 256.00M
  netbackuplv     vg00          -wi-ao 512.00M
  oraclelv        vg00          -wi-ao   5.00G
  tivolilv        vg00          -wi-ao  64.00M
  u001lv          vg00          -wi-ao   5.00G
  u003lv          vg00          -wi-ao 512.00M
  ybslv           vg00          -wi-ao  32.00M
  aserver         vgGWPOCSHARED -wi-a-  10.00G
  fmw1            vgGWPOCSHARED -wi-a-  50.00G
  fmw2            vgGWPOCSHARED -wi-a-  50.00G
  gwpoc_cluster   vgGWPOCSHARED -wi-a-  20.00G
  gwpoc_instance1 vgGWPOCSHARED -wi-a-  10.00G
  gwpoc_instance2 vgGWPOCSHARED -wi-a-  10.00G
  mserver1        vgGWPOCSHARED -wi-a-  10.00G
  mserver2        vgGWPOCSHARED -wi-a-  10.00G
  test            vgGWPOCSHARED -wi-a-   1.00G
  test2           vgGWPOCSHARED -wi-a-   1.00G

test3 has not appeared, despite being visible in the other node on which it was created.  So I stop and start clvmd (no gfs2 filesystems mounted): -

[root at ybsxlx89 ~]# service clvmd stop
Deactivating clustered VG(s):   0 logical volume(s) in volume group "vgGWPOCSHARED" now active
                                                           [  OK  ]
Signaling clvmd to exit                                    [  OK  ]
clvmd terminated                                           [  OK  ]
[root at ybsxlx89 ~]# service clvmd start
Starting clvmd:
Activating VG(s):   11 logical volume(s) in volume group "vgGWPOCSHARED" now active
  13 logical volume(s) in volume group "vg00" now active
                                                           [  OK  ]

Now the test3 LV appears: -

[root at ybsxlx89 ~]# vgs ; lvs
  VG            #PV #LV #SN Attr   VSize   VFree
  vg00            1  13   0 wz--n-  49.88G 27.16G
  vgGWPOCSHARED   1  11   0 wz--nc 199.98G 26.98G
  LV              VG            Attr   LSize   Origin Snap%  Move Log Copy%  Convert
  esmlv           vg00          -wi-ao 480.00M
  lvol1           vg00          -wi-ao   1.00G
  lvol2           vg00          -wi-ao   4.00G
  lvol3           vg00          -wi-ao   3.91G
  lvol4           vg00          -wi-ao   1.00G
  lvol5           vg00          -wi-ao   1.00G
  lvol6           vg00          -wi-ao 256.00M
  netbackuplv     vg00          -wi-ao 512.00M
  oraclelv        vg00          -wi-ao   5.00G
  tivolilv        vg00          -wi-ao  64.00M
  u001lv          vg00          -wi-ao   5.00G
  u003lv          vg00          -wi-ao 512.00M
  ybslv           vg00          -wi-ao  32.00M
  aserver         vgGWPOCSHARED -wi-a-  10.00G
  fmw1            vgGWPOCSHARED -wi-a-  50.00G
  fmw2            vgGWPOCSHARED -wi-a-  50.00G
  gwpoc_cluster   vgGWPOCSHARED -wi-a-  20.00G
  gwpoc_instance1 vgGWPOCSHARED -wi-a-  10.00G
  gwpoc_instance2 vgGWPOCSHARED -wi-a-  10.00G
  mserver1        vgGWPOCSHARED -wi-a-  10.00G
  mserver2        vgGWPOCSHARED -wi-a-  10.00G
  test            vgGWPOCSHARED -wi-a-   1.00G
  test2           vgGWPOCSHARED -wi-a-   1.00G
  test3           vgGWPOCSHARED -wi-a-   1.00G

It is interesting that if I do an lvcreate on one node, then ls -l /dev/mapper/vgGWPOCSHARED* on the other node, the device mapper entry has been created, and I can therefore use it in filesystems etc.  It seems restricted to the lvm specific commands.


Simon


________________________________________________
This email and any attachments are confidential and may contain privileged information.
If you are not the person for whom they are intended please return the email and then delete all material from any computer. You must not use the email or attachments for any purpose, nor disclose its contents to anyone other than the intended recipient.
Any statements made by an individual in this email do not necessarily reflect the views of the Yorkshire Building Society Group.
________________________________________________

Yorkshire Building Society, which is authorised and regulated by the Financial Services Authority, chooses to introduce its customers to Legal & General for the purposes of advising on and arranging life assurance and investment products bearing Legal & General?s name.

We are entered in the FSA Register and our FSA registration number is 106085 http://www.fsa.gov.uk/register

Head Office: Yorkshire Building Society, Yorkshire House, Yorkshire Drive, Bradford, BD5 8LJ
Tel: 0845 1 200 100

Visit Our Website
http://www.ybs.co.uk

All communications with us may be monitored/recorded to improve the quality of our service and for your protection and security.



________________________________________________________________________
This e-mail has been scanned for all viruses by Star. The
service is powered by MessageLabs. For more information on a proactive
anti-virus service working around the clock, around the globe, visit:
http://www.star.net.uk
________________________________________________________________________



From swhiteho at redhat.com  Thu Feb 24 17:50:01 2011
From: swhiteho at redhat.com (Steven Whitehouse)
Date: Thu, 24 Feb 2011 17:50:01 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D66556A.6000509@mssl.ucl.ac.uk>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>
	<20110216175848.GC2291@redhat.com>	<4D5D9219.8090002@mssl.ucl.ac.uk>
	<20110217212948.GA9582@redhat.com> <4D5E68C9.1000206@mssl.ucl.ac.uk>
	<4D658093.5010807@mssl.ucl.ac.uk> <1298550588.2566.16.camel@dolmen>
	<4D66556A.6000509@mssl.ucl.ac.uk>
Message-ID: <1298569801.2566.36.camel@dolmen>

Hi,

On Thu, 2011-02-24 at 12:56 +0000, Alan Brown wrote:
> Steven Whitehouse wrote:
> 
> > That doesn't sound like it is related to a DLM issue. 150 entries is not
> > a lot.
> 
> It isn't, but when the machine's being hammered by requests in other 
> filesystems, things can get very slow, very quickly.
> 
Depending on the exact mix of I/O, that is expected behaviour. That is
why it is so important to look at what can be done at the application
layer to mitigate such problems.

> > What do you mean be "access" in this case? Just looking up a
> > single file in the directory, or create/delete files or an ls -l
> > (implying stats to each file) or what exactly?
> 
> ls -l and creation/deletion.
> 
As soon as you mix creation/deletion on one node with accesses (of
whatever kind) from other nodes, you run this risk. Obviously you
wouldn't be using a cluster filesystem if you didn't intend to have this
kind of access from time to time, but anything that can be done at the
application level to help improve locality will pay big dividends
compared with any tuning that can be done at the fs/dlm level.

At first sight at least, this does not appear to be a dlm related
problem, so we need to be careful not to confuse two different issues,
even if the symptoms may appear the same.

Thanks for the iostat reports, I'll have a more detailed look and get
back to you,

Steve.

> 
> >>
> > Again, figuring out the exact workload should help us get to the bottom
> > of what is going on here. How are you measuring the delays reported
> > above? Is the syscall service time, for example?
> 
> iostats.
> 
> This is a typcal example. Note the huge difference between await (total 
> time in ms) and svctm (scsi command response time in ms)
> 
> 
> avg-cpu:  %user   %nice %system %iowait  %steal   %idle
>             0.62    0.00    2.12   27.03    0.00   70.22
> 
> Device:         rrqm/s   wrqm/s   r/s   w/s    rkB/s    wkB/s avgrq-sz 
> avgqu-sz   await  svctm  %util
> dm-6              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> dm-7              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> dm-8              0.00     0.00 16.00  1.00     8.00     0.50     1.00 
>     0.20   11.47  11.47  19.50
> dm-9              0.00     0.00 45.00 22.00   180.00    88.00     8.00 
>    13.45  351.07   6.19  41.50
> dm-10             0.00     0.00 61.00 30.00   244.00   120.00     8.00 
>    22.19  307.56  10.19  92.70
> dm-11             0.00     0.00 53.00  5.00   212.00    20.00     8.00 
>   128.12 4332.69  17.26 100.10
> dm-12             0.00     0.00 54.00  0.00   216.00     0.00     8.00 
>   100.28 2951.09  18.54 100.10
> dm-13             0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> dm-14             0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> 
> For comparison here's a FC-attached box running Ext4 (on a slower set of 
> arrays)
> 
> 
> Device:         rrqm/s   wrqm/s   r/s   w/s    rkB/s    wkB/s avgrq-sz 
> avgqu-sz   await  svctm  %util
> dm-2              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> dm-3              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> dm-4              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> dm-5              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> dm-6              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> dm-7              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> dm-8              0.00     0.00 247.00  0.00   988.00     0.00     8.00 
>     19.17   92.43   3.28  81.10
> dm-9              0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> dm-10             0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> dm-11             0.00     0.00 51.00 14.00 21548.00    56.00   664.74 
>     1.93   29.91   4.14  26.90
> dm-12             0.00     0.00 54.00 424.00 25800.00  1696.00   115.05 
>      6.07   12.69   0.60  28.50
> dm-13             0.00     0.00 55.00  0.00 23928.00     0.00   870.11 
>     2.50   45.49   5.38  29.60
> dm-14             0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> dm-15             0.00     0.00 247.00  0.00   988.00     0.00     8.00 
>     19.17   92.44   3.29  81.20
> dm-16             0.00     0.00  0.00  0.00     0.00     0.00     0.00 
>     0.00    0.00   0.00   0.00
> 
> 
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster




From thomas at sjolshagen.net  Thu Feb 24 17:57:14 2011
From: thomas at sjolshagen.net (Thomas Sjolshagen)
Date: Thu, 24 Feb 2011 12:57:14 -0500
Subject: [Linux-cluster]
 =?utf-8?q?lvm2-cluster_not_syncing_correctly=3F?=
In-Reply-To: <20110224173814596.00000004632@H04405>
References: <20110224161352225.00000004632@H04405>
	<1151718125.172523.1298566173100.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
	<20110224173814596.00000004632@H04405>
Message-ID: <09f63f66e569f668e219115b5e26f55f@sjolshagen.net>

  

On Thu, 24 Feb 2011 17:38:14 +0000, Simon Hargrave wrote: 

> Yes,
the volume group was created with vgcreate -cy, and the output of vgs
shows the "c" flag: -

As a rule of thumb, I always do a "vgscan" on all
of the other nodes in the cluster, after creating a LV on a clustered
VG.// Thomas
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110224/732b90a5/attachment.htm>

From ajb2 at mssl.ucl.ac.uk  Thu Feb 24 18:37:24 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Thu, 24 Feb 2011 18:37:24 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <1298569801.2566.36.camel@dolmen>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>	<20110216175848.GC2291@redhat.com>	<4D5D9219.8090002@mssl.ucl.ac.uk>	<20110217212948.GA9582@redhat.com>
	<4D5E68C9.1000206@mssl.ucl.ac.uk>	<4D658093.5010807@mssl.ucl.ac.uk>
	<1298550588.2566.16.camel@dolmen>	<4D66556A.6000509@mssl.ucl.ac.uk>
	<1298569801.2566.36.camel@dolmen>
Message-ID: <4D66A564.40609@mssl.ucl.ac.uk>

Steven Whitehouse wrote:

> As soon as you mix creation/deletion on one node with accesses (of
> whatever kind) from other nodes, you run this risk. 

_ALL_ the GFS2 filesystems (bar one 5Gb one for common config files, 
etc) are mounted one-node-only.

_ALL_ the GFS2 filesystems (with the same exception) are NFS exported.

NONE of the NFS exported filesystems have local processes accessing them 
except for backups(*) - because there's distinct and non-theoretrical 
risk of file coruption if anything other than NFSd touches a 
NFS-exported filesystem (We've experienced it and I've reproduced the 
corruption on non-cluster systems).

Even Samba is a re-export from a NFS client and I've been toying with 
the idea of moving backups to a NFS client despite the network penalties.

(*) Backups run on the node where the filesystem is NFS exported.

> Obviously you wouldn't be using a cluster filesystem if you didn't intend to have this
> kind of access from time to time, but anything that can be done at the
> application level to help improve locality will pay big dividends
> compared with any tuning that can be done at the fs/dlm level.

We originally installed this to run as pNFS/SAMBA/iscsi fileservers but 
after encountering the NFS corruption issues and finding out just how 
much slower it gets if other nodes mount/access the filesystems we just 
use GFS to ensure corruption-free failover.

Let's just say that what was promised was not what was delivered...






From sachinbhugra at hotmail.com  Thu Feb 24 21:32:10 2011
From: sachinbhugra at hotmail.com (sachin)
Date: Fri, 25 Feb 2011 03:02:10 +0530
Subject: [Linux-cluster] Cluster node hangs
In-Reply-To: <SNT112-DS15927A4805EBF6388A7ED6DADB0@phx.gbl>
References: <SNT112-W544548584AD789563BB1ECDAD10@phx.gbl>	<4D57A763.8030700@redhat.com>	<4D57A9F3.90408@redhat.com>	<SNT112-W456C3B1F45B101BABEDFDFDAD10@phx.gbl>	<AANLkTimZ9E4xm8qS_bb7-NcxY7X+u0Eg1nYaCXK0iuKj@mail.gmail.com>
	<SNT112-DS15927A4805EBF6388A7ED6DADB0@phx.gbl>
Message-ID: <SNT112-DS147132BD98D35D80F14062DADA0@phx.gbl>

Hi, 

 

Can someone please help me with this.

 

Thanks

 

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of sachin
Sent: Thursday, February 24, 2011 1:57 AM
To: 'linux clustering'
Subject: Re: [Linux-cluster] Cluster node hangs

 

Hi Dominic,

 

Below is my cluster.conf:

===================================

<?xml version="1.0"?>

<cluster alias="rhel5_cluster" config_version="21" name="rhel5_cluster">

        <fence_daemon post_fail_delay="0" post_join_delay="3"/>

        <clusternodes>

                <clusternode name="rhel5cln1.home.com" nodeid="1" votes="1">

                        <fence>

                                <method name="1">

                                        <device name="manual_fence"
nodename="rhel5cln1.home.com"/>

                                </method>

                        </fence>

                </clusternode>

                <clusternode name="rhel5cln2.home.com" nodeid="2" votes="1">

                        <fence>

                                <method name="1">

                                        <device name="manual_fence"
nodename="rhel5cln2.home.com"/>

                                </method>

                        </fence>

                </clusternode>

        </clusternodes>

        <cman expected_votes="1" two_node="1"/>

        <fencedevices>

                <fencedevice agent="fence_manual" name="manual_fence"/>

        </fencedevices>

        <rm log_level="7" log_facility="local3">

                <failoverdomains/>

                <resources>

                        <script file="/usr/local/httpd2.2.16/bin/apachectl"
name="Apache_Script"/>

                        <ip address="192.168.30.137" monitor_link="1"/>

                        <clusterfs device="/dev/sdc" force_unmount="0"
fsid="22440" fstype="gfs2" mountpoint="/usr/local/httpd2.2.16/htdocs/"
name="gfs2share" options=""/>

                </resources>

                <service autostart="1" name="Apache_Service"
recovery="restart">

                        <ip ref="192.168.30.137"/>

                        <script ref="Apache_Script"/>

                </service>

                <service autostart="1" name="gfs2share" recovery="relocate">

                        <clusterfs ref="gfs2share"/>

                </service>

        </rm>

<logging to_syslog="yes" to_logfile="yes" syslog_facility="local3">

<logging_daemon name="corosync" logfile="/var/log/cluster.log"/>

</logging>

</cluster>

=================================

 

One thing which I noticed is when I move the service on other node, it
generates the following logs:

 

Feb 20 21:50:48 rhel5cln1 clurgmgrd[13764]: <notice> Stopping service
service:gfs2share

Feb 20 21:50:48 rhel5cln1 clurgmgrd: [13764]: <debug> Not umounting /dev/sdc
(clustered file system)

Feb 20 21:50:48 rhel5cln1 clurgmgrd[13764]: <notice> Service
service:gfs2share is stopped

 

Cluster is configured in such that only one node should be mounting the GFS2
FS. When I start the cluster only one node mounts GFS2, however when service
is moved GFS2 gets mounted on both the node but it is still accessible. It
hangs when the owner node goes down and services move to other node
automatically.

 

 

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of dOminic
Sent: Sunday, February 13, 2011 8:03 PM
To: linux clustering
Subject: Re: [Linux-cluster] Cluster node hangs

 

Hi,

 

Whats the msg you are getting in logs ?. It would be great if you could
attach log mesgs along with cluster.conf 

 

-dominic 

 

On Sun, Feb 13, 2011 at 3:49 PM, Sachin Bhugra <sachinbhugra at hotmail.com>
wrote:

Thank for the reply and link. However, GFS2 is not listed in fstab, it is
only handled by cluster config.

  _____  

Date: Sun, 13 Feb 2011 10:52:51 +0100
From: ekuric at redhat.com
To: linux-cluster at redhat.com
Subject: Re: [Linux-cluster] Cluster node hangs



On 02/13/2011 10:41 AM, Elvir Kuric wrote: 

On 02/13/2011 10:14 AM, Sachin Bhugra wrote: 

Hi ,

I have setup a two node cluster in lab, with Vmware Server, and hence used
manual fencing. It includes a iSCSI GFS2 partition and it service Apache in
Active/Passive mode.

Cluster works and I am able to relocate service between nodes with no
issues. However, the problem comes when I shutdown the node, for testing,
which is presently holding the service. When the node becomes unavailable,
service gets relocated and GFS partition gets mounted on the other node,
however it is not accessible. If I try to do a "ls/du" on GFS partition, the
command hangs. On the other hand the node which was shutdown gets stuck at
"unmounting file system". 

I tried using fence_manual -n nodename and then fence_ack_manual -n
nodename, however it still remains the same.

Can someone please help me is what I am doing wrong?

Thanks, 








--






Linux-cluster mailing list






Linux-cluster at redhat.com






https://www.redhat.com/mailman/listinfo/linux-cluster

It would be good to see  /etc/fstab configuration used on cluster nodes. If
/gfs partition is mounted manually it will not be unmounted correctly in
case you restart node ( and not executing umount prior restart ), and will
hang during shutdown/reboot process.

More at:
http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/html-single/Glo
bal_File_System_2/index.html


Edit: above link, section 3.4 Special Considerations when Mounting GFS2 File
Systems 



Regards, 

Elvir 

 

 








--






Linux-cluster mailing list






Linux-cluster at redhat.com






https://www.redhat.com/mailman/listinfo/linux-cluster

 

-- Linux-cluster mailing list Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster 


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110225/14cf3793/attachment.htm>

From scooter at cgl.ucsf.edu  Thu Feb 24 22:40:25 2011
From: scooter at cgl.ucsf.edu (Scooter Morris)
Date: Thu, 24 Feb 2011 14:40:25 -0800
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <20110217212948.GA9582@redhat.com>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>	<20110216175848.GC2291@redhat.com>	<4D5D9219.8090002@mssl.ucl.ac.uk>
	<20110217212948.GA9582@redhat.com>
Message-ID: <4D66DE59.6010103@cgl.ucsf.edu>

On 02/17/2011 01:29 PM, David Teigland wrote:
> On Thu, Feb 17, 2011 at 09:24:41PM +0000, Alan Brown wrote:
>> David Teigland wrote:
>>> Don't change the buffer size, but I'd increase all the hash table sizes to
>>> 4096 and see if anything changes.
>>>
>>> echo "4096">  /sys/kernel/config/dlm/cluster/rsbtbl_size
>>> echo "4096">  /sys/kernel/config/dlm/cluster/lkbtbl_size
>>> echo "4096">  /sys/kernel/config/dlm/cluster/dirtbl_size
>> Increasing rsbtbl_size to 4096 or higher results in FSes refusing to
>> mount and clvm refusing to start - both with "cannot allocate
>> memory"
>>
>> At 2048, it works, but gfs_controld and dlm_controld exited when I
>> tried to mount all FSes on one node as a test.
>>
>> At 1024 it seems stable.
>>
>> The other settings seemed to have applied OK. So far, reports are
>> positive (but it's quiet at the moment)
>>
>> I've got a strace of clvmd trying to start with rsbtbl_size set to
>> 4096. Should I post it here or would you prefer it mailed direct?
> Thanks for testing, you can post here.
Hi all.  After two tries, we've modified our cluster so that all nodes 
have increased their dlm hash table sizes to 1024.  Initially, I put the 
echos in /etc/init.d/gfs2, but it turns out that /etc/init.d/gfs2 is 
sort of a no-op: /etc/init.d/netfs mounts the gfs2 filesystems before 
/etc/init.d/gfs2 is ever called, so the echos need to be before netfs.

At any rate, we have noticed a significant perceived improvement in 
overall performance of the systems.  Where before, it was common to see 
imap process in D wait -- sometimes hanging for long periods of time -- 
we have not seen that at all since updating the hash table size.  So 
far, so good!

-- scooter



From ajb2 at mssl.ucl.ac.uk  Thu Feb 24 23:14:35 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Thu, 24 Feb 2011 23:14:35 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <4D66DE59.6010103@cgl.ucsf.edu>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>	<20110216175848.GC2291@redhat.com>	<4D5D9219.8090002@mssl.ucl.ac.uk>	<20110217212948.GA9582@redhat.com>
	<4D66DE59.6010103@cgl.ucsf.edu>
Message-ID: <4D66E65B.6000605@mssl.ucl.ac.uk>

On 24/02/11 22:40, Scooter Morris wrote:
> Hi all.  After two tries, we've modified our cluster so that all nodes 
> have increased their dlm hash table sizes to 1024.  Initially, I put 
> the echos in /etc/init.d/gfs2, but it turns out that /etc/init.d/gfs2 
> is sort of a no-op: /etc/init.d/netfs mounts the gfs2 filesystems 
> before /etc/init.d/gfs2 is ever called, so the echos need to be before 
> netfs.
>

I created a new init.d script to apply the values immediately after cman 
is started - see attached files.

> At any rate, we have noticed a significant perceived improvement in 
> overall performance of the systems.  Where before, it was common to 
> see imap process in D wait -- sometimes hanging for long periods of 
> time -- we have not seen that at all since updating the hash table 
> size.  So far, so good!

How much ram have you got and which version of gfs2 are you running?

vermagic:       2.6.18-238.1.1.el5 SMP mod_unload gcc-4.1




-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: dlm_tune
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110224/4597b4b2/attachment.ksh>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: tweakdlmhash.sh
Type: application/x-shellscript
Size: 380 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110224/4597b4b2/attachment.bin>

From rossnick-lists at cybercat.ca  Fri Feb 25 03:18:06 2011
From: rossnick-lists at cybercat.ca (Nicolas Ross)
Date: Thu, 24 Feb 2011 22:18:06 -0500
Subject: [Linux-cluster] Fsck on GFS2
Message-ID: <480F2AC98B1840E4A0036BC897491575@Inspiron>

Hi !

We are in the testing phase of our setup, and I had to grow a GFS2 
filesystem under RH6, wich was full. It didn't work because of a bug, so I 
deleted some files to make room, and the the grow worked, but still I 
couldn't write more data. So I took the FS offline and made a FSCK on it. 
And it took something like 2 and a half hours to complete. The FS is now 941 
gigs on on a 1.8 tb lvm on a shared raid rack (active storage), via a 4 gb 
fibre channel switch. It contains millions (I don't have an exact count, but 
3 might be a good estimate) files.

So is it normal for fsck on a FS that size to took so long? On our curent 
XSan setup, xserve-raid, older drives and 2 gb fc, for the same size and 
file count it takes about 40 or 50 minutes to complete. 



From ashley at host365.com  Fri Feb 25 04:07:23 2011
From: ashley at host365.com (ashley at host365.com)
Date: 25 Feb 2011 04:07:23 +0000
Subject: [Linux-cluster] =?utf-8?q?Fsck_on_GFS2?=
Message-ID: <20110225040723.19434.qmail@psa101.host365.com>

Hi

I am away until 28/02/11. If you require support, please email support at host365.com or call +44 (0)207 610 9911 in my absence.

Regards
Ashley




From martijn.storck at gmail.com  Fri Feb 25 07:21:51 2011
From: martijn.storck at gmail.com (Martijn Storck)
Date: Fri, 25 Feb 2011 08:21:51 +0100
Subject: [Linux-cluster] High DLM CPU usage - low GFS/iSCSI performance
In-Reply-To: <1298541761.2566.2.camel@dolmen>
References: <AANLkTikDO534-5do_ziv4HikdFX3X7fjT2YCtcTPfztH@mail.gmail.com>
	<1298541761.2566.2.camel@dolmen>
Message-ID: <AANLkTi=6cSEWjb4i3m5nushK3eAa5B6V5WjdgpbPi19x@mail.gmail.com>

Thanks for your message.

Somehow the issue has not returned since yesterday when I applied some
tuning to our GFS, specifically:

glock_purge 50
demote_secs 100
scand_secs 5
statfs_fast 1

I'm not sure how this could be related to our DLM issues, but it seems to
have some sort effect. We're able to push the IOPS limit of our SAN without
problems now and system load is down to 1.5 from 6.

The cluster traffic is hard to judge since the cluster communicates over our
'Internet' interface (total traffic peaked at about 24Mbit/s). It might be
better to route it over our internal network (once I put a decent switch in
there), but how would I do that? I assume:

1. I should use hostnames that resolve to internal IP's in cluster.conf
2. I should change the bindnetaddr in openais.conf (it's now on the default
192.168.2.0, which is not a subnet we use)

Would that do the trick or am I missing something?

Cheers,
Martijn

On Thu, Feb 24, 2011 at 11:02 AM, Steven Whitehouse <swhiteho at redhat.com>wrote:

> Hi,
>
> On Thu, 2011-02-24 at 10:34 +0100, Martijn Storck wrote:
> > Hello everyone,
> >
> >
> > We currently have the following RHCS cluster in operation:
> >
> >
> > - 3 nodes, Xeon CPU, 12 GB hardware etc.
> > - 100mbit network between the cluster nodes
> > - Dell MD3200i iSCSI SAN, with 4 Gbit links (dm-multipath) to each
> > server (through two switches), 5 15k RPM spindles
> > - 1 GFS1 file system on the above mentioned SAN
> >
> >
> > 2 of the nodes share a single GFS file system, which is used for
> > hosting virtual machine containers (for web serving, mail and light
> > database work). We've noticed that performance is suboptimal so we've
> > started to investigate. The load is not high (we previously ran the
> > same containers on a single, much cheaper server using local 7200rpm
> > disks and ext3 fs without issues), but there is a lot of small block
> > I/O.
> >
> >
> > When I run iptraf (only monitoring the iSCSI traffic) and top side by
> > side on a single server I often see dlm_send using 100% CPU. During
> > this time I/O to our gfs filesystem seems to be blocked and container
> > performance goes down the drain.
> >
> Can you take a netstat -t while the cpu usage is at 100%, that will tell
> us whether there is queued data at that point in time.
>
> >
> > My question is: what causes dlm_send to use 100% CPU and is this wat
> > causes the low GFS performance? Based on what the servers are doing
> > I'm not expecting any deadlocks (they're mostly accessing separate
> > parts of the filesystem), so I'm suspecting some other kind of
> > limitation here. Could it be the 100Mbit network?
> >
> Well, that depends on how much traffic there is... have you measured the
> traffic when the problem is occurring?
>
> >
> > I've looked into the waiters queue using the debug fs and it varies
> > between 0 and 60 entries which doesn't seem to bad to me. The locks
> > table has some 30.000 locks. All DLM and GFS settings are defaults.
> > Any hints on where to look are appreciated!
> >
> It does sounds like a performance issue, and it shouldn't be too hard to
> get to the bottom of what is going on,
>
> Steve.
>
> >
> > Regards,
> >
> >
> > Martijn Storck
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110225/c59cfb0e/attachment.htm>

From szhargrave at ybs.co.uk  Fri Feb 25 08:55:19 2011
From: szhargrave at ybs.co.uk (Simon Hargrave)
Date: Fri, 25 Feb 2011 08:55:19 +0000
Subject: [Linux-cluster] lvm2-cluster not syncing correctly?
In-Reply-To: <09f63f66e569f668e219115b5e26f55f@sjolshagen.net>
References: <20110224161352225.00000004632@H04405>	<1151718125.172523.1298566173100.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>	<20110224173814596.00000004632@H04405>
	<09f63f66e569f668e219115b5e26f55f@sjolshagen.net>
Message-ID: <20110225085519297.00000004632@H04405>

Please read the warning at the end of this email
________________________________________________

 >  As a rule of thumb, I always do a "vgscan" on all of the other nodes in the cluster, after creating a LV on a clustered VG.
 >  // Thomas
 
In this instance however, vgscan doesn't appear to fix the issue.  After doing the vgdisplay -v vgGWPOCSHARED to see all the volumes, I then perform a vgscan and get the same Archiving volume group "vgGWPOCSHARED" metadata error in the verbose output.  A vgdisplay -v without the VG name once again shows the volumes as absent.
[root at ybsxlx89 ~]# vgscan -v
    Wiping cache of LVM-capable devices
    Wiping internal VG cache
  Reading all physical volumes.  This may take a while...
    Finding all volume groups
    Finding volume group "vgGWPOCSHARED"
  Found volume group "vgGWPOCSHARED" using metadata type lvm2
    Archiving volume group "vgGWPOCSHARED" metadata (seqno 25).
    Archiving volume group "vgGWPOCSHARED" metadata (seqno 23).
    Creating volume group backup "/etc/lvm/backup/vgGWPOCSHARED" (seqno 23).
    Finding volume group "vg00"
  Found volume group "vg00" using metadata type lvm2


________________________________________________
This email and any attachments are confidential and may contain privileged information.
If you are not the person for whom they are intended please return the email and then delete all material from any computer. You must not use the email or attachments for any purpose, nor disclose its contents to anyone other than the intended recipient.
Any statements made by an individual in this email do not necessarily reflect the views of the Yorkshire Building Society Group.
________________________________________________

Yorkshire Building Society, which is authorised and regulated by the Financial Services Authority, chooses to introduce its customers to Legal & General for the purposes of advising on and arranging life assurance and investment products bearing Legal & General?s name.

We are entered in the FSA Register and our FSA registration number is 106085 http://www.fsa.gov.uk/register

Head Office: Yorkshire Building Society, Yorkshire House, Yorkshire Drive, Bradford, BD5 8LJ
Tel: 0845 1 200 100

Visit Our Website
http://www.ybs.co.uk

All communications with us may be monitored/recorded to improve the quality of our service and for your protection and security.



________________________________________________________________________
This e-mail has been scanned for all viruses by Star. The
service is powered by MessageLabs. For more information on a proactive
anti-virus service working around the clock, around the globe, visit:
http://www.star.net.uk
________________________________________________________________________
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110225/9ca30832/attachment.htm>

From ajb2 at mssl.ucl.ac.uk  Fri Feb 25 10:03:07 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Fri, 25 Feb 2011 10:03:07 +0000
Subject: [Linux-cluster] High DLM CPU usage - low GFS/iSCSI performance
In-Reply-To: <AANLkTi=6cSEWjb4i3m5nushK3eAa5B6V5WjdgpbPi19x@mail.gmail.com>
References: <AANLkTikDO534-5do_ziv4HikdFX3X7fjT2YCtcTPfztH@mail.gmail.com>	<1298541761.2566.2.camel@dolmen>
	<AANLkTi=6cSEWjb4i3m5nushK3eAa5B6V5WjdgpbPi19x@mail.gmail.com>
Message-ID: <4D677E5B.8050207@mssl.ucl.ac.uk>

On 25/02/11 07:21, Martijn Storck wrote:
> Thanks for your message.
>
> Somehow the issue has not returned since yesterday when I applied some 
> tuning to our GFS, specifically:
>
> glock_purge 50
> demote_secs 100
> scand_secs 5
> statfs_fast 1
>

It's most likely the biggest contributor is the statfs_fast setting

For GFS1, We had the following settings (which gave pretty good performance)

for d in `mount | grep ^/dev | grep gfs | grep -v gfs2 | cut -f3 -d" "`; \
  do \
  /sbin/gfs_tool settune $d glock_purge 80 & \
  /sbin/gfs_tool settune $d demote_secs 25 & \
  /sbin/gfs_tool settune $d statfs_fast 1 & \
  /sbin/gfs_tool settune $d statfs_slots 512 & \
  /sbin/gfs_tool settune $d quota_account 1 & \
  /sbin/gfs_tool settune $d quota_enforce 0 & \
  /sbin/gfs_tool setflag inherit_jdata $d & \
  done







From rpeterso at redhat.com  Fri Feb 25 13:43:33 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Fri, 25 Feb 2011 08:43:33 -0500 (EST)
Subject: [Linux-cluster] Fsck on GFS2
In-Reply-To: <480F2AC98B1840E4A0036BC897491575@Inspiron>
Message-ID: <984986171.189718.1298641413061.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Hi !
| 
| We are in the testing phase of our setup, and I had to grow a GFS2
| filesystem under RH6, wich was full. It didn't work because of a bug,
| so I
| deleted some files to make room, and the the grow worked, but still I
| couldn't write more data. So I took the FS offline and made a FSCK on
| it.
| And it took something like 2 and a half hours to complete. The FS is
| now 941
| gigs on on a 1.8 tb lvm on a shared raid rack (active storage), via a
| 4 gb
| fibre channel switch. It contains millions (I don't have an exact
| count, but
| 3 might be a good estimate) files.
| 
| So is it normal for fsck on a FS that size to took so long? On our
| curent
| XSan setup, xserve-raid, older drives and 2 gb fc, for the same size
| and
| file count it takes about 40 or 50 minutes to complete.

Hi,

The speed and accuracy of fsck.gfs2 varies based on a lot of things,
including the amount of metadata, if there is any damage to fix,
and which version of fsck.gfs2 you are using.  The tool is improving
all the time.

I try to put the latest fixes for x86_64 on my people page:

http://people.redhat.com/rpeterso/Experimental/RHEL5.x/gfs2/fsck.gfs2

All of the fixes going into RHEL5.7 are in that version, and
it is faster and more accurate than the version shipped with RHEL5.6.
In fact we've made about 75+ patches since RHEL5.6 to improve it.
The upstream git repos for gfs2-utils and cluster have the latest
source code as well.

Regards,

Bob Peterson
Red Hat File Systems



From niks at logik-internet.rs  Sat Feb 26 02:33:29 2011
From: niks at logik-internet.rs (Nikola Savic)
Date: Sat, 26 Feb 2011 03:33:29 +0100
Subject: [Linux-cluster] How fast can rsync be on GFS2?
In-Reply-To: <984986171.189718.1298641413061.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <984986171.189718.1298641413061.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <4D686679.40104@logik-internet.rs>


  Hello,

  I have started building my first cluster with 3 nodes using CentOS
5.4. Two nodes have disk mirrored using DRBD, and third node is
accessing it using GNBD.

  We're still not in production, but I wanted to test backup procedures.
All web files will be copied to the remote server using rsync. There are
about 3 milion files, mostly small. Rsync is very slow in creating file
list, little faster than 100files/s. I have added:
  <dlm plock_ownership="1" plock_rate_limit="0"/>
  <gfs_controld plock_rate_limit="0"/>
to cluster.conf. Also tried to manually add options to gfs_controld in
cman init script. Nothing made rsync faster :(

  Is there any other tuning I can use to make GFS2 faster in this
scenario? Does anyone use rsync in similar scenario, and how fast is
file list created?

  I assume it has something to do with locking, but I couldn't find
suitable settings, except plock_ownership and plock_rate_limit, which
have already been included in cluster configuration file.

  Thanks,
  Nikola



From niks at logik-internet.rs  Sat Feb 26 02:41:45 2011
From: niks at logik-internet.rs (Nikola Savic)
Date: Sat, 26 Feb 2011 03:41:45 +0100
Subject: [Linux-cluster] Restarting cluster node, without other nodes hanging
In-Reply-To: <984986171.189718.1298641413061.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <984986171.189718.1298641413061.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <4D686869.6070207@logik-internet.rs>


  Hello,

  As mentioned in the previous post, I have setup my first cluster with
3 nodes running CentOS 5.4. Two nodes are mirrored using DRBD, and that
is used as shared storage for cluster. Third node is accessing that
shared storage using GNBD. When all three nodes are up and running,
everything works well. However, when I need to shutdown one of nodes, I
usually can't do it without hanging one or all of nodes, and in most
cases I have to use hardware reset :( I tried stopping all services
using shared storage (apache web server), and then stopping all cluster
related services (cman, clvmd, rgmanager, gnbd_imports, etc.). For some
reason, it doesn't work.

  Can anyone tell me what steps should I take when one of the cluster
nodes needs to be restarted?

  Thanks,
  Nikola

 



From ashley at host365.com  Sat Feb 26 02:37:17 2011
From: ashley at host365.com (ashley at host365.com)
Date: 26 Feb 2011 02:37:17 +0000
Subject: [Linux-cluster] =?utf-8?q?How_fast_can_rsync_be_on_GFS2=3F?=
Message-ID: <20110226023717.13816.qmail@psa101.host365.com>

Hi

I am away until 28/02/11. If you require support, please email support at host365.com or call +44 (0)207 610 9911 in my absence.

Regards
Ashley




From rpeterso at redhat.com  Sat Feb 26 03:32:31 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Fri, 25 Feb 2011 22:32:31 -0500 (EST)
Subject: [Linux-cluster] How fast can rsync be on GFS2?
In-Reply-To: <4D686679.40104@logik-internet.rs>
Message-ID: <1658019716.200321.1298691151489.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| Is there any other tuning I can use to make GFS2 faster in this
| scenario? Does anyone use rsync in similar scenario, and how fast is
| file list created?

Hi Nikola,

GFS2 is mostly self-tuning.  However, we are making improvements all
the time.  The performance depends a lot on what version of GFS2 you
are running.  In general, newer is better.

On the positive side:
(1) There have been recent improvements to the DLM lock manager that
speed things up somewhat.
(2) I've got a set of patches that haven't been released yet that
should help performance for bugzilla bug 650494.
(3) I recently wrote a performance enhancement to the unlink code.
All three of these patches have been posted to the public cluster-devel
mailing list and are in different stages finding their way to the kernels.

On the negative side:
Some people have reported some more performance problems that we're
currently investigating (haven't solved yet).

Regards,

Bob Peterson
Red Hat File Systems



From niks at logik-internet.rs  Sat Feb 26 11:37:14 2011
From: niks at logik-internet.rs (Nikola Savic)
Date: Sat, 26 Feb 2011 12:37:14 +0100
Subject: [Linux-cluster] How fast can rsync be on GFS2?
In-Reply-To: <1658019716.200321.1298691151489.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <1658019716.200321.1298691151489.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <4D68E5EA.4030006@logik-internet.rs>


  Hi Bob,

  I see that packages available for CentOS 5.x are very old :( Do you
expect any problems with compiling cluster3 on CentOS 5.4 or 5.5?

  Did any try to install the cluster3 on CentOS? Are there available RPMS?

  Thanks,
  Nikola

Bob Peterson wrote:
> ----- Original Message -----
> | Is there any other tuning I can use to make GFS2 faster in this
> | scenario? Does anyone use rsync in similar scenario, and how fast is
> | file list created?
>
> Hi Nikola,
>
> GFS2 is mostly self-tuning.  However, we are making improvements all
> the time.  The performance depends a lot on what version of GFS2 you
> are running.  In general, newer is better.
>
> On the positive side:
> (1) There have been recent improvements to the DLM lock manager that
> speed things up somewhat.
> (2) I've got a set of patches that haven't been released yet that
> should help performance for bugzilla bug 650494.
> (3) I recently wrote a performance enhancement to the unlink code.
> All three of these patches have been posted to the public cluster-devel
> mailing list and are in different stages finding their way to the kernels.
>
> On the negative side:
> Some people have reported some more performance problems that we're
> currently investigating (haven't solved yet).
>
> Regards,
>
> Bob Peterson
> Red Hat File Systems
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>   



From ajb2 at mssl.ucl.ac.uk  Sun Feb 27 14:41:46 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Sun, 27 Feb 2011 14:41:46 +0000
Subject: [Linux-cluster] optimising DLM speed?
In-Reply-To: <1298569801.2566.36.camel@dolmen>
References: <4D5BDB4E.3010006@mssl.ucl.ac.uk>	<20110216175848.GC2291@redhat.com>	<4D5D9219.8090002@mssl.ucl.ac.uk>	<20110217212948.GA9582@redhat.com>
	<4D5E68C9.1000206@mssl.ucl.ac.uk>	<4D658093.5010807@mssl.ucl.ac.uk>
	<1298550588.2566.16.camel@dolmen>	<4D66556A.6000509@mssl.ucl.ac.uk>
	<1298569801.2566.36.camel@dolmen>
Message-ID: <4D6A62AA.2030404@mssl.ucl.ac.uk>

On 24/02/11 17:50, Steven Whitehouse wrote:
>
> Depending on the exact mix of I/O, that is expected behaviour. That is
> why it is so important to look at what can be done at the application
> layer to mitigate such problems.
>

This is an academic environment.

Telling users to adjust the way they do things is an exercise in 
futility. It's hard enough trying to get them not to put 100k of small 
files in one directory and then complain about access speeds.

Finding things like a massive IDL program being written to simply call 
wget 20 times is common (never mind that what was wanted could be done 
with one wget command)...






From ajb2 at mssl.ucl.ac.uk  Sun Feb 27 14:49:54 2011
From: ajb2 at mssl.ucl.ac.uk (Alan Brown)
Date: Sun, 27 Feb 2011 14:49:54 +0000
Subject: [Linux-cluster] Fsck on GFS2
In-Reply-To: <984986171.189718.1298641413061.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
References: <984986171.189718.1298641413061.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>
Message-ID: <4D6A6492.4040501@mssl.ucl.ac.uk>

On 25/02/11 13:43, Bob Peterson wrote:
> All of the fixes going into RHEL5.7 are in that version, and
> it is faster and more accurate than the version shipped with RHEL5.6.

Will it be backported to 5.6?






From ashley at host365.com  Sun Feb 27 15:00:36 2011
From: ashley at host365.com (ashley at host365.com)
Date: 27 Feb 2011 15:00:36 +0000
Subject: [Linux-cluster] =?utf-8?q?Fsck_on_GFS2?=
Message-ID: <20110227150036.24710.qmail@psa101.host365.com>

Hi

I am away until 28/02/11. If you require support, please email support at host365.com or call +44 (0)207 610 9911 in my absence.

Regards
Ashley




From laszlo.budai at gmail.com  Sun Feb 27 16:08:41 2011
From: laszlo.budai at gmail.com (Budai Laszlo)
Date: Sun, 27 Feb 2011 18:08:41 +0200
Subject: [Linux-cluster] Service location (colocation)
Message-ID: <4D6A7709.6060108@gmail.com>

Hi all,

is there a way to define location dependencies among services? for
instance how can I define that Service A should run on the same node as
service B? Or the opposite: Service C should run on a different node
than service D?

Thank you,
Laszlo



From rpeterso at redhat.com  Sun Feb 27 21:31:06 2011
From: rpeterso at redhat.com (Bob Peterson)
Date: Sun, 27 Feb 2011 16:31:06 -0500 (EST)
Subject: [Linux-cluster] Fsck on GFS2
In-Reply-To: <4D6A6492.4040501@mssl.ucl.ac.uk>
Message-ID: <1835014936.204921.1298842266546.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

----- Original Message -----
| On 25/02/11 13:43, Bob Peterson wrote:
| > All of the fixes going into RHEL5.7 are in that version, and
| > it is faster and more accurate than the version shipped with
| > RHEL5.6.
| 
| Will it be backported to 5.6?

Hi,

I don't know yet.  I don't control what goes into the z-stream
updates like 5.6.z.  Someone in Red Hat's support organization
can make a request for it and someone in management has to make
that call.

Regards,

Bob Peterson
Red Hat File Systems



From laszlo.budai at gmail.com  Mon Feb 28 00:33:51 2011
From: laszlo.budai at gmail.com (Budai Laszlo)
Date: Mon, 28 Feb 2011 02:33:51 +0200
Subject: [Linux-cluster] GFS2
Message-ID: <4D6AED6F.9000203@gmail.com>

Hi all,

in which version of RHEL GFS2 is considered production ready? 5.3?

Thank you,
Laszlo



From ashley at host365.com  Mon Feb 28 00:45:26 2011
From: ashley at host365.com (ashley at host365.com)
Date: 28 Feb 2011 00:45:26 +0000
Subject: [Linux-cluster] =?utf-8?q?GFS2?=
Message-ID: <20110228004526.24404.qmail@psa101.host365.com>

Hi

I am away until 28/02/11. If you require support, please email support at host365.com or call +44 (0)207 610 9911 in my absence.

Regards
Ashley




From parvez.h.shaikh at gmail.com  Mon Feb 28 07:13:10 2011
From: parvez.h.shaikh at gmail.com (Parvez Shaikh)
Date: Mon, 28 Feb 2011 12:43:10 +0530
Subject: [Linux-cluster] SNMP support with IBM Blade Center Fence Agent
Message-ID: <AANLkTiksPPr8o8yML7rd148-mdkXNUROsVbwP-g6NEpG@mail.gmail.com>

Hi all,

I have a question related to fence agents and SNMP alarms.

Fence Agent can fail to fence the failed node for various reason; e.g. with
my bladecenter fencing agent, I sometimes get message saying bladecenter
fencing failed because of timeout or fence device IP address/user
credentials are incorrect.

In such a situation is it possible to generate SNMP trap?

My cluster config file looks like below and in my case if bladecenter
fencing fails, manual fencing kicks in and requires user to do
fence_ack_manual, for this user must at least be notified via SNMP (or any
other mechanism?) to intervene  -

  <clusternodes>
    <clusternode name="blade2" nodeid="2" votes="1">
      <fence>
        <method name="1">
          <device blade="2" name="BladeCenterFencing"/>
        </method>
        <method name="2">
          <device name="ManualFencing" nodename="blade2"/>
        </method>
      </fence>
    </clusternode>
    <clusternode name="blade1" nodeid="1" votes="1">
      <fence>
        <method name="1">
          <device blade="1" name="BladeCenterFencing"/>
        </method>
        <method name="2">
          <device name="ManualFencing" nodename="blade1"/>
        </method>
      </fence>
    </clusternode>
  </clusternodes>
  <cman expected_votes="1" two_node="1"/>
  <fencedevices>
    <fencedevice agent="fence_bladecenter" ipaddr="blade-mm.com"
login="USERID" name="BladeCenterFencing" passwd="PASSW0RD"/>
    <fencedevice agent="fence_manual" name="ManualFencing"/>
  </fencedevices>

Thanks,
Parvez
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110228/38d541c2/attachment.htm>

From rohara at redhat.com  Mon Feb 28 16:14:06 2011
From: rohara at redhat.com (Ryan O'Hara)
Date: Mon, 28 Feb 2011 10:14:06 -0600
Subject: [Linux-cluster] SNMP support with IBM Blade Center Fence Agent
In-Reply-To: <AANLkTiksPPr8o8yML7rd148-mdkXNUROsVbwP-g6NEpG@mail.gmail.com>
References: <AANLkTiksPPr8o8yML7rd148-mdkXNUROsVbwP-g6NEpG@mail.gmail.com>
Message-ID: <20110228161406.GA14120@redhat.com>

On Mon, Feb 28, 2011 at 12:43:10PM +0530, Parvez Shaikh wrote:
> Hi all,
> 
> I have a question related to fence agents and SNMP alarms.
> 
> Fence Agent can fail to fence the failed node for various reason; e.g. with
> my bladecenter fencing agent, I sometimes get message saying bladecenter
> fencing failed because of timeout or fence device IP address/user
> credentials are incorrect.
> 
> In such a situation is it possible to generate SNMP trap?

This feature will be in RHEL6.1. There is a new project called
'foghorn' that creates SNMPv2 traps from dbus signals.

git://git.fedorahosted.org/foghorn.git

In RHEL6.1 (and the latest upstream release), certain cluster
components will emit dbus signals when certain events occurs. This
includes fencing. So when a node is fenced a dbus signal is generated
by fenced. The foghorn service catches this signal and generated
SNMPv2 trap.

Note that foghorn runs as an AgentX subagent, so snmpd must be running
as the master agentx.

Ryan

> My cluster config file looks like below and in my case if bladecenter
> fencing fails, manual fencing kicks in and requires user to do
> fence_ack_manual, for this user must at least be notified via SNMP (or any
> other mechanism?) to intervene  -
> 
>   <clusternodes>
>     <clusternode name="blade2" nodeid="2" votes="1">
>       <fence>
>         <method name="1">
>           <device blade="2" name="BladeCenterFencing"/>
>         </method>
>         <method name="2">
>           <device name="ManualFencing" nodename="blade2"/>
>         </method>
>       </fence>
>     </clusternode>
>     <clusternode name="blade1" nodeid="1" votes="1">
>       <fence>
>         <method name="1">
>           <device blade="1" name="BladeCenterFencing"/>
>         </method>
>         <method name="2">
>           <device name="ManualFencing" nodename="blade1"/>
>         </method>
>       </fence>
>     </clusternode>
>   </clusternodes>
>   <cman expected_votes="1" two_node="1"/>
>   <fencedevices>
>     <fencedevice agent="fence_bladecenter" ipaddr="blade-mm.com"
> login="USERID" name="BladeCenterFencing" passwd="PASSW0RD"/>
>     <fencedevice agent="fence_manual" name="ManualFencing"/>
>   </fencedevices>
> 
> Thanks,
> Parvez

> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster



From ooolinux at 163.com  Mon Feb 28 17:16:05 2011
From: ooolinux at 163.com (yue)
Date: Tue, 1 Mar 2011 01:16:05 +0800 (CST)
Subject: [Linux-cluster] if there is no cman.ko anymore
Message-ID: <3f1dd29a.50.12e6d450d86.Coremail.ooolinux@163.com>

my kernel 2.6.32,fc12
cman 3.0.17
i install cman.rpm
but i search no cman.ko, redhat cluster can work .
if there is not cman.ko anymore?
 
thanks
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20110301/cd843d42/attachment.htm>

From adrew at redhat.com  Mon Feb 28 17:36:46 2011
From: adrew at redhat.com (Adam Drew)
Date: Mon, 28 Feb 2011 12:36:46 -0500 (EST)
Subject: [Linux-cluster] if there is no cman.ko anymore
In-Reply-To: <3f1dd29a.50.12e6d450d86.Coremail.ooolinux@163.com>
Message-ID: <289115570.283817.1298914606691.JavaMail.root@zmail01.collab.prod.int.phx2.redhat.com>

Hi Yue,

The cman was moved out of kernel space in Cluster 2 (RHEL 5+). Hope this helps.

Thanks,
Adam

----- Original Message -----
From: "yue" <ooolinux at 163.com>
To: "linux-cluster" <linux-cluster at redhat.com>
Sent: Monday, February 28, 2011 12:16:05 PM
Subject: [Linux-cluster] if there is no cman.ko anymore



my kernel 2.6.32,fc12 
cman 3.0.17 
i install cman.rpm 
but i search no cman.ko, redhat cluster can work . 
if there is not cman.ko anymore? 

thanks 


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster



From linux at alteeve.com  Mon Feb 28 17:36:37 2011
From: linux at alteeve.com (Digimer)
Date: Mon, 28 Feb 2011 12:36:37 -0500
Subject: [Linux-cluster] if there is no cman.ko anymore
In-Reply-To: <3f1dd29a.50.12e6d450d86.Coremail.ooolinux@163.com>
References: <3f1dd29a.50.12e6d450d86.Coremail.ooolinux@163.com>
Message-ID: <4D6BDD25.8040506@alteeve.com>

On 02/28/2011 12:16 PM, yue wrote:
> my kernel 2.6.32,fc12
> cman 3.0.17
> i install cman.rpm
> but i search no cman.ko, redhat cluster can work .
> if there is not cman.ko anymore?
>  
> thanks

I can't speak to this specific version, but I can confirm that CMAN is
going away (in fact, I think it is already gone from 3.1).

-- 
Digimer
E-Mail: digimer at alteeve.com
AN!Whitepapers: http://alteeve.com
Node Assassin:  http://nodeassassin.org



From bturner at redhat.com  Mon Feb 28 20:22:31 2011
From: bturner at redhat.com (Ben Turner)
Date: Mon, 28 Feb 2011 15:22:31 -0500 (EST)
Subject: [Linux-cluster] GFS2
In-Reply-To: <4D6AED6F.9000203@gmail.com>
Message-ID: <1818755651.156254.1298924551514.JavaMail.root@zmail07.collab.prod.int.phx2.redhat.com>

GFS2 became officially supported with the release of RHEL 5.3.  As with any piece of software bugs have been found and resolved since it was officially released and I recommend running on the latest bits.

-Ben

----- Original Message -----
> Hi all,
> 
> in which version of RHEL GFS2 is considered production ready? 5.3?
> 
> Thank you,
> Laszlo
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster