From cluster at squiz.net  Wed Mar  1 03:02:11 2006
From: cluster at squiz.net (oly)
Date: Wed, 01 Mar 2006 14:02:11 +1100
Subject: [Linux-cluster] GFS = filesystem consistency error
Message-ID: <1141182131.29086.46.camel@sunrise.squiz.net>

Hi there
        I've got a 4nodes RHEL4 cluster with GFS version  6.1.0 (built
        Jun  7
        2005 12:46:04).
        The shared disk is a NAS detected by aoe as /dev/etherd/e0.0.
        ANd i have problem on few files on teh file system : if i tried
        to
        modify the inodes o this files (delete the file, or unlink the
        inode)
        the cluster nodes where i launch the command lost the GFS and
        the GFS
        modules stay busy and cannot be remove from the kernel. my nodes
        is so
        stuck and the only solution is only to hardware restart this
        nodes.
         All the GFS journal seems to work fine ...i can even get stat
        of the
        DEAD file.
         Is GFS got problem to manipulate file in a 'more than 1 million
        files'
        folder ?
         IS anyone got a solution to remove this dead files or delete
        teh fodler
        that content all these dead files ?
         Is a gfs.fsck can resolv my problem ?
         Is there any later version that fix this problem ?
        
        Thanks in advance.
        PS = see below all the details
         
        The error i get when i try to unlink the file inode:
        ===========ERROR============
        GFS: fsid=entcluster:sataide.2: fatal: filesystem consistency
        error
        GFS: fsid=entcluster:sataide.2:   inode = 8516674/8516674
        GFS: fsid=entcluster:sataide.2:   function = gfs_change_nlink
        GFS: fsid=entcluster:sataide.2:   file
        = /usr/src/build/574067-i686/BUILD/smp/src/gfs/inode.c, line =
        843
        GFS: fsid=entcluster:sataide.2:   time = 1141080134
        GFS: fsid=entcluster:sataide.2: about to withdraw from the
        cluster
        GFS: fsid=entcluster:sataide.2: waiting for outstanding I/O
        GFS: fsid=entcluster:sataide.2: telling LM to withdraw
        lock_dlm: withdraw abandoned memory
        GFS: fsid=entcluster:sataide.2: withdrawn
          mh_magic = 0x01161970
          mh_type = 4
          mh_generation = 68
          mh_format = 400
          mh_incarn = 6
          no_formal_ino = 8516674
          no_addr = 8516674
          di_mode = 0664
          di_uid = 500
          di_gid = 500
          di_nlink = 0
          di_size = 0
          di_blocks = 1
          di_atime = 1141042636
          di_mtime = 1140001370
          di_ctime = 1140001370
          di_major = 0
          di_minor = 0
          di_rgrp = 8513987
          di_goal_rgrp = 8513987
          di_goal_dblk = 2682
          di_goal_mblk = 2682
          di_flags = 0x00000004
          di_payload_format = 0
          di_type = 1
          di_height = 0
          di_incarn = 0
          di_pad = 0
          di_depth = 0
          di_entries = 0
          no_formal_ino = 0
          no_addr = 0
          di_eattr = 0
          di_reserved =
        00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
        00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
        00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
        00 00 00 00 00 00 00 00
        ========END OF ERROR==========
        
        My cman status:
        ==========STATUS============
        Protocol version: 5.0.1
        Config version: 4
        Cluster name: entcluster
        Cluster ID: 42548
        Cluster Member: Yes
        Membership state: Cluster-Member
        Nodes: 4
        Expected_votes: 1
        Total_votes: 4
        Quorum: 3
        Active subsystems: 5
        Node name: XXX.domainX.tld
        Node addresses: x.x.x.x
        ========END CMAN=========
        
        My gfs_tool df :
        ============DF=========
        /home:
          SB lock proto = "lock_dlm"
          SB lock table = "entcluster:sataide"
          SB ondisk format = 1309
          SB multihost format = 1401
          Block size = 4096
          Journals = 4
          Resource Groups = 274
          Mounted lock proto = "lock_dlm"
          Mounted lock table = "entcluster:sataide"
          Mounted host data = ""
          Journal number = 0
          Lock module flags =
          Local flocks = FALSE
          Local caching = FALSE
          Oopses OK = FALSE
        
          Type           Total          Used           Free
        use%
        
        ------------------------------------------------------------------------
          inodes         100642         100642         0
        100%
          metadata       3842538        8527           3834011        0%
          data           13999476       2760327        11239149
        20%
        =============END DF =========
        Version of my modules :
        ========modules========
        CMAN 2.6.9-36.0 (built May 31 2005 12:15:02) installed
        DLM 2.6.9-34.0 (built Jun  2 2005 15:17:56) installed
        Lock_Harness 2.6.9-35.5 (built Jun  7 2005 12:42:30) installed
        GFS 2.6.9-35.5 (built Jun  7 2005 12:42:49) installed
        aoe: aoe_init: AoE v2.6-11 initialised.
        Lock_DLM (built Jun  7 2005 12:42:32) installed
        ========end modules========
        
        
        -- 
        Aurelien Lemaire (oly)
        http://www.squiz.net
        Sydney | Canberra | London
        92 Jarrett St Leichhardt, Sydney, NSW 2040
        T:+61 2 9568 6866 
        F:+61 2 9568 6733    


From cjk at techma.com  Wed Mar  1 19:06:30 2006
From: cjk at techma.com (Kovacs, Corey J.)
Date: Wed, 1 Mar 2006 14:06:30 -0500
Subject: [Linux-cluster] NFS exports, RHCS3 and Autofs.
Message-ID: <FF2CE0D593AEE34B955FEC77BD5AFBE0079E17@tmaemail.techma.com>

Folks, question regarding the subject line...


We have a 3 node cluster running RHEL3u6+RHCS3u6. Each system has a nfs 
service with two exports, one system has an additional nfs service with two
exports. 
This last service starts and mounts locally to the cluster, the problem is
the two 
exports never get listed in to exportfs list.

We can issue clusvcadm -R clu_srv2 and the exports will appear.


Any thoughts?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060301/94966541/attachment.htm>

From pcaulfie at redhat.com  Thu Mar  2 08:25:33 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 02 Mar 2006 08:25:33 +0000
Subject: [Linux-cluster] where to find user documentation for DLM?
In-Reply-To: <uslq4sbrn.fsf@yahoo.com>
References: <uslq4sbrn.fsf@yahoo.com>
Message-ID: <4406ABFD.5050502@redhat.com>

jalmeter_99 at yahoo.com wrote:
> Hello,
> 
> I have googled everything I can think of, but haven't found any
> documentation for using DLM as a developer.  Would someone please
> point me at a tutorial of some kind?
> 
> Background:
> 
> My employer has recently (last week) set up a GFS cluster for
> evaluation.  I am trying to set up a locking test that mimics a system
> that we use on VMS.
> 

If you download or checout the sources there is documentation in
cluster/dlm/doc as well as several example programs in cluster/dlm/test/usertest.

-- 

patrick


From Frank.Weyns at ordina.nl  Thu Mar  2 10:19:02 2006
From: Frank.Weyns at ordina.nl (Weyns, Frank)
Date: Thu, 2 Mar 2006 11:19:02 +0100
Subject: [Linux-cluster] Any recommentdations for Oracle on a Netapp filer ?
Message-ID: <4D30FCF33FE1FC4DB79C18A73D46C6730859A3@BA12-0013.work.local>

I'm designing a  very simple oracle cluster with a NetApp filer.
Just two nodes, one oracle production instance falling over to the second node if needed.
Second node running the "test-acceptance" oracle instance, which is brought down if needed.
 
The Oracle filesystems ( binary, database and archive logs) will be nfs mounted.
(I worked with Fiber SANs before, not a NetApp I have my doubts but you can take them away ;-)
 
Any caveats ? Any best practices. Why should I avoid nfs or why is it good ? Versions to have or avoid.)
 
If you  don't want to fill the mailing list with unneeded materials: Frank*Weyns.net   (*=@)
 
Regards,
Frank
Disclaimer

Dit bericht met eventuele bijlagen is vertrouwelijk en uitsluitend bestemd voor de geadresseerde. Indien u niet de bedoelde ontvanger bent, wordt u verzocht de afzender te waarschuwen en dit bericht met eventuele bijlagen direct te verwijderen en/of te vernietigen. Het is niet toegestaan dit bericht en eventuele bijlagen te vermenigvuldigen, door te sturen, openbaar te maken, op te slaan of op andere wijze te gebruiken. Ordina N.V. en/of haar groepsmaatschappijen accepteren geen verantwoordelijkheid of aansprakelijkheid voor schade die voortvloeit uit de inhoud en/of de verzending van dit bericht.

This e-mail and any attachments are confidential and is solely intended for the addressee only. If you are not the intended recipient, please notify the sender and delete and/or destroy this message and any attachments immediately. It is prohibited to copy, to distribute, to disclose or to use this e-mail and any attachments in any other way. Ordina N.V. and/or its group companies do not accept any responsibility nor liability for any damage resulting from the content of and/or the transmission of this message.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3544 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060302/ae91e2d0/attachment.bin>

From deval.kulshrestha at progression.com  Thu Mar  2 12:47:36 2006
From: deval.kulshrestha at progression.com (Deval kulshrestha)
Date: Thu, 2 Mar 2006 18:17:36 +0530
Subject: [Linux-cluster] Buffer I/O error on dev cciss c0d0p9,
	lost pagewrite due to io error
Message-ID: <001401c63df7$7b660da0$cf00a8c0@PROGRESSION>

Hi 

Whenever I write some large file on SAN Logical Volume, after writing some
data it gives me error in /var/log/message.

 
as follows:

 
          Feb 28 11:08:53 s1_new kernel: cciss: cmd f7400000 timedout

          Feb 28 11:08:54 s1_new kernel: cciss: cmd f7436fb4 timedout

          Feb 28 11:08:54 s1_new kernel: printk: 232 messages suppressed.

          Feb 28 11:08:54 s1_new kernel: Buffer I/O error on device
cciss/c0d0p9, logical block 2162845

          Feb 28 11:08:54 s1_new kernel: lost page write due to I/O error on
cciss/c0d0p9

 
Also it is noticed that when started with clean reboot of systems servers
including SAN device)

I/O on SAN works very fast for the first few seconds then it starts timeout
and page lost errors

 
This is HP MSA 500G2 connected with HP DL 360 G4 , RHEL 4 ES U1(2.6.9-11)
RHCS4

 
With Regard

Deval K.

Progression Infonet Pvt. Ltd. 
55, Independent Electronic Modules, 
Sector - 18, Electronic City, 
Gurgaon - 122015 
Tel          : - 0124 - 2455070, Ext. 215, Fax: 91-124-2398647
Mobile   : - 98186 -82509 
URL        : - www.progression.com 

 
===========================================================
Privileged or confidential information may be contained
in this message. If you are not the addressee indicated
in this message (or responsible for delivery of the 
message to such person), please delete this message and
kindly notify the sender by an emailed reply. Opinions,
conclusions and other information in this message that
do not relate to the official business of Progression
and its associate entities shall be understood as neither
given nor endorsed by them.
  

-------------------------------------------------------------
Progression Infonet Private Limited, Gurgaon (Haryana), India
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060302/ccf163fb/attachment.htm>

From Fabrizio.Lippolis at AurigaInformatica.it  Thu Mar  2 16:37:04 2006
From: Fabrizio.Lippolis at AurigaInformatica.it (Fabrizio Lippolis)
Date: Thu, 02 Mar 2006 17:37:04 +0100
Subject: [Linux-cluster] cluster.conf reference
Message-ID: <44071F30.9060500@aurigainformatica.it>

I am configuring a cluster of two Linux machines and I would like to 
configure a service so that, should a machine fail, the over can start 
the service and go on. I have read I have to configure a failover domain 
containing the two machines and the service so that the task can be 
accomplished.

Unfortunately the cluster.conf man page just documents a very basic 
configuration and shows nothing about failover domains and so on.

Can anybody point me to better resources because even googling I wasn't 
able to find anything around and with the graphical tool 
(system-config-cluster) I am unable to save the configuration.

Thanks in advance.

-- 
Fabrizio Lippolis
fabrizio.lippolis at aurigainformatica.it
Auriga Informatica s.r.l.
Via Don Guanella 15/B - 70124 Bari
Tel.: 080/5025414  Fax: 080/5027448


From devrim at gunduz.org  Thu Mar  2 16:50:03 2006
From: devrim at gunduz.org (Devrim GUNDUZ)
Date: Thu, 2 Mar 2006 18:50:03 +0200 (EET)
Subject: [Linux-cluster] GFS: "transport endpoint is not connected" error
In-Reply-To: <200602281014.15703.hlawatschek@atix.de>
References: <Pine.LNX.4.63.0602280049230.11714@mail.kivi.com.tr>
	<200602281014.15703.hlawatschek@atix.de>
Message-ID: <Pine.LNX.4.63.0602281238410.19134@mail.kivi.com.tr>


Hi,

On Tue, 28 Feb 2006, Mark Hlawatschek wrote:

> have you already started your cluster environment (CMAN, fenced, DLM/Gulm) ?
> What are the exact steps you have done ?

Sorry for the delay in response.

After double,triple checking the configuration, I make CMAN and DLM 
working. Thanks, and now I can mount the filesystems. It was a 
misconfiguration in the cluster.conf

And also thanks to Michael Will who helped me off-list for the LVM thing.

Regards,
--
Devrim GUNDUZ
Kivi Bili?im Teknolojileri         -          http://www.kivi.com.tr
devrim~gunduz.org, devrim~PostgreSQL.org, devrim.gunduz~linux.org.tr
                       http://www.gunduz.org

From devrim at gunduz.org  Thu Mar  2 17:14:44 2006
From: devrim at gunduz.org (Devrim GUNDUZ)
Date: Thu, 2 Mar 2006 19:14:44 +0200 (EET)
Subject: [Linux-cluster] Using GFS on a hybrid system
Message-ID: <Pine.LNX.4.63.0603021850130.10819@mail.kivi.com.tr>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


Hi,

We have a RHEL hybrid system, which has 8 servers in it.

First of all, let me draw a picture of the system:

Due to a binary driver problem (IBM!), we had to install RHEL ES 4 U1 to 4 
servers (Let's call them S1, S2, S3 and S4) . The other ones have RHEL AS 
4 U2 (S5,S6,S7,S8) . The ESU1 ones have GFS 6.0 and the other ones have 
6.1. They are connected to a SAN.

2 of the ASU2 ones are using a seperate partition in SAN, and I had no 
problem in clustering and mounting the systems.

S3 and S4 will work as a Cluster. S1, S2, S5 and S6 are standalone 
servers.

S1,S2,S5 and S6 needs shared access to the LVM#1.
S1,S2,S3 and S4 needs shared access to another partition in SAN.
S1,S2,S5 and S6 needs shared access to the LVM#2.

The problem arose when we wanted to share LVM#1. We mkfs'ed LVM#1 using 
GFS 6.1 from S6. It is ok when we mount the LVM from S5 and S6. As we want 
to access data from S1 and S2, S5 and S6 ooopes and we need to reboot the 
servers, even if we mount with  -o oopses_ok.

Now the questions:

* What should be the cluster.conf files for S1...S6? Should they have the 
same cluster name?
* Is using GFS 6.0 and 6.1 dangerous? I have to use 6.0 in ESU1 servers. 
Should I rollback to RHEL AS 4 U1 on the U2 systems?

I wanted to ask the list before getting help from Red Hat, for Google to 
catch the answer and possibly help other people who may need it.

Any help/comment is appreciated.

Regards,
- --
Devrim GUNDUZ
Kivi Bili?im Teknolojileri         -          http://www.kivi.com.tr
devrim~gunduz.org, devrim~PostgreSQL.org, devrim.gunduz~linux.org.tr
                       http://www.gunduz.org
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.1 (GNU/Linux)

iD8DBQFEBygJ4zE8DGqpiZARAgdoAKCKaTtq1RsaRUee6rI6rQzdlroChQCePomf
0A5B4mQP1Zdw84dearDMdQw=
=H/v9
-----END PGP SIGNATURE-----

From devrim at gunduz.org  Thu Mar  2 17:41:19 2006
From: devrim at gunduz.org (Devrim GUNDUZ)
Date: Thu, 2 Mar 2006 19:41:19 +0200 (EET)
Subject: [Linux-cluster] Using GFS on a hybrid system
In-Reply-To: <Pine.LNX.4.63.0603021850130.10819@mail.kivi.com.tr>
References: <Pine.LNX.4.63.0603021850130.10819@mail.kivi.com.tr>
Message-ID: <Pine.LNX.4.63.0603021938180.10819@mail.kivi.com.tr>


Hi,

On Thu, 2 Mar 2006, Devrim GUNDUZ wrote:

> Due to a binary driver problem (IBM!), we had to install RHEL ES 4 U1 to 4 
> servers (Let's call them S1, S2, S3 and S4) . The other ones have RHEL AS 4 
> U2 (S5,S6,S7,S8) . The ESU1 ones have GFS 6.0 and the other ones have 6.1.

Oops, there is a typo: The ESU1 ones have GFS 6.1 and the other ones have 
6.1.3.

> * Is using GFS 6.0 and 6.1 dangerous? I have to use 6.0 in ESU1 servers. 
> Should I rollback to RHEL AS 4 U1 on the U2 systems?

This should also be " Is using GFS 6.1.0 and 6.1.3 dangerous?"

Regards,
--
Devrim GUNDUZ
Kivi Bili?im Teknolojileri         -          http://www.kivi.com.tr
devrim~gunduz.org, devrim~PostgreSQL.org, devrim.gunduz~linux.org.tr
                       http://www.gunduz.org

From lhh at redhat.com  Thu Mar  2 19:29:43 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 02 Mar 2006 14:29:43 -0500
Subject: [Linux-cluster] Any recommentdations for Oracle on a Netapp
	filer ?
In-Reply-To: <4D30FCF33FE1FC4DB79C18A73D46C6730859A3@BA12-0013.work.local>
References: <4D30FCF33FE1FC4DB79C18A73D46C6730859A3@BA12-0013.work.local>
Message-ID: <1141327783.13130.163.camel@ayanami.boston.redhat.com>

On Thu, 2006-03-02 at 11:19 +0100, Weyns, Frank wrote:
> I'm designing a  very simple oracle cluster with a NetApp filer.
> Just two nodes, one oracle production instance falling over to the second node if needed.
> Second node running the "test-acceptance" oracle instance, which is brought down if needed.
>  
> The Oracle filesystems ( binary, database and archive logs) will be nfs mounted.
> (I worked with Fiber SANs before, not a NetApp I have my doubts but you can take them away ;-)
>  
> Any caveats ? Any best practices. Why should I avoid nfs or why is it good ? Versions to have or avoid.)

I wrote a howto on how to do it with SAN storage for 10g Release 2.
It's fairly similar, I suspect, to how one might do it with NFS; it's
attached to Bugzilla 182423 if you want to give it a peek and/or make
comments.	

-- Lon


From clusterbuilder at gmail.com  Thu Mar  2 21:59:44 2006
From: clusterbuilder at gmail.com (Nick I)
Date: Thu, 2 Mar 2006 14:59:44 -0700
Subject: [Linux-cluster] GFS
Message-ID: <e073f9120603021359l4cf404b8uf1c0dcb9ed45f51@mail.gmail.com>

Hi,

I help maintain a Web site called www.clusterbuilder.org.  We have a
question and answer section to help those involved in clustering.  We are
developing a knowledgebase of cluster questions and responses so people with
similar problems might be able to find answers to their question.

I received a question concerning Red Hat and wanted to see what the opinions
are of everyone here.

"How to configure a two node GFS Cluster?"
I have found some documentation on Red Hats' site, but  wanted to see if
anyone has any advice for this user.
You can respond to this email or submit a response at
www.clusterbuilder.org/FAQ

Any response is greatly appreciated.


Thanks,
Nick
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060302/e873e966/attachment.htm>

From cluster at squiz.net  Thu Mar  2 22:13:41 2006
From: cluster at squiz.net (oly)
Date: Fri, 03 Mar 2006 09:13:41 +1100
Subject: [Linux-cluster] GFS = filesystem consistency error
In-Reply-To: <1141182131.29086.46.camel@sunrise.squiz.net>
References: <1141182131.29086.46.camel@sunrise.squiz.net>
Message-ID: <1141337621.26844.4.camel@sunrise.squiz.net>

Hi there,
I would like to give an update to my ticket. That will maybe help people
who've got similar trouble :
I resolved my problem by doing:
- gfs_tool shrink /home (supposed to reclaim but did not)
- gfs_tool reclaim /home (still not enough )
unmount the /home on all my nodes
-gfs_fsck -y /dev/etherd/e0.0
-remount my /home 
VICTORY = i lost all the broken inode files
ADVICE= avoid 1 million file folder in the future

Cheers, Oly


On Wed, 2006-03-01 at 14:02 +1100, oly wrote:
> Hi there
>         I've got a 4nodes RHEL4 cluster with GFS version  6.1.0 (built
>         Jun  7
>         2005 12:46:04).
>         The shared disk is a NAS detected by aoe as /dev/etherd/e0.0.
>         ANd i have problem on few files on teh file system : if i tried
>         to
>         modify the inodes o this files (delete the file, or unlink the
>         inode)
>         the cluster nodes where i launch the command lost the GFS and
>         the GFS
>         modules stay busy and cannot be remove from the kernel. my nodes
>         is so
>         stuck and the only solution is only to hardware restart this
>         nodes.
>          All the GFS journal seems to work fine ...i can even get stat
>         of the
>         DEAD file.
>          Is GFS got problem to manipulate file in a 'more than 1 million
>         files'
>         folder ?
>          IS anyone got a solution to remove this dead files or delete
>         teh fodler
>         that content all these dead files ?
>          Is a gfs.fsck can resolv my problem ?
>          Is there any later version that fix this problem ?
>         
>         Thanks in advance.
>         PS = see below all the details
>          
>         The error i get when i try to unlink the file inode:
>         ===========ERROR============
>         GFS: fsid=entcluster:sataide.2: fatal: filesystem consistency
>         error
>         GFS: fsid=entcluster:sataide.2:   inode = 8516674/8516674
>         GFS: fsid=entcluster:sataide.2:   function = gfs_change_nlink
>         GFS: fsid=entcluster:sataide.2:   file
>         = /usr/src/build/574067-i686/BUILD/smp/src/gfs/inode.c, line =
>         843
>         GFS: fsid=entcluster:sataide.2:   time = 1141080134
>         GFS: fsid=entcluster:sataide.2: about to withdraw from the
>         cluster
>         GFS: fsid=entcluster:sataide.2: waiting for outstanding I/O
>         GFS: fsid=entcluster:sataide.2: telling LM to withdraw
>         lock_dlm: withdraw abandoned memory
>         GFS: fsid=entcluster:sataide.2: withdrawn
>           mh_magic = 0x01161970
>           mh_type = 4
>           mh_generation = 68
>           mh_format = 400
>           mh_incarn = 6
>           no_formal_ino = 8516674
>           no_addr = 8516674
>           di_mode = 0664
>           di_uid = 500
>           di_gid = 500
>           di_nlink = 0
>           di_size = 0
>           di_blocks = 1
>           di_atime = 1141042636
>           di_mtime = 1140001370
>           di_ctime = 1140001370
>           di_major = 0
>           di_minor = 0
>           di_rgrp = 8513987
>           di_goal_rgrp = 8513987
>           di_goal_dblk = 2682
>           di_goal_mblk = 2682
>           di_flags = 0x00000004
>           di_payload_format = 0
>           di_type = 1
>           di_height = 0
>           di_incarn = 0
>           di_pad = 0
>           di_depth = 0
>           di_entries = 0
>           no_formal_ino = 0
>           no_addr = 0
>           di_eattr = 0
>           di_reserved =
>         00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
>         00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
>         00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
>         00 00 00 00 00 00 00 00
>         ========END OF ERROR==========
>         
>         My cman status:
>         ==========STATUS============
>         Protocol version: 5.0.1
>         Config version: 4
>         Cluster name: entcluster
>         Cluster ID: 42548
>         Cluster Member: Yes
>         Membership state: Cluster-Member
>         Nodes: 4
>         Expected_votes: 1
>         Total_votes: 4
>         Quorum: 3
>         Active subsystems: 5
>         Node name: XXX.domainX.tld
>         Node addresses: x.x.x.x
>         ========END CMAN=========
>         
>         My gfs_tool df :
>         ============DF=========
>         /home:
>           SB lock proto = "lock_dlm"
>           SB lock table = "entcluster:sataide"
>           SB ondisk format = 1309
>           SB multihost format = 1401
>           Block size = 4096
>           Journals = 4
>           Resource Groups = 274
>           Mounted lock proto = "lock_dlm"
>           Mounted lock table = "entcluster:sataide"
>           Mounted host data = ""
>           Journal number = 0
>           Lock module flags =
>           Local flocks = FALSE
>           Local caching = FALSE
>           Oopses OK = FALSE
>         
>           Type           Total          Used           Free
>         use%
>         
>         ------------------------------------------------------------------------
>           inodes         100642         100642         0
>         100%
>           metadata       3842538        8527           3834011        0%
>           data           13999476       2760327        11239149
>         20%
>         =============END DF =========
>         Version of my modules :
>         ========modules========
>         CMAN 2.6.9-36.0 (built May 31 2005 12:15:02) installed
>         DLM 2.6.9-34.0 (built Jun  2 2005 15:17:56) installed
>         Lock_Harness 2.6.9-35.5 (built Jun  7 2005 12:42:30) installed
>         GFS 2.6.9-35.5 (built Jun  7 2005 12:42:49) installed
>         aoe: aoe_init: AoE v2.6-11 initialised.
>         Lock_DLM (built Jun  7 2005 12:42:32) installed
>         ========end modules========
>         
>         
>         
>         -- 
>         Aurelien Lemaire (oly)
>         http://www.squiz.net
>         Sydney | Canberra | London
>         92 Jarrett St Leichhardt, Sydney, NSW 2040
>         T:+61 2 9568 6866 
>         F:+61 2 9568 6733    
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From wcheng at redhat.com  Thu Mar  2 20:15:58 2006
From: wcheng at redhat.com (Wendy Cheng)
Date: Thu, 02 Mar 2006 15:15:58 -0500
Subject: [Linux-cluster] GFS = filesystem consistency error
In-Reply-To: <1141337621.26844.4.camel@sunrise.squiz.net>
References: <1141182131.29086.46.camel@sunrise.squiz.net>
	<1141337621.26844.4.camel@sunrise.squiz.net>
Message-ID: <1141330558.6362.5.camel@localhost.localdomain>

On Fri, 2006-03-03 at 09:13 +1100, oly wrote:
> Hi there,
> I would like to give an update to my ticket. That will maybe help people
> who've got similar trouble :
> I resolved my problem by doing:
> - gfs_tool shrink /home (supposed to reclaim but did not)
> - gfs_tool reclaim /home (still not enough )
> unmount the /home on all my nodes
> -gfs_fsck -y /dev/etherd/e0.0
> -remount my /home 
> VICTORY = i lost all the broken inode files
> ADVICE= avoid 1 million file folder in the future

Sorry, this is probably a late reply but out of curiosity ..

Look to me that the filesystem had been corrupted before you unlinked
the file. Is there any other errors *before* the filesystem consistency
errors ? Out of memory warning ? How much memory had you put on this
machine ? And could I assume that was an i686 machine ?

-- Wendy 


> 
> 
> On Wed, 2006-03-01 at 14:02 +1100, oly wrote:
> > Hi there
> >         I've got a 4nodes RHEL4 cluster with GFS version  6.1.0 (built
> >         Jun  7
> >         2005 12:46:04).
> >         The shared disk is a NAS detected by aoe as /dev/etherd/e0.0.
> >         ANd i have problem on few files on teh file system : if i tried
> >         to
> >         modify the inodes o this files (delete the file, or unlink the
> >         inode)
> >         the cluster nodes where i launch the command lost the GFS and
> >         the GFS
> >         modules stay busy and cannot be remove from the kernel. my nodes
> >         is so
> >         stuck and the only solution is only to hardware restart this
> >         nodes.
> >          All the GFS journal seems to work fine ...i can even get stat
> >         of the
> >         DEAD file.
> >          Is GFS got problem to manipulate file in a 'more than 1 million
> >         files'
> >         folder ?
> >          IS anyone got a solution to remove this dead files or delete
> >         teh fodler
> >         that content all these dead files ?
> >          Is a gfs.fsck can resolv my problem ?
> >          Is there any later version that fix this problem ?
> >         
> >         Thanks in advance.
> >         PS = see below all the details
> >          
> >         The error i get when i try to unlink the file inode:
> >         ===========ERROR============
> >         GFS: fsid=entcluster:sataide.2: fatal: filesystem consistency
> >         error
> >         GFS: fsid=entcluster:sataide.2:   inode = 8516674/8516674
> >         GFS: fsid=entcluster:sataide.2:   function = gfs_change_nlink
> >         GFS: fsid=entcluster:sataide.2:   file
> >         = /usr/src/build/574067-i686/BUILD/smp/src/gfs/inode.c, line =
> >         843
> >         GFS: fsid=entcluster:sataide.2:   time = 1141080134
> >         GFS: fsid=entcluster:sataide.2: about to withdraw from the
> >         cluster
> >         GFS: fsid=entcluster:sataide.2: waiting for outstanding I/O
> >         GFS: fsid=entcluster:sataide.2: telling LM to withdraw
> >         lock_dlm: withdraw abandoned memory
> >         GFS: fsid=entcluster:sataide.2: withdrawn
> >           mh_magic = 0x01161970
> >           mh_type = 4
> >           mh_generation = 68
> >           mh_format = 400
> >           mh_incarn = 6
> >           no_formal_ino = 8516674
> >           no_addr = 8516674
> >           di_mode = 0664
> >           di_uid = 500
> >           di_gid = 500
> >           di_nlink = 0
> >           di_size = 0
> >           di_blocks = 1
> >           di_atime = 1141042636
> >           di_mtime = 1140001370
> >           di_ctime = 1140001370
> >           di_major = 0
> >           di_minor = 0
> >           di_rgrp = 8513987
> >           di_goal_rgrp = 8513987
> >           di_goal_dblk = 2682
> >           di_goal_mblk = 2682
> >           di_flags = 0x00000004
> >           di_payload_format = 0
> >           di_type = 1
> >           di_height = 0
> >           di_incarn = 0
> >           di_pad = 0
> >           di_depth = 0
> >           di_entries = 0
> >           no_formal_ino = 0
> >           no_addr = 0
> >           di_eattr = 0
> >           di_reserved =
> >         00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> >         00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> >         00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
> >         00 00 00 00 00 00 00 00
> >         ========END OF ERROR==========
> >         
> >         My cman status:
> >         ==========STATUS============
> >         Protocol version: 5.0.1
> >         Config version: 4
> >         Cluster name: entcluster
> >         Cluster ID: 42548
> >         Cluster Member: Yes
> >         Membership state: Cluster-Member
> >         Nodes: 4
> >         Expected_votes: 1
> >         Total_votes: 4
> >         Quorum: 3
> >         Active subsystems: 5
> >         Node name: XXX.domainX.tld
> >         Node addresses: x.x.x.x
> >         ========END CMAN=========
> >         
> >         My gfs_tool df :
> >         ============DF=========
> >         /home:
> >           SB lock proto = "lock_dlm"
> >           SB lock table = "entcluster:sataide"
> >           SB ondisk format = 1309
> >           SB multihost format = 1401
> >           Block size = 4096
> >           Journals = 4
> >           Resource Groups = 274
> >           Mounted lock proto = "lock_dlm"
> >           Mounted lock table = "entcluster:sataide"
> >           Mounted host data = ""
> >           Journal number = 0
> >           Lock module flags =
> >           Local flocks = FALSE
> >           Local caching = FALSE
> >           Oopses OK = FALSE
> >         
> >           Type           Total          Used           Free
> >         use%
> >         
> >         ------------------------------------------------------------------------
> >           inodes         100642         100642         0
> >         100%
> >           metadata       3842538        8527           3834011        0%
> >           data           13999476       2760327        11239149
> >         20%
> >         =============END DF =========
> >         Version of my modules :
> >         ========modules========
> >         CMAN 2.6.9-36.0 (built May 31 2005 12:15:02) installed
> >         DLM 2.6.9-34.0 (built Jun  2 2005 15:17:56) installed
> >         Lock_Harness 2.6.9-35.5 (built Jun  7 2005 12:42:30) installed
> >         GFS 2.6.9-35.5 (built Jun  7 2005 12:42:49) installed
> >         aoe: aoe_init: AoE v2.6-11 initialised.
> >         Lock_DLM (built Jun  7 2005 12:42:32) installed
> >         ========end modules========
> >         
> >         
> >         
> >         -- 
> >         Aurelien Lemaire (oly)
> >         http://www.squiz.net
> >         Sydney | Canberra | London
> >         92 Jarrett St Leichhardt, Sydney, NSW 2040
> >         T:+61 2 9568 6866 
> >         F:+61 2 9568 6733    
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From suvankar_moitra at yahoo.com  Fri Mar  3 05:14:09 2006
From: suvankar_moitra at yahoo.com (SUVANKAR MOITRA)
Date: Thu, 2 Mar 2006 21:14:09 -0800 (PST)
Subject: [Linux-cluster] Any recommentdations for Oracle on a Netapp filer
	?
In-Reply-To: <1141327783.13130.163.camel@ayanami.boston.redhat.com>
Message-ID: <20060303051409.90698.qmail@web52302.mail.yahoo.com>

dear lon,

On 6th i will be in customer place and apply the whole
thing again which u have put.If i am facing any
problem i will mail u again and pl help me at that
time.

with warm regsrds

Suvankar
kolkata, india


--- Lon Hohberger <lhh at redhat.com> wrote:

> On Thu, 2006-03-02 at 11:19 +0100, Weyns, Frank
> wrote:
> > I'm designing a  very simple oracle cluster with a
> NetApp filer.
> > Just two nodes, one oracle production instance
> falling over to the second node if needed.
> > Second node running the "test-acceptance" oracle
> instance, which is brought down if needed.
> >  
> > The Oracle filesystems ( binary, database and
> archive logs) will be nfs mounted.
> > (I worked with Fiber SANs before, not a NetApp I
> have my doubts but you can take them away ;-)
> >  
> > Any caveats ? Any best practices. Why should I
> avoid nfs or why is it good ? Versions to have or
> avoid.)
> 
> I wrote a howto on how to do it with SAN storage for
> 10g Release 2.
> It's fairly similar, I suspect, to how one might do
> it with NFS; it's
> attached to Bugzilla 182423 if you want to give it a
> peek and/or make
> comments.	
> 
> -- Lon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster
> 


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From omer at faruk.net  Fri Mar  3 07:35:56 2006
From: omer at faruk.net (Omer Faruk Sen)
Date: Fri, 3 Mar 2006 09:35:56 +0200 (EET)
Subject: [Linux-cluster] RH 4.3 release date?
Message-ID: <52405.193.140.74.2.1141371356.squirrel@193.140.74.2>


A few weeks ago it has been stated in this list that there are problems
with rgmanager (clurgmgrd) that causes clurgmgrd to die suddenly. I hope
this will be fixed in RH 4.3 so when will the 4.3 come out? There is also
a bug in initscripts that causes service relocation fail (stop-after-stop
problem) this one will also be fixed in 4.3 right?


-- 
Omer Faruk Sen
http://www.faruk.net


From deval.kulshrestha at progression.com  Fri Mar  3 11:13:17 2006
From: deval.kulshrestha at progression.com (Deval kulshrestha)
Date: Fri, 3 Mar 2006 16:43:17 +0530
Subject: [Linux-cluster] Is anybody using MSA 500 G2 with HP Server's
Message-ID: <000601c63eb3$790fbb20$cf00a8c0@PROGRESSION>

Hi

 
Now it s another problem have started coming up.

 
Whenever I try to create a partition using mke2fs -j /dev/cciss/c0d0p5

 
Screen shows that its creating file system on SAN partitions, but in
var/log/messages it continuously keeps on showing 

      Feb 28 11:08:53 s1_new kernel: cciss: cmd f7400000 timedout

      Feb 28 11:08:54 s1_new kernel: cciss: cmd f7436fb4 timedout

      Feb 28 11:08:54 s1_new kernel: printk: 232 messages suppressed.

      Feb 28 11:08:54 s1_new kernel: Buffer I/O error on device
cciss/c0d0p9, logical block 2162845

      Feb 28 11:08:54 s1_new kernel: lost page write due to I/O error on
cciss/c0d0p9

 
If partitions size is small i.e 5 GB it anyway get created using some 20-25
minutes, but if I create large file system than it used to give error , it
simply stuck up at "Writing Superblock.." and above messages keeps on coming
in /var/log/messages.

 
If I write any data in partitions, than first few bytes get stored on
partitions but later it also shows Buffer I/O error.

 
I am using one MSA 500 G2 , two no. of HP DL360 G4P server with HP's HBA
642, Server installed with RHEL 4 ES U1 and RHCS4

 
Any Help Would be highly appreciable.

 
With regard

Deval K.

===========================================================
Privileged or confidential information may be contained
in this message. If you are not the addressee indicated
in this message (or responsible for delivery of the 
message to such person), please delete this message and
kindly notify the sender by an emailed reply. Opinions,
conclusions and other information in this message that
do not relate to the official business of Progression
and its associate entities shall be understood as neither
given nor endorsed by them.
  

-------------------------------------------------------------
Progression Infonet Private Limited, Gurgaon (Haryana), India
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060303/38baf429/attachment.htm>

From E.H.Beekman at amc.nl  Fri Mar  3 12:57:28 2006
From: E.H.Beekman at amc.nl (Ewald Beekman)
Date: Fri, 3 Mar 2006 13:57:28 +0100
Subject: [Linux-cluster] Any recommentdations for Oracle on a Netapp	filer
	?
In-Reply-To: <1141327783.13130.163.camel@ayanami.boston.redhat.com>
References: <4D30FCF33FE1FC4DB79C18A73D46C6730859A3@BA12-0013.work.local>
	<1141327783.13130.163.camel@ayanami.boston.redhat.com>
Message-ID: <20060303125728.GU6090@core.amc.uva.nl>

Hi Lon,

Is the howto available on the net? I would like to play around with
GFS and i suspect a NFS shared storage is the simplest way to try
it out.

best regards,
Ewald...

On Thu, Mar 02, 2006 at 02:29:43PM -0500, Lon Hohberger wrote:
> On Thu, 2006-03-02 at 11:19 +0100, Weyns, Frank wrote:
> > I'm designing a  very simple oracle cluster with a NetApp filer.
> > Just two nodes, one oracle production instance falling over to the second node if needed.
> > Second node running the "test-acceptance" oracle instance, which is brought down if needed.
> >  
> > The Oracle filesystems ( binary, database and archive logs) will be nfs mounted.
> > (I worked with Fiber SANs before, not a NetApp I have my doubts but you can take them away ;-)
> >  
> > Any caveats ? Any best practices. Why should I avoid nfs or why is it good ? Versions to have or avoid.)
> 
> I wrote a howto on how to do it with SAN storage for 10g Release 2.
> It's fairly similar, I suspect, to how one might do it with NFS; it's
> attached to Bugzilla 182423 if you want to give it a peek and/or make
> comments.	
> 
> -- Lon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Ewald Beekman, Security Engineer, Academic Medical Center,
dept. ADB/ICT Computer & Network Services, The Netherlands
## Your mind-mint is:
The IRS spends God knows how much of your tax money on these toll-free
information hot lines staffed by IRS employees, whose idea of a dynamite tax
tip is that you should print neatly.  If you ask them a real tax question,
such as how you can cheat, they're useless.

So, for guidance, you want to look to big business.  Big business never pays
a nickel in taxes, according to Ralph Nader, who represents a big consumer
organization that never pays a nickel in taxes...
		-- Dave Barry, "Sweating Out Taxes"


From lhh at redhat.com  Fri Mar  3 15:13:19 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 03 Mar 2006 10:13:19 -0500
Subject: [Linux-cluster] Any recommentdations for Oracle on a
	Netapp	filer ?
In-Reply-To: <20060303125728.GU6090@core.amc.uva.nl>
References: <4D30FCF33FE1FC4DB79C18A73D46C6730859A3@BA12-0013.work.local>
	<1141327783.13130.163.camel@ayanami.boston.redhat.com>
	<20060303125728.GU6090@core.amc.uva.nl>
Message-ID: <1141398799.13130.184.camel@ayanami.boston.redhat.com>

On Fri, 2006-03-03 at 13:57 +0100, Ewald Beekman wrote:

> Is the howto available on the net? I would like to play around with
> GFS and i suspect a NFS shared storage is the simplest way to try
> it out.

It is only in bugzilla, because there is not enough feedback to even
make the claim that it works ;)

Here is a link to the tar.gz though (howto, some screen captures, and an
agent), though:

https://bugzilla.redhat.com/bugzilla/attachment.cgi?id=125371

Once there is enough feedback to get it working (read: send comments to
the bugzilla as to what worked and what did not work for you, please), I
will add it to CVS.

-- Lon


From lhh at redhat.com  Fri Mar  3 15:23:20 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 03 Mar 2006 10:23:20 -0500
Subject: [Linux-cluster] Any recommentdations for Oracle on a Netapp
	filer ?
In-Reply-To: <20060303051409.90698.qmail@web52302.mail.yahoo.com>
References: <20060303051409.90698.qmail@web52302.mail.yahoo.com>
Message-ID: <1141399400.13130.195.camel@ayanami.boston.redhat.com>

On Thu, 2006-03-02 at 21:14 -0800, SUVANKAR MOITRA wrote:
> dear lon,
> 
> On 6th i will be in customer place and apply the whole
> thing again which u have put.If i am facing any
> problem i will mail u again and pl help me at that
> time.

Suvankar,

You should really test the HOWTO in your lab *before* trying to use it
to deploy anything remotely close to a production environment.  It is a
beta-quality HOWTO right now.

-- Lon


From magobin at gmail.com  Fri Mar  3 16:23:09 2006
From: magobin at gmail.com (Alessandro Binarelli)
Date: Fri, 3 Mar 2006 17:23:09 +0100
Subject: [Linux-cluster] HELP: Newbe on service cluster configuration!
Message-ID: <44086d7e.2afce7ef.6883.7291@mx.gmail.com>

Hi, I'm a Newbe about cluster configuration and I've some problem
understannding how to set up a service in cluster scenario....
For my test I use an nfs partition as shared storage for 2 server in cluster
mode, so this is scenario:

ServerA: 192.168.1.10
ServerB: 192.168.2.20
ServerC(nfs export) : 192.168.1.50  (hostname: san)


Initially I tried to set up a dns in cluster, so I mounted nfs partition as
/var/named/
If I try to run named normally on serverA it works...but when I try to start
named in cluster it failed

I set up a resources as NFS mount with this parameters:

Name: Dns
Mount point: /var/named
Host: san
Export Path: /SAN/DNS

...then I congigured a service with this resource and I attached an Ip
address (10.23.5.240) and a script (etc/init.d/named/) as Private Resource.

When I boot the server always I have an Service Failed, but when I check the
logs I find only "clurgmgrd: <err> #43: Service DNS has failed; can not
start." and a "#13: Service DNS failed to stop cleanly"

I suppose that thi is a problem how I configured the service...

Any suggestion??

Thanks in advance!

Alex


From wcheng at redhat.com  Fri Mar  3 14:07:15 2006
From: wcheng at redhat.com (Wendy Cheng)
Date: Fri, 03 Mar 2006 09:07:15 -0500
Subject: [Linux-cluster] GFS = filesystem consistency error
In-Reply-To: <1141330558.6362.5.camel@localhost.localdomain>
References: <1141182131.29086.46.camel@sunrise.squiz.net>
	<1141337621.26844.4.camel@sunrise.squiz.net>
	<1141330558.6362.5.camel@localhost.localdomain>
Message-ID: <1141394836.3705.11.camel@localhost.localdomain>

On Thu, 2006-03-02 at 15:15 -0500, Wendy Cheng wrote:
> On Fri, 2006-03-03 at 09:13 +1100, oly wrote:
> > Hi there,
> > I would like to give an update to my ticket. That will maybe help people
> > who've got similar trouble :
> > I resolved my problem by doing:
> > - gfs_tool shrink /home (supposed to reclaim but did not)
> > - gfs_tool reclaim /home (still not enough )
> > unmount the /home on all my nodes
> > -gfs_fsck -y /dev/etherd/e0.0
> > -remount my /home 
> > VICTORY = i lost all the broken inode files
> > ADVICE= avoid 1 million file folder in the future
> 

One more question since this "file folder" confuses me. What's the max
file count you have within one directory (excluding files in any
subdirectory) ? 

This could be a bug from our end so the input is highly appreciated.

-- Wendy


From baesso at ksolutions.it  Fri Mar  3 17:48:32 2006
From: baesso at ksolutions.it (Baesso Mirko)
Date: Fri, 3 Mar 2006 18:48:32 +0100
Subject: [Linux-cluster] sun cluster ccp for redhat
Message-ID: <984C9DBB29704B47B7AAD308F2C95A3B04DE71@kmail.ksolutions.it>

Hi,
i would like to known if there is a tool like cluster console panel to
manage cluster node as sun cluster do
Thanks in advance

Baesso Mirko - System Engineer
KSolutions.S.p.A.
Via Lenin 132/26
56017  S.Martino Ulmiano (PI) - Italy
tel.+ 39 0 50 898369 fax. + 39 0 50 861200
baesso at ksolutions.it   http//www.ksolutions.it


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060303/c1f38d82/attachment.htm>

From suvankar_moitra at yahoo.com  Sat Mar  4 06:35:04 2006
From: suvankar_moitra at yahoo.com (SUVANKAR MOITRA)
Date: Fri, 3 Mar 2006 22:35:04 -0800 (PST)
Subject: [Linux-cluster] Any recommentdations for Oracle on a Netapp filer
	?
In-Reply-To: <1141399400.13130.195.camel@ayanami.boston.redhat.com>
Message-ID: <20060304063504.30937.qmail@web52308.mail.yahoo.com>

dear lon,

I have some question about the script and the
installation :--
1> Can i install RHCS4 after install the Oracle 10g?
2> The /mnt/oracle mount point is temporary for the
oracle installation or should  i write on /etc/fstab ?
   
3> Can i mention ORACLE_HOME,ORACLE_BASE,ORACLE_SID
etc on .bash_profile of every node or leave it as it
is only create oracle user and group?
4> Where should i place oracledb.sh file? I think its
required in every node, am i write ?
5>What is the exact use of oracledb.sh file?
6> How can i shutdown the oracle, should i write
script for that, like orastop and orastart for up the
oracle?  

thanks and warm regards

Suvankar
--- Lon Hohberger <lhh at redhat.com> wrote:

> On Thu, 2006-03-02 at 21:14 -0800, SUVANKAR MOITRA
> wrote:
> > dear lon,
> > 
> > On 6th i will be in customer place and apply the
> whole
> > thing again which u have put.If i am facing any
> > problem i will mail u again and pl help me at that
> > time.
> 
> Suvankar,
> 
> You should really test the HOWTO in your lab
> *before* trying to use it
> to deploy anything remotely close to a production
> environment.  It is a
> beta-quality HOWTO right now.
> 
> -- Lon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster
> 


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From tekion at gmail.com  Sun Mar  5 16:11:08 2006
From: tekion at gmail.com (Screaming Eagle)
Date: Sun, 5 Mar 2006 11:11:08 -0500
Subject: [Linux-cluster] FC4 and LVM GFS ...
Message-ID: <ee9c961f0603050811v7a031091x32809880d75bbd59@mail.gmail.com>

All,
I got this to work with FC4. However, rebooting the server does not mount
the gfs file system (lvm + gfs). Here's what I have for modprobe.conf:
alias eth0 e1000
alias eth1 e1000
alias usb-controller ohci-hcd
alias block-major-152 aoe
alias char-major-152 aoe


any idea? Thanks.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060305/e0d9c598/attachment.htm>

From tekion at gmail.com  Sun Mar  5 16:12:11 2006
From: tekion at gmail.com (Screaming Eagle)
Date: Sun, 5 Mar 2006 11:12:11 -0500
Subject: [Linux-cluster] limits on fs size for GFS ...
Message-ID: <ee9c961f0603050812o2149341ev3e932888521be94d@mail.gmail.com>

Does any one know what is the limit of file system size on GFS, in
particular LVM + GFS.
Thanks.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060305/5839b846/attachment.htm>

From s.bridgwater at sinergy.it  Mon Mar  6 09:24:40 2006
From: s.bridgwater at sinergy.it (Simon Bridgwater)
Date: Mon, 06 Mar 2006 10:24:40 +0100
Subject: [Linux-cluster] vsftpd clusterized in virtual IP + iscsi initiator
	question
Message-ID: <3d16136b7b5caf4d9be1d00bd1a36bc0@sinergy.it>

Hi
I am new to the red hat cluster suite and have recently set up a two
node HA cluster (Red Hat Cluster Suite 4) with iscsi inititiatior
software (iscsid) and a netapp as the iscsi storage target.
I have a problem with clusterizing vsftpd on a virtual IP that belongs
to the cluster (not the real node's IP) . If I bind vsftpd standalone on
the real IP of the node it works fine (listen_address ip_real). If I
bind vsftpd standalone on a virtual IP defined as a cluster resource it
starts to give me problems. Ftp clients (even from the localhost)
connect but after a few "ls" commands it starts to get a "passive mode
refused". Has anybody any suggestions ?
Also I have the following question. I have two network cards in each
node in an active-failover ethernet channel bond. The two cards are
connected to two seperate switches for fault tollerance.  If RHCS 4
doesn't use a quorum partition and the heartbeat is exclusively via
network, how can the cluster tell when the iscsi initiator connection is
down (while the network is up) and failover to the other node ?

Simon Bridgwater


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060306/8b3c6d0a/attachment.htm>

From jan.kudjak at snt.sk  Sun Mar  5 21:47:40 2006
From: jan.kudjak at snt.sk (Kudjak Jan)
Date: Sun, 5 Mar 2006 22:47:40 +0100
Subject: [Linux-cluster] 4 node gfs cluster, quorum needs 3
Message-ID: <139607FAAB7E0F46AD1A9BA381EC815A012C2516@KLEIO.snt.sk>


Hello,

I have at this time 4 node gfs cluster using RLM.
Two nodes (node1, node2) have mounted gfs filesystem and other two (node3, node4)
are working as loadballancers and as redundant lock servers (no gfs fs mounted on node3 or node4).
(i am using GFS-6.0.2.20-2, GFS-modules-smp-6.0.2.20-2, kernel-smp-2.4.21-32.0.1.EL)

So when all nodes are up there is:

quorum_has = 4
quorum_needs = 3

I tried to stop lock_gulm on node3 and node4.
Although the cluster was in state

quorum_has = 2
quorum_needs = 3

the gfs filesystem on node1 or node2 still
remained read/write accessible. 
Is this behaviour correct ?

----

nodes   quorum_needs	quorum_has	filesystem

3		>=2		2		r/w

4		>=3		2		r/w ?????

5		>=3		3		r/w


Can anybody help me out to correct or even extend the table above?
Where is the truth ? :)

Thanks a lot for your answers.

--
J?n Kudj?k
UNIX/Linux Consultant


From saju8 at rediffmail.com  Mon Mar  6 06:47:49 2006
From: saju8 at rediffmail.com (saju  john)
Date: 6 Mar 2006 06:47:49 -0000
Subject: [Linux-cluster] Cluster service restarting Locally
Message-ID: <20060306064749.22036.qmail@webmail50.rediffmail.com>

  
Dear All,

I have a 2 node cluster with RHAS3 update 3.
Kernel : 2.4.21-20.Elsmp
Clumanager : clumanager-1.2.16-1

For more than a year everyting had been fine. Suddenly it started showing the follwing and restarted the service locally

clusvcmgrd[1388]: <err> Unable to obtain cluster lock: Connection timed out
clulockd[1378]: <warning> Denied A.B.C.D: Broken pipe
clulockd[1378]: <err> select error: Broken pipe
clusvcmgrd: [1625]: <notice> service notice: Stopping service postgresql ...
clusvcmgrd: [1625]: <notice> service notice: Running user script '/etc/init.d/postgresql stop'
clusvcmgrd: [1625]: <notice> service notice: Stopped service postgresql
clusvcmgrd: [1625]: <notice> service notice: Starting service postgresql ...
clusvcmgrd: [1625]: <notice> service notice: Running user script '/etc/init.d/postgresql start'
clusvcmgrd: [1625]: <notice> service notice: Started service postgresql ...

I saw the same problem already reported by Mr. Anu Matthew 

Is there any solution to this reported problem

Thanks in advance

Saju John
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060306/19c14bcb/attachment.htm>

From kabobofpug at yahoo.com  Mon Mar  6 06:22:13 2006
From: kabobofpug at yahoo.com (paul raymond)
Date: Sun, 5 Mar 2006 22:22:13 -0800 (PST)
Subject: [Linux-cluster] Trouble with RHCS 3.0
Message-ID: <20060306062213.85577.qmail@web36107.mail.mud.yahoo.com>

Greetings Lon,
  I am trying to setup a simple two cluster system using RedHat Cluster Manager!
  The problem is that I can not get Quorum to start unless I run the  command "cluforce"! But after viewing clustat commands on systems c11  and c12, it looks like c11 and c12 cant see each other status due to  some issue with the raw partitions I believe? I am using a Mylex Fiber  Channel Box with QLogic 2300  interface card! The raw devices are  setup on a 2 mirror drives, Raid 1. Can you please shed any good ideas  what might be wrong here? The vidals are below!
  Thanks!
  Paul  R
  Linux System Admin
  
  c11*****************************************
  [root at c11 root] # clustat -i 5
  Cluster Status -  TESTVNSHA                                              21:42:16
  Cluster Quorum Incarnation #2
  Shared State: Shared Raw Device Driver v1.2
  
    Member             Status
    ------------------ ----------
     c11                 Active     <-- You are here
    c12                Inactive
  
    Service         Status   Owner (Last)     Last Transition  Chk Restarts
    -------------- -------- ---------------- --------------- --- --------
     vns             started   c12               19:25:12 Mar 05   2         0
  [root at c11 root]# shutil -p /cluster/header
  /cluster/header is 144 bytes long
  SharedStateHeader {
          ss_magic = 0x39119fcd
          ss_timestamp = 0x000000004408c23c (14:25:00 Mar 03 2006)
          ss_updateHost = c11.wf.ibm.com
  }
  [root at c11 root]# fdisk -l
  
  Disk /dev/sda: 146.8 GB, 146814976000 bytes
  255 heads, 63 sectors/track, 17849 cylinders
  Units = cylinders of 16065 * 512 = 8225280 bytes
  
     Device Boot     Start       End     Blocks   Id  System
  /dev/sda1   *          1        16     128488+  83  Linux
  /dev/sda2             17       526    4096575   83  Linux
  /dev/sda3            527      1036   4096575    83  Linux
  /dev/sda4           1037     17849 135050422+   f  Win95  Ext'd (LBA)
  /dev/sda5           1037      1546   4096543+   83  Linux
  /dev/sda6           1547      1801   2048256    82  Linux swap
  /dev/sda7           1802      1930   1036161    83  Linux
  /dev/sda8           1931      1993     506016   83  Linux
  
  Disk /dev/sdb: 72.7 GB, 72796340224 bytes
  255 heads, 63 sectors/track, 8850 cylinders
  Units = cylinders of 16065 * 512 = 8225280 bytes
  
     Device Boot     Start       End     Blocks   Id  System
  /dev/sdb1   *          1      8850  71087593+  83   Linux
  
  Disk /dev/sdc: 72.7 GB, 72796340224 bytes
  255 heads, 63 sectors/track, 8850 cylinders
  Units = cylinders of 16065 * 512 = 8225280 bytes
  
     Device Boot     Start       End     Blocks   Id  System
  /dev/sdc1   *          1          5     40131   83  Linux
  /dev/sdc2              6        10      40162+  83  Linux
  
  [root at c11 root]# raw -qa
  /dev/raw/raw1:  bound to major 8, minor 33
  /dev/raw/raw2:  bound to major 8, minor 34
  
  [root at c11 root]# rpm -qa |grep clu
  redhat-config-cluster-1.0.2-2.0
  clumanager-1.2.22-2
  
  [root at c11 root]# uname -r
  2.4.21-32.0.1.ELsmp
  
  [root at c11 root]# lsmod
  Module                   Size  Used by    Not tainted
  soundcore                7012   0  (autoclean)
  ide-cd                  34016   0  (autoclean)
  cdrom                   32864   0  (autoclean) [ide-cd]
  iptable_filter          2412   0  (autoclean) (unused)
  ip_tables               16544   1  [iptable_filter]
  softdog                 2972   1
  lp                       9124   0  (autoclean)
  parport                 38816   0  (autoclean) [lp]
  autofs                  13620   0  (autoclean) (unused)
  tg3                     69768   2
  floppy                  57520   0  (autoclean)
  microcode                6848   0  (autoclean)
  keybdev                  2976   0  (unused)
  mousedev                5624   1
  hid                     22500   0  (unused)
  input                    6144   0  [keybdev mousedev hid]
  usb-ohci               23208   0  (unused)
  usbcore                 81120   1  [hid usb-ohci]
  ext3                    89928   7
  jbd                     55124   7  [ext3]
  qla2300               696284   5
  mptscsih               42384   7
  mptbase                 42816   3  [mptscsih]
  diskdumplib             5228   0  [mptscsih mptbase]
  sd_mod                 14096  24
  scsi_mod               115368   3  [qla2300 mptscsih sd_mod]
  
  c12*****************************************
  [root at c12 root]# clustat -i 5
  Cluster Status -  TESTVNSHA                                              21:56:30
  Cluster Quorum Incarnation #4
  Shared State: Shared Raw Device Driver v1.2
  
    Member             Status
    ------------------ ----------
    c11                Inactive
     c12                 Active     <-- You are here
  
    Service         Status   Owner (Last)     Last Transition  Chk Restarts
    -------------- -------- ---------------- --------------- --- --------
     vns             started   c12               19:25:12 Mar 05   2         0
  [root at c12 root]#  shutil -p /cluster/header
  /cluster/header is 144 bytes long
  SharedStateHeader {
          ss_magic = 0x39119fcd
          ss_timestamp = 0x000000004408c23c (14:25:00 Mar 03 2006)
          ss_updateHost = c11.wf.ibm.com
  }
  [root at c12 root]#  shutil -p /cluster/header
  /cluster/header is 144 bytes long
  SharedStateHeader {
          ss_magic = 0x39119fcd
          ss_timestamp = 0x000000004408c23c (14:25:00 Mar 03 2006)
          ss_updateHost = c11.wf.ibm.com
  }
  [root at c12 root]#  fdisk -l
  
  Disk /dev/sda: 146.8 GB, 146814976000 bytes
  255 heads, 63 sectors/track, 17849 cylinders
  Units = cylinders of 16065 * 512 = 8225280 bytes
  
     Device Boot     Start       End     Blocks   Id  System
  /dev/sda1   *          1        16     128488+  83  Linux
  /dev/sda2             17       526    4096575   83  Linux
  /dev/sda3            527      1036   4096575    83  Linux
  /dev/sda4           1037     17849 135050422+   f  Win95  Ext'd (LBA)
  /dev/sda5           1037      1546   4096543+   83  Linux
  /dev/sda6           1547      1801   2048256    82  Linux swap
  /dev/sda7           1802      1930   1036161    83  Linux
  /dev/sda8           1931      1993     506016   83  Linux
  
  Disk /dev/sdb: 72.7 GB, 72796340224 bytes
  255 heads, 63 sectors/track, 8850 cylinders
  Units = cylinders of 16065 * 512 = 8225280 bytes
  
     Device Boot     Start       End     Blocks   Id  System
  /dev/sdb1   *          1      8850  71087593+  83   Linux
  
  Disk /dev/sdc: 72.7 GB, 72796340224 bytes
  255 heads, 63 sectors/track, 8850 cylinders
  Units = cylinders of 16065 * 512 = 8225280 bytes
  
     Device Boot     Start       End     Blocks   Id  System
  /dev/sdc1   *          1          5     40131   83  Linux
  /dev/sdc2              6        10      40162+  83  Linux
  
   [root at c12 root]#  raw -qa
  /dev/raw/raw1:  bound to major 8, minor 33
  /dev/raw/raw2:  bound to major 8, minor 34
  
  [root at c12 root]# rpm -qa |grep clu
  redhat-config-cluster-1.0.2-2.0
  clumanager-1.2.22-2
  
  [root at c12 root]#  uname -r
  2.4.21-32.0.1.ELsmp
  
  [root at c12 root]# lsmod
  Module                   Size  Used by    Not tainted
  soundcore                7012   0  (autoclean)
  ide-cd                  34016   0  (autoclean)
  cdrom                   32864   0  (autoclean) [ide-cd]
  iptable_filter          2412   0  (autoclean) (unused)
  ip_tables               16544   1  [iptable_filter]
  softdog                 2972   1
  lp                       9124   0  (autoclean)
  parport                 38816   0  (autoclean) [lp]
  autofs                  13620   0  (autoclean) (unused)
  tg3                     69768   2
  floppy                  57520   0  (autoclean)
  microcode                6848   0  (autoclean)
  keybdev                  2976   0  (unused)
  mousedev                5624   1
  hid                     22500   0  (unused)
  input                    6144   0  [keybdev mousedev hid]
  usb-ohci               23208   0  (unused)
  usbcore                 81120   1  [hid usb-ohci]
  ext3                    89928   7
  jbd                     55124   7  [ext3]
  qla2300               696284   5
  mptscsih               42384   7
  mptbase                 42816   3  [mptscsih]
  diskdumplib             5228   0  [mptscsih mptbase]
  sd_mod                 14096  24
  scsi_mod               115368   3  [qla2300 mptscsih sd_mod]
  
  
---------------------------------
 Yahoo! Mail
 Use Photomail to share photos without annoying attachments.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060305/9b76483a/attachment.htm>

From basv at sara.nl  Mon Mar  6 19:59:26 2006
From: basv at sara.nl (Bas van der Vlies)
Date: Mon, 6 Mar 2006 20:59:26 +0100
Subject: [Linux-cluster] (no subject)
Message-ID: <05AF3B6E-B88A-45CC-83E4-1353291E6347@sara.nl>

Out setup is:
  * We are using GFS from cvs stable branch  on our 2.6.14.7  
cluster.  Just updated today to the
    newest CVS version. Only had to change the mutex() calls.
* The 4 nodes are running debian sarge;
* The 4 nodes act as NFS-servers for +/- 640  client-nodes
* brocade switch with SGI TP9300 4 controllers (15 TB)

We did a lot of testing an we could not crash the cluster, bonnie/ 
iozone and other tools/jobs. Now the cluster is in production we
get a lot of nfsd crashed with EIP is at fda_create. We had it with  
our previous kernel 2.16.4.4 and with this one and "latest"
CVS stable version. The server still runs ++ the load is high and it  
does not respond any more. If we are luckly only one NFS
thread is gone and rest is still up. The rest of the nodes still work.

Have users experienced this kind of problems and maybe have a  
solution for this problem?


Regards,


Here is a oops message:
Unable to handle kernel NULL pointer dereference at virtual address  
00000038
printing eip:
f89bf999
*pde = 37bff001
*pte = 00000000
Oops: 0000 [#1]
SMP
Modules linked in: lock_dlm dlm cman dm_round_robin dm_multipath sg  
ide_floppy ide_cd cdrom qla2300 qla2xxx_conf qla2xxx firmware_class  
siimage piix e1000 gfs lock_harness dm_mod
CPU:    0
EIP:    0060:[<f89bf999>]    Tainted: GF     VLI
EFLAGS: 00010246   (2.6.14.7-sara1)
EIP is at gfs_create+0xa9/0x1e0 [gfs]
eax: ffffffef   ebx: ffffffef   ecx: 00000001   edx: 00000000
esi: f296e24c   edi: ebf01e18   ebp: ebf01e84   esp: ebf01df8
ds: 007b   es: 007b   ss: 0068
Process nfsd (pid: 16924, threadinfo=ebf00000 task=ebe84540)
Stack: ebf01e48 f296e24c 00000001 00008180 ebf01e18 00000001 f8cb9000  
dd042254
        ebf01e18 ebf01e18 00000000 ebe84540 00000001 00000120  
00000000 000000c2
        00000000 00000001 ebf01e40 ebf01e40 ebf01e48 ebf01e48  
df0bd858 ebe84540
Call Trace:
[<c0103e5f>] show_stack+0x7f/0xa0
[<c0104012>] show_registers+0x162/0x1d0
[<c0104224>] die+0xf4/0x180
[<c035f697>] do_page_fault+0x2e7/0x6b2
[<c0103b03>] error_code+0x4f/0x54
[<c016b663>] vfs_create+0x83/0xf0
[<c01b81ce>] nfsd_create_v3+0x40e/0x550
[<c01bed2d>] nfsd3_proc_create+0x11d/0x180
[<c01b2f87>] nfsd_dispatch+0xd7/0x200
[<c0353a96>] svc_process+0x536/0x670
[<c01b2d1d>] nfsd+0x1bd/0x350
[<c010127d>] kernel_thread_helper+0x5/0x18
Code: 24 08 8d 45 c4 89 54 24 0c 89 74 24 04 89 04 24 e8 1d c3 fe ff  
85 c0 89 c3 0f 84 2e 01 00 00 83 f8 ef 0f 85 13 01 00 00 8b 55 14  
<80> 7a 38 00 0f 88 06 01 00 00 89 7c 24 0c 31 c0 8d 55 c4 89 44


--
Bas van der Vlies
basv at sara.nl


From basv at sara.nl  Mon Mar  6 20:10:12 2006
From: basv at sara.nl (Bas van der Vlies)
Date: Mon, 6 Mar 2006 21:10:12 +0100
Subject: [Linux-cluster] gfs + nfsd crash
In-Reply-To: <05AF3B6E-B88A-45CC-83E4-1353291E6347@sara.nl>
References: <05AF3B6E-B88A-45CC-83E4-1353291E6347@sara.nl>
Message-ID: <C273A968-DE83-451D-93E4-589F137E5F27@sara.nl>

Sorry no subject ;-(

On Mar 6, 2006, at 8:59 PM, Bas van der Vlies wrote:

> Out setup is:
>  * We are using GFS from cvs stable branch  on our 2.6.14.7  
> cluster.  Just updated today to the
>    newest CVS version. Only had to change the mutex() calls.
> * The 4 nodes are running debian sarge;
> * The 4 nodes act as NFS-servers for +/- 640  client-nodes
> * brocade switch with SGI TP9300 4 controllers (15 TB)
>
> We did a lot of testing an we could not crash the cluster, bonnie/ 
> iozone and other tools/jobs. Now the cluster is in production we
> get a lot of nfsd crashed with EIP is at fda_create. We had it with  
> our previous kernel 2.16.4.4 and with this one and "latest"
> CVS stable version. The server still runs ++ the load is high and  
> it does not respond any more. If we are luckly only one NFS
> thread is gone and rest is still up. The rest of the nodes still work.
>
> Have users experienced this kind of problems and maybe have a  
> solution for this problem?
>
>
> Regards,
>
>
> Here is a oops message:
> Unable to handle kernel NULL pointer dereference at virtual address  
> 00000038
> printing eip:
> f89bf999
> *pde = 37bff001
> *pte = 00000000
> Oops: 0000 [#1]
> SMP
> Modules linked in: lock_dlm dlm cman dm_round_robin dm_multipath sg  
> ide_floppy ide_cd cdrom qla2300 qla2xxx_conf qla2xxx firmware_class  
> siimage piix e1000 gfs lock_harness dm_mod
> CPU:    0
> EIP:    0060:[<f89bf999>]    Tainted: GF     VLI
> EFLAGS: 00010246   (2.6.14.7-sara1)
> EIP is at gfs_create+0xa9/0x1e0 [gfs]
> eax: ffffffef   ebx: ffffffef   ecx: 00000001   edx: 00000000
> esi: f296e24c   edi: ebf01e18   ebp: ebf01e84   esp: ebf01df8
> ds: 007b   es: 007b   ss: 0068
> Process nfsd (pid: 16924, threadinfo=ebf00000 task=ebe84540)
> Stack: ebf01e48 f296e24c 00000001 00008180 ebf01e18 00000001  
> f8cb9000 dd042254
>        ebf01e18 ebf01e18 00000000 ebe84540 00000001 00000120  
> 00000000 000000c2
>        00000000 00000001 ebf01e40 ebf01e40 ebf01e48 ebf01e48  
> df0bd858 ebe84540
> Call Trace:
> [<c0103e5f>] show_stack+0x7f/0xa0
> [<c0104012>] show_registers+0x162/0x1d0
> [<c0104224>] die+0xf4/0x180
> [<c035f697>] do_page_fault+0x2e7/0x6b2
> [<c0103b03>] error_code+0x4f/0x54
> [<c016b663>] vfs_create+0x83/0xf0
> [<c01b81ce>] nfsd_create_v3+0x40e/0x550
> [<c01bed2d>] nfsd3_proc_create+0x11d/0x180
> [<c01b2f87>] nfsd_dispatch+0xd7/0x200
> [<c0353a96>] svc_process+0x536/0x670
> [<c01b2d1d>] nfsd+0x1bd/0x350
> [<c010127d>] kernel_thread_helper+0x5/0x18
> Code: 24 08 8d 45 c4 89 54 24 0c 89 74 24 04 89 04 24 e8 1d c3 fe  
> ff 85 c0 89 c3 0f 84 2e 01 00 00 83 f8 ef 0f 85 13 01 00 00 8b 55  
> 14 <80> 7a 38 00 0f 88 06 01 00 00 89 7c 24 0c 31 c0 8d 55 c4 89 44
>
>
>
>
>
>
> --
> Bas van der Vlies
> basv at sara.nl
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

--
Bas van der Vlies
basv at sara.nl


From hong.zheng at wsdtx.org  Mon Mar  6 20:02:52 2006
From: hong.zheng at wsdtx.org (Hong Zheng)
Date: Mon, 6 Mar 2006 14:02:52 -0600
Subject: [Linux-cluster] Cluster service restarting Locally
Message-ID: <E2FAC907AF594A43B22185B5253DFCA0F34D@mx1.wsd.edu>

I'm having the same problem. My system configuration is as follows:

2-node cluster: RH ES3, GFS6.0, clumanager-1.2.28-1 and
redhat-config-cluster-1.0.8-1

Kernel: 2.4.21-37.EL

Linux-iscsi-3.6.3 initiator: connections to iSCSI shared storage server

 
I just noticed that whenever I have a heavy IO access this problem
happens.

 
Any suggestion I would really appreciate.

 
Hong

________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of saju john
Sent: Monday, March 06, 2006 12:48 AM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] Cluster service restarting Locally

 
Dear All,

I have a 2 node cluster with RHAS3 update 3.
Kernel : 2.4.21-20.Elsmp
Clumanager : clumanager-1.2.16-1

For more than a year everyting had been fine. Suddenly it started
showing the follwing and restarted the service locally

clusvcmgrd[1388]: <err> Unable to obtain cluster lock: Connection timed
out
clulockd[1378]: <warning> Denied A.B.C.D: Broken pipe
clulockd[1378]: <err> select error: Broken pipe
clusvcmgrd: [1625]: <notice> service notice: Stopping service postgresql
...
clusvcmgrd: [1625]: <notice> service notice: Running user script
'/etc/init.d/postgresql stop'
clusvcmgrd: [1625]: <notice> service notice: Stopped service postgresql
clusvcmgrd: [1625]: <notice> service notice: Starting service postgresql
...
clusvcmgrd: [1625]: <notice> service notice: Running user script
'/etc/init.d/postgresql start'
clusvcmgrd: [1625]: <notice> service notice: Started service postgresql
...

I saw the same problem already reported by Mr. Anu Matthew 

Is there any solution to this reported problem

Thanks in advance

Saju John


<http://adworks.rediff.com/cgi-bin/AdWorks/sigclick.cgi/www.rediff.com/s
ignature-home.htm/1507191490 at Middle5?PARTNER=3> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060306/4ee7d937/attachment.htm>

From jan.kudjak at snt.sk  Tue Mar  7 09:15:50 2006
From: jan.kudjak at snt.sk (Kudjak Jan)
Date: Tue, 7 Mar 2006 10:15:50 +0100
Subject: [Linux-cluster] 4 node gfs cluster, quorum needs 3
Message-ID: <139607FAAB7E0F46AD1A9BA381EC815A012C2668@KLEIO.snt.sk>


Hello,

I have at this time 4 node gfs cluster using RLM.
Two nodes (node1, node2) have mounted gfs filesystem and other two (node3, node4) are working as loadballancers and as redundant lock servers (no gfs fs mounted on node3 or node4).
(i am using GFS-6.0.2.20-2, GFS-modules-smp-6.0.2.20-2, kernel-smp-2.4.21-32.0.1.EL)

So when all nodes are up there is:

quorum_has = 4
quorum_needs = 3

I tried to stop lock_gulm on node3 and node4.
Although the cluster was in state

quorum_has = 2
quorum_needs = 3

the gfs filesystem on node1 or node2 still remained read/write accessible. 
Is this behaviour correct ?

----

nodes   quorum_needs	quorum_has	filesystem

3		>=2		2		r/w

4		>=3		2		r/w ?????

5		>=3		3		r/w


Can anybody help me out to correct or even extend the table above?
Where is the truth ? :) or have I misunderstood something ?

Thanks a lot for your answers.

--
J?n Kudj?k
UNIX/Linux Consultant


From sebastien.didier at gmail.com  Tue Mar  7 10:50:25 2006
From: sebastien.didier at gmail.com (=?ISO-8859-1?Q?S=E9bastien_DIDIER?=)
Date: Tue, 7 Mar 2006 11:50:25 +0100
Subject: [Linux-cluster] Httpd Process io blocked
Message-ID: <e5c14df60603070250v6063dccj@mail.gmail.com>

Hi,

I'm running a two-nodes GFS cluster which hosts web sites. The GFS
partition is over a Iscsi device and by now, i'm using manual fencing.

Today, I got 5 httpd process on both nodes which got stuck in IO
blocking state. I suspected a GFS filesystem corruption but I haven't
got any output from the kernel. I ran a fsck two days ago after a
power chute.

Here's the wait state of the process. (idem for the other node)

# ps -o pid,tt,user,fname,wchan -C apache
  PID TT       USER     COMMAND  WCHAN
 4426 ?        root     apache   -
14970 ?        www-data apache   glock_wait_internal
15103 ?        www-data apache   glock_wait_internal
16780 ?        www-data apache   glock_wait_internal
16959 ?        www-data apache   glock_wait_internal
14936 ?        www-data apache   finish_stop
12859 ?        www-data apache   -
13005 ?        www-data apache   -
13311 ?        www-data apache   semtimedop
13390 ?        www-data apache   semtimedop

How can I debug further this problem ? And how can I bring back home
my httpd processes without a reboot ?

Many thanks for your help.

Regards,
S?bastien DIDIER


From grimme at atix.de  Tue Mar  7 11:12:13 2006
From: grimme at atix.de (Marc Grimme)
Date: Tue, 7 Mar 2006 12:12:13 +0100
Subject: [Linux-cluster] Httpd Process io blocked
In-Reply-To: <e5c14df60603070250v6063dccj@mail.gmail.com>
References: <e5c14df60603070250v6063dccj@mail.gmail.com>
Message-ID: <200603071212.13600.grimme@atix.de>

Hi,
to debug you could use strace. E.g. executing strace -p 14970 will probably 
show you that the process is waiting for a lock. As the ps already does. My 
first guess would be, that you use apache with php and sessions. 

If so, the phplib uses flocks for locking the session-ids. Normally it happens 
that one process locks a session. If another process comes along to get an 
flock on that session it has to wait until the further flock is closed. It 
very often happens that the other process gets that flock when the client and 
session are not available any more. Then the flock is held until the apache 
process timesout. 

We have made a patch for a better locking with php which you can find on 
http:/www.open-sharedroot.org in the downloads section.
Hope that helps
Regards Marc.

On Tuesday 07 March 2006 11:50, S?bastien DIDIER wrote:
> Hi,
>
> I'm running a two-nodes GFS cluster which hosts web sites. The GFS
> partition is over a Iscsi device and by now, i'm using manual fencing.
>
> Today, I got 5 httpd process on both nodes which got stuck in IO
> blocking state. I suspected a GFS filesystem corruption but I haven't
> got any output from the kernel. I ran a fsck two days ago after a
> power chute.
>
> Here's the wait state of the process. (idem for the other node)
>
> # ps -o pid,tt,user,fname,wchan -C apache
>   PID TT       USER     COMMAND  WCHAN
>  4426 ?        root     apache   -
> 14970 ?        www-data apache   glock_wait_internal
> 15103 ?        www-data apache   glock_wait_internal
> 16780 ?        www-data apache   glock_wait_internal
> 16959 ?        www-data apache   glock_wait_internal
> 14936 ?        www-data apache   finish_stop
> 12859 ?        www-data apache   -
> 13005 ?        www-data apache   -
> 13311 ?        www-data apache   semtimedop
> 13390 ?        www-data apache   semtimedop
>
> How can I debug further this problem ? And how can I bring back home
> my httpd processes without a reboot ?
>
> Many thanks for your help.
>
> Regards,
> S?bastien DIDIER
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 121 409-54
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From sebastien.didier at gmail.com  Tue Mar  7 11:35:09 2006
From: sebastien.didier at gmail.com (=?ISO-8859-1?Q?S=E9bastien_DIDIER?=)
Date: Tue, 7 Mar 2006 12:35:09 +0100
Subject: [Linux-cluster] Httpd Process io blocked
In-Reply-To: <200603071212.13600.grimme@atix.de>
References: <e5c14df60603070250v6063dccj@mail.gmail.com>
	<200603071212.13600.grimme@atix.de>
Message-ID: <e5c14df60603070335r6d723375v@mail.gmail.com>

2006/3/7, Marc Grimme <grimme at atix.de>:
> Hi,
> to debug you could use strace. E.g. executing strace -p 14970 will probably
> show you that the process is waiting for a lock. As the ps already does. My
> first guess would be, that you use apache with php and sessions.

Thanks. But strace doesnt output anything and became Ctrl-C imune. It
needs a sigkill to exit and the traced process stays in T state. I
seems that it doesnt manage to get last system call where the process
is in D state.

>
> If so, the phplib uses flocks for locking the session-ids. Normally it happens
> that one process locks a session. If another process comes along to get an
> flock on that session it has to wait until the further flock is closed. It
> very often happens that the other process gets that flock when the client and
> session are not available any more. Then the flock is held until the apache
> process timesout.
>

I don't think it is session related because I store sessions file
outside the GFS mount point (/tmp) and I run a load balancer based
upon the source adress (to always send requests to the same server and
then keep sessions)

But, we are using mysql query caching (with some libraries like AdoDb)
inside the GFS mount point. Do you think it could be the cache files
which are dead-locked ?

> We have made a patch for a better locking with php which you can find on
> http:/www.open-sharedroot.org in the downloads section.
> Hope that helps
> Regards Marc.
>
> On Tuesday 07 March 2006 11:50, S?bastien DIDIER wrote:
> > Hi,
> >
> > I'm running a two-nodes GFS cluster which hosts web sites. The GFS
> > partition is over a Iscsi device and by now, i'm using manual fencing.
> >
> > Today, I got 5 httpd process on both nodes which got stuck in IO
> > blocking state. I suspected a GFS filesystem corruption but I haven't
> > got any output from the kernel. I ran a fsck two days ago after a
> > power chute.
> >
> > Here's the wait state of the process. (idem for the other node)
> >
> > # ps -o pid,tt,user,fname,wchan -C apache
> >   PID TT       USER     COMMAND  WCHAN
> >  4426 ?        root     apache   -
> > 14970 ?        www-data apache   glock_wait_internal
> > 15103 ?        www-data apache   glock_wait_internal
> > 16780 ?        www-data apache   glock_wait_internal
> > 16959 ?        www-data apache   glock_wait_internal
> > 14936 ?        www-data apache   finish_stop
> > 12859 ?        www-data apache   -
> > 13005 ?        www-data apache   -
> > 13311 ?        www-data apache   semtimedop
> > 13390 ?        www-data apache   semtimedop
> >
> > How can I debug further this problem ? And how can I bring back home
> > my httpd processes without a reboot ?
> >
> > Many thanks for your help.
> >
> > Regards,
> > S?bastien DIDIER
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Gruss / Regards,
>
> Marc Grimme
> Phone: +49-89 121 409-54
> http://www.atix.de/               http://www.open-sharedroot.org/
>
> **
> ATIX - Ges. fuer Informationstechnologie und Consulting mbH
> Einsteinstr. 10 - 85716 Unterschleissheim - Germany
>
>


From grimme at atix.de  Tue Mar  7 12:43:20 2006
From: grimme at atix.de (Marc Grimme)
Date: Tue, 7 Mar 2006 13:43:20 +0100
Subject: [Linux-cluster] Httpd Process io blocked
In-Reply-To: <e5c14df60603070335r6d723375v@mail.gmail.com>
References: <e5c14df60603070250v6063dccj@mail.gmail.com>
	<200603071212.13600.grimme@atix.de>
	<e5c14df60603070335r6d723375v@mail.gmail.com>
Message-ID: <200603071343.21212.grimme@atix.de>

Sebastien,
On Tuesday 07 March 2006 12:35, S?bastien DIDIER wrote:
> 2006/3/7, Marc Grimme <grimme at atix.de>:
> > Hi,
> > to debug you could use strace. E.g. executing strace -p 14970 will
> > probably show you that the process is waiting for a lock. As the ps
> > already does. My first guess would be, that you use apache with php and
> > sessions.
>
> Thanks. But strace doesnt output anything and became Ctrl-C imune. It
> needs a sigkill to exit and the traced process stays in T state. I
> seems that it doesnt manage to get last system call where the process
> is in D state.
Hmm, sounds like I've heard that already. If you trace the root httpd with -f 
and -t and lookout for great timeslices you'll propably find processes 
waiting for locks. The D state is a good indicator (ps ax | grep " D " and 
look at the pids). Do the pids of the D processes change from time to time or 
do they stay the same pids? 
>
> > If so, the phplib uses flocks for locking the session-ids. Normally it
> > happens that one process locks a session. If another process comes along
> > to get an flock on that session it has to wait until the further flock is
> > closed. It very often happens that the other process gets that flock when
> > the client and session are not available any more. Then the flock is held
> > until the apache process timesout.
>
> I don't think it is session related because I store sessions file
> outside the GFS mount point (/tmp) and I run a load balancer based
> upon the source adress (to always send requests to the same server and
> then keep sessions)
Yes, I agree. Sessions get lost if the the node fails, right?
>
> But, we are using mysql query caching (with some libraries like AdoDb)
> inside the GFS mount point. Do you think it could be the cache files
> which are dead-locked ?
It depends on how those files are locked and how and when the locks are set 
and released. If a lock is set at apache-child forktime and released at 
process terminate time, then yes that could happen. If only accesses to data 
of those files are protected with flocks then it should perform quite well.

Is that query caching part of perl-adodb or is it implemented by yourselves?

Have a look and play with strace and watch out for great times and the 
syscalls concerned with that. I would expect you ending up with 
flock-timeouts.

Hope that helps,
regards Marc.
>
> > We have made a patch for a better locking with php which you can find on
> > http:/www.open-sharedroot.org in the downloads section.
> > Hope that helps
> > Regards Marc.
> >
> > On Tuesday 07 March 2006 11:50, S?bastien DIDIER wrote:
> > > Hi,
> > >
> > > I'm running a two-nodes GFS cluster which hosts web sites. The GFS
> > > partition is over a Iscsi device and by now, i'm using manual fencing.
> > >
> > > Today, I got 5 httpd process on both nodes which got stuck in IO
> > > blocking state. I suspected a GFS filesystem corruption but I haven't
> > > got any output from the kernel. I ran a fsck two days ago after a
> > > power chute.
> > >
> > > Here's the wait state of the process. (idem for the other node)
> > >
> > > # ps -o pid,tt,user,fname,wchan -C apache
> > >   PID TT       USER     COMMAND  WCHAN
> > >  4426 ?        root     apache   -
> > > 14970 ?        www-data apache   glock_wait_internal
> > > 15103 ?        www-data apache   glock_wait_internal
> > > 16780 ?        www-data apache   glock_wait_internal
> > > 16959 ?        www-data apache   glock_wait_internal
> > > 14936 ?        www-data apache   finish_stop
> > > 12859 ?        www-data apache   -
> > > 13005 ?        www-data apache   -
> > > 13311 ?        www-data apache   semtimedop
> > > 13390 ?        www-data apache   semtimedop
> > >
> > > How can I debug further this problem ? And how can I bring back home
> > > my httpd processes without a reboot ?
> > >
> > > Many thanks for your help.
> > >
> > > Regards,
> > > S?bastien DIDIER
> > >
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> > --
> > Gruss / Regards,
> >
> > Marc Grimme
> > Phone: +49-89 121 409-54
> > http://www.atix.de/               http://www.open-sharedroot.org/
> >
> > **
> > ATIX - Ges. fuer Informationstechnologie und Consulting mbH
> > Einsteinstr. 10 - 85716 Unterschleissheim - Germany
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 121 409-54
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From sebastien.didier at gmail.com  Tue Mar  7 14:12:36 2006
From: sebastien.didier at gmail.com (=?ISO-8859-1?Q?S=E9bastien_DIDIER?=)
Date: Tue, 7 Mar 2006 15:12:36 +0100
Subject: [Linux-cluster] Httpd Process io blocked
In-Reply-To: <200603071343.21212.grimme@atix.de>
References: <e5c14df60603070250v6063dccj@mail.gmail.com>
	<200603071212.13600.grimme@atix.de>
	<e5c14df60603070335r6d723375v@mail.gmail.com>
	<200603071343.21212.grimme@atix.de>
Message-ID: <e5c14df60603070612s70b36f61v@mail.gmail.com>

2006/3/7, Marc Grimme <grimme at atix.de>:
> Sebastien,
> On Tuesday 07 March 2006 12:35, S?bastien DIDIER wrote:
> > 2006/3/7, Marc Grimme <grimme at atix.de>:
> > > Hi,
> > > to debug you could use strace. E.g. executing strace -p 14970 will
> > > probably show you that the process is waiting for a lock. As the ps
> > > already does. My first guess would be, that you use apache with php and
> > > sessions.
> >
> > Thanks. But strace doesnt output anything and became Ctrl-C imune. It
> > needs a sigkill to exit and the traced process stays in T state. I
> > seems that it doesnt manage to get last system call where the process
> > is in D state.
> Hmm, sounds like I've heard that already. If you trace the root httpd with -f
> and -t and lookout for great timeslices you'll propably find processes
> waiting for locks. The D state is a good indicator (ps ax | grep " D " and
> look at the pids). Do the pids of the D processes change from time to time or
> do they stay the same pids?

Marc,

All the blocked processes have the same pid since the beginning of
this issue. (22 hours by now)

> >
> > > If so, the phplib uses flocks for locking the session-ids. Normally it
> > > happens that one process locks a session. If another process comes along
> > > to get an flock on that session it has to wait until the further flock is
> > > closed. It very often happens that the other process gets that flock when
> > > the client and session are not available any more. Then the flock is held
> > > until the apache process timesout.
> >
> > I don't think it is session related because I store sessions file
> > outside the GFS mount point (/tmp) and I run a load balancer based
> > upon the source adress (to always send requests to the same server and
> > then keep sessions)
> Yes, I agree. Sessions get lost if the the node fails, right?

Yes. That may be a problem for some apps... But it is easier (and more
efficient) than storing session data into SQL.

> >
> > But, we are using mysql query caching (with some libraries like AdoDb)
> > inside the GFS mount point. Do you think it could be the cache files
> > which are dead-locked ?
> It depends on how those files are locked and how and when the locks are set
> and released. If a lock is set at apache-child forktime and released at
> process terminate time, then yes that could happen. If only accesses to data
> of those files are protected with flocks then it should perform quite well.
>
> Is that query caching part of perl-adodb or is it implemented by yourselves?

It appears that we are using a very common PHP AdoDB abstact class
without any change in the code.

When I run a "lsof -p" on each blocked process on the two nodes, each
one has exactly the same file open :
apache  23327 www-data   10r   REG  253,0      2128  5053927
/home/sites/website/web/queryCache/ca/adodb_cad1702c2e5d18a71d765e95bf55ea3b.cache
(deleted)

>
> Have a look and play with strace and watch out for great times and the
> syscalls concerned with that. I would expect you ending up with
> flock-timeouts.
>
> Hope that helps,
> regards Marc.
> >
> > > We have made a patch for a better locking with php which you can find on
> > > http:/www.open-sharedroot.org in the downloads section.
> > > Hope that helps
> > > Regards Marc.
> > >
> > > On Tuesday 07 March 2006 11:50, S?bastien DIDIER wrote:
> > > > Hi,
> > > >
> > > > I'm running a two-nodes GFS cluster which hosts web sites. The GFS
> > > > partition is over a Iscsi device and by now, i'm using manual fencing.
> > > >
> > > > Today, I got 5 httpd process on both nodes which got stuck in IO
> > > > blocking state. I suspected a GFS filesystem corruption but I haven't
> > > > got any output from the kernel. I ran a fsck two days ago after a
> > > > power chute.
> > > >
> > > > Here's the wait state of the process. (idem for the other node)
> > > >
> > > > # ps -o pid,tt,user,fname,wchan -C apache
> > > >   PID TT       USER     COMMAND  WCHAN
> > > >  4426 ?        root     apache   -
> > > > 14970 ?        www-data apache   glock_wait_internal
> > > > 15103 ?        www-data apache   glock_wait_internal
> > > > 16780 ?        www-data apache   glock_wait_internal
> > > > 16959 ?        www-data apache   glock_wait_internal
> > > > 14936 ?        www-data apache   finish_stop
> > > > 12859 ?        www-data apache   -
> > > > 13005 ?        www-data apache   -
> > > > 13311 ?        www-data apache   semtimedop
> > > > 13390 ?        www-data apache   semtimedop
> > > >
> > > > How can I debug further this problem ? And how can I bring back home
> > > > my httpd processes without a reboot ?
> > > >
> > > > Many thanks for your help.
> > > >
> > > > Regards,
> > > > S?bastien DIDIER
> > > >
> > > > --
> > > > Linux-cluster mailing list
> > > > Linux-cluster at redhat.com
> > > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > >
> > > --
> > > Gruss / Regards,
> > >
> > > Marc Grimme
> > > Phone: +49-89 121 409-54
> > > http://www.atix.de/               http://www.open-sharedroot.org/
> > >
> > > **
> > > ATIX - Ges. fuer Informationstechnologie und Consulting mbH
> > > Einsteinstr. 10 - 85716 Unterschleissheim - Germany
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Gruss / Regards,
>
> Marc Grimme
> Phone: +49-89 121 409-54
> http://www.atix.de/               http://www.open-sharedroot.org/
>
> **
> ATIX - Ges. fuer Informationstechnologie und Consulting mbH
> Einsteinstr. 10 - 85716 Unterschleissheim - Germany
>
>


From cjk at techma.com  Tue Mar  7 15:52:57 2006
From: cjk at techma.com (Kovacs, Corey J.)
Date: Tue, 7 Mar 2006 10:52:57 -0500
Subject: [Linux-cluster] Trouble with RHCS 3.0
Message-ID: <FF2CE0D593AEE34B955FEC77BD5AFBE0079E1D@tmaemail.techma.com>

Do your host files have entries for the node names that are NOT loopback? The
default for RedHat is to have the nodename in the loopback line. Correct that
if it is the case and you might get better results.
 
 
Cheers
 
 
Corey

________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of paul raymond
Sent: Monday, March 06, 2006 1:22 AM
To: lhh at redhat.com; linux-cluster at redhat.com
Subject: [Linux-cluster] Trouble with RHCS 3.0


Greetings Lon,
I am trying to setup a simple two cluster system using RedHat Cluster
Manager!
The problem is that I can not get Quorum to start unless I run the command
"cluforce"! But after viewing clustat commands on systems c11 and c12, it
looks like c11 and c12 cant see each other status due to some issue with the
raw partitions I believe? I am using a Mylex Fiber Channel Box with QLogic
2300  interface card! The raw devices are setup on a 2 mirror drives, Raid 1.
Can you please shed any good ideas what might be wrong here? The vidals are
below!
Thanks!
Paul  R
Linux System Admin

c11*****************************************
[root at c11 root] # clustat -i 5
Cluster Status - TESTVNSHA
21:42:16
Cluster Quorum Incarnation #2
Shared State: Shared Raw Device Driver v1.2

  Member             Status
  ------------------ ----------
  c11                Active     <-- You are here
  c12                Inactive

  Service        Status   Owner (Last)     Last Transition Chk Restarts
  -------------- -------- ---------! ------- --------------- --- --------
  vns            started  c12              19:25:12 Mar 05   2        0
[root at c11 root]# shutil -p /cluster/header
/cluster/header is 144 bytes long
SharedStateHeader {
        ss_magic = 0x39119fcd
        ss_timestamp = 0x000000004408c23c (14:25:00 Mar 03 2006)
        ss_updateHost = c11.wf.ibm.com
}
[root at c11 root]# fdisk -l

Disk /dev/sda: 146.8 GB, 146814976000 bytes
255 heads, 63 sectors/track, 17849 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot    Start       End    Blocks   ! Id  System
/dev/sda1   *         1        16    128488+  83  Linux
/dev/sda2            17       526   4096575   83  Linux
/dev/sda3           527      1036   4096575   83  Linux
/dev/sda4          1037     17849 135050422+   f  Win95 Ext'd (LBA)
/dev/sda5          1037      1546   4096543+  83  Linux
/dev/sda6          1547      1801   2048256   82  Linux swap /dev/sda7
1802      1930   1036161   83  Linux
/dev/sda8          1931      1993    506016   83  Linux

Disk /dev/sdb: 72.7 GB, 72796340224 bytes
255 heads, 63 sectors/track, 8850 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot    Start       End    Blocks   Id  System
/dev/sdb1   *         1      8850  71087593+  83  Linux

Disk /dev/sdc: 72.7 GB, 72796340224 bytes
255 heads, 63 sectors/track, 8850 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot   ! ; Start       End    Blocks   Id  System
/dev/sdc1   *         1         5     40131   83  Linux
/dev/sdc2             6        10     40162+  83  Linux

[root at c11 root]# raw -qa
/dev/raw/raw1:  bound to major 8, minor 33
/dev/raw/raw2:  bound to major 8, minor 34

[root at c11 root]# rpm -qa |grep clu
redhat-config-cluster-1.0.2-2.0
clumanager-1.2.22-2

[root at c11 root]# uname -r
2.4.21-32.0.1.ELsmp

[root at c11 root]# lsmod
Module                  Size  Used by    Not tainted soundcore
7012   0  (autoclean)
ide-cd                 34016   0  (autoclean)
cdrom                  32864   0  (autoclean) [ide-cd]
iptable_filter          2412   0  (autoclean) (unused)
ip_tables              16544   1  [iptable_filter]
softdog                 2972   1
lp                      9124   0  (autoclean)
parport                38816   0  (autoclean) [lp]
autofs                 13620   0  (autoclean) (unused)
tg3                    69768   2
floppy                 57520   0  (autoclean)
microcode               6848   0  (autoclean)
keybdev                 2976   0  (unused)
mousedev              &nbs! p; 5624   1
hid                    22500   0  (unused)
input                   6144   0  [keybdev mousedev hid]
usb-ohci               23208   0  (unused)
usbcore                81120   1  [hid usb-ohci]
ext3                   89928   7
jbd                    55124   7  [ext3]
qla2300               696284   5
mptscsih               42384   7
mptbase                42816   3  [mptscsih]
diskdumplib             5228   0  [mptscsih mptbase]
sd_mod                 14096  24
scsi_mod              115368   3  [qla2300 mptscsih sd_mod]

c12*****************************************
[root at c12 root]# clustat -i 5
Cluster Status - TESTVNSHA
21:56:30
Cluster Quorum Incarnation #4
Shared State: Shared Raw Device Driver v1.2

  Member             Status
  ------------------ ----------
  c11                Inactive
  c12                Active     <-- You are here

  Service        Status   Owner (Last)     Last Transition Chk Restarts
  -------------- -------- ---------! ------- --------------- --- --------
  vns            started  c12              19:25:12 Mar 05   2        0
[root at c12 root]#  shutil -p /cluster/header
/cluster/header is 144 bytes long
SharedStateHeader {
        ss_magic = 0x39119fcd
        ss_timestamp = 0x000000004408c23c (14:25:00 Mar 03 2006)
        ss_updateHost = c11.wf.ibm.com
}
[root at c12 root]#  shutil -p /cluster/header
/cluster/header is 144 bytes long
SharedStateHeader {
        ss_magic = 0x39119fcd
        ss_timestamp = 0x000000004408c23c (14:25:00 Mar 03 2006)
        ss_updateHost = c11.wf.ibm.com
}
[root at c12 root]#  fdisk -l

Disk /dev/sda: 146.8 GB, 146814976000 bytes
255 heads, 63 sectors/track, 17849 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot    Start       End    Blocks   Id  System
/dev/sda1   *         1        16    128488+  83  Linux
/dev/sda2            17       526   4096575   83  Linux
/dev/sda3           527      1036   4096575   83  Linux
/dev/sda4          1037     17849 135050422+   f  Win95 Ext'd (LBA)
/dev/sda5          1037      1546   4096543+  83  Linux
/dev/sda6          1547      1801   2048256   82  Linux swap
/dev/sda7          1802      1930   1036161   83  Linux
/dev/sda8          1931      1993    506016   83  Linux

Disk /dev/sdb: 72.7 GB, 72796340224 bytes
255 heads, 63 sectors/track, 8850 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot &nbs! p;  Start       End    Blocks   Id  System
/dev/sdb1   *         1      8850  71087593+  83  Linux

Disk /dev/sdc: 72.7 GB, 72796340224 bytes
255 heads, 63 sectors/track, 8850 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot    Start       End    Blocks   Id  System
/dev/sdc1   *         1         5     40131   83  Linux
/dev/sdc2             6        10     40162+  83  Linux

 [root at c12 root]#  raw -qa /dev/raw/raw1:  bound to major 8, minor 33
/dev/raw/raw2:  bound to major 8, minor 34

[root at c12 root]# rpm -qa |grep clu
redhat-config-cluster-1.0.2-2.0
clumanager-1.2.22-2

[root at c12 root]#  uname -r
2.4.21-32.0.1.ELsmp

[root at c12 root]# lsmod
Module                  Size  Used by    Not tainted
soundcore               7012   0  (autoclean)
ide-cd                 34016   0  (autoclean)
cdrom                  32864   0  (autoclean) [ide-cd]
iptable_filter        &! nbsp; 2412   0  (autoclean) (unused)
ip_tables              16544   1  [iptable_filter]
softdog                 2972   1
lp                      9124   0  (autoclean)
parport                38816   0  (autoclean) [lp]
autofs                 13620   0  (autoclean) (unused)
tg3                    69768   2
floppy                 57520   0  (autoclean)
microcode               6848   0  (autoclean)
keybdev                 2976   0  (unused)
mousedev                5624   1
hid                    22500   0  (unused)
input                   6144   0  [keybdev mousedev hid]
usb-ohci               23208  ! 0  (unused)
usbcore                81120   1  [hid usb-ohci]
ext3                   89928   7
jbd                    55124   7  [ext3]
qla2300               696284   5
mptscsih               42384   7
mptbase                42816   3  [mptscsih]
diskdumplib             5228   0  [mptscsih mptbase]
sd_mod                 14096  24
scsi_mod              115368   3  [qla2300 mptscsih sd_mod]


________________________________

Yahoo! Mail
Use Photomail
<http://pa.yahoo.com/*http://us.rd.yahoo.com/evt=38867/*http://photomail.mail
.yahoo.com>  to share photos without annoying attachments.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060307/358a60b2/attachment.htm>

From m.catanese at kinetikon.com  Tue Mar  7 16:04:06 2006
From: m.catanese at kinetikon.com (Matteo Catanese)
Date: Tue, 7 Mar 2006 17:04:06 +0100
Subject: [Linux-cluster] More CS4 fencing fun
Message-ID: <590F0181-7330-408F-B693-E182780DE8A7@kinetikon.com>

Hi, im doing failover tests on a CS4 cluster.

I have 2 HP dl380 + HP msa1000 (aka dl380 packaged cluster).

I already read this post
https://www.redhat.com/archives/linux-cluster/2006-January/msg00195.html

Im clustering a single oracle instance using active/passive. I don't  
use GFS.

I use fence_ilo

I have a fully working clustered oracle, i tried to migrate oracle  
instance from a node to another using system-config-cluster and  
everything works perfectly.

I tried some more "rude" failover tests with this setup:

node1 = active node
node2 = passive node

and those are the results:

Situation 1:

I rudely disconnect the powercable(s) from node1, so that node1 is  
_completely_ turned off, no current flows in it. ILO is down.

I have redundant powerunits but i wanted to simulate short circuit or  
motherboard failure

Node2, using fence, tries to poweroff node1

Fence_ilo tries to connect to node1_ilo_ip_address, but ilo is down  
because of power failure so fencing fails and starts retrying forever.

Result: One node perfectly up but cluster service stalled


Situation2:

I push the on/off button on node1. It  stops in 4 seconds, but power  
is still on, so ILO is up and working.

node2, using fence, tries to poweroff the node1.

ilo is working so fence_ilo correctly connects to  
node1_ilo_ip_address, it tries for some time to poweroff the already  
poweroff'd server, then it finally decides that server  is off.

Oracle is STILL down, no virtual ip, no storage mounted bla bla bla

Now node2 tries to wake up the turned_off_but_still_powered_ node1.

Node1 wakes up, then it does bootstrap (cluster is still stalled)  
then joins fence_domain. Fence on node2 completes succesfully and  
unlocks cluster and everything is up again

Switch time: 55 seconds (+ oracle startup time).


Situation 3:

This is not a real failover test.

Everything is off. I turn on the msa1000 and wait for its bootstrap.  
Then i turn on node1 but i still have node2 electrically disconnected.

Node1 tries to turn on node2 to complete the fence_domain, node2 is  
disconnected from power current so it will never wake up.

Cluster is stalled

Can you change fence behaviour to be less "radical" ?

If ILO is unreachable means that machine is already off and could not  
be powered on so fence shold spit out a warning and let the failover  
happen

If ILO is reachable then check its status to avoid pointless poweroff/ 
poweron

As of today fence is really dangerous in a production environment,  
for now i will turn it off

Matteo


From alfeijoo at cesga.es  Tue Mar  7 18:40:48 2006
From: alfeijoo at cesga.es (Alejandro Feijoo)
Date: Tue, 7 Mar 2006 19:40:48 +0100 (CET)
Subject: [Linux-cluster] cman for CS
Message-ID: <46998.193.144.44.59.1141756848.squirrel@webmail.cesga.es>


hi i have a linux kernel version 2.6.9.22.0.2 (the lastest!) buttt the cman
for dowload is cman-kernel-2.6.9-39.8.src.rpm....

there are any problem if i install that cman ????? and where is rpm for
kernel 2.6.9-39.8???


Tanks!

++-------------------------++
Alejandro Feij?o Fraga
Tecnico de Sistemas.
Centro de supercomputaci?n de Galicia
Avda. de Vigo s/n. Campus Sur.
15705 - Santiago de Compostela. Spain
Tlfn.: 981 56 98 10 Extension: 216
Fax: 981 59 46 16


From lhh at redhat.com  Tue Mar  7 19:12:39 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 07 Mar 2006 14:12:39 -0500
Subject: [Linux-cluster] More CS4 fencing fun
In-Reply-To: <590F0181-7330-408F-B693-E182780DE8A7@kinetikon.com>
References: <590F0181-7330-408F-B693-E182780DE8A7@kinetikon.com>
Message-ID: <1141758759.25169.120.camel@ayanami.boston.redhat.com>

On Tue, 2006-03-07 at 17:04 +0100, Matteo Catanese wrote:

> Result: One node perfectly up but cluster service stalled

Fencing never completes because iLO does not have power.  This an
architectural limitation to using iLO (or IPMI, actually) in a cluster
environment as the sole fencing method.  Compare to RSA - which can have
its own external power supply - even though it is an integrated solution
like iLO.

With redundant power supplies, the expectation is that different
circuits (or preferably - different power sources entirely) are used,
which should make the tested case significantly less likely to occur.


> Switch time: 55 seconds (+ oracle startup time).

Hrm, the backup node should take over the service after the primary node
is confirmed 'dead', i.e. after fencing is complete.  It should
certainly not be waiting around for the other node to come back to life.
What does your fence + service configuration look like, and were there
any obvious log messages which might explain the odd behavior?


> Cluster is stalled
> 
> Can you change fence behaviour to be less "radical" ?
> 
> If ILO is unreachable means that machine is already off and could not  
> be powered on so fence shold spit out a warning and let the failover  
> happen

iLO being unreachable means iLO is unreachable, and assumptions as to
why should probably not be limited to lack of power.  Routing problems,
bad network cable, disconnected cable, and the occasional infinite
iLO-DHCP loop will all make iLO unreachable, but in no way confirm that
the node is dead.

More to the point, though, you can get around this particular behavior
(fencing on startup -> hang because fencing fails) by starting fenced
with the clean start parameter.  In a two node cluster, this is useful
to start things up in a controlled way when you know you won't be able
to fence the other node.  I think it's:

   fence_tool join -c

If you (the administrator) are sure that the node is dead and does not
have any services running, it will cause fenced to not fence the other
node on startup, thereby avoiding the hang entirely.  However,
automatically doing this is unsafe if both nodes are booting while a
network partition exists between the nodes, the cluster will end up with
a split brain.

-- Lon


From milis at ogs-id.com  Wed Mar  8 04:13:57 2006
From: milis at ogs-id.com (Milis)
Date: Wed, 8 Mar 2006 11:13:57 +0700
Subject: [Linux-cluster] Clustering RHEL 4 with EXP400
Message-ID: <751353696.20060308111357@sur.ogs-id.com>

Dear All,

does any one here have experince to cluster RHEL 4  with 2 IBM x346
and 1 EXP 400,
I need to do this for build Oracle10g On Rac, whether I need driver
for build and startup of this device.
may I know what I need for this requirement?
and what should I do to get driver of IBM EXP400 ?
I've been success install IBM X346 with Raid 1 option on RHEL 4, but I
really confuse what next to do to show up cluster on EXP400 (whether I
have no driver for this device to share storage)
Thanks for your share knowledge.

-- 
Tks & Best regards,
Andi EP
IT Engineer
mailto:milis at ogs-id.com


From francisco_javier.pena at roche.com  Wed Mar  8 08:49:27 2006
From: francisco_javier.pena at roche.com (Pena, Francisco Javier)
Date: Wed, 8 Mar 2006 09:49:27 +0100
Subject: [Linux-cluster] cman for CS
Message-ID: <C0C1791E8EC6F249B5570F01409BD3EE91622D@rmamsem1.emea.roche.com>

Hi Alejandro,

The cman version after the dash sign does not have to be the same as the kernel version. Just do a "rpm -qp --requires cman-kernel-2.6.9-39.8.src.rpm", and it should tell you which kernel version is required.

Cheers,

Javier

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of 
> Alejandro Feijoo
> Sent: Tuesday, March 07, 2006 7:41 PM
> To: linux-cluster at redhat.com
> Subject: [Linux-cluster] cman for CS
> 
> 
> 
> 
> hi i have a linux kernel version 2.6.9.22.0.2 (the lastest!) 
> buttt the cman for dowload is cman-kernel-2.6.9-39.8.src.rpm....
> 
> there are any problem if i install that cman ????? and where 
> is rpm for kernel 2.6.9-39.8???
> 
> 
> Tanks!
> 
> ++-------------------------++
> Alejandro Feij?o Fraga
> Tecnico de Sistemas.
> Centro de supercomputaci?n de Galicia
> Avda. de Vigo s/n. Campus Sur.
> 15705 - Santiago de Compostela. Spain
> Tlfn.: 981 56 98 10 Extension: 216
> Fax: 981 59 46 16
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com 
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From mykleb at no.ibm.com  Wed Mar  8 09:11:57 2006
From: mykleb at no.ibm.com (Jan-Frode Myklebust)
Date: Wed, 8 Mar 2006 10:11:57 +0100
Subject: [Linux-cluster] Re: Clustering RHEL 4 with EXP400
References: <751353696.20060308111357@sur.ogs-id.com>
Message-ID: <slrne0t7ut.qu6.mykleb@99RXZYP.ibm.com>

On 2006-03-08, Milis <milis at ogs-id.com> wrote:
>
> does any one here have experince to cluster RHEL 4  with 2 IBM x346
> and 1 EXP 400,

I've clustered RHEL3 with 2x Dell PowerEdge 2650/ServeRAID 6M and 
1 EXP400.  This was an active/passive cluster, using the ServeRAID's 
hardware fencing to prevent more than one node from accessing the 
volumes on the EXP400. x346+RHEL4 shouldn't be any difference.

Now, I'm not much familiar with Oracle10g On Rac, but is it really an
active/passive solution you want there? AFAIK you can't have both
nodes active against the volumes on the EXP400 at the same time.

> I've been success install IBM X346 with Raid 1 option on RHEL 4, but I
> really confuse what next to do to show up cluster on EXP400 (whether I
> have no driver for this device to share storage)


  -jf


From carlopmart at gmail.com  Wed Mar  8 12:35:17 2006
From: carlopmart at gmail.com (carlopmart)
Date: Wed, 08 Mar 2006 13:35:17 +0100
Subject: [Linux-cluster] Postfix under cluster suite
Message-ID: <440ECF85.2050807@gmail.com>

Hi all,

  Somebody have tried to setup a postfix cluster service under RHCS 4? 
Is  it possible to mantain two postfix instances (one for node and 
another to the cluster)?

Thanks.

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From basv at sara.nl  Wed Mar  8 15:35:25 2006
From: basv at sara.nl (Bas van der Vlies)
Date: Wed, 08 Mar 2006 16:35:25 +0100
Subject: [Linux-cluster] gfs + nfsd crash
In-Reply-To: <C273A968-DE83-451D-93E4-589F137E5F27@sara.nl>
References: <05AF3B6E-B88A-45CC-83E4-1353291E6347@sara.nl>
	<C273A968-DE83-451D-93E4-589F137E5F27@sara.nl>
Message-ID: <440EF9BD.4030108@sara.nl>

We just upgraded to 2.6.16-rc5 and cvs stable gfs. We still have
gfs_create crashes.

=== Ooops =====
Unable to handle kernel NULL pointer dereference at virtual address 00000038
  printing eip:
f89a4be3
*pde = 37809001
*pte = 00000000
Oops: 0000 [#1]
SMP
Modules linked in: lock_dlm dlm cman dm_round_robin dm_multipath sg 
ide_floppy ide_cd cdrom qla2xxx siimage piix e1000 gfs lock_harness dm_mod
CPU:    0
EIP:    0060:[<f89a4be3>]    Tainted: GF     VLI
EFLAGS: 00010246   (2.6.16-rc5-sara3 #1)
EIP is at gfs_create+0x6f/0x153 [gfs]
eax: 00000000   ebx: ffffffef   ecx: f27d0d98   edx: ffffffef
esi: f2f84690   edi: f8b93000   ebp: f34a5e98   esp: f34a5e20
ds: 007b   es: 007b   ss: 0068
Process nfsd (pid: 8973, threadinfo=f34a4000 task=f3462a70)
Stack: <0>f092a530 00000001 f34a5e48 00000000 f34a5e84 f89a6628 f34a5e48 
ee1fc324
        00000003 00000000 f34a5e48 f34a5e48 00000000 f3462a70 00000003 
f34a5e5c
        f34a5e5c f27d0d98 f3462a70 00000001 00000020 00000000 000000c2 
00000000
Call Trace:
  [<c0103599>] show_stack_log_lvl+0xad/0xb5
  [<c01036db>] show_registers+0x10d/0x176
  [<c01038ad>] die+0xf2/0x16d
  [<c010f668>] do_page_fault+0x3dd/0x57a
  [<c010322f>] error_code+0x4f/0x54
  [<c01585f2>] vfs_create+0x6a/0xa7
  [<c0195e1c>] nfsd_create_v3+0x2b1/0x48a
  [<c019af2f>] nfsd3_proc_create+0x116/0x123
  [<c019229f>] nfsd_dispatch+0xbe/0x17f
  [<c02e0a52>] svc_process+0x381/0x5c7
  [<c019208c>] nfsd+0x18d/0x2e2
  [<c0100ed9>] kernel_thread_helper+0x5/0xb
Code: 94 50 8b 45 0c ff 75 10 83 c0 1c 6a 01 89 45 88 50 8d 45 c4 50 e8 
70 08 ff ff 83 c4 14 89 c3 85 c0 74 4883 f8 ef 75 33 8b 45 14 <80> 78 38 
00 78 2a 8d 45 94 50 8d 45 c4 6a 00 ff 75 88 50 e8 3c
  BUG: nfsd/8973, lock held at task exit time!
  [ee1fc398] {inode_init_once}
.. held by:              nfsd: 8973 [f3462a70, 115]
... acquired at:               nfsd_create_v3+0x127/0x48a


-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************


From lhh at redhat.com  Wed Mar  8 15:48:30 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 08 Mar 2006 10:48:30 -0500
Subject: [Linux-cluster] Postfix under cluster suite
In-Reply-To: <440ECF85.2050807@gmail.com>
References: <440ECF85.2050807@gmail.com>
Message-ID: <1141832910.25169.151.camel@ayanami.boston.redhat.com>

On Wed, 2006-03-08 at 13:35 +0100, carlopmart wrote:
> Hi all,
> 
>   Somebody have tried to setup a postfix cluster service under RHCS 4? 
> Is  it possible to mantain two postfix instances (one for node and 
> another to the cluster)?

I have not tried it, and I am not familiar with configuring Postfix in
the least, but here are some hints and gotchas which might exist ;)

* Both instances will want to (by default) bind to INADDR_ANY.  The two
instances must be binding to specific IP addresses - i.e., one instance
must bind to the host's IP, the other to the cluster floating IP - in
order for two instances to start in the first place.

* Both instances will want to deliver stuff in /var/mail ... and email
readers want to read from /var/mail.

Maybe the "RightThing(tm)" to do is something weird like the following.
This is a complete shot in the dark... ;)

- Cluster-mounted /var/mail (either GFS or not, it shouldn't matter)

- Node-specific Postfix instances never deliver mail directly, but
rather, they both forward to the cluster-instance Postfix IP.

- All postfix instances may accept mail for sending off-site.

This way, you don't have two instances of postfix both trying to manage
the contents of /var/mail (do they play nicely together?), all instances
of postfix can send mail, but only one does the ultimate receiving of
mail.

Also, if you use GFS for /var/mail, you may be able to run imapd on
multiple cluster nodes, but I've never tried this either.  As long as
multiple people aren't accessing the same imap mailbox, I am guessing it
would "just work" (famous last words, I know ;) ).

-- Lon


From gstaltari at arnet.net.ar  Wed Mar  8 18:05:27 2006
From: gstaltari at arnet.net.ar (German Staltari)
Date: Wed, 08 Mar 2006 15:05:27 -0300
Subject: [Linux-cluster] missing services
Message-ID: <440F1CE7.2010104@arnet.net.ar>

Hi, we have a 6 node cluster, each one mounts 6 GFS partitions. When I 
ask for the services to cman, there is always a mount point missing. Is 
this correct?
FC 4
kernel-smp-2.6.15-1.1831_FC4
dlm-kernel-smp-2.6.11.5-20050601.152643.FC4.21
GFS-kernel-smp-2.6.11.8-20050601.152643.FC4.24
cman-kernel-smp-2.6.11.5-20050601.152643.FC4.22

TIA
German Staltari

# df -h
Filesystem            Size  Used Avail Use% Mounted on
/dev/sda1              59G  2.4G   54G   5% /
/dev/shm              2.0G     0  2.0G   0% /dev/shm
/dev/mapper/vg1-store1    399G  184K  399G   1% /store/1
/dev/mapper/vg2-store2    399G  2.8M  399G   1% /store/2
/dev/mapper/vg3-store3    399G  180K  399G   1% /store/3
/dev/mapper/vg4-store4    399G  180K  399G   1% /store/4
/dev/mapper/vg5-store5    399G  180K  399G   1% /store/5
/dev/mapper/vg6-store6    399G  180K  399G   1% /store/6

# cman_tool services
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 run       -
[1 3]
DLM Lock Space:  "clvmd"                             7   3 run       -
[1 4 3]
DLM Lock Space:  "mailstore01"                      20   4 run       -
[1 3]
DLM Lock Space:  "mailstore02"                      22   6 run       -
[1 3]
DLM Lock Space:  "mailstore03"                      24   8 run       -
[1 3]
DLM Lock Space:  "mailstore04"                      26  10 run       -
[1 3]
DLM Lock Space:  "mailstore05"                      28  12 run       -
[1 3]
DLM Lock Space:  "mailstore06"                      30  14 run       -
[1 3]
GFS Mount Group: "mailstore01"                      21   5 run       -
[1 3]
GFS Mount Group: "mailstore02"                      23   7 run       -
[1 3]
GFS Mount Group: "mailstore03"                      25   9 run       -
[1 3]
GFS Mount Group: "mailstore04"                      27  11 run       -
[1 3]
GFS Mount Group: "mailstore05"                      29  13 run       -
[1 3]


From bobby.m.dalton at nasa.gov  Wed Mar  8 18:43:07 2006
From: bobby.m.dalton at nasa.gov (Dalton, Maurice)
Date: Wed, 8 Mar 2006 12:43:07 -0600
Subject: [Linux-cluster] RHEL4.0 CS and Ldap
Message-ID: <EB190CD1E73E1146ACB7694746E205A801655E0C@hx1.ums.msfc.nasa.gov>

Is there a way to create an Ldap cluster that can do replication with
RHEL 4.0 CS?

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060308/1f92bdb6/attachment.htm>

From Jon.Stanley at savvis.net  Wed Mar  8 18:54:22 2006
From: Jon.Stanley at savvis.net (Stanley, Jon)
Date: Wed, 8 Mar 2006 12:54:22 -0600
Subject: [Linux-cluster] GFS load average and locking
Message-ID: <9A6FE0FCC2B29846824C5CD81C6647B90143604F@s228130hz1ew08.apptix-01.savvis.net>

I have a 7 node GFS cluster, plus 3 lock servers (RH AS3U5, GULM
locking) that do not mount the filesystem.  I have a problem whereby the
load average on the system is extremely high (occasionally
astronomical), eventually leading to a complete site outage, via
inability to access the shared filesystem.  I have a couple questions
about the innards of GFS that I would be most grateful for someone to
answer:

The application is written in PHP, and the PHP sessioning is handled via
the GFS filesystem as well, if that's important.

1)  I notice that I have a lot of processes in uninterruptible sleep.
When I attached strace to one of these processes, I obviously found it
doing nothing for a period of ~30-60 seconds.  An excerpt of the strace
(using -r) follows:

     0.001224
stat64("/media/files/global/2/6/26c4f61c69117d55b352ce328babbff4.jpg",
{st_mode=S_IFREG|0644, st_size=9072, ...}) = 0
     0.000251
open("/media/files/global/2/6/26c4f61c69117d55b352ce328babbff4.jpg",
O_RDONLY) = 5
     0.000108 mmap2(NULL, 9072, PROT_READ, MAP_PRIVATE, 5, 0) =
0xaf381000
     0.000069 writev(4, [{"HTTP/1.1 200 OK\r\nDate: Wed, 08 M"..., 318},
{"\377\330\377\340\0\20JFIF\0\1\2\0\0d\0d\0\0\377\354\0\21"..., 9072}],
2) = 9390
     0.000630 close(5)                  = 0
     0.000049 munmap(0xaf381000, 9072)  = 0
     0.000052 rt_sigaction(SIGUSR1, {0x81ef474, [],
SA_RESTORER|SA_INTERRUPT, 0x1b2eb8}, {SIG_IGN}, 8) = 0
     0.000068 read(4, 0xa239b3c, 4096)  = ? ERESTARTSYS (To be
restarted)
     6.546891 --- SIGALRM (Alarm clock) @ 0 (0) ---
     0.000119 close(4)                  = 0

What it looks like is it hangs out in read() for a period of time, thus
leading to the uninterruptible sleep.  This particular example was 6
seconds, however it seems that the time is variable.  The particular
file in this instance is not large, only 9k.

I've never seen ERESTARTSYS before, and some googling tells me that it's
basically telling the kernel to interrupt the current syscall in order
to handle a signal (SIGALRM in this case, which I'm not sure the
function of).  I could be *way* off base here - I'm not a programmer by
any stretch of the imagination.

2)  The locking statistics seems to be a huge mystery.  The lock total
doesn't seem to correspond to the number of open files that I have (I
hope!).  Here's the output of a 'cat /proc/gulm/lockspace - I can't
imagine that I have 300,000+ files open on this system at this point -
when are the locks released, or is this even an indication of how many
locks that are active at the current time?  What does the 'pending'
number mean?

[svadmin at s259830hz1sl01 gulm]$ cat lockspace

lock counts:
  total: 369822
    unl: 176518
    exl: 1555
    shd: 191501
    dfr: 0
pending: 5
   lvbs: 2000
   lops: 21467433

[svadmin at s259830hz1sl01 gulm]$

Thanks for any help that anyone can provide on this!

Thanks!
-Jon


From cjk at techma.com  Wed Mar  8 19:06:10 2006
From: cjk at techma.com (Kovacs, Corey J.)
Date: Wed, 8 Mar 2006 14:06:10 -0500
Subject: [Linux-cluster] GFS load average and locking
Message-ID: <FF2CE0D593AEE34B955FEC77BD5AFBE0079E22@tmaemail.techma.com>

There is a condition (known) where locks are not being 
released as they should be. In a forthcoming patch, there
is a tunable parameter which allows the purging of unused,
yet retained locks by a percentage. I've tested this under
conditions which affect my ststem and it was rock solid 
afterwards. At the time I tested it, you had to make the 
change after the system was up and running (ie, not a config 
setting). Hopefully this will make it into update 7.

Regards,


Corey

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Stanley, Jon
Sent: Wednesday, March 08, 2006 1:54 PM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] GFS load average and locking

I have a 7 node GFS cluster, plus 3 lock servers (RH AS3U5, GULM
locking) that do not mount the filesystem.  I have a problem whereby the load
average on the system is extremely high (occasionally astronomical),
eventually leading to a complete site outage, via inability to access the
shared filesystem.  I have a couple questions about the innards of GFS that I
would be most grateful for someone to
answer:

The application is written in PHP, and the PHP sessioning is handled via the
GFS filesystem as well, if that's important.

1)  I notice that I have a lot of processes in uninterruptible sleep.
When I attached strace to one of these processes, I obviously found it doing
nothing for a period of ~30-60 seconds.  An excerpt of the strace (using -r)
follows:

     0.001224
stat64("/media/files/global/2/6/26c4f61c69117d55b352ce328babbff4.jpg",
{st_mode=S_IFREG|0644, st_size=9072, ...}) = 0
     0.000251
open("/media/files/global/2/6/26c4f61c69117d55b352ce328babbff4.jpg",
O_RDONLY) = 5
     0.000108 mmap2(NULL, 9072, PROT_READ, MAP_PRIVATE, 5, 0) = 0xaf381000
     0.000069 writev(4, [{"HTTP/1.1 200 OK\r\nDate: Wed, 08 M"..., 318},
{"\377\330\377\340\0\20JFIF\0\1\2\0\0d\0d\0\0\377\354\0\21"..., 9072}],
2) = 9390
     0.000630 close(5)                  = 0
     0.000049 munmap(0xaf381000, 9072)  = 0
     0.000052 rt_sigaction(SIGUSR1, {0x81ef474, [], SA_RESTORER|SA_INTERRUPT,
0x1b2eb8}, {SIG_IGN}, 8) = 0
     0.000068 read(4, 0xa239b3c, 4096)  = ? ERESTARTSYS (To be
restarted)
     6.546891 --- SIGALRM (Alarm clock) @ 0 (0) ---
     0.000119 close(4)                  = 0

What it looks like is it hangs out in read() for a period of time, thus
leading to the uninterruptible sleep.  This particular example was 6 seconds,
however it seems that the time is variable.  The particular file in this
instance is not large, only 9k.

I've never seen ERESTARTSYS before, and some googling tells me that it's
basically telling the kernel to interrupt the current syscall in order to
handle a signal (SIGALRM in this case, which I'm not sure the function of).
I could be *way* off base here - I'm not a programmer by any stretch of the
imagination.

2)  The locking statistics seems to be a huge mystery.  The lock total
doesn't seem to correspond to the number of open files that I have (I hope!).
Here's the output of a 'cat /proc/gulm/lockspace - I can't imagine that I
have 300,000+ files open on this system at this point - when are the locks
released, or is this even an indication of how many locks that are active at
the current time?  What does the 'pending'
number mean?

[svadmin at s259830hz1sl01 gulm]$ cat lockspace

lock counts:
  total: 369822
    unl: 176518
    exl: 1555
    shd: 191501
    dfr: 0
pending: 5
   lvbs: 2000
   lops: 21467433

[svadmin at s259830hz1sl01 gulm]$

Thanks for any help that anyone can provide on this!

Thanks!
-Jon

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From wcheng at redhat.com  Wed Mar  8 19:20:32 2006
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 08 Mar 2006 14:20:32 -0500
Subject: [Linux-cluster] GFS load average and locking
In-Reply-To: <9A6FE0FCC2B29846824C5CD81C6647B90143604F@s228130hz1ew08.apptix-01.savvis.net>
References: <9A6FE0FCC2B29846824C5CD81C6647B90143604F@s228130hz1ew08.apptix-01.savvis.net>
Message-ID: <440F2E80.9030507@redhat.com>

Stanley, Jon wrote:

>2)  The locking statistics seems to be a huge mystery.  The lock total
>doesn't seem to correspond to the number of open files that I have (I
>hope!).  Here's the output of a 'cat /proc/gulm/lockspace - I can't
>imagine that I have 300,000+ files open on this system at this point -
>when are the locks released, or is this even an indication of how many
>locks that are active at the current time?  What does the 'pending'
>number mean?
>  
>

GFS caches locks and normally won't release them (for performance 
reason). However, we do find this could cause latency issue, 
particularly after back up and/or tar command where lots of locks are 
accumulated into one single node that previously issued the backup 
command. Judging by your description of read latency and number of 
"shared" locks in your lockspace output, we do have a new tunable in 
to-be-released-soon RHEL3 Update 7 that allows admin to purge the locks. 
This seems to help several of (beta) customers to resolve their latency 
issues.

Other than this, do you find any error messages in your 
/var/log/messages file ?

-- Wendy


From Jon.Stanley at savvis.net  Wed Mar  8 19:36:00 2006
From: Jon.Stanley at savvis.net (Stanley, Jon)
Date: Wed, 8 Mar 2006 13:36:00 -0600
Subject: [Linux-cluster] GFS load average and locking
Message-ID: <9A6FE0FCC2B29846824C5CD81C6647B901436104@s228130hz1ew08.apptix-01.savvis.net>

We're not doing any tar/backup of the filesystem, so I don't think that
this is the issue.  There are a *large* number of small files (but the
files per directory are kept small).  I'm not sure if that has anything
to do with this.

There are no abnormal messages in /var/log/messages.  The lockspace
output that I gave you is from a client, not the lock master.  Let me
know if there is any more information that I might be able to provide.

We have a GFS service request open, but it doesn't seem to be getting
very far :-( 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Wendy Cheng
Sent: Wednesday, March 08, 2006 1:21 PM
To: linux clustering
Subject: Re: [Linux-cluster] GFS load average and locking

Stanley, Jon wrote:

>2)  The locking statistics seems to be a huge mystery.  The lock total 
>doesn't seem to correspond to the number of open files that I have (I 
>hope!).  Here's the output of a 'cat /proc/gulm/lockspace - I can't 
>imagine that I have 300,000+ files open on this system at this point - 
>when are the locks released, or is this even an indication of how many 
>locks that are active at the current time?  What does the 'pending'
>number mean?
>  
>

GFS caches locks and normally won't release them (for performance
reason). However, we do find this could cause latency issue,
particularly after back up and/or tar command where lots of locks are
accumulated into one single node that previously issued the backup
command. Judging by your description of read latency and number of
"shared" locks in your lockspace output, we do have a new tunable in
to-be-released-soon RHEL3 Update 7 that allows admin to purge the locks.

This seems to help several of (beta) customers to resolve their latency
issues.

Other than this, do you find any error messages in your
/var/log/messages file ?

-- Wendy

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From wcheng at redhat.com  Wed Mar  8 19:51:59 2006
From: wcheng at redhat.com (Wendy Cheng)
Date: Wed, 08 Mar 2006 14:51:59 -0500
Subject: [Linux-cluster] GFS load average and locking
In-Reply-To: <9A6FE0FCC2B29846824C5CD81C6647B901436104@s228130hz1ew08.apptix-01.savvis.net>
References: <9A6FE0FCC2B29846824C5CD81C6647B901436104@s228130hz1ew08.apptix-01.savvis.net>
Message-ID: <440F35DF.1070604@redhat.com>

Stanley, Jon wrote:

>We have a GFS service request open, 
>
Could you pass your ticket number so we can check into this ?

-- Wendy


From Britt.Treece at savvis.net  Wed Mar  8 19:58:23 2006
From: Britt.Treece at savvis.net (Treece, Britt)
Date: Wed, 8 Mar 2006 13:58:23 -0600
Subject: [Linux-cluster] GFS load average and locking
Message-ID: <9A6FE0FCC2B29846824C5CD81C6647B901436169@s228130hz1ew08.apptix-01.savvis.net>

Wendy,

The ticket number is 836516.  We've been told that it has been escalated
to the GFS engineers.

Here is lockdump information that we've pulled on an inode whose httpd
process is in the "D" state...

$ sudo lsof /data02 | grep 14060
httpd   14060 nobody  cwd    DIR 254,66  3864 93725927
/data02/resources/htdocs/SVVS-2006-03-02-06-17-20
httpd   14060 nobody    5u   REG 254,66 15624 52992012
/data02/sessions/6/c/2/4/sess_6c249351c42e2c19c669b068433db9a8

$ ps -auxwww | grep 14060
root     24901  0.0  0.0  1700  432 pts/2    S    12:57   0:00 strace
-rp 14060
nobody   14060  0.1  0.5 150564 42348 ?      D    09:47   0:19
/usr/local/apache/bin/httpd -DSSL


following parsed from a lockdump of /data02...

Glock (7, 52992012)
  gl_flags =
  gl_count = 2
  gl_state = 3
  lvb_count = 0
  object = yes
  dependencies = no
  reclaim = no
  Holder
    owner = -1
    gh_state = 3
    gh_flags = 5 7
    error = 0
    gh_iflags = 1 5 6

Glock (8, 52992012)
  gl_flags =
  gl_count = 4
  gl_state = 1
  lvb_count = 0
  object = no
  dependencies = no
  reclaim = no
  Holder
    owner = -1
    gh_state = 1
    gh_flags = 5 7
    error = 0
    gh_iflags = 1 5 6
  Waiter2
    owner = -1
    gh_state = 0
    gh_flags = 0
    error = 0
    gh_iflags = 2 3 4
  Waiter2
    owner = -1
    gh_state = 1
    gh_flags = 5 7
    error = 0
    gh_iflags = 1

Glock (4, 52992012)
  gl_flags =
  gl_count = 3
  gl_state = 3
  lvb_count = 0
  object = yes
  dependencies = no
  reclaim = no
  Inode:
    num = 52992012/52992012
    type = 1
    i_count = 1
    i_flags =
    vnode = yes


-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Wendy Cheng
Sent: Wednesday, March 08, 2006 1:52 PM
To: linux clustering
Subject: Re: [Linux-cluster] GFS load average and locking

Stanley, Jon wrote:

>We have a GFS service request open, 
>
Could you pass your ticket number so we can check into this ?

-- Wendy

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From rstevens at vitalstream.com  Wed Mar  8 21:05:55 2006
From: rstevens at vitalstream.com (Rick Stevens)
Date: Wed, 08 Mar 2006 13:05:55 -0800
Subject: [Linux-cluster] Clustering RHEL 4 with EXP400
In-Reply-To: <751353696.20060308111357@sur.ogs-id.com>
References: <751353696.20060308111357@sur.ogs-id.com>
Message-ID: <1141851956.890.268.camel@prophead.corp.publichost.com>

On Wed, 2006-03-08 at 11:13 +0700, Milis wrote:
> Dear All,
> 
> does any one here have experince to cluster RHEL 4  with 2 IBM x346
> and 1 EXP 400,
> I need to do this for build Oracle10g On Rac, whether I need driver
> for build and startup of this device.
> may I know what I need for this requirement?
> and what should I do to get driver of IBM EXP400 ?
> I've been success install IBM X346 with Raid 1 option on RHEL 4, but I
> really confuse what next to do to show up cluster on EXP400 (whether I
> have no driver for this device to share storage)

It's rather too much to go into on the mailing list.  I first recommend
you google "linux +cluster" for some background information.  You
probably should also join the linux-cluster mailing list for details
on this.  If you purchased the RHEL HA package or GFS system, check
your manuals.

Generally, you need a SAN of some sort to provide the storage
(fiberchannel disk array, iSCSI array, something).  You then need to
install the cluster software and kernel patches, decide on what kind
of device management you need (gulm, dlm, etc.) and fire it up.

As I said, it's too complicated to give you a tutorial on a mailing
list.

----------------------------------------------------------------------
- Rick Stevens, Senior Systems Engineer     rstevens at vitalstream.com -
- VitalStream, Inc.                       http://www.vitalstream.com -
-                                                                    -
- I never drink water because of the disgusting things that fish do  -
-                                  in it.                            -
-                                                      -- WC. Fields -
----------------------------------------------------------------------


From lhh at redhat.com  Wed Mar  8 22:32:37 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 08 Mar 2006 17:32:37 -0500
Subject: [Linux-cluster] RHEL4.0 CS and Ldap
In-Reply-To: <EB190CD1E73E1146ACB7694746E205A801655E0C@hx1.ums.msfc.nasa.gov>
References: <EB190CD1E73E1146ACB7694746E205A801655E0C@hx1.ums.msfc.nasa.gov>
Message-ID: <1141857157.25169.153.camel@ayanami.boston.redhat.com>

On Wed, 2006-03-08 at 12:43 -0600, Dalton, Maurice wrote:
> Is there a way to create an Ldap cluster that can do replication with
> RHEL 4.0 CS?

For some reason, I though OpenLDAP had built-in replication?

-- Lon


From rainer at ultra-secure.de  Wed Mar  8 22:43:29 2006
From: rainer at ultra-secure.de (Rainer Duffner)
Date: Wed, 08 Mar 2006 23:43:29 +0100
Subject: [Linux-cluster] RHEL4.0 CS and Ldap
In-Reply-To: <1141857157.25169.153.camel@ayanami.boston.redhat.com>
References: <EB190CD1E73E1146ACB7694746E205A801655E0C@hx1.ums.msfc.nasa.gov>
	<1141857157.25169.153.camel@ayanami.boston.redhat.com>
Message-ID: <440F5E11.4040406@ultra-secure.de>

Lon Hohberger wrote:

>On Wed, 2006-03-08 at 12:43 -0600, Dalton, Maurice wrote:
>  
>
>>Is there a way to create an Ldap cluster that can do replication with
>>RHEL 4.0 CS?
>>    
>>
>
>For some reason, I though OpenLDAP had built-in replication?
>  
>


Not multi-master (which I assume is what the original poster wants).

http://www.redhat.com/en_us/USA/home/solutions/directoryserver/

exists for a reason...


cheers,
Rainer


From lhh at redhat.com  Wed Mar  8 22:48:10 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 08 Mar 2006 17:48:10 -0500
Subject: [Linux-cluster] Re: Trouble with RHCS 3.0
In-Reply-To: <20060306062213.85577.qmail@web36107.mail.mud.yahoo.com>
References: <20060306062213.85577.qmail@web36107.mail.mud.yahoo.com>
Message-ID: <1141858090.25169.166.camel@ayanami.boston.redhat.com>

On Sun, 2006-03-05 at 22:22 -0800, paul raymond wrote:
> Greetings Lon,

Hi, sorry I'm late responding to this.

> The problem is that I can not get Quorum to start unless I run the
> command "cluforce"! But after viewing clustat commands on systems c11
> and c12, it looks like c11 and c12 cant see each other status due to
> some issue with the raw partitions I believe?

If you're using an IP tiebreaker, they won't be looking for each other
on the shared partitions.  The nodes communicate with each other
primarily over the network - if they don't see each other, they will not
form a quorum.

You can try this if you want more detailed information:

  # service clumanager stop (on both nodes)
  # clumembd -fd (on both nodes)

It will give you all sorts of information, but the most important one
you should be looking for is:

  [PID] info: Membership View #1:0x00000001

If you see both nodes, it will show 0x00000003 (it's a bitmap).  If the
nodes can't see each other over the network, they will show 1 or 2.  If
this happens, you should check your network configuration and
clumanager's settings - you might want to try using broadcast instead of
multicast, etc.


>  I am using a Mylex Fiber Channel Box with QLogic 2300  interface
> card! The raw devices are setup on a 2 mirror drives, Raid 1. Can you
> please shed any good ideas what might be wrong here? The vidals are
> below!

Note that for one node to *start* without the other when using an IP
tiebreaker, having to run 'cluforce' is the default behavior.  If you
wish to change this, please check the man page for the 'cluforce'
command and the 'cludb' command.

The IP tiebreaker is typically used to *maintain* a quorum after a node
failure, because there are certain network faults in which two nodes may
see the tiebreaker - but not each other.

-- Lon


From lhh at redhat.com  Wed Mar  8 22:48:41 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 08 Mar 2006 17:48:41 -0500
Subject: [Linux-cluster] sun cluster ccp for redhat
In-Reply-To: <984C9DBB29704B47B7AAD308F2C95A3B04DE71@kmail.ksolutions.it>
References: <984C9DBB29704B47B7AAD308F2C95A3B04DE71@kmail.ksolutions.it>
Message-ID: <1141858121.25169.168.camel@ayanami.boston.redhat.com>

On Fri, 2006-03-03 at 18:48 +0100, Baesso Mirko wrote:
> Hi,
> 
> i would like to known if there is a tool like cluster console panel to
> manage cluster node as sun cluster do

I don't know what CCP is, but there's system-config-cluster

-- Lon


From lhh at redhat.com  Wed Mar  8 22:50:52 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 08 Mar 2006 17:50:52 -0500
Subject: [Linux-cluster] Cluster service restarting Locally
In-Reply-To: <20060306064749.22036.qmail@webmail50.rediffmail.com>
References: <20060306064749.22036.qmail@webmail50.rediffmail.com>
Message-ID: <1141858252.25169.170.camel@ayanami.boston.redhat.com>

On Mon, 2006-03-06 at 06:47 +0000, saju john wrote:
>   
>   
> Dear All,
> 
> I have a 2 node cluster with RHAS3 update 3.
> Kernel : 2.4.21-20.Elsmp
> Clumanager : clumanager-1.2.16-1
> 
> For more than a year everyting had been fine. Suddenly it started
> showing the follwing and restarted the service locally
> 
> clusvcmgrd[1388]: <err> Unable to obtain cluster lock: Connection
> timed out
> clulockd[1378]: <warning> Denied A.B.C.D: Broken pipe
> clulockd[1378]: <err> select error: Broken pipe
> clusvcmgrd: [1625]: <notice> service notice: Stopping service
> postgresql ...
> clusvcmgrd: [1625]: <notice> service notice: Running user script
> '/etc/init.d/postgresql stop'
> clusvcmgrd: [1625]: <notice> service notice: Stopped service
> postgresql
> clusvcmgrd: [1625]: <notice> service notice: Starting service
> postgresql ...
> clusvcmgrd: [1625]: <notice> service notice: Running user script
> '/etc/init.d/postgresql start'
> clusvcmgrd: [1625]: <notice> service notice: Started service
> postgresql ...

It should be fixed in RHCS3U7

-- Lon


From lhh at redhat.com  Wed Mar  8 22:51:58 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 08 Mar 2006 17:51:58 -0500
Subject: [Linux-cluster] Cluster service restarting Locally
In-Reply-To: <E2FAC907AF594A43B22185B5253DFCA0F34D@mx1.wsd.edu>
References: <E2FAC907AF594A43B22185B5253DFCA0F34D@mx1.wsd.edu>
Message-ID: <1141858318.25169.172.camel@ayanami.boston.redhat.com>

On Mon, 2006-03-06 at 14:02 -0600, Hong Zheng wrote:
> I?m having the same problem. My system configuration is as follows:
> 
> 2-node cluster: RH ES3, GFS6.0, clumanager-1.2.28-1 and
> redhat-config-cluster-1.0.8-1
> 
> Kernel: 2.4.21-37.EL
> 
> Linux-iscsi-3.6.3 initiator: connections to iSCSI shared storage
> server

If it's not fixed in U7 (which I think it should be), please file a
bugzilla... It sounds like the lock traffic is getting network-starved.

-- Lon


From lhh at redhat.com  Wed Mar  8 22:53:46 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 08 Mar 2006 17:53:46 -0500
Subject: [Linux-cluster] RHEL4.0 CS and Ldap
In-Reply-To: <440F5E11.4040406@ultra-secure.de>
References: <EB190CD1E73E1146ACB7694746E205A801655E0C@hx1.ums.msfc.nasa.gov>
	<1141857157.25169.153.camel@ayanami.boston.redhat.com>
	<440F5E11.4040406@ultra-secure.de>
Message-ID: <1141858426.25169.175.camel@ayanami.boston.redhat.com>

On Wed, 2006-03-08 at 23:43 +0100, Rainer Duffner wrote:
> Lon Hohberger wrote:
> 
> >On Wed, 2006-03-08 at 12:43 -0600, Dalton, Maurice wrote:
> >  
> >
> >>Is there a way to create an Ldap cluster that can do replication with
> >>RHEL 4.0 CS?
> >>    
> >>
> >
> >For some reason, I though OpenLDAP had built-in replication?
> >  
> >
> 
> 
> Not multi-master (which I assume is what the original poster wants).

Indeed.

-- Lon


From lhh at redhat.com  Wed Mar  8 23:22:29 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 08 Mar 2006 18:22:29 -0500
Subject: [Linux-cluster] Any recommentdations for Oracle on a Netapp
	filer ?
In-Reply-To: <20060304063504.30937.qmail@web52308.mail.yahoo.com>
References: <20060304063504.30937.qmail@web52308.mail.yahoo.com>
Message-ID: <1141860149.25169.204.camel@ayanami.boston.redhat.com>

On Fri, 2006-03-03 at 22:35 -0800, SUVANKAR MOITRA wrote:
> dear lon,
> 
> I have some question about the script and the
> installation :--
> 1> Can i install RHCS4 after install the Oracle 10g?

Yes, but that was not how the document was written.  The major
difference is that you have to manually test everything rather than
using the cluster tools to help you.

e.g. to stop (being consistent with the presented example in the howto,
and assuming all environment vars are set correctly):

   /usr/share/cluster/oracledb.sh stop
   umount /mnt/oracle
   ip addr del 192.168.1.20/22 dev eth0

or to start the service:

   ip addr add 192.168.1.20/22 dev eth0
   mount -t ext3 /dev/sdb7 /mnt/oracle
   /usr/share/cluster/oracledb.sh start

If the cluster is set up correctly, this will work on both nodes (note
that you must stop it on one before starting on another).


> 2> The /mnt/oracle mount point is temporary for the
> oracle installation or should  i write on /etc/fstab ?

It is mounted by the cluster when you start the service.  Do not place
it in /etc/fstab, as mounting an ext3 file system on multiple nodes will
cause you to have a corrupt file system *very* quickly!


> 3> Can i mention ORACLE_HOME,ORACLE_BASE,ORACLE_SID
> etc on .bash_profile of every node or leave it as it
> is only create oracle user and group?

You can, and it will help the testing/debugging phase.  However, it is
not used by the cluster software when starting/stopping Oracle;
everything must be in the cluster configuration.

Don't forget to set ORACLE_HOSTNAME (which is used by the script to
trick Oracle in to using the service IP address/hostname that you set in
Part 2 - step 2), since apparently OUI_HOSTNAME does not seem to work
the way I expected it should.


> 4> Where should i place oracledb.sh file? I think its
> required in every node, am i write ?

Part 1, step 6 (steps to take on all nodes):
Install the oracledb.sh resource agent in to /usr/share/cluster


> 5>What is the exact use of oracledb.sh file?

It is called by the cluster software to start/stop/check status of the
Oracle instance.

Additionally, if you have the environment variables set up correctly, it
will start/stop Oracle outside of the cluster environment, too (just
like a normal initscript...).


> 6> How can i shutdown the oracle, should i write
> script for that, like orastop and orastart for up the
> oracle?

If your environment variables are set correctly and the cluster is not
running:

  /usr/share/cluster/oracledb.sh stop

Once the instance is managed by RHCS (and RHCS is running!), you can use
'clusvcadm' to disable and enable the now failover-capable Oracle
instance, and move it around (see the clusvcadm man page for more
details).

-- Lon


From bobby.m.dalton at nasa.gov  Wed Mar  8 23:40:21 2006
From: bobby.m.dalton at nasa.gov (Dalton, Maurice)
Date: Wed, 8 Mar 2006 17:40:21 -0600
Subject: [Linux-cluster] RHEL4.0 CS and Ldap
References: <EB190CD1E73E1146ACB7694746E205A801655E0C@hx1.ums.msfc.nasa.gov>
	<1141857157.25169.153.camel@ayanami.boston.redhat.com>
Message-ID: <EB190CD1E73E1146ACB7694746E205A8017AA470@hx1.ums.msfc.nasa.gov>

Yes multi-master is what I am looking for. 

________________________________

From: linux-cluster-bounces at redhat.com on behalf of Lon Hohberger
Sent: Wed 3/8/2006 4:32 PM
To: linux clustering
Subject: Re: [Linux-cluster] RHEL4.0 CS and Ldap


On Wed, 2006-03-08 at 12:43 -0600, Dalton, Maurice wrote:
> Is there a way to create an Ldap cluster that can do replication with
> RHEL 4.0 CS?

For some reason, I though OpenLDAP had built-in replication?

-- Lon

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3676 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060308/2f178239/attachment.bin>

From suvankar_moitra at yahoo.com  Thu Mar  9 05:38:24 2006
From: suvankar_moitra at yahoo.com (SUVANKAR MOITRA)
Date: Wed, 8 Mar 2006 21:38:24 -0800 (PST)
Subject: [Linux-cluster] Any recommentdations for Oracle on a Netapp filer
	?
In-Reply-To: <1141860149.25169.204.camel@ayanami.boston.redhat.com>
Message-ID: <20060309053824.82455.qmail@web52301.mail.yahoo.com>

dear lon,

I am thankful to u .Its a great opertunity for me to
take your guidence .I was sucessfully load the oracle
in virtual ip and the script is runing .But one
problem is there the script is not runing from cluster
if i mension the path /usr/share/cluster/oracledb.sh
start.When i want to run   the oracle i am using the
following command :- ./oracledb.sh start or stop or
status etc....
How can i put the thing in cluster suite?
Lon can i install oracle rac on rhcs4?Is it possible?
Because oracle rac using ocfs.

Thanks & regards

Suvankar Moitra
Kolkata , India


--- Lon Hohberger <lhh at redhat.com> wrote:

> On Fri, 2006-03-03 at 22:35 -0800, SUVANKAR MOITRA
> wrote:
> > dear lon,
> > 
> > I have some question about the script and the
> > installation :--
> > 1> Can i install RHCS4 after install the Oracle
> 10g?
> 
> Yes, but that was not how the document was written. 
> The major
> difference is that you have to manually test
> everything rather than
> using the cluster tools to help you.
> 
> e.g. to stop (being consistent with the presented
> example in the howto,
> and assuming all environment vars are set
> correctly):
> 
>    /usr/share/cluster/oracledb.sh stop
>    umount /mnt/oracle
>    ip addr del 192.168.1.20/22 dev eth0
> 
> or to start the service:
> 
>    ip addr add 192.168.1.20/22 dev eth0
>    mount -t ext3 /dev/sdb7 /mnt/oracle
>    /usr/share/cluster/oracledb.sh start
> 
> If the cluster is set up correctly, this will work
> on both nodes (note
> that you must stop it on one before starting on
> another).
> 
> 
> 
> 
> > 2> The /mnt/oracle mount point is temporary for
> the
> > oracle installation or should  i write on
> /etc/fstab ?
> 
> It is mounted by the cluster when you start the
> service.  Do not place
> it in /etc/fstab, as mounting an ext3 file system on
> multiple nodes will
> cause you to have a corrupt file system *very*
> quickly!
> 
> 
> > 3> Can i mention
> ORACLE_HOME,ORACLE_BASE,ORACLE_SID
> > etc on .bash_profile of every node or leave it as
> it
> > is only create oracle user and group?
> 
> You can, and it will help the testing/debugging
> phase.  However, it is
> not used by the cluster software when
> starting/stopping Oracle;
> everything must be in the cluster configuration.
> 
> Don't forget to set ORACLE_HOSTNAME (which is used
> by the script to
> trick Oracle in to using the service IP
> address/hostname that you set in
> Part 2 - step 2), since apparently OUI_HOSTNAME does
> not seem to work
> the way I expected it should.
> 
> 
> > 4> Where should i place oracledb.sh file? I think
> its
> > required in every node, am i write ?
> 
> Part 1, step 6 (steps to take on all nodes):
> Install the oracledb.sh resource agent in to
> /usr/share/cluster
> 
> 
> > 5>What is the exact use of oracledb.sh file?
> 
> It is called by the cluster software to
> start/stop/check status of the
> Oracle instance.
> 
> Additionally, if you have the environment variables
> set up correctly, it
> will start/stop Oracle outside of the cluster
> environment, too (just
> like a normal initscript...).
> 
> 
> > 6> How can i shutdown the oracle, should i write
> > script for that, like orastop and orastart for up
> the
> > oracle?
> 
> If your environment variables are set correctly and
> the cluster is not
> running:
> 
>   /usr/share/cluster/oracledb.sh stop
> 
> Once the instance is managed by RHCS (and RHCS is
> running!), you can use
> 'clusvcadm' to disable and enable the now
> failover-capable Oracle
> instance, and move it around (see the clusvcadm man
> page for more
> details).
> 
> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster
> 


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam
protection around 
http://mail.yahoo.com 

__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From basv at sara.nl  Thu Mar  9 06:34:50 2006
From: basv at sara.nl (Bas van der Vlies)
Date: Thu, 09 Mar 2006 07:34:50 +0100
Subject: [Linux-cluster] gfs + nfsd crash
In-Reply-To: <440EF9BD.4030108@sara.nl>
References: <05AF3B6E-B88A-45CC-83E4-1353291E6347@sara.nl>	<C273A968-DE83-451D-93E4-589F137E5F27@sara.nl>
	<440EF9BD.4030108@sara.nl>
Message-ID: <440FCC8A.7040803@sara.nl>

I have checked the CVS source and found this Changelog below.
Does this fix my GFS crashes with NFS? It describes my kind of
problems.

Thanks


====== Changelog
CVSROOT:	/cvs/cluster
Module name:	cluster
Branch: 	STABLE
Changes by:	bmarzins at sourceware.org	2006-03-08 20:47:09

Modified files:
	gfs-kernel/src/gfs: ops_inode.c

Log message:
  Really gross hack!!!
  This is a workaround for one of the bugs the got lumped into 166701. It
  Breaks POSIX behavior in a corner case to avoid crashing... It's icky.
  when NFS opens a file with O_CREAT, the kernel nfs daemon checks to see
  if the file exists. If it does, nfsd does the *right thing* (either
  opens the file, or if the file was opened with O_EXCL, returns an
  error).  If the file doesn't exist, it passes the request down to the
  underlying file system. Unfortunately, since nfs *knows* that the file
  doesn't exist, it doesn't bother to pass a nameidata structure, which
  would include the intent information. However since gfs is a cluster
  file system, the file could have been created on another node after nfs
  checks for it. If this is the case, gfs needs the intent information to
  do the *right thing*.  It panics when it finds a NULL pointer, instead
  of the nameidata. Now, instead of panicing, if gfs finds a NULL
  nameidata pointer. It assumes that the file was not created with
  O_EXCL.
	
  This assumption could be wrong, with the result that an application
  could thing that it has created a new file, when in fact, it has opened
  an existing one.

=== End Changelog ===

Bas van der Vlies wrote:
> We just upgraded to 2.6.16-rc5 and cvs stable gfs. We still have
> gfs_create crashes.
> 
> === Ooops =====
> Unable to handle kernel NULL pointer dereference at virtual address 
> 00000038
>  printing eip:
> f89a4be3
> *pde = 37809001
> *pte = 00000000
> Oops: 0000 [#1]
> SMP
> Modules linked in: lock_dlm dlm cman dm_round_robin dm_multipath sg 
> ide_floppy ide_cd cdrom qla2xxx siimage piix e1000 gfs lock_harness dm_mod
> CPU:    0
> EIP:    0060:[<f89a4be3>]    Tainted: GF     VLI
> EFLAGS: 00010246   (2.6.16-rc5-sara3 #1)
> EIP is at gfs_create+0x6f/0x153 [gfs]
> eax: 00000000   ebx: ffffffef   ecx: f27d0d98   edx: ffffffef
> esi: f2f84690   edi: f8b93000   ebp: f34a5e98   esp: f34a5e20
> ds: 007b   es: 007b   ss: 0068
> Process nfsd (pid: 8973, threadinfo=f34a4000 task=f3462a70)
> Stack: <0>f092a530 00000001 f34a5e48 00000000 f34a5e84 f89a6628 f34a5e48 
> ee1fc324
>        00000003 00000000 f34a5e48 f34a5e48 00000000 f3462a70 00000003 
> f34a5e5c
>        f34a5e5c f27d0d98 f3462a70 00000001 00000020 00000000 000000c2 
> 00000000
> Call Trace:
>  [<c0103599>] show_stack_log_lvl+0xad/0xb5
>  [<c01036db>] show_registers+0x10d/0x176
>  [<c01038ad>] die+0xf2/0x16d
>  [<c010f668>] do_page_fault+0x3dd/0x57a
>  [<c010322f>] error_code+0x4f/0x54
>  [<c01585f2>] vfs_create+0x6a/0xa7
>  [<c0195e1c>] nfsd_create_v3+0x2b1/0x48a
>  [<c019af2f>] nfsd3_proc_create+0x116/0x123
>  [<c019229f>] nfsd_dispatch+0xbe/0x17f
>  [<c02e0a52>] svc_process+0x381/0x5c7
>  [<c019208c>] nfsd+0x18d/0x2e2
>  [<c0100ed9>] kernel_thread_helper+0x5/0xb
> Code: 94 50 8b 45 0c ff 75 10 83 c0 1c 6a 01 89 45 88 50 8d 45 c4 50 e8 
> 70 08 ff ff 83 c4 14 89 c3 85 c0 74 4883 f8 ef 75 33 8b 45 14 <80> 78 38 
> 00 78 2a 8d 45 94 50 8d 45 c4 6a 00 ff 75 88 50 e8 3c
>  BUG: nfsd/8973, lock held at task exit time!
>  [ee1fc398] {inode_init_once}
> .. held by:              nfsd: 8973 [f3462a70, 115]
> ... acquired at:               nfsd_create_v3+0x127/0x48a
> 
> 
> 


-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************


From grimme at atix.de  Thu Mar  9 08:48:01 2006
From: grimme at atix.de (Marc Grimme)
Date: Thu, 9 Mar 2006 09:48:01 +0100
Subject: [Linux-cluster] GFS load average and locking
In-Reply-To: <9A6FE0FCC2B29846824C5CD81C6647B90143604F@s228130hz1ew08.apptix-01.savvis.net>
References: <9A6FE0FCC2B29846824C5CD81C6647B90143604F@s228130hz1ew08.apptix-01.savvis.net>
Message-ID: <200603090948.03134.grimme@atix.de>

Hi,
On Wednesday 08 March 2006 19:54, Stanley, Jon wrote:
> I have a 7 node GFS cluster, plus 3 lock servers (RH AS3U5, GULM
> locking) that do not mount the filesystem.  I have a problem whereby the
> load average on the system is extremely high (occasionally
> astronomical), eventually leading to a complete site outage, via
> inability to access the shared filesystem.  I have a couple questions
> about the innards of GFS that I would be most grateful for someone to
> answer:
>
> The application is written in PHP, and the PHP sessioning is handled via
> the GFS filesystem as well, if that's important.
>
> 1)  I notice that I have a lot of processes in uninterruptible sleep.
> When I attached strace to one of these processes, I obviously found it
> doing nothing for a period of ~30-60 seconds.  An excerpt of the strace
> (using -r) follows:
>
>      0.001224
> stat64("/media/files/global/2/6/26c4f61c69117d55b352ce328babbff4.jpg",
> {st_mode=S_IFREG|0644, st_size=9072, ...}) = 0
>      0.000251
> open("/media/files/global/2/6/26c4f61c69117d55b352ce328babbff4.jpg",
> O_RDONLY) = 5
>      0.000108 mmap2(NULL, 9072, PROT_READ, MAP_PRIVATE, 5, 0) =
> 0xaf381000
>      0.000069 writev(4, [{"HTTP/1.1 200 OK\r\nDate: Wed, 08 M"..., 318},
> {"\377\330\377\340\0\20JFIF\0\1\2\0\0d\0d\0\0\377\354\0\21"..., 9072}],
> 2) = 9390
>      0.000630 close(5)                  = 0
>      0.000049 munmap(0xaf381000, 9072)  = 0
>      0.000052 rt_sigaction(SIGUSR1, {0x81ef474, [],
> SA_RESTORER|SA_INTERRUPT, 0x1b2eb8}, {SIG_IGN}, 8) = 0
>      0.000068 read(4, 0xa239b3c, 4096)  = ? ERESTARTSYS (To be
> restarted)
>      6.546891 --- SIGALRM (Alarm clock) @ 0 (0) ---
>      0.000119 close(4)                  = 0

>
> What it looks like is it hangs out in read() for a period of time, thus
> leading to the uninterruptible sleep.  This particular example was 6
> seconds, however it seems that the time is variable.  The particular
> file in this instance is not large, only 9k.
Although the strace does not show the output I know of the problem description 
sounds like a deja vu.
We had loads of problems with having sessions on GFS and httpd s ending up 
with "D" state for some time (at high load times we had ServerLimit httpd in 
D per node which ended up in the service not being available). 
As I posted already we think it is because of the "bad" locking of sessions 
with php (as php sessions are on gfs and strace showed those timeouts with 
the session files). When you issue a "session_start" or what ever that 
function is called, the session_file is locked via an flock syscall. That 
lock is held until you end the session which is implicitly done when the tcp 
connection to the client is ended. Now comes another http process (on 
whatever node) and calls a "session start" and trys an flock on that session 
while another process already holds that lock. The process might end up in 
the seen timeouts (30-60secs) which (as far as I remember relates to the 
timeout of the tcp connection defined in the httpd.conf or some timeout in 
the php.ini) - there is an explanation on this but I cannot rember ;-) ). 
Nevertheless in our scenario the problems were the "bad" session handling by 
php. We have made a patch for the phplib where you can disable the locking, 
or just implicitly do locking and therefore keep consitency while session 
data is read or written. We could make apache work as expected and now we 
don't see any "D" process anymore since a year.
Oh yes the patch can be found at
www.opensharedroot.org in the download section.

Besides: You will never encounter this on a localfilesystem or nfs (as nfs 
ignores flocks). As nfs does not support flocks and silently ignores them.

Hope that helps and let us know about problems.
Regards Marc.
>
> I've never seen ERESTARTSYS before, and some googling tells me that it's
> basically telling the kernel to interrupt the current syscall in order
> to handle a signal (SIGALRM in this case, which I'm not sure the
> function of).  I could be *way* off base here - I'm not a programmer by
> any stretch of the imagination.
>
> 2)  The locking statistics seems to be a huge mystery.  The lock total
> doesn't seem to correspond to the number of open files that I have (I
> hope!).  Here's the output of a 'cat /proc/gulm/lockspace - I can't
> imagine that I have 300,000+ files open on this system at this point -
> when are the locks released, or is this even an indication of how many
> locks that are active at the current time?  What does the 'pending'
> number mean?
>
> [svadmin at s259830hz1sl01 gulm]$ cat lockspace
>
> lock counts:
>   total: 369822
>     unl: 176518
>     exl: 1555
>     shd: 191501
>     dfr: 0
> pending: 5
>    lvbs: 2000
>    lops: 21467433
>
> [svadmin at s259830hz1sl01 gulm]$
>
> Thanks for any help that anyone can provide on this!
>
> Thanks!
> -Jon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 121 409-54
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From nick at sqrt.co.uk  Thu Mar  9 09:48:39 2006
From: nick at sqrt.co.uk (Nick Burrett)
Date: Thu, 09 Mar 2006 09:48:39 +0000
Subject: [Linux-cluster] RHEL4.0 CS and Ldap
In-Reply-To: <440F5E11.4040406@ultra-secure.de>
References: <EB190CD1E73E1146ACB7694746E205A801655E0C@hx1.ums.msfc.nasa.gov>	<1141857157.25169.153.camel@ayanami.boston.redhat.com>
	<440F5E11.4040406@ultra-secure.de>
Message-ID: <440FF9F7.2060500@sqrt.co.uk>

Rainer Duffner wrote:
> Lon Hohberger wrote:
> 
>> On Wed, 2006-03-08 at 12:43 -0600, Dalton, Maurice wrote:
>>  
>>
>>> Is there a way to create an Ldap cluster that can do replication with
>>> RHEL 4.0 CS?
>>>   
>>
>>
>> For some reason, I though OpenLDAP had built-in replication?
>>  
>>
> 
> 
> Not multi-master (which I assume is what the original poster wants).
> 
> http://www.redhat.com/en_us/USA/home/solutions/directoryserver/
> 
> exists for a reason...

As I understand it, multi-master can be accomplished in OpenLDAP 2.3 
using sync-replication between the two nodes.  This is not true 
multi-master, but the effect is near-enough.

Regards,

Nick.


From cjk at techma.com  Thu Mar  9 13:43:03 2006
From: cjk at techma.com (Kovacs, Corey J.)
Date: Thu, 9 Mar 2006 08:43:03 -0500
Subject: [Linux-cluster] RHEL4.0 CS and Ldap
Message-ID: <FF2CE0D593AEE34B955FEC77BD5AFBE0079E23@tmaemail.techma.com>

Multi master LDAP is not all that it's cracked up to be. There are few
benefits 
in being able to write to different servers. The problem is that like many
things,
the rule is "Last write wins". Writing in LDAP dirs is not something that is 
"typically" done in high volumes anyway, but reading is. If you just need to
load balance traffic, then you might look into an LVS implementation. That
way
you can set up several replicas and spread the load across them. If your
Master
gets blown away, you can promote of the replicas to a master.. Just takes
some
manual intervention.


Just my two cents...


Corey 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Nick Burrett
Sent: Thursday, March 09, 2006 4:49 AM
To: linux clustering
Subject: Re: [Linux-cluster] RHEL4.0 CS and Ldap

Rainer Duffner wrote:
> Lon Hohberger wrote:
> 
>> On Wed, 2006-03-08 at 12:43 -0600, Dalton, Maurice wrote:
>>  
>>
>>> Is there a way to create an Ldap cluster that can do replication 
>>> with RHEL 4.0 CS?
>>>   
>>
>>
>> For some reason, I though OpenLDAP had built-in replication?
>>  
>>
> 
> 
> Not multi-master (which I assume is what the original poster wants).
> 
> http://www.redhat.com/en_us/USA/home/solutions/directoryserver/
> 
> exists for a reason...

As I understand it, multi-master can be accomplished in OpenLDAP 2.3 using
sync-replication between the two nodes.  This is not true multi-master, but
the effect is near-enough.

Regards,

Nick.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From baesso at ksolutions.it  Thu Mar  9 13:59:36 2006
From: baesso at ksolutions.it (Baesso Mirko)
Date: Thu, 9 Mar 2006 14:59:36 +0100
Subject: R: [Linux-cluster] RHEL4.0 CS and Ldap
Message-ID: <984C9DBB29704B47B7AAD308F2C95A3B04DEF4@kmail.ksolutions.it>

Sorry but i try to use sync-replica either then slurpd on openldap 2.3, can i setup a multi-master environment?


Baesso Mirko - System Engineer
KSolutions.S.p.A.
Via Lenin 132/26
56017  S.Martino Ulmiano (PI) - Italy
tel.+ 39 0 50 898369 fax. + 39 0 50 861200
baesso at ksolutions.it   http//www.ksolutions.it

-----Messaggio originale-----
Da: Kovacs, Corey J. [mailto:cjk at techma.com] 
Inviato: gioved? 9 marzo 2006 14.43
A: linux clustering
Oggetto: RE: [Linux-cluster] RHEL4.0 CS and Ldap

Multi master LDAP is not all that it's cracked up to be. There are few
benefits 
in being able to write to different servers. The problem is that like many
things,
the rule is "Last write wins". Writing in LDAP dirs is not something that is 
"typically" done in high volumes anyway, but reading is. If you just need to
load balance traffic, then you might look into an LVS implementation. That
way
you can set up several replicas and spread the load across them. If your
Master
gets blown away, you can promote of the replicas to a master.. Just takes
some
manual intervention.


Just my two cents...


Corey 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Nick Burrett
Sent: Thursday, March 09, 2006 4:49 AM
To: linux clustering
Subject: Re: [Linux-cluster] RHEL4.0 CS and Ldap

Rainer Duffner wrote:
> Lon Hohberger wrote:
> 
>> On Wed, 2006-03-08 at 12:43 -0600, Dalton, Maurice wrote:
>>  
>>
>>> Is there a way to create an Ldap cluster that can do replication 
>>> with RHEL 4.0 CS?
>>>   
>>
>>
>> For some reason, I though OpenLDAP had built-in replication?
>>  
>>
> 
> 
> Not multi-master (which I assume is what the original poster wants).
> 
> http://www.redhat.com/en_us/USA/home/solutions/directoryserver/
> 
> exists for a reason...

As I understand it, multi-master can be accomplished in OpenLDAP 2.3 using
sync-replication between the two nodes.  This is not true multi-master, but
the effect is near-enough.

Regards,

Nick.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From cjk at techma.com  Thu Mar  9 14:07:34 2006
From: cjk at techma.com (Kovacs, Corey J.)
Date: Thu, 9 Mar 2006 09:07:34 -0500
Subject: [Linux-cluster] RHEL4.0 CS and Ldap
Message-ID: <FF2CE0D593AEE34B955FEC77BD5AFBE0079E24@tmaemail.techma.com>

You might be able to, but it's not really cluster related. You'll get
better information on the openldap lists.


Cheers.


Corey 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Baesso Mirko
Sent: Thursday, March 09, 2006 9:00 AM
To: linux clustering
Subject: R: [Linux-cluster] RHEL4.0 CS and Ldap

Sorry but i try to use sync-replica either then slurpd on openldap 2.3, can i
setup a multi-master environment?


Baesso Mirko - System Engineer
KSolutions.S.p.A.
Via Lenin 132/26
56017  S.Martino Ulmiano (PI) - Italy
tel.+ 39 0 50 898369 fax. + 39 0 50 861200
baesso at ksolutions.it   http//www.ksolutions.it

-----Messaggio originale-----
Da: Kovacs, Corey J. [mailto:cjk at techma.com]
Inviato: gioved? 9 marzo 2006 14.43
A: linux clustering
Oggetto: RE: [Linux-cluster] RHEL4.0 CS and Ldap

Multi master LDAP is not all that it's cracked up to be. There are few
benefits in being able to write to different servers. The problem is that
like many things, the rule is "Last write wins". Writing in LDAP dirs is not
something that is "typically" done in high volumes anyway, but reading is. If
you just need to load balance traffic, then you might look into an LVS
implementation. That way you can set up several replicas and spread the load
across them. If your Master gets blown away, you can promote of the replicas
to a master.. Just takes some manual intervention.


Just my two cents...


Corey 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Nick Burrett
Sent: Thursday, March 09, 2006 4:49 AM
To: linux clustering
Subject: Re: [Linux-cluster] RHEL4.0 CS and Ldap

Rainer Duffner wrote:
> Lon Hohberger wrote:
> 
>> On Wed, 2006-03-08 at 12:43 -0600, Dalton, Maurice wrote:
>>  
>>
>>> Is there a way to create an Ldap cluster that can do replication 
>>> with RHEL 4.0 CS?
>>>   
>>
>>
>> For some reason, I though OpenLDAP had built-in replication?
>>  
>>
> 
> 
> Not multi-master (which I assume is what the original poster wants).
> 
> http://www.redhat.com/en_us/USA/home/solutions/directoryserver/
> 
> exists for a reason...

As I understand it, multi-master can be accomplished in OpenLDAP 2.3 using
sync-replication between the two nodes.  This is not true multi-master, but
the effect is near-enough.

Regards,

Nick.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From bobby.m.dalton at nasa.gov  Thu Mar  9 14:17:05 2006
From: bobby.m.dalton at nasa.gov (Dalton, Maurice)
Date: Thu, 9 Mar 2006 08:17:05 -0600
Subject: [Linux-cluster] RHEL4.0 CS and Ldap
Message-ID: <EB190CD1E73E1146ACB7694746E205A801655E1A@hx1.ums.msfc.nasa.gov>

I am currently using Heartbeat to solve my need for Highly Available Ldap servers. I was just trying to figure another way to provide HA, replication with RHEL4.0 CS.  It would be nice to build a RHEL4.0 CS service for ldap that would cover all of my requirements.

Just a thought.
Thanks for the replies...
 

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Kovacs, Corey J.
Sent: Thursday, March 09, 2006 8:08 AM
To: linux clustering
Subject: RE: [Linux-cluster] RHEL4.0 CS and Ldap

You might be able to, but it's not really cluster related. You'll get better information on the openldap lists.


Cheers.


Corey 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Baesso Mirko
Sent: Thursday, March 09, 2006 9:00 AM
To: linux clustering
Subject: R: [Linux-cluster] RHEL4.0 CS and Ldap

Sorry but i try to use sync-replica either then slurpd on openldap 2.3, can i setup a multi-master environment?


Baesso Mirko - System Engineer
KSolutions.S.p.A.
Via Lenin 132/26
56017  S.Martino Ulmiano (PI) - Italy
tel.+ 39 0 50 898369 fax. + 39 0 50 861200
baesso at ksolutions.it   http//www.ksolutions.it

-----Messaggio originale-----
Da: Kovacs, Corey J. [mailto:cjk at techma.com]
Inviato: gioved? 9 marzo 2006 14.43
A: linux clustering
Oggetto: RE: [Linux-cluster] RHEL4.0 CS and Ldap

Multi master LDAP is not all that it's cracked up to be. There are few benefits in being able to write to different servers. The problem is that like many things, the rule is "Last write wins". Writing in LDAP dirs is not something that is "typically" done in high volumes anyway, but reading is. If you just need to load balance traffic, then you might look into an LVS implementation. That way you can set up several replicas and spread the load across them. If your Master gets blown away, you can promote of the replicas to a master.. Just takes some manual intervention.


Just my two cents...


Corey 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Nick Burrett
Sent: Thursday, March 09, 2006 4:49 AM
To: linux clustering
Subject: Re: [Linux-cluster] RHEL4.0 CS and Ldap

Rainer Duffner wrote:
> Lon Hohberger wrote:
> 
>> On Wed, 2006-03-08 at 12:43 -0600, Dalton, Maurice wrote:
>>  
>>
>>> Is there a way to create an Ldap cluster that can do replication 
>>> with RHEL 4.0 CS?
>>>   
>>
>>
>> For some reason, I though OpenLDAP had built-in replication?
>>  
>>
> 
> 
> Not multi-master (which I assume is what the original poster wants).
> 
> http://www.redhat.com/en_us/USA/home/solutions/directoryserver/
> 
> exists for a reason...

As I understand it, multi-master can be accomplished in OpenLDAP 2.3 using sync-replication between the two nodes.  This is not true multi-master, but the effect is near-enough.

Regards,

Nick.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From cjk at techma.com  Thu Mar  9 14:24:55 2006
From: cjk at techma.com (Kovacs, Corey J.)
Date: Thu, 9 Mar 2006 09:24:55 -0500
Subject: [Linux-cluster] RHEL4.0 CS and Ldap
Message-ID: <FF2CE0D593AEE34B955FEC77BD5AFBE0079E25@tmaemail.techma.com>

Dalton, I figured as much and you could indeed use the CS portion to do what
Heartbeat
does. I do that same thing on RHEL3CS. It's the replication part that doesn't
fall into
the clusterring category, that's all. The cluster serverices (resource
manager) does, in
effect, what hearbeat does tho so you can either have a single instence
bounce aroung
from node to node as needed, or you could have a master + replicas all
running at the 
same time (with differnet data stores) and not have to worry about the
service failing
over, just flip IP address, which is what it sounds like you are doing
already. For better
performance tho, if that's needed, you'll prolly want the LVS (piranha)
solution.


Good luck


Corey 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Dalton, Maurice
Sent: Thursday, March 09, 2006 9:17 AM
To: linux clustering
Subject: RE: [Linux-cluster] RHEL4.0 CS and Ldap

I am currently using Heartbeat to solve my need for Highly Available Ldap
servers. I was just trying to figure another way to provide HA, replication
with RHEL4.0 CS.  It would be nice to build a RHEL4.0 CS service for ldap
that would cover all of my requirements.

Just a thought.
Thanks for the replies...
 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Kovacs, Corey J.
Sent: Thursday, March 09, 2006 8:08 AM
To: linux clustering
Subject: RE: [Linux-cluster] RHEL4.0 CS and Ldap

You might be able to, but it's not really cluster related. You'll get better
information on the openldap lists.


Cheers.


Corey 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Baesso Mirko
Sent: Thursday, March 09, 2006 9:00 AM
To: linux clustering
Subject: R: [Linux-cluster] RHEL4.0 CS and Ldap

Sorry but i try to use sync-replica either then slurpd on openldap 2.3, can i
setup a multi-master environment?


Baesso Mirko - System Engineer
KSolutions.S.p.A.
Via Lenin 132/26
56017  S.Martino Ulmiano (PI) - Italy
tel.+ 39 0 50 898369 fax. + 39 0 50 861200
baesso at ksolutions.it   http//www.ksolutions.it

-----Messaggio originale-----
Da: Kovacs, Corey J. [mailto:cjk at techma.com]
Inviato: gioved? 9 marzo 2006 14.43
A: linux clustering
Oggetto: RE: [Linux-cluster] RHEL4.0 CS and Ldap

Multi master LDAP is not all that it's cracked up to be. There are few
benefits in being able to write to different servers. The problem is that
like many things, the rule is "Last write wins". Writing in LDAP dirs is not
something that is "typically" done in high volumes anyway, but reading is. If
you just need to load balance traffic, then you might look into an LVS
implementation. That way you can set up several replicas and spread the load
across them. If your Master gets blown away, you can promote of the replicas
to a master.. Just takes some manual intervention.


Just my two cents...


Corey 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Nick Burrett
Sent: Thursday, March 09, 2006 4:49 AM
To: linux clustering
Subject: Re: [Linux-cluster] RHEL4.0 CS and Ldap

Rainer Duffner wrote:
> Lon Hohberger wrote:
> 
>> On Wed, 2006-03-08 at 12:43 -0600, Dalton, Maurice wrote:
>>  
>>
>>> Is there a way to create an Ldap cluster that can do replication 
>>> with RHEL 4.0 CS?
>>>   
>>
>>
>> For some reason, I though OpenLDAP had built-in replication?
>>  
>>
> 
> 
> Not multi-master (which I assume is what the original poster wants).
> 
> http://www.redhat.com/en_us/USA/home/solutions/directoryserver/
> 
> exists for a reason...

As I understand it, multi-master can be accomplished in OpenLDAP 2.3 using
sync-replication between the two nodes.  This is not true multi-master, but
the effect is near-enough.

Regards,

Nick.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From bobby.m.dalton at nasa.gov  Thu Mar  9 14:33:18 2006
From: bobby.m.dalton at nasa.gov (Dalton, Maurice)
Date: Thu, 9 Mar 2006 08:33:18 -0600
Subject: [Linux-cluster] RHEL4.0 CS and Ldap
Message-ID: <EB190CD1E73E1146ACB7694746E205A801655E1E@hx1.ums.msfc.nasa.gov>

Thanks Corey.

That's exactly what I want.

At least 2 servers doing replication and the failover part is just moving the virtual IP to the slave server..


Thanks..
 

-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Kovacs, Corey J.
Sent: Thursday, March 09, 2006 8:25 AM
To: linux clustering
Subject: RE: [Linux-cluster] RHEL4.0 CS and Ldap

Dalton, I figured as much and you could indeed use the CS portion to do what Heartbeat does. I do that same thing on RHEL3CS. It's the replication part that doesn't fall into the clusterring category, that's all. The cluster serverices (resource
manager) does, in
effect, what hearbeat does tho so you can either have a single instence bounce aroung from node to node as needed, or you could have a master + replicas all running at the same time (with differnet data stores) and not have to worry about the service failing over, just flip IP address, which is what it sounds like you are doing already. For better performance tho, if that's needed, you'll prolly want the LVS (piranha) solution.


Good luck


Corey 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Dalton, Maurice
Sent: Thursday, March 09, 2006 9:17 AM
To: linux clustering
Subject: RE: [Linux-cluster] RHEL4.0 CS and Ldap

I am currently using Heartbeat to solve my need for Highly Available Ldap
servers. I was just trying to figure another way to provide HA, replication
with RHEL4.0 CS.  It would be nice to build a RHEL4.0 CS service for ldap
that would cover all of my requirements.

Just a thought.
Thanks for the replies...
 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Kovacs, Corey J.
Sent: Thursday, March 09, 2006 8:08 AM
To: linux clustering
Subject: RE: [Linux-cluster] RHEL4.0 CS and Ldap

You might be able to, but it's not really cluster related. You'll get better
information on the openldap lists.


Cheers.


Corey 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Baesso Mirko
Sent: Thursday, March 09, 2006 9:00 AM
To: linux clustering
Subject: R: [Linux-cluster] RHEL4.0 CS and Ldap

Sorry but i try to use sync-replica either then slurpd on openldap 2.3, can i
setup a multi-master environment?


Baesso Mirko - System Engineer
KSolutions.S.p.A.
Via Lenin 132/26
56017  S.Martino Ulmiano (PI) - Italy
tel.+ 39 0 50 898369 fax. + 39 0 50 861200
baesso at ksolutions.it   http//www.ksolutions.it

-----Messaggio originale-----
Da: Kovacs, Corey J. [mailto:cjk at techma.com]
Inviato: gioved? 9 marzo 2006 14.43
A: linux clustering
Oggetto: RE: [Linux-cluster] RHEL4.0 CS and Ldap

Multi master LDAP is not all that it's cracked up to be. There are few
benefits in being able to write to different servers. The problem is that
like many things, the rule is "Last write wins". Writing in LDAP dirs is not
something that is "typically" done in high volumes anyway, but reading is. If
you just need to load balance traffic, then you might look into an LVS
implementation. That way you can set up several replicas and spread the load
across them. If your Master gets blown away, you can promote of the replicas
to a master.. Just takes some manual intervention.


Just my two cents...


Corey 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Nick Burrett
Sent: Thursday, March 09, 2006 4:49 AM
To: linux clustering
Subject: Re: [Linux-cluster] RHEL4.0 CS and Ldap

Rainer Duffner wrote:
> Lon Hohberger wrote:
> 
>> On Wed, 2006-03-08 at 12:43 -0600, Dalton, Maurice wrote:
>>  
>>
>>> Is there a way to create an Ldap cluster that can do replication 
>>> with RHEL 4.0 CS?
>>>   
>>
>>
>> For some reason, I though OpenLDAP had built-in replication?
>>  
>>
> 
> 
> Not multi-master (which I assume is what the original poster wants).
> 
> http://www.redhat.com/en_us/USA/home/solutions/directoryserver/
> 
> exists for a reason...

As I understand it, multi-master can be accomplished in OpenLDAP 2.3 using
sync-replication between the two nodes.  This is not true multi-master, but
the effect is near-enough.

Regards,

Nick.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From lhh at redhat.com  Thu Mar  9 14:35:37 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 09 Mar 2006 09:35:37 -0500
Subject: [Linux-cluster] Any recommentdations for Oracle on a Netapp
	filer ?
In-Reply-To: <20060309053824.82455.qmail@web52301.mail.yahoo.com>
References: <20060309053824.82455.qmail@web52301.mail.yahoo.com>
Message-ID: <1141914937.25169.220.camel@ayanami.boston.redhat.com>

On Wed, 2006-03-08 at 21:38 -0800, SUVANKAR MOITRA wrote:
> dear lon,
> 
> I am thankful to u .Its a great opertunity for me to
> take your guidence .I was sucessfully load the oracle
> in virtual ip and the script is runing .But one
> problem is there the script is not runing from cluster
> if i mension the path /usr/share/cluster/oracledb.sh
> start.When i want to run   the oracle i am using the
> following command :- ./oracledb.sh start or stop or
> status etc....
> How can i put the thing in cluster suite?
> Lon can i install oracle rac on rhcs4?Is it possible?
> Because oracle rac using ocfs.

The HOWTO was written to show how to do a single instance Oracle 10g R2
database failover configuration using RHCS4.

RAC is a *very* different operational model.

It has its own notion of membership and quorum, which is redundant with
RHCS.  Oracle 10g RAC should not need RHCS to work properly or fail over
- RAC should handle all of this for you if you have set it up correctly.

Basically, if you have both instances of RAC running, you're done.  You
don't need to make it work with RHCS at all...!

On a side note, I am surprised the oracledb.sh script started RAC
correctly, since it doesn't start ocssd or any of the other Oracle
Clusterware components...

-- Lon


From teigland at redhat.com  Thu Mar  9 14:43:51 2006
From: teigland at redhat.com (David Teigland)
Date: Thu, 9 Mar 2006 08:43:51 -0600
Subject: [Linux-cluster] gfs + nfsd crash
In-Reply-To: <440FCC8A.7040803@sara.nl>
References: <05AF3B6E-B88A-45CC-83E4-1353291E6347@sara.nl>
	<C273A968-DE83-451D-93E4-589F137E5F27@sara.nl>
	<440EF9BD.4030108@sara.nl> <440FCC8A.7040803@sara.nl>
Message-ID: <20060309144351.GA22258@redhat.com>

On Thu, Mar 09, 2006 at 07:34:50AM +0100, Bas van der Vlies wrote:
> I have checked the CVS source and found this Changelog below.
> Does this fix my GFS crashes with NFS? It describes my kind of
> problems.

Yes, we hope so, your problem looks very similar.

Dave


From basv at sara.nl  Thu Mar  9 15:00:48 2006
From: basv at sara.nl (Bas van der Vlies)
Date: Thu, 09 Mar 2006 16:00:48 +0100
Subject: [Linux-cluster] gfs + nfsd crash
In-Reply-To: <20060309144351.GA22258@redhat.com>
References: <05AF3B6E-B88A-45CC-83E4-1353291E6347@sara.nl>
	<C273A968-DE83-451D-93E4-589F137E5F27@sara.nl>
	<440EF9BD.4030108@sara.nl> <440FCC8A.7040803@sara.nl>
	<20060309144351.GA22258@redhat.com>
Message-ID: <44104320.7060404@sara.nl>

David Teigland wrote:
> On Thu, Mar 09, 2006 at 07:34:50AM +0100, Bas van der Vlies wrote:
>> I have checked the CVS source and found this Changelog below.
>> Does this fix my GFS crashes with NFS? It describes my kind of
>> problems.
> 
> Yes, we hope so, your problem looks very similar.
> 

I have installed the newest GFS version from cvs STABLE and did not 
encounter any nfsd crashes ;-). Just to inform the progress.


-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************


From hong.zheng at wsdtx.org  Thu Mar  9 17:02:54 2006
From: hong.zheng at wsdtx.org (Hong Zheng)
Date: Thu, 9 Mar 2006 11:02:54 -0600
Subject: [Linux-cluster] Cluster service restarting Locally
Message-ID: <E2FAC907AF594A43B22185B5253DFCA0F363@mx1.wsd.edu>

Lon,

Thanks for your reply. In my system I don't use any lock system like
lock_gulm or lock_dlm, I use no_lock because our applications'
limitation. Do you think no_lock will also bring some lock traffic or
not? When I tried lock_gulm before, our application had very bad
performance, so I choose no_lock.

And I'm not sure which update we have right now. Do you know the
versions for clumanager and redhat-config-cluster of RHCS3U7?

Hong

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
Sent: Wednesday, March 08, 2006 4:52 PM
To: linux clustering
Subject: RE: [Linux-cluster] Cluster service restarting Locally

On Mon, 2006-03-06 at 14:02 -0600, Hong Zheng wrote:
> I'm having the same problem. My system configuration is as follows:
> 
> 2-node cluster: RH ES3, GFS6.0, clumanager-1.2.28-1 and
> redhat-config-cluster-1.0.8-1
> 
> Kernel: 2.4.21-37.EL
> 
> Linux-iscsi-3.6.3 initiator: connections to iSCSI shared storage
> server

If it's not fixed in U7 (which I think it should be), please file a
bugzilla... It sounds like the lock traffic is getting network-starved.

-- Lon


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From kabobofpug at yahoo.com  Thu Mar  9 18:49:31 2006
From: kabobofpug at yahoo.com (paul raymond)
Date: Thu, 9 Mar 2006 10:49:31 -0800 (PST)
Subject: [Linux-cluster] Thank You! Trouble with RHCS 3.0
In-Reply-To: <1141858090.25169.166.camel@ayanami.boston.redhat.com>
Message-ID: <20060309184931.73576.qmail@web36108.mail.mud.yahoo.com>

Greetings Lon,
  
  Thank you very much in pointing me in the correct direct on this! I  thought it was a rawdevice issue, but it was network issue with several  wrong network settings! Yikes! 
  
  We are system testing the cluster now but still have one issue:  Sometimes when I stop Cluster Manager service on the member #0, with  this command: "service clumanager stop", the system reboots it's self  automatically! Do you have any clues why this happens? Thanks!
  
  Warm Regards,
  Paul 
  Linux/AIX/Windows System Admin

Lon Hohberger <lhh at redhat.com> wrote:  On Sun, 2006-03-05 at 22:22 -0800, paul raymond wrote:
> Greetings Lon,

Hi, sorry I'm late responding to this.

> The problem is that I can not get Quorum to start unless I run the
> command "cluforce"! But after viewing clustat commands on systems c11
> and c12, it looks like c11 and c12 cant see each other status due to
> some issue with the raw partitions I believe?

If you're using an IP tiebreaker, they won't be looking for each other
on the shared partitions.  The nodes communicate with each other
primarily over the network - if they don't see each other, they will not
form a quorum.

You can try this if you want more detailed information:

  # service clumanager stop (on both nodes)
  # clumembd -fd (on both nodes)

It will give you all sorts of information, but the most important one
you should be looking for is:

  [PID] info: Membership View #1:0x00000001

If you see both nodes, it will show 0x00000003 (it's a bitmap).  If the
nodes can't see each other over the network, they will show 1 or 2.  If
this happens, you should check your network configuration and
clumanager's settings - you might want to try using broadcast instead of
multicast, etc.


>  I am using a Mylex Fiber Channel Box with QLogic 2300  interface
> card! The raw devices are setup on a 2 mirror drives, Raid 1. Can you
> please shed any good ideas what might be wrong here? The vidals are
> below!

Note that for one node to *start* without the other when using an IP
tiebreaker, having to run 'cluforce' is the default behavior.  If you
wish to change this, please check the man page for the 'cluforce'
command and the 'cludb' command.

The IP tiebreaker is typically used to *maintain* a quorum after a node
failure, because there are certain network faults in which two nodes may
see the tiebreaker - but not each other.

-- Lon


---------------------------------
 Yahoo! Mail
 Use Photomail to share photos without annoying attachments.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060309/7850285c/attachment.htm>

From wcheng at redhat.com  Thu Mar  9 20:32:56 2006
From: wcheng at redhat.com (Wendy Cheng)
Date: Thu, 09 Mar 2006 15:32:56 -0500
Subject: [Linux-cluster] GFS load average and locking
In-Reply-To: <200603090948.03134.grimme@atix.de>
References: <9A6FE0FCC2B29846824C5CD81C6647B90143604F@s228130hz1ew08.apptix-01.savvis.net>
	<200603090948.03134.grimme@atix.de>
Message-ID: <441090F8.4050308@redhat.com>

Marc Grimme wrote:

>Although the strace does not show the output I know of the problem description 
>sounds like a deja vu.
>We had loads of problems with having sessions on GFS and httpd s ending up 
>with "D" state for some time (at high load times we had ServerLimit httpd in 
>D per node which ended up in the service not being available). 
>As I posted already we think it is because of the "bad" locking of sessions 
>with php (as php sessions are on gfs and strace showed those timeouts with 
>the session files). When you issue a "session_start" or what ever that 
>function is called, the session_file is locked via an flock syscall. That 
>lock is held until you end the session which is implicitly done when the tcp 
>connection to the client is ended. Now comes another http process (on 
>whatever node) and calls a "session start" and trys an flock on that session 
>while another process already holds that lock. The process might end up in 
>the seen timeouts (30-60secs) which (as far as I remember relates to the 
>timeout of the tcp connection defined in the httpd.conf or some timeout in 
>the php.ini) - there is an explanation on this but I cannot rember ;-) ). 
>Nevertheless in our scenario the problems were the "bad" session handling by 
>php. We have made a patch for the phplib where you can disable the locking, 
>or just implicitly do locking and therefore keep consitency while session 
>data is read or written. We could make apache work as expected and now we 
>don't see any "D" process anymore since a year.
>Oh yes the patch can be found at
>www.opensharedroot.org in the download section.
>
>Besides: You will never encounter this on a localfilesystem or nfs (as nfs 
>ignores flocks). As nfs does not support flocks and silently ignores them.
>
>  
>
Hi,

This does look like the problem description sent out by savvis.net folks 
during our off-list email exchanges. However, without actually looking 
at the thread traces (when they are in D state), it is difficult to be 
sure. One way to obtain the exact thread trace is using "crash" tool to 
do a back trace (e.g. "bt <pid>", you need kernel debuginfo RPM though). 
Britt, do let us know whether this php patch helps and/or using crash 
command to obtain the thread trace output.

On the other hand, I don't understand how a local (non-cluster) 
filesystem can be immune from this problem ?

-- Wendy


From tekion at gmail.com  Thu Mar  9 20:40:28 2006
From: tekion at gmail.com (Screaming Eagle)
Date: Thu, 9 Mar 2006 15:40:28 -0500
Subject: [Linux-cluster] GFS and extend attribute ...
Message-ID: <ee9c961f0603091240i774aac8ex35a17fdd19765948@mail.gmail.com>

Hi,
I am running GFS with Coraid. I tried using extended attribute on GFS, but
it err out with (using setfacl )message: "Operation not supported". Does
anyone know for sure that GFS does not support extended attribute options?
Thanks.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060309/d1c81901/attachment.htm>

From erling.nygaard at gmail.com  Thu Mar  9 21:52:14 2006
From: erling.nygaard at gmail.com (Erling Nygaard)
Date: Thu, 9 Mar 2006 22:52:14 +0100
Subject: [Linux-cluster] Cluster service restarting Locally
In-Reply-To: <E2FAC907AF594A43B22185B5253DFCA0F363@mx1.wsd.edu>
References: <E2FAC907AF594A43B22185B5253DFCA0F363@mx1.wsd.edu>
Message-ID: <adb721b40603091352n3ae9640by6ffd8aae23e2307f@mail.gmail.com>

I am sorry if this sounds a little harsh, but I'm not sure if laughing
or crying is the correct reaction to this email.

Let us get one thing straight.
You are currently mounting a GFS filesystem _concurrently_ on multiple
nodes using lock_nolock?

If this is the case I can tell you that this will _not_ work. You
_will_ corrupt your filesystem.

Mounting a GFS filesystem with lock_nolock for all practical purposes
turns the GFS filesystem into a local filesystem. There is _no_
locking done anymore.
With this setup there is no longer any coordination done among the
nodes to control the filesystem access, so they are all going to step
on each others toes.
You might as well use ext3, the end result will be the same ;-)

The purpose of lock_nolock is to (temporarily) be able to mount a GFS
filesystem on a single node in such cases where the entire locking
infrastructure is unavailable. (Something like a massive cluster
failure)

So you should really look into setting up one of the lock services :-)

E.


On 3/9/06, Hong Zheng <hong.zheng at wsdtx.org> wrote:
> Lon,
>
> Thanks for your reply. In my system I don't use any lock system like
> lock_gulm or lock_dlm, I use no_lock because our applications'
> limitation. Do you think no_lock will also bring some lock traffic or
> not? When I tried lock_gulm before, our application had very bad
> performance, so I choose no_lock.
>
> And I'm not sure which update we have right now. Do you know the
> versions for clumanager and redhat-config-cluster of RHCS3U7?
>
> Hong
>
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
> Sent: Wednesday, March 08, 2006 4:52 PM
> To: linux clustering
> Subject: RE: [Linux-cluster] Cluster service restarting Locally
>
> On Mon, 2006-03-06 at 14:02 -0600, Hong Zheng wrote:
> > I'm having the same problem. My system configuration is as follows:
> >
> > 2-node cluster: RH ES3, GFS6.0, clumanager-1.2.28-1 and
> > redhat-config-cluster-1.0.8-1
> >
> > Kernel: 2.4.21-37.EL
> >
> > Linux-iscsi-3.6.3 initiator: connections to iSCSI shared storage
> > server
>
> If it's not fixed in U7 (which I think it should be), please file a
> bugzilla... It sounds like the lock traffic is getting network-starved.
>
> -- Lon
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


--
-
Mac OS X. Because making Unix user-friendly is easier than debugging Windows


From hong.zheng at wsdtx.org  Thu Mar  9 22:26:07 2006
From: hong.zheng at wsdtx.org (Hong Zheng)
Date: Thu, 9 Mar 2006 16:26:07 -0600
Subject: [Linux-cluster] Cluster service restarting Locally
Message-ID: <E2FAC907AF594A43B22185B5253DFCA0F365@mx1.wsd.edu>

I understand no_lock won't work for multiple nodes, so I never mount GFS
w/ no_lock to multiple nodes, our cluster is two-node active-passive
cluster. So every time only active node has GFS mount. I could use iSCSI
disk only, but just want to test if GFS has better performance than
iSCSI.

Hong

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Erling Nygaard
Sent: Thursday, March 09, 2006 3:52 PM
To: linux clustering
Subject: Re: [Linux-cluster] Cluster service restarting Locally

I am sorry if this sounds a little harsh, but I'm not sure if laughing
or crying is the correct reaction to this email.

Let us get one thing straight.
You are currently mounting a GFS filesystem _concurrently_ on multiple
nodes using lock_nolock?

If this is the case I can tell you that this will _not_ work. You
_will_ corrupt your filesystem.

Mounting a GFS filesystem with lock_nolock for all practical purposes
turns the GFS filesystem into a local filesystem. There is _no_
locking done anymore.
With this setup there is no longer any coordination done among the
nodes to control the filesystem access, so they are all going to step
on each others toes.
You might as well use ext3, the end result will be the same ;-)

The purpose of lock_nolock is to (temporarily) be able to mount a GFS
filesystem on a single node in such cases where the entire locking
infrastructure is unavailable. (Something like a massive cluster
failure)

So you should really look into setting up one of the lock services :-)

E.


On 3/9/06, Hong Zheng <hong.zheng at wsdtx.org> wrote:
> Lon,
>
> Thanks for your reply. In my system I don't use any lock system like
> lock_gulm or lock_dlm, I use no_lock because our applications'
> limitation. Do you think no_lock will also bring some lock traffic or
> not? When I tried lock_gulm before, our application had very bad
> performance, so I choose no_lock.
>
> And I'm not sure which update we have right now. Do you know the
> versions for clumanager and redhat-config-cluster of RHCS3U7?
>
> Hong
>
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
> Sent: Wednesday, March 08, 2006 4:52 PM
> To: linux clustering
> Subject: RE: [Linux-cluster] Cluster service restarting Locally
>
> On Mon, 2006-03-06 at 14:02 -0600, Hong Zheng wrote:
> > I'm having the same problem. My system configuration is as follows:
> >
> > 2-node cluster: RH ES3, GFS6.0, clumanager-1.2.28-1 and
> > redhat-config-cluster-1.0.8-1
> >
> > Kernel: 2.4.21-37.EL
> >
> > Linux-iscsi-3.6.3 initiator: connections to iSCSI shared storage
> > server
>
> If it's not fixed in U7 (which I think it should be), please file a
> bugzilla... It sounds like the lock traffic is getting
network-starved.
>
> -- Lon
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


--
-
Mac OS X. Because making Unix user-friendly is easier than debugging
Windows

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From erling.nygaard at gmail.com  Thu Mar  9 22:34:29 2006
From: erling.nygaard at gmail.com (Erling Nygaard)
Date: Thu, 9 Mar 2006 23:34:29 +0100
Subject: [Linux-cluster] Cluster service restarting Locally
In-Reply-To: <E2FAC907AF594A43B22185B5253DFCA0F365@mx1.wsd.edu>
References: <E2FAC907AF594A43B22185B5253DFCA0F365@mx1.wsd.edu>
Message-ID: <adb721b40603091434u5acdcca8k1a6386ac8466d81e@mail.gmail.com>

oh, thats good to hear :-)
Multiple lock_nolock nodes would be... interesting...

However, you are saying you want to compare the performance of GFS
with the performance of iSCSI.
GFS is a filesystem, iSCSI is a block level device.
May I ask how you intend to "compare" the performance of the two?

Erling

On 3/9/06, Hong Zheng <hong.zheng at wsdtx.org> wrote:
> I understand no_lock won't work for multiple nodes, so I never mount GFS
> w/ no_lock to multiple nodes, our cluster is two-node active-passive
> cluster. So every time only active node has GFS mount. I could use iSCSI
> disk only, but just want to test if GFS has better performance than
> iSCSI.
>
> Hong
>
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Erling Nygaard
> Sent: Thursday, March 09, 2006 3:52 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] Cluster service restarting Locally
>
> I am sorry if this sounds a little harsh, but I'm not sure if laughing
> or crying is the correct reaction to this email.
>
> Let us get one thing straight.
> You are currently mounting a GFS filesystem _concurrently_ on multiple
> nodes using lock_nolock?
>
> If this is the case I can tell you that this will _not_ work. You
> _will_ corrupt your filesystem.
>
> Mounting a GFS filesystem with lock_nolock for all practical purposes
> turns the GFS filesystem into a local filesystem. There is _no_
> locking done anymore.
> With this setup there is no longer any coordination done among the
> nodes to control the filesystem access, so they are all going to step
> on each others toes.
> You might as well use ext3, the end result will be the same ;-)
>
> The purpose of lock_nolock is to (temporarily) be able to mount a GFS
> filesystem on a single node in such cases where the entire locking
> infrastructure is unavailable. (Something like a massive cluster
> failure)
>
> So you should really look into setting up one of the lock services :-)
>
> E.
>
>
>
>
>
>
> On 3/9/06, Hong Zheng <hong.zheng at wsdtx.org> wrote:
> > Lon,
> >
> > Thanks for your reply. In my system I don't use any lock system like
> > lock_gulm or lock_dlm, I use no_lock because our applications'
> > limitation. Do you think no_lock will also bring some lock traffic or
> > not? When I tried lock_gulm before, our application had very bad
> > performance, so I choose no_lock.
> >
> > And I'm not sure which update we have right now. Do you know the
> > versions for clumanager and redhat-config-cluster of RHCS3U7?
> >
> > Hong
> >
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
> > Sent: Wednesday, March 08, 2006 4:52 PM
> > To: linux clustering
> > Subject: RE: [Linux-cluster] Cluster service restarting Locally
> >
> > On Mon, 2006-03-06 at 14:02 -0600, Hong Zheng wrote:
> > > I'm having the same problem. My system configuration is as follows:
> > >
> > > 2-node cluster: RH ES3, GFS6.0, clumanager-1.2.28-1 and
> > > redhat-config-cluster-1.0.8-1
> > >
> > > Kernel: 2.4.21-37.EL
> > >
> > > Linux-iscsi-3.6.3 initiator: connections to iSCSI shared storage
> > > server
> >
> > If it's not fixed in U7 (which I think it should be), please file a
> > bugzilla... It sounds like the lock traffic is getting
> network-starved.
> >
> > -- Lon
> >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
>
> --
> -
> Mac OS X. Because making Unix user-friendly is easier than debugging
> Windows
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


--
-
Mac OS X. Because making Unix user-friendly is easier than debugging Windows


From hong.zheng at wsdtx.org  Thu Mar  9 22:45:26 2006
From: hong.zheng at wsdtx.org (Hong Zheng)
Date: Thu, 9 Mar 2006 16:45:26 -0600
Subject: [Linux-cluster] Cluster service restarting Locally
Message-ID: <E2FAC907AF594A43B22185B5253DFCA0F366@mx1.wsd.edu>

We have iSCSI external storage server and on the cluster node we use
software initiator connect to iSCSI target. One way is to format that
iSCSI disk to ext3, another test is to format it to GFS filesystem. I
thought ext3 should be better than GFS, but the benchmark result shows
GFS is better. That's what we are testing for.


-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Erling Nygaard
Sent: Thursday, March 09, 2006 4:34 PM
To: linux clustering
Subject: Re: [Linux-cluster] Cluster service restarting Locally

oh, thats good to hear :-)
Multiple lock_nolock nodes would be... interesting...

However, you are saying you want to compare the performance of GFS
with the performance of iSCSI.
GFS is a filesystem, iSCSI is a block level device.
May I ask how you intend to "compare" the performance of the two?

Erling

On 3/9/06, Hong Zheng <hong.zheng at wsdtx.org> wrote:
> I understand no_lock won't work for multiple nodes, so I never mount
GFS
> w/ no_lock to multiple nodes, our cluster is two-node active-passive
> cluster. So every time only active node has GFS mount. I could use
iSCSI
> disk only, but just want to test if GFS has better performance than
> iSCSI.
>
> Hong
>
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Erling Nygaard
> Sent: Thursday, March 09, 2006 3:52 PM
> To: linux clustering
> Subject: Re: [Linux-cluster] Cluster service restarting Locally
>
> I am sorry if this sounds a little harsh, but I'm not sure if laughing
> or crying is the correct reaction to this email.
>
> Let us get one thing straight.
> You are currently mounting a GFS filesystem _concurrently_ on multiple
> nodes using lock_nolock?
>
> If this is the case I can tell you that this will _not_ work. You
> _will_ corrupt your filesystem.
>
> Mounting a GFS filesystem with lock_nolock for all practical purposes
> turns the GFS filesystem into a local filesystem. There is _no_
> locking done anymore.
> With this setup there is no longer any coordination done among the
> nodes to control the filesystem access, so they are all going to step
> on each others toes.
> You might as well use ext3, the end result will be the same ;-)
>
> The purpose of lock_nolock is to (temporarily) be able to mount a GFS
> filesystem on a single node in such cases where the entire locking
> infrastructure is unavailable. (Something like a massive cluster
> failure)
>
> So you should really look into setting up one of the lock services :-)
>
> E.
>
>
>
>
>
>
> On 3/9/06, Hong Zheng <hong.zheng at wsdtx.org> wrote:
> > Lon,
> >
> > Thanks for your reply. In my system I don't use any lock system like
> > lock_gulm or lock_dlm, I use no_lock because our applications'
> > limitation. Do you think no_lock will also bring some lock traffic
or
> > not? When I tried lock_gulm before, our application had very bad
> > performance, so I choose no_lock.
> >
> > And I'm not sure which update we have right now. Do you know the
> > versions for clumanager and redhat-config-cluster of RHCS3U7?
> >
> > Hong
> >
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
> > Sent: Wednesday, March 08, 2006 4:52 PM
> > To: linux clustering
> > Subject: RE: [Linux-cluster] Cluster service restarting Locally
> >
> > On Mon, 2006-03-06 at 14:02 -0600, Hong Zheng wrote:
> > > I'm having the same problem. My system configuration is as
follows:
> > >
> > > 2-node cluster: RH ES3, GFS6.0, clumanager-1.2.28-1 and
> > > redhat-config-cluster-1.0.8-1
> > >
> > > Kernel: 2.4.21-37.EL
> > >
> > > Linux-iscsi-3.6.3 initiator: connections to iSCSI shared storage
> > > server
> >
> > If it's not fixed in U7 (which I think it should be), please file a
> > bugzilla... It sounds like the lock traffic is getting
> network-starved.
> >
> > -- Lon
> >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
>
> --
> -
> Mac OS X. Because making Unix user-friendly is easier than debugging
> Windows
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


--
-
Mac OS X. Because making Unix user-friendly is easier than debugging
Windows

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From Britt.Treece at savvis.net  Thu Mar  9 23:04:59 2006
From: Britt.Treece at savvis.net (Treece, Britt)
Date: Thu, 9 Mar 2006 17:04:59 -0600
Subject: [Linux-cluster] GFS load average and locking
Message-ID: <9A6FE0FCC2B29846824C5CD81C6647B9014836E9@s228130hz1ew08.apptix-01.savvis.net>

Wendy,

Unfortunately our customer has (for the time being) moved their PHP
sessions off of the GFS filesystem because of the instability.  Our GFS
performance has returned to normal, but our customer expects us to fix
GFS so that they can have the PHP sessions on GFS.  I'm *attempting* to
reproduce the behavior on a lab GFS cluster.  Assuming I can
successfully do this I will send strace's of the issue as it occurs.

Is Redhat aware of any issues with GFS and flock syscalls?  

Regarding the U7 kernel suggestion you made previously, is this going to
help with the flock issue or is it strictly for keeping the number of
cached locks down?

Britt


-----Original Message-----
From: Wendy Cheng [mailto:wcheng at redhat.com] 
Sent: Thursday, March 09, 2006 2:33 PM
To: linux clustering
Cc: Stanley, Jon; Treece, Britt
Subject: Re: [Linux-cluster] GFS load average and locking

Marc Grimme wrote:

>Although the strace does not show the output I know of the problem
description 
>sounds like a deja vu.
>We had loads of problems with having sessions on GFS and httpd s ending
up 
>with "D" state for some time (at high load times we had ServerLimit
httpd in 
>D per node which ended up in the service not being available). 
>As I posted already we think it is because of the "bad" locking of
sessions 
>with php (as php sessions are on gfs and strace showed those timeouts
with 
>the session files). When you issue a "session_start" or what ever that 
>function is called, the session_file is locked via an flock syscall.
That 
>lock is held until you end the session which is implicitly done when
the tcp 
>connection to the client is ended. Now comes another http process (on 
>whatever node) and calls a "session start" and trys an flock on that
session 
>while another process already holds that lock. The process might end up
in 
>the seen timeouts (30-60secs) which (as far as I remember relates to
the 
>timeout of the tcp connection defined in the httpd.conf or some timeout
in 
>the php.ini) - there is an explanation on this but I cannot rember ;-)
). 
>Nevertheless in our scenario the problems were the "bad" session
handling by 
>php. We have made a patch for the phplib where you can disable the
locking, 
>or just implicitly do locking and therefore keep consitency while
session 
>data is read or written. We could make apache work as expected and now
we 
>don't see any "D" process anymore since a year.
>Oh yes the patch can be found at
>www.opensharedroot.org in the download section.
>
>Besides: You will never encounter this on a localfilesystem or nfs (as
nfs 
>ignores flocks). As nfs does not support flocks and silently ignores
them.
>
>  
>
Hi,

This does look like the problem description sent out by savvis.net folks

during our off-list email exchanges. However, without actually looking 
at the thread traces (when they are in D state), it is difficult to be 
sure. One way to obtain the exact thread trace is using "crash" tool to 
do a back trace (e.g. "bt <pid>", you need kernel debuginfo RPM though).

Britt, do let us know whether this php patch helps and/or using crash 
command to obtain the thread trace output.

On the other hand, I don't understand how a local (non-cluster) 
filesystem can be immune from this problem ?

-- Wendy


From lhh at redhat.com  Thu Mar  9 23:09:16 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 09 Mar 2006 18:09:16 -0500
Subject: [Linux-cluster] Cluster service restarting Locally
In-Reply-To: <E2FAC907AF594A43B22185B5253DFCA0F363@mx1.wsd.edu>
References: <E2FAC907AF594A43B22185B5253DFCA0F363@mx1.wsd.edu>
Message-ID: <1141945756.25169.292.camel@ayanami.boston.redhat.com>

On Thu, 2006-03-09 at 11:02 -0600, Hong Zheng wrote:
> Lon,
> 
> Thanks for your reply. In my system I don't use any lock system like
> lock_gulm or lock_dlm, I use no_lock because our applications'
> limitation. Do you think no_lock will also bring some lock traffic or
> not? 

No, but if you mount the file system on more than one node, say "good
bye" to your data.

> When I tried lock_gulm before, our application had very bad
> performance, so I choose no_lock.
> 
> And I'm not sure which update we have right now. Do you know the
> versions for clumanager and redhat-config-cluster of RHCS3U7?

1.2.28-1 is U6.  U7 will be out soon.  You can contact Red Hat Support
if you want an earlier version.

Another way to make things work a little better for you is to separate
the cluster communication path from the iSCSI path so they're not
contending for the same network.

-- Lon


From wcheng at redhat.com  Thu Mar  9 23:21:39 2006
From: wcheng at redhat.com (Wendy Cheng)
Date: Thu, 09 Mar 2006 18:21:39 -0500
Subject: [Linux-cluster] GFS load average and locking
In-Reply-To: <9A6FE0FCC2B29846824C5CD81C6647B9014836E9@s228130hz1ew08.apptix-01.savvis.net>
References: <9A6FE0FCC2B29846824C5CD81C6647B9014836E9@s228130hz1ew08.apptix-01.savvis.net>
Message-ID: <4410B883.30608@redhat.com>

Treece, Britt wrote:

>Wendy,
>
>Unfortunately our customer has (for the time being) moved their PHP
>sessions off of the GFS filesystem because of the instability.  Our GFS
>performance has returned to normal, but our customer expects us to fix
>GFS so that they can have the PHP sessions on GFS.  I'm *attempting* to
>reproduce the behavior on a lab GFS cluster.  Assuming I can
>successfully do this I will send strace's of the issue as it occurs.
>  
>

So this problem doesn't show up in local filesystem ? Is it ext3 ? Also 
I prefer thread back trace in kernel mode (sysrq-t and/or crash output) 
to strace - since thread kernel back trace can really show where it gets 
stuck. If you plan to recreate this in your lab, turn the fencing off 
(make heart beat interval very long) so we can get a decent sysrq-t output.

>Is Redhat aware of any issues with GFS and flock syscalls?  
>  
>

Will check but I don't recall such issues from top of my head.

>Regarding the U7 kernel suggestion you made previously, is this going to
>help with the flock issue or is it strictly for keeping the number of
>cached locks down?
>
>  
>
The new tuning parameters added into U7 do help with several lock 
latency issues. Based on your lockspace output, I strongly believe they 
can help. However, they can't do much if the bottleneck of your 
customer's application is in flock as described in previous post.

-- Wendy


From wcheng at redhat.com  Fri Mar 10 03:30:02 2006
From: wcheng at redhat.com (Wendy Cheng)
Date: Thu, 09 Mar 2006 22:30:02 -0500
Subject: [Linux-cluster] GFS load average and locking
In-Reply-To: <9A6FE0FCC2B29846824C5CD81C6647B9014836E9@s228130hz1ew08.apptix-01.savvis.net>
References: <9A6FE0FCC2B29846824C5CD81C6647B9014836E9@s228130hz1ew08.apptix-01.savvis.net>
Message-ID: <1141961403.3409.18.camel@localhost.localdomain>

On Thu, 2006-03-09 at 17:04 -0600, Treece, Britt wrote:

> Is Redhat aware of any issues with GFS and flock syscalls?  

Just checked kernel source and got a rough idea what could go wrong. In
RHEL 3 (linux 2.4 based) kernel, flock has the following logic:

1. lock_kernel (Big Kernel Lock - BKL)
2. call filesystem-specific supplemental lock
3. handle linux vfs flock
4. unlock_kernel

There are two issues here:

* performance

Step 2 is a noop for most of the local filesystems (e.g. ext3) and the
code path of step 3 is relatively short. So you won't see much impacts
of BKL. For GFS, if step 2 is run concurrently (as in other cases such
as read, write, etc), it is reasonably "fast" unless you need the lock
for the very same file and/or the lock network traffic is congested.
However, adding BKL on top of that would have a big impact - it
virtually serializes *every* flock attempt. 

* deadlock

I'm a little bit fuzzy how Linux's BKL is implemented. In theory, the
above sequence would get into deadlock (unless when process goes to
sleep, it'll drop BKL), regardless whether step 2 is a noop or not. Will
ask our base kernel folks about this.

In any case, I think we need to remove that BKL if we can. At the mean
time, to work around this issue, you have to either:

* use previous mentioned PHP patch to turn off flock if you can; or
* get GFS U7 RPMs where we have two tuning parameters that could speed
up the lock process. However, I don't have quantitative data at this
moment to know how effective they'll be in this kind of situation.


-- Wendy


From pcaulfie at redhat.com  Fri Mar 10 08:51:32 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 10 Mar 2006 08:51:32 +0000
Subject: [Linux-cluster] missing services
In-Reply-To: <440F1CE7.2010104@arnet.net.ar>
References: <440F1CE7.2010104@arnet.net.ar>
Message-ID: <44113E14.9030802@redhat.com>

German Staltari wrote:
> Hi, we have a 6 node cluster, each one mounts 6 GFS partitions. When I
> ask for the services to cman, there is always a mount point missing. Is
> this correct?
> FC 4
> kernel-smp-2.6.15-1.1831_FC4
> dlm-kernel-smp-2.6.11.5-20050601.152643.FC4.21
> GFS-kernel-smp-2.6.11.8-20050601.152643.FC4.24
> cman-kernel-smp-2.6.11.5-20050601.152643.FC4.22
> 
> TIA
> German Staltari
> 
> # df -h
> Filesystem            Size  Used Avail Use% Mounted on
> /dev/sda1              59G  2.4G   54G   5% /
> /dev/shm              2.0G     0  2.0G   0% /dev/shm
> /dev/mapper/vg1-store1    399G  184K  399G   1% /store/1
> /dev/mapper/vg2-store2    399G  2.8M  399G   1% /store/2
> /dev/mapper/vg3-store3    399G  180K  399G   1% /store/3
> /dev/mapper/vg4-store4    399G  180K  399G   1% /store/4
> /dev/mapper/vg5-store5    399G  180K  399G   1% /store/5
> /dev/mapper/vg6-store6    399G  180K  399G   1% /store/6
> 
> # cman_tool services
> Service          Name                              GID LID State     Code
> Fence Domain:    "default"                           1   2 run       -
> [1 3]
> DLM Lock Space:  "clvmd"                             7   3 run       -
> [1 4 3]
> DLM Lock Space:  "mailstore01"                      20   4 run       -
> [1 3]
> DLM Lock Space:  "mailstore02"                      22   6 run       -
> [1 3]
> DLM Lock Space:  "mailstore03"                      24   8 run       -
> [1 3]
> DLM Lock Space:  "mailstore04"                      26  10 run       -
> [1 3]
> DLM Lock Space:  "mailstore05"                      28  12 run       -
> [1 3]
> DLM Lock Space:  "mailstore06"                      30  14 run       -
> [1 3]
> GFS Mount Group: "mailstore01"                      21   5 run       -
> [1 3]
> GFS Mount Group: "mailstore02"                      23   7 run       -
> [1 3]
> GFS Mount Group: "mailstore03"                      25   9 run       -
> [1 3]
> GFS Mount Group: "mailstore04"                      27  11 run       -
> [1 3]
> GFS Mount Group: "mailstore05"                      29  13 run       -
> [1 3]
> 

It's possible this is a (now fixed) bug in the /proc code.  See
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=175372

Does "cat /proc/cluster/services" show the same thing ?

-- 

patrick


From magobin at gmail.com  Fri Mar 10 09:38:22 2006
From: magobin at gmail.com (Alex aka Magobin)
Date: Fri, 10 Mar 2006 10:38:22 +0100
Subject: [Linux-cluster] Strange behaviour of the services in cluster!!
In-Reply-To: <1141961403.3409.18.camel@localhost.localdomain>
References: <9A6FE0FCC2B29846824C5CD81C6647B9014836E9@s228130hz1ew08.apptix-01.savvis.net>
	<1141961403.3409.18.camel@localhost.localdomain>
Message-ID: <1141983502.9580.19.camel@localhost.localdomain>

Hi, I configured as first Service in cluster a DNS....it works fine and
from cluster console I can move service from serverA to ServerB without
problem.
According with documentation I've configured Apache...exactly !
The problem is that with 2 services in cluster I'm not able to switch
they from server to another server anymore...Console say thats this is
an error...but there isn't any error in /var/log/messages..only a
warning #70

NOTICE that if I disabled a service and then restart to other server,
services run correctly...(both services), but if I want to switch on the
fly...doesn't work!...It seems that Dns hung up !

any help is greatly appreciated!

Alex

below...tail -40 /var/log/messages trying to start DNS from serverB to
ServerA while apache is running on serverA:


Mar 10 10:24:57 nodo2 clurgmgrd: [2569]: <info>
Executing /etc/init.d/named status 
Mar 10 10:25:14 nodo2 clurgmgrd[2569]: <notice> Stopping service dns 
Mar 10 10:25:14 nodo2 clurgmgrd: [2569]: <info>
Executing /etc/init.d/named stop 
Mar 10 10:25:14 nodo2 named[13926]: shutting down: flushing changes
Mar 10 10:25:14 nodo2 named:  succeeded
Mar 10 10:25:14 nodo2 named[13926]: stopping command channel on
127.0.0.1#953
Mar 10 10:25:14 nodo2 named[13926]: no longer listening on 127.0.0.1#53
Mar 10 10:25:14 nodo2 named[13926]: no longer listening on
10.23.5.253#53
Mar 10 10:25:14 nodo2 named[13926]: no longer listening on
10.23.5.240#53
Mar 10 10:25:14 nodo2 named[13926]: no longer listening on
192.168.254.3#53
Mar 10 10:25:14 nodo2 named[13926]: exiting
Mar 10 10:25:14 nodo2 clurgmgrd: [2569]: <info> Removing IPv4 address
10.23.5.240 from eth0 
Mar 10 10:25:24 nodo2 clurgmgrd: [2569]: <info> unmounting san:/SAN/DNS
(/var/named) 
Mar 10 10:25:24 nodo2 clurgmgrd[2569]: <notice> Service dns is stopped 
Mar 10 10:25:24 nodo2 clurgmgrd[2569]: <warning> #70: Attempting to
restart service dns locally. 
Mar 10 10:25:24 nodo2 clurgmgrd[2569]: <notice> Starting stopped service
dns 
Mar 10 10:25:24 nodo2 clurgmgrd: [2569]: <info> Adding IPv4 address
10.23.5.240 to eth0 
Mar 10 10:25:25 nodo2 clurgmgrd: [2569]: <info>
Executing /etc/init.d/named start 
Mar 10 10:25:25 nodo2 named: Avvio named succeeded
Mar 10 10:25:25 nodo2 named[14145]: starting BIND 9.2.4 -u named
-t /var/named/chroot
Mar 10 10:25:25 nodo2 named[14145]: using 1 CPU
Mar 10 10:25:25 nodo2 named[14145]: loading configuration from
'/etc/named.conf'
Mar 10 10:25:25 nodo2 named[14145]: listening on IPv4 interface lo,
127.0.0.1#53
Mar 10 10:25:25 nodo2 named[14145]: listening on IPv4 interface eth0,
10.23.5.253#53
Mar 10 10:25:25 nodo2 named[14145]: listening on IPv4 interface eth0,
10.23.5.240#53
Mar 10 10:25:25 nodo2 named[14145]: listening on IPv4 interface eth1,
192.168.254.3#53
Mar 10 10:25:25 nodo2 clurgmgrd[2569]: <notice> Service dns started 
Mar 10 10:25:25 nodo2 named[14145]: command channel listening on
127.0.0.1#953
Mar 10 10:25:25 nodo2 named[14145]: zone 5.23.10.in-addr.arpa/IN: loaded
serial 199609206
Mar 10 10:25:25 nodo2 named[14145]: zone 0.0.127.in-addr.arpa/IN: loaded
serial 199609206
Mar 10 10:25:25 nodo2 named[14145]: zone linux.testing/IN: loaded serial
199609206
Mar 10 10:25:25 nodo2 named[14145]: running
Mar 10 10:25:56 nodo2 clurgmgrd: [2569]: <info>
Executing /etc/init.d/named status 
Mar 10 10:26:01 nodo2 crond(pam_unix)[14196]: session opened for user
root by (uid=0)
Mar 10 10:26:03 nodo2 crond(pam_unix)[14196]: session closed for user
root
Mar 10 10:26:26 nodo2 clurgmgrd: [2569]: <info>
Executing /etc/init.d/named status 
Mar 10 10:26:57 nodo2 clurgmgrd: [2569]: <info>
Executing /etc/init.d/named status 
Mar 10 10:27:58 nodo2 last message repeated 2 times
Mar 10 10:28:01 nodo2 crond(pam_unix)[14526]: session opened for user
root by (uid=0)
Mar 10 10:28:03 nodo2 crond(pam_unix)[14526]: session closed for user
root


From magobin at gmail.com  Fri Mar 10 09:45:21 2006
From: magobin at gmail.com (Alex aka Magobin)
Date: Fri, 10 Mar 2006 10:45:21 +0100
Subject: [Linux-cluster] Strange behaviour of the services in cluster!!
Message-ID: <1141983922.9580.21.camel@localhost.localdomain>

Hi, I configured as first Service in cluster a DNS....it works fine and
from cluster console I can move service from serverA to ServerB without
problem.
According with documentation I've configured Apache...exactly !
The problem is that with 2 services in cluster I'm not able to switch
they from server to another server anymore...Console say thats this is
an error...but there isn't any error in /var/log/messages..only a
warning #70

NOTICE that if I disabled a service and then restart to other server,
services run correctly...(both services), but if I want to switch on the
fly...doesn't work!...It seems that Dns hung up !

any help is greatly appreciated!

Alex

below...tail -40 /var/log/messages trying to start DNS from serverB to
ServerA while apache is running on serverA:


Mar 10 10:24:57 nodo2 clurgmgrd: [2569]: <info>
Executing /etc/init.d/named status 
Mar 10 10:25:14 nodo2 clurgmgrd[2569]: <notice> Stopping service dns 
Mar 10 10:25:14 nodo2 clurgmgrd: [2569]: <info>
Executing /etc/init.d/named stop 
Mar 10 10:25:14 nodo2 named[13926]: shutting down: flushing changes
Mar 10 10:25:14 nodo2 named:  succeeded
Mar 10 10:25:14 nodo2 named[13926]: stopping command channel on
127.0.0.1#953
Mar 10 10:25:14 nodo2 named[13926]: no longer listening on 127.0.0.1#53
Mar 10 10:25:14 nodo2 named[13926]: no longer listening on
10.23.5.253#53
Mar 10 10:25:14 nodo2 named[13926]: no longer listening on
10.23.5.240#53
Mar 10 10:25:14 nodo2 named[13926]: no longer listening on
192.168.254.3#53
Mar 10 10:25:14 nodo2 named[13926]: exiting
Mar 10 10:25:14 nodo2 clurgmgrd: [2569]: <info> Removing IPv4 address
10.23.5.240 from eth0 
Mar 10 10:25:24 nodo2 clurgmgrd: [2569]: <info> unmounting san:/SAN/DNS
(/var/named) 
Mar 10 10:25:24 nodo2 clurgmgrd[2569]: <notice> Service dns is stopped 
Mar 10 10:25:24 nodo2 clurgmgrd[2569]: <warning> #70: Attempting to
restart service dns locally. 
Mar 10 10:25:24 nodo2 clurgmgrd[2569]: <notice> Starting stopped service
dns 
Mar 10 10:25:24 nodo2 clurgmgrd: [2569]: <info> Adding IPv4 address
10.23.5.240 to eth0 
Mar 10 10:25:25 nodo2 clurgmgrd: [2569]: <info>
Executing /etc/init.d/named start 
Mar 10 10:25:25 nodo2 named: Avvio named succeeded
Mar 10 10:25:25 nodo2 named[14145]: starting BIND 9.2.4 -u named
-t /var/named/chroot
Mar 10 10:25:25 nodo2 named[14145]: using 1 CPU
Mar 10 10:25:25 nodo2 named[14145]: loading configuration from
'/etc/named.conf'
Mar 10 10:25:25 nodo2 named[14145]: listening on IPv4 interface lo,
127.0.0.1#53
Mar 10 10:25:25 nodo2 named[14145]: listening on IPv4 interface eth0,
10.23.5.253#53
Mar 10 10:25:25 nodo2 named[14145]: listening on IPv4 interface eth0,
10.23.5.240#53
Mar 10 10:25:25 nodo2 named[14145]: listening on IPv4 interface eth1,
192.168.254.3#53
Mar 10 10:25:25 nodo2 clurgmgrd[2569]: <notice> Service dns started 
Mar 10 10:25:25 nodo2 named[14145]: command channel listening on
127.0.0.1#953
Mar 10 10:25:25 nodo2 named[14145]: zone 5.23.10.in-addr.arpa/IN: loaded
serial 199609206
Mar 10 10:25:25 nodo2 named[14145]: zone 0.0.127.in-addr.arpa/IN: loaded
serial 199609206
Mar 10 10:25:25 nodo2 named[14145]: zone linux.testing/IN: loaded serial
199609206
Mar 10 10:25:25 nodo2 named[14145]: running
Mar 10 10:25:56 nodo2 clurgmgrd: [2569]: <info>
Executing /etc/init.d/named status 
Mar 10 10:26:01 nodo2 crond(pam_unix)[14196]: session opened for user
root by (uid=0)
Mar 10 10:26:03 nodo2 crond(pam_unix)[14196]: session closed for user
root
Mar 10 10:26:26 nodo2 clurgmgrd: [2569]: <info>
Executing /etc/init.d/named status 
Mar 10 10:26:57 nodo2 clurgmgrd: [2569]: <info>
Executing /etc/init.d/named status 
Mar 10 10:27:58 nodo2 last message repeated 2 times
Mar 10 10:28:01 nodo2 crond(pam_unix)[14526]: session opened for user
root by (uid=0)
Mar 10 10:28:03 nodo2 crond(pam_unix)[14526]: session closed for user
root


From adingman at cookgroup.com  Fri Mar 10 13:04:57 2006
From: adingman at cookgroup.com (Andrew C. Dingman)
Date: Fri, 10 Mar 2006 08:04:57 -0500
Subject: [Linux-cluster] Any recommentdations for Oracle on a Netapp
	filer ?
In-Reply-To: <1141914937.25169.220.camel@ayanami.boston.redhat.com>
References: <20060309053824.82455.qmail@web52301.mail.yahoo.com>
	<1141914937.25169.220.camel@ayanami.boston.redhat.com>
Message-ID: <1141995897.23733.2.camel@adingman.cin.cook>

On Thu, 2006-03-09 at 09:35 -0500, Lon Hohberger wrote:
> On a side note, I am surprised the oracledb.sh script started RAC
> correctly, since it doesn't start ocssd or any of the other Oracle
> Clusterware components...

Oracle's RAC installation puts the cluster system into /etc/inittab.
They're probably already running when the shell script is called.


-- 
Andrew C. Dingman
Unix Administrator
Cook Incorporated
(812)339-2235 x2131
adingman at cookgroup.com


From hong.zheng at wsdtx.org  Fri Mar 10 13:37:40 2006
From: hong.zheng at wsdtx.org (Hong Zheng)
Date: Fri, 10 Mar 2006 07:37:40 -0600
Subject: [Linux-cluster] Cluster service restarting Locally
Message-ID: <E2FAC907AF594A43B22185B5253DFCA0F367@mx1.wsd.edu>

We didn't mount multiple nodes to that file system and also the iSCSI
channel is in a separate subnet.


-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
Sent: Thursday, March 09, 2006 5:09 PM
To: linux clustering
Subject: RE: [Linux-cluster] Cluster service restarting Locally

On Thu, 2006-03-09 at 11:02 -0600, Hong Zheng wrote:
> Lon,
> 
> Thanks for your reply. In my system I don't use any lock system like
> lock_gulm or lock_dlm, I use no_lock because our applications'
> limitation. Do you think no_lock will also bring some lock traffic or
> not? 

No, but if you mount the file system on more than one node, say "good
bye" to your data.

> When I tried lock_gulm before, our application had very bad
> performance, so I choose no_lock.
> 
> And I'm not sure which update we have right now. Do you know the
> versions for clumanager and redhat-config-cluster of RHCS3U7?

1.2.28-1 is U6.  U7 will be out soon.  You can contact Red Hat Support
if you want an earlier version.

Another way to make things work a little better for you is to separate
the cluster communication path from the iSCSI path so they're not
contending for the same network.

-- Lon

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From Britt.Treece at savvis.net  Fri Mar 10 14:25:48 2006
From: Britt.Treece at savvis.net (Treece, Britt)
Date: Fri, 10 Mar 2006 08:25:48 -0600
Subject: [Linux-cluster] GFS load average and locking
Message-ID: <9A6FE0FCC2B29846824C5CD81C6647B901483935@s228130hz1ew08.apptix-01.savvis.net>

Wendy,

Did the sysrq-t's that I sent illustrate this problem further?  I'm
hoping that they corroborate the situation that you described below.

Britt


-----Original Message-----
From: Wendy Cheng [mailto:wcheng at redhat.com] 
Sent: Thursday, March 09, 2006 9:30 PM
To: Treece, Britt
Cc: linux clustering; Stanley, Jon
Subject: RE: [Linux-cluster] GFS load average and locking

On Thu, 2006-03-09 at 17:04 -0600, Treece, Britt wrote:

> Is Redhat aware of any issues with GFS and flock syscalls?  

Just checked kernel source and got a rough idea what could go wrong. In
RHEL 3 (linux 2.4 based) kernel, flock has the following logic:

1. lock_kernel (Big Kernel Lock - BKL)
2. call filesystem-specific supplemental lock
3. handle linux vfs flock
4. unlock_kernel

There are two issues here:

* performance

Step 2 is a noop for most of the local filesystems (e.g. ext3) and the
code path of step 3 is relatively short. So you won't see much impacts
of BKL. For GFS, if step 2 is run concurrently (as in other cases such
as read, write, etc), it is reasonably "fast" unless you need the lock
for the very same file and/or the lock network traffic is congested.
However, adding BKL on top of that would have a big impact - it
virtually serializes *every* flock attempt. 

* deadlock

I'm a little bit fuzzy how Linux's BKL is implemented. In theory, the
above sequence would get into deadlock (unless when process goes to
sleep, it'll drop BKL), regardless whether step 2 is a noop or not. Will
ask our base kernel folks about this.

In any case, I think we need to remove that BKL if we can. At the mean
time, to work around this issue, you have to either:

* use previous mentioned PHP patch to turn off flock if you can; or
* get GFS U7 RPMs where we have two tuning parameters that could speed
up the lock process. However, I don't have quantitative data at this
moment to know how effective they'll be in this kind of situation.


-- Wendy


From wcheng at redhat.com  Fri Mar 10 15:18:10 2006
From: wcheng at redhat.com (Wendy Cheng)
Date: Fri, 10 Mar 2006 10:18:10 -0500
Subject: [Linux-cluster] GFS load average and locking
In-Reply-To: <9A6FE0FCC2B29846824C5CD81C6647B901483935@s228130hz1ew08.apptix-01.savvis.net>
References: <9A6FE0FCC2B29846824C5CD81C6647B901483935@s228130hz1ew08.apptix-01.savvis.net>
Message-ID: <441198B2.1000209@redhat.com>

Treece, Britt wrote:

>Wendy,
>
>Did the sysrq-t's that I sent illustrate this problem further?  I'm
>hoping that they corroborate the situation that you described below.
>
>  
>
Funny thing is that the sysrq-t shows the symptom we addressed in RHEL3 
U7 - so look to me like a combination of serveral issues.

I believe a conf. call via support has been scheduled. Let's discuss 
this off-list.

-- Wendy


From gstaltari at arnet.net.ar  Fri Mar 10 15:50:14 2006
From: gstaltari at arnet.net.ar (German Staltari)
Date: Fri, 10 Mar 2006 12:50:14 -0300
Subject: [Linux-cluster] missing services
In-Reply-To: <44113E14.9030802@redhat.com>
References: <440F1CE7.2010104@arnet.net.ar> <44113E14.9030802@redhat.com>
Message-ID: <4411A036.3040200@arnet.net.ar>

Patrick Caulfield wrote:
> German Staltari wrote:
>   
>> Hi, we have a 6 node cluster, each one mounts 6 GFS partitions. When I
>> ask for the services to cman, there is always a mount point missing. Is
>> this correct?
>> FC 4
>> kernel-smp-2.6.15-1.1831_FC4
>> dlm-kernel-smp-2.6.11.5-20050601.152643.FC4.21
>> GFS-kernel-smp-2.6.11.8-20050601.152643.FC4.24
>> cman-kernel-smp-2.6.11.5-20050601.152643.FC4.22
>>
>> TIA
>> German Staltari
>>
>> # df -h
>> Filesystem            Size  Used Avail Use% Mounted on
>> /dev/sda1              59G  2.4G   54G   5% /
>> /dev/shm              2.0G     0  2.0G   0% /dev/shm
>> /dev/mapper/vg1-store1    399G  184K  399G   1% /store/1
>> /dev/mapper/vg2-store2    399G  2.8M  399G   1% /store/2
>> /dev/mapper/vg3-store3    399G  180K  399G   1% /store/3
>> /dev/mapper/vg4-store4    399G  180K  399G   1% /store/4
>> /dev/mapper/vg5-store5    399G  180K  399G   1% /store/5
>> /dev/mapper/vg6-store6    399G  180K  399G   1% /store/6
>>
>> # cman_tool services
>> Service          Name                              GID LID State     Code
>> Fence Domain:    "default"                           1   2 run       -
>> [1 3]
>> DLM Lock Space:  "clvmd"                             7   3 run       -
>> [1 4 3]
>> DLM Lock Space:  "mailstore01"                      20   4 run       -
>> [1 3]
>> DLM Lock Space:  "mailstore02"                      22   6 run       -
>> [1 3]
>> DLM Lock Space:  "mailstore03"                      24   8 run       -
>> [1 3]
>> DLM Lock Space:  "mailstore04"                      26  10 run       -
>> [1 3]
>> DLM Lock Space:  "mailstore05"                      28  12 run       -
>> [1 3]
>> DLM Lock Space:  "mailstore06"                      30  14 run       -
>> [1 3]
>> GFS Mount Group: "mailstore01"                      21   5 run       -
>> [1 3]
>> GFS Mount Group: "mailstore02"                      23   7 run       -
>> [1 3]
>> GFS Mount Group: "mailstore03"                      25   9 run       -
>> [1 3]
>> GFS Mount Group: "mailstore04"                      27  11 run       -
>> [1 3]
>> GFS Mount Group: "mailstore05"                      29  13 run       -
>> [1 3]
>>
>>     
>
> It's possible this is a (now fixed) bug in the /proc code.  See
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=175372
>
> Does "cat /proc/cluster/services" show the same thing ?
>
>   
Yes it does. 
Is there a way to make updated packages for the FC4 cluster/gfs rpm's 
based on the RHEL4 rpm's?
I'm asking this because I see some bugs fixes, that definitely need to 
be in the FC4 rpm's.
Thanks
German


From kanderso at redhat.com  Fri Mar 10 18:14:30 2006
From: kanderso at redhat.com (Kevin Anderson)
Date: Fri, 10 Mar 2006 12:14:30 -0600
Subject: [Linux-cluster] missing services
In-Reply-To: <4411A036.3040200@arnet.net.ar>
References: <440F1CE7.2010104@arnet.net.ar> <44113E14.9030802@redhat.com>
	<4411A036.3040200@arnet.net.ar>
Message-ID: <1142014470.2932.28.camel@localhost.localdomain>

On Fri, 2006-03-10 at 12:50 -0300, German Staltari wrote:

> >
> >   
> Yes it does. 
> Is there a way to make updated packages for the FC4 cluster/gfs rpm's 
> based on the RHEL4 rpm's?
> I'm asking this because I see some bugs fixes, that definitely need to 
> be in the FC4 rpm's.

Which fixes?  We try to make sure that everything that gets checked into
RHEL4 also goes under the STABLE tag which is what FC4 gets built.  If
we are missing something, then we need to get the source base fixed.

Kevin


From epeelea at gmail.com  Fri Mar 10 19:01:34 2006
From: epeelea at gmail.com (Daniel EPEE LEA)
Date: Fri, 10 Mar 2006 11:01:34 -0800
Subject: [Linux-cluster] Help: Cannot mount GFS partition in cluster
Message-ID: <df22854e0603101101o55064d2dj29380a86c9ca19f1@mail.gmail.com>

Hello,

After installing application on one of my 2 node cluster, the system
won't mount the GFS partition automatically as usual.

- clustat shows both nodes are still in the cluster,
- clvmd started correctly on both nodes, but no luch in mounting the partition
- lvscan gives this error
lvscan: symbol lookup error: /usr/lib/liblvm2clusterlock.so: undefined
symbol: malloc_aux

How can I get out of this ?

Waiting for answers.

Best Regards
--
--------------------------
Daniel Epee Lea


From e.tano at palazzochigi.it  Fri Mar 10 19:12:48 2006
From: e.tano at palazzochigi.it (Tano Enzo)
Date: Fri, 10 Mar 2006 20:12:48 +0100
Subject: [Linux-cluster] Resource Shared
Message-ID: <6C5FB8EA05488B44B62544668AB4EE9C1BD870@PCM-EXCH-VIRT2.pcm.it>


Hi,

I have a cluster with 2 nodes RHEL 4 U3, my cluster not use GFS, I have
a ECM2 shared storage. I have some shared resource: ip address and file
system. Can I use the shared resource in my service more then one times?
For example I have a file system /mnt/san/web where more then one
services access to it. I have created 2 service Test and Test2 which use
the only file system as resource, but only Test use it, Test2 ignore it.

When Test2 become enable, it not mount the file system. Why?

Thanks for help
Enzo


_______________________________________________________
Messaggio analizzato e protetto da tecnologia antivirus

Servizio erogato dal sistema informativo della
Presidenza del Consiglio dei Ministri


From cjkovacs at verizon.net  Fri Mar 10 22:28:47 2006
From: cjkovacs at verizon.net (Corey Kovacs)
Date: Fri, 10 Mar 2006 17:28:47 -0500
Subject: [Linux-cluster] GFS load average and locking
In-Reply-To: <441198B2.1000209@redhat.com>
References: <9A6FE0FCC2B29846824C5CD81C6647B901483935@s228130hz1ew08.apptix-01.savvis.net>
	<441198B2.1000209@redhat.com>
Message-ID: <1142029727.28510.0.camel@ronin.home.net>

Is there a target date for the release of RHELu7? I'm expecting
it will be soon since RHEL4u3 was just released. Any solid date?


Regards


Corey

On Fri, 2006-03-10 at 10:18 -0500, Wendy Cheng wrote:
> Treece, Britt wrote:
> 
> >Wendy,
> >
> >Did the sysrq-t's that I sent illustrate this problem further?  I'm
> >hoping that they corroborate the situation that you described below.
> >
> >  
> >
> Funny thing is that the sysrq-t shows the symptom we addressed in RHEL3 
> U7 - so look to me like a combination of serveral issues.
> 
> I believe a conf. call via support has been scheduled. Let's discuss 
> this off-list.
> 
> -- Wendy
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From brilong at cisco.com  Sat Mar 11 01:55:30 2006
From: brilong at cisco.com (Brian Long)
Date: Fri, 10 Mar 2006 20:55:30 -0500
Subject: [Linux-cluster] GFS load average and locking
In-Reply-To: <1142029727.28510.0.camel@ronin.home.net>
References: <9A6FE0FCC2B29846824C5CD81C6647B901483935@s228130hz1ew08.apptix-01.savvis.net>	<441198B2.1000209@redhat.com>
	<1142029727.28510.0.camel@ronin.home.net>
Message-ID: <44122E12.2030704@cisco.com>

Corey Kovacs wrote:

>Is there a target date for the release of RHELu7? I'm expecting
>it will be soon since RHEL4u3 was just released. Any solid date?
>
>  
>
I was told next week.  It's been pushed out a few times since it was 
originally due around 3/1.

/Brian/


From saju8 at rediffmail.com  Sat Mar 11 10:50:07 2006
From: saju8 at rediffmail.com (saju  john)
Date: 11 Mar 2006 10:50:07 -0000
Subject: [Linux-cluster] Cluster service restarting Locally
Message-ID: <20060311105007.27577.qmail@webmail50.rediffmail.com>

  
Dear Mr. Hohberger,

Thanx for the replay.

I saw your comments for the problem I reported. ie lock traffic is getting network-starved.

But I think differently. Because when I stop clumanager on one of the node, the frequency of service restart is very very less compared to that was earlier when clumanager is running on both nodes .My assumption is that, the problem is due to some curruption of meta data information writing to the quroum partition ,as both nodes writing to quroum cuncurrently. May be due to bug in the rawdeivce driver.I am not sure.Then interesting question is ,how the cluster worked all these days(for me around one year with out any major problem).

Could you pelase consider this also when releasing the RHCS3U7.


Thank You,
Saju John
Linux System Administrator,
Thuraya Satellite Telicommunications Company
UAE,Sharjah

On Thu, 09 Mar 2006 Lon Hohberger wrote :
>On Mon, 2006-03-06 at 06:47 +0000, saju john wrote:
> >
> >
> > Dear All,
> >
> > I have a 2 node cluster with RHAS3 update 3.
> > Kernel : 2.4.21-20.Elsmp
> > Clumanager : clumanager-1.2.16-1
> >
> > For more than a year everyting had been fine. Suddenly it started
> > showing the follwing and restarted the service locally
> >
> > clusvcmgrd[1388]: <err> Unable to obtain cluster lock: Connection
> > timed out
> > clulockd[1378]: <warning> Denied A.B.C.D: Broken pipe
> > clulockd[1378]: <err> select error: Broken pipe
> > clusvcmgrd: [1625]: <notice> service notice: Stopping service
> > postgresql ...
> > clusvcmgrd: [1625]: <notice> service notice: Running user script
> > '/etc/init.d/postgresql stop'
> > clusvcmgrd: [1625]: <notice> service notice: Stopped service
> > postgresql
> > clusvcmgrd: [1625]: <notice> service notice: Starting service
> > postgresql ...
> > clusvcmgrd: [1625]: <notice> service notice: Running user script
> > '/etc/init.d/postgresql start'
> > clusvcmgrd: [1625]: <notice> service notice: Started service
> > postgresql ...
>
>It should be fixed in RHCS3U7
>
>-- Lon
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060311/da3e91af/attachment.htm>

From alban.crequy at seanodes.com  Mon Mar 13 09:46:40 2006
From: alban.crequy at seanodes.com (Alban Crequy)
Date: Mon, 13 Mar 2006 10:46:40 +0100
Subject: [Linux-cluster] GFS locks granularity (DLM or GULM)
Message-ID: <44153F80.2060302@seanodes.com>

Hello,

What is the locking granularity in GFS? Can GFS do range locks? Is the 
granularity of DLM different than the GULM one?

The only explanation I found are:

    ?Locking in GFS is closely tied to physical storage. Earlier versions of 
GFS [21] required locking to be implemented at the disk device via 
extensions to the SCSI protocol. Newer versions allow the use of an external 
distributed lock manager, but still lock individual disk blocks of 4kB or 
8kB size. Therefore, accessing large files in GFS entails significantly more 
locking overhead than the byte-range locks used in GPFS.?
    http://www.broadcastpapers.com/asset/IBMGPFS07.htm

But maybe this is outdated?

Other doc:

    ?GFS has a couple pf locks for each file. (one for data, one for meta
data, one for iopen counts. maybe others, don't recall off the top of my
head.)  Directories get a lock, as well as most of the interal
structures.  So more-or-less gfs locks at the file level.  (note that
this is not the same or similar to fcntl locking, nor is it compatible.)?
    http://www.redhat.com/archives/linux-cluster/2005-June/msg00016.html

-- 
Alban


From Alain.Moulle at bull.net  Mon Mar 13 10:38:36 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Mon, 13 Mar 2006 11:38:36 +0100
Subject: [Linux-cluster] CS4 behavior on killall -9
Message-ID: <44154BAC.1070007@bull.net>

Hi
On a HA pair in mutual takeover, it seems that if we do a "killall -9" on one
node, there is no failover, the CS4 seems to be stalled .
Any reason ? idea ?

Thanks
Alain
-- 


mailto:Alain.Moulle at bull.net
+------------------------------+--------------------------------+
|	Alain Moull?	       	| from France :	04 76 29 75 99  |
|                              	| FAX number  : 04 76 29 72 49  |
| Bull SA		       	|				|
| 1, Rue de Provence  		| Adr  : FREC B1-041            |
| B.P. 208			|				|
| 38432 Echirolles - CEDEX     	| Email: Alain.Moulle at bull.net  |
| France                       	| BCOM : 229 7599               |
+-------------------------------+-------------------------------+


From magobin at gmail.com  Mon Mar 13 12:12:18 2006
From: magobin at gmail.com (Alessandro Binarelli)
Date: Mon, 13 Mar 2006 13:12:18 +0100
Subject: [Linux-cluster] Where to study Cluster suite ??
Message-ID: <108b923c0603130412q610e2908q@mail.gmail.com>

Hi,
I 've some problem (basic problem) with cluster suite that is not cover from
documentation...so I would to know if there are some site that explain step
by step a  basic installation and configuration .

As I said in my previous message I configured dns and http service but once
installed I'm not able to move services from serverA to serverB...only if I
disabled it and restart on other server....plus....I try to disable ethernet
card on serverB and I thought that service moves to other server
automatically when it see server died...but doesnt' work for me..

So I think that there is some steps that I have to know before trying to
configure cluster in HA....is there some site that explain Redhat Cluster
Suite?

Thanks in advance
Alex
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060313/445d7fef/attachment.htm>

From updatemyself at gmail.com  Mon Mar 13 13:00:23 2006
From: updatemyself at gmail.com (updatemyself .)
Date: Mon, 13 Mar 2006 18:30:23 +0530
Subject: [Linux-cluster] Few Doubts About "GFS + ISCSI with Multipathing And
	NIC Bonding"
Message-ID: <ab5b05b20603130500y7b04db71qe4ed1a11b32f35ab@mail.gmail.com>

Hai All,

I have few questions to ask..
i already have a setup of GPFS Cluster on SAN with mulipathing (total 12 TB
Volumes)
And planning to go for a Another One With GFS + ISCSI with Multipathing And
NIC Bonding

So my doubt are about
1, Multipathing
2, NIC Bonding
3, Whats the Option for ISCSI Multipath same as RDAC in SAN
4, Comparison of GPFS on SAN and GFS with ISCSI (which is better )
5, Is it needed to go for Redhat AS or Fedora Core 4 is enough?


Thank You In Advance,
jerrynikky
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060313/052b2260/attachment.htm>

From s.bridgwater at sinergy.it  Mon Mar 13 13:27:48 2006
From: s.bridgwater at sinergy.it (Simon Bridgwater)
Date: Mon, 13 Mar 2006 14:27:48 +0100
Subject: [Linux-cluster] problem with vip addresses
Message-ID: <faca485121e93e458e87c186ff0e3084@sinergy.it>

I seem to have a problem with the vip's in a bonding interface
disappearing. At first I thought it was a problem with the vsftpd
service but when I closely monitored the vip with the command "ip addr
show bond0" I sometimes see that the virtual ip's disappear and then
reappear after a few seconds. What could be causing this problem ? Is it
an operating system problem or a cluster problem. Could it be caused by
a malconfigured service ? (I have to put two scripts which depend on
each other into a single service). 
I have tried upgrading the cluster suite (with corresponding kernel-smp-
2.6.9-22.0.2) to the most updated version but it is still giving me this
error. I have two e1000 NICS configured in bonding in active-backup with
miimon=100. Could it be a problem with bonding or the miimon parameter ?

Simon Bridgwater
Sinergy Srl 


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060313/a4e5dbf3/attachment.htm>

From brilong at cisco.com  Mon Mar 13 13:36:33 2006
From: brilong at cisco.com (Brian Long)
Date: Mon, 13 Mar 2006 08:36:33 -0500
Subject: [Linux-cluster] Few Doubts About "GFS + ISCSI with
	Multipathing And NIC Bonding"
In-Reply-To: <ab5b05b20603130500y7b04db71qe4ed1a11b32f35ab@mail.gmail.com>
References: <ab5b05b20603130500y7b04db71qe4ed1a11b32f35ab@mail.gmail.com>
Message-ID: <1142256993.4566.6.camel@brilong-lnx>

On Mon, 2006-03-13 at 18:30 +0530, updatemyself . wrote:
> Hai All,
> 
> I have few questions to ask..
> i already have a setup of GPFS Cluster on SAN with mulipathing (total
> 12 TB Volumes)
> And planning to go for a Another One With GFS + ISCSI with
> Multipathing And NIC Bonding 
> 
> So my doubt are about
> 1, Multipathing
> 2, NIC Bonding
> 3, Whats the Option for ISCSI Multipath same as RDAC in SAN
> 4, Comparison of GPFS on SAN and GFS with ISCSI (which is better )
> 5, Is it needed to go for Redhat AS or Fedora Core 4 is enough? 

Jerrynikky,

I can answer #5 easily.  Do you require Enterprise-level support for
your implementation or are you just setting this up to play around?  If
you require Enterprise support with a vendor's throat to choke when
something dies, you absolutely need to pursue Red Hat AS or ES.

/Brian/
-- 
       Brian Long                      |         |           |
       IT Data Center Systems          |       .|||.       .|||.
       Cisco Linux Developer           |   ..:|||||||:...:|||||||:..
       Phone: (919) 392-7363           |   C i s c o   S y s t e m s


From updatemyself at gmail.com  Mon Mar 13 14:05:05 2006
From: updatemyself at gmail.com (updatemyself .)
Date: Mon, 13 Mar 2006 19:35:05 +0530
Subject: [Linux-cluster] Few Doubts About "GFS + ISCSI with Multipathing
	And NIC Bonding"
In-Reply-To: <1142256993.4566.6.camel@brilong-lnx>
References: <ab5b05b20603130500y7b04db71qe4ed1a11b32f35ab@mail.gmail.com>
	<1142256993.4566.6.camel@brilong-lnx>
Message-ID: <ab5b05b20603130605s63454411ld919c4fbd09fff5d@mail.gmail.com>

Thank You Brian,

That i know i already having 5 Enterprise Licence
i mean only about modules.. and stability...

who can help me.. to get all other information...

Yhanks a lot..
Jerrynikky.

On 3/13/06, Brian Long <brilong at cisco.com> wrote:
>
> On Mon, 2006-03-13 at 18:30 +0530, updatemyself . wrote:
> > Hai All,
> >
> > I have few questions to ask..
> > i already have a setup of GPFS Cluster on SAN with mulipathing (total
> > 12 TB Volumes)
> > And planning to go for a Another One With GFS + ISCSI with
> > Multipathing And NIC Bonding
> >
> > So my doubt are about
> > 1, Multipathing
> > 2, NIC Bonding
> > 3, Whats the Option for ISCSI Multipath same as RDAC in SAN
> > 4, Comparison of GPFS on SAN and GFS with ISCSI (which is better )
> > 5, Is it needed to go for Redhat AS or Fedora Core 4 is enough?
>
> Jerrynikky,
>
> I can answer #5 easily.  Do you require Enterprise-level support for
> your implementation or are you just setting this up to play around?  If
> you require Enterprise support with a vendor's throat to choke when
> something dies, you absolutely need to pursue Red Hat AS or ES.
>
> /Brian/
> --
>        Brian Long                      |         |           |
>        IT Data Center Systems          |       .|||.       .|||.
>        Cisco Linux Developer           |   ..:|||||||:...:|||||||:..
>        Phone: (919) 392-7363           |   C i s c o   S y s t e m s
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060313/7de7e741/attachment.htm>

From lhh at redhat.com  Mon Mar 13 16:38:07 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 13 Mar 2006 11:38:07 -0500
Subject: [Linux-cluster] Resource Shared
In-Reply-To: <6C5FB8EA05488B44B62544668AB4EE9C1BD870@PCM-EXCH-VIRT2.pcm.it>
References: <6C5FB8EA05488B44B62544668AB4EE9C1BD870@PCM-EXCH-VIRT2.pcm.it>
Message-ID: <1142267887.15119.28.camel@ayanami.boston.redhat.com>

On Fri, 2006-03-10 at 20:12 +0100, Tano Enzo wrote:
> Hi,
> 
> I have a cluster with 2 nodes RHEL 4 U3, my cluster not use GFS, I have
> a ECM2 shared storage. I have some shared resource: ip address and file
> system. Can I use the shared resource in my service more then one times?
> For example I have a file system /mnt/san/web where more then one
> services access to it. I have created 2 service Test and Test2 which use
> the only file system as resource, but only Test use it, Test2 ignore it.

You can not reference an IP or a regular "fs" (ext3, ext2, reiserfs,
etc) multiple times, because mounting those types of file systems on two
systems is an invitation for a corrupt file system.  Similarly, bringing
up a single IP on two separate systems ... well, generally does not work
well ;)

Currently, you can reuse:

- clusterfs (i.e. GFS; could be extended for other cluster file systems
though)

- netfs (mounting a file system from an NFS server - locks + file system
consistency is handled server-side)

- nfsexport (Meta-resource which is a child of fs or clusterfs to help
with creation of an NFS failover service)

- nfsclient (Resource which describes a target of a failover NFS
service... hostname, wildcard, etc.)

- script (though this should be done with caution!)

Reusing non-shareable resources causes some of the references to be
ignored.  If you want to see what rgmanager thinks about the resource
tree, run:

   rg_test test /etc/cluster/cluster.conf 2>&1 | less

It will tell you of problems it finds in the resource tree (like
exceeding max reference counts for resources, etc.)

-- Lon


From dex.chen at crosswalkinc.com  Mon Mar 13 17:31:18 2006
From: dex.chen at crosswalkinc.com (Dex Chen)
Date: Mon, 13 Mar 2006 10:31:18 -0700
Subject: [Linux-cluster] lost quorum,
	but the cluster services and GFS are still up
Message-ID: <2E02749DAF5338479606A056219BE109E0DB42@smail.crosswalkinc.com>

Hi,
 
I believe that I saw something unusual here.
 
I have a 3 node cluster (with GFS) using CMAN. After I shutdown 2 nodes
in short time span, the cluster shows it lost quorum, but I run the
clustat on the third node, and clustat shows the cluster has 3 nodes (2
are offline) and the other services are up. I was able to access/read
the share storage. CMAN_TOOL shows cluster lost quorum and the activity
is blocked. What I expected is that I should not allow accessing the
shared storage and other services at all when the cluster lost the
quorum. Anyone has seen the similar things? What/where should I look
into?
 
Thanks,
 
Dex
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060313/561aecdf/attachment.htm>

From lhh at redhat.com  Mon Mar 13 18:35:59 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 13 Mar 2006 13:35:59 -0500
Subject: [Linux-cluster] lost quorum, but the cluster services and GFS
	are still up
In-Reply-To: <2E02749DAF5338479606A056219BE109E0DB42@smail.crosswalkinc.com>
References: <2E02749DAF5338479606A056219BE109E0DB42@smail.crosswalkinc.com>
Message-ID: <1142274959.15119.114.camel@ayanami.boston.redhat.com>

On Mon, 2006-03-13 at 10:31 -0700, Dex Chen wrote:
> Hi,
> 
>  
> 
> I believe that I saw something unusual here.
> 
>  
> 
> I have a 3 node cluster (with GFS) using CMAN. After I shutdown 2
> nodes in short time span, the cluster shows it lost quorum, but I run
> the clustat on the third node, and clustat shows the cluster has 3
> nodes (2 are offline) and the other services are up. I was able to
> access/read the share storage. CMAN_TOOL shows cluster lost quorum and
> the activity is blocked. What I expected is that I should not allow
> accessing the shared storage and other services at all when the
> cluster lost the quorum. Anyone has seen the similar things?
> What/where should I look into?

CMAN is supposed to deliver (more or less) a STATECHANGE event to
clients.  At that point, quorum is checked by rgmanager, and if the
cluster is no longer quorate, it halts all services immediately.

Are there anything in the logs which would indicate this?  It would look
like:

   <emerg> #1 Quorum Dissolved

Given that you can still access service data (e.g. clustat reports
something), that means that rgmanager can still acquire locks for some
reason (it takes DLM locks before giving out service data...).

Does clustat report that the cluster is quorate or not?

-- Lon


From lhh at redhat.com  Mon Mar 13 18:38:18 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 13 Mar 2006 13:38:18 -0500
Subject: [Linux-cluster] problem with vip addresses
In-Reply-To: <faca485121e93e458e87c186ff0e3084@sinergy.it>
References: <faca485121e93e458e87c186ff0e3084@sinergy.it>
Message-ID: <1142275098.15119.117.camel@ayanami.boston.redhat.com>

On Mon, 2006-03-13 at 14:27 +0100, Simon Bridgwater wrote:
> I seem to have a problem with the vip's in a bonding interface
> disappearing. At first I thought it was a problem with the vsftpd
> service but when I closely monitored the vip with the command "ip addr
> show bond0" I sometimes see that the virtual ip's disappear and then
> reappear after a few seconds. What could be causing this problem ? Is
> it an operating system problem or a cluster problem. Could it be
> caused by a malconfigured service ? (I have to put two scripts which
> depend on each other into a single service). 

How often does this happen, and is rgmanager the thing tearing down /
restarting the IPs?


> I have tried upgrading the cluster suite (with corresponding
> kernel-smp- 2.6.9-22.0.2) to the most updated version but it is still
> giving me this error. I have two e1000 NICS configured in bonding in
> active-backup with miimon=100. Could it be a problem with bonding or
> the miimon parameter ?

There are some odd problems with e1000 in bonding configuration and the
SIOCGIFCONF ioctls where sometimes, the ioctl() just returns nothing,
but I was under the impression this kind of problem did not occur with
the newer Netlink interfaces (e.g. what /sbin/ip uses).

-- Lon


From dex.chen at crosswalkinc.com  Mon Mar 13 18:46:34 2006
From: dex.chen at crosswalkinc.com (Dex Chen)
Date: Mon, 13 Mar 2006 11:46:34 -0700
Subject: [Linux-cluster] lost quorum,
	but the cluster services and GFSare still up
Message-ID: <2E02749DAF5338479606A056219BE109E0DBA7@smail.crosswalkinc.com>

Odd enough! Clustat still reports "Inquorate".
See the screen capture:

Member Status: Inquorate

  Member Name                              Status
  ------ ----                              ------
  c01                                      Offline
  c02                                      Offline
  c03                                      Online, Local, rgmanager

  Service Name         Owner (Last)                   State
  ------- ----         ----- ------                   -----
  c-mgmt               c03                            started
  snapshot             c03                            started
  email_notifier       c03                            started

Thanks,

Dex
-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
Sent: Monday, March 13, 2006 11:36 AM
To: linux clustering
Subject: Re: [Linux-cluster] lost quorum, but the cluster services and
GFSare still up

On Mon, 2006-03-13 at 10:31 -0700, Dex Chen wrote:
> Hi,
> 
>  
> 
> I believe that I saw something unusual here.
> 
>  
> 
> I have a 3 node cluster (with GFS) using CMAN. After I shutdown 2
> nodes in short time span, the cluster shows it lost quorum, but I run
> the clustat on the third node, and clustat shows the cluster has 3
> nodes (2 are offline) and the other services are up. I was able to
> access/read the share storage. CMAN_TOOL shows cluster lost quorum and
> the activity is blocked. What I expected is that I should not allow
> accessing the shared storage and other services at all when the
> cluster lost the quorum. Anyone has seen the similar things?
> What/where should I look into?

CMAN is supposed to deliver (more or less) a STATECHANGE event to
clients.  At that point, quorum is checked by rgmanager, and if the
cluster is no longer quorate, it halts all services immediately.

Are there anything in the logs which would indicate this?  It would look
like:

   <emerg> #1 Quorum Dissolved

Given that you can still access service data (e.g. clustat reports
something), that means that rgmanager can still acquire locks for some
reason (it takes DLM locks before giving out service data...).

Does clustat report that the cluster is quorate or not?

-- Lon

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From lhh at redhat.com  Mon Mar 13 19:44:40 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 13 Mar 2006 14:44:40 -0500
Subject: [Linux-cluster] CS4 behavior on killall -9
In-Reply-To: <44154BAC.1070007@bull.net>
References: <44154BAC.1070007@bull.net>
Message-ID: <1142279080.15119.119.camel@ayanami.boston.redhat.com>

On Mon, 2006-03-13 at 11:38 +0100, Alain Moulle wrote:
> Hi
> On a HA pair in mutual takeover, it seems that if we do a "killall -9" on one
> node, there is no failover, the CS4 seems to be stalled .
> Any reason ? idea ?

Killall -9 on what specifically...?

It sounds like a bug.

-- Lon


From teigland at redhat.com  Mon Mar 13 22:24:03 2006
From: teigland at redhat.com (David Teigland)
Date: Mon, 13 Mar 2006 16:24:03 -0600
Subject: [Linux-cluster] lost quorum,
	but the cluster services and GFS are still up
In-Reply-To: <2E02749DAF5338479606A056219BE109E0DB42@smail.crosswalkinc.com>
References: <2E02749DAF5338479606A056219BE109E0DB42@smail.crosswalkinc.com>
Message-ID: <20060313222403.GA17640@redhat.com>

On Mon, Mar 13, 2006 at 10:31:18AM -0700, Dex Chen wrote:
> Hi,
>  
> I believe that I saw something unusual here.
>  
> I have a 3 node cluster (with GFS) using CMAN. After I shutdown 2 nodes
> in short time span, the cluster shows it lost quorum, but I run the
> clustat on the third node, and clustat shows the cluster has 3 nodes (2
> are offline) and the other services are up. I was able to access/read
> the share storage. CMAN_TOOL shows cluster lost quorum and the activity
> is blocked. What I expected is that I should not allow accessing the
> shared storage and other services at all when the cluster lost the
> quorum. Anyone has seen the similar things? What/where should I look
> into?

Quorum is the normal method of preventing an instance of some cluster
subsystem or application (a gfs mount-group, a dlm lock-space, an
rgmanager service/app/resource, etc) from being enabled on both sides of a
partitioned cluster.  It does this by preventing the creation of new
instances in inquorate clusters and by preventing recovery (re-enabling)
of existing instances in inquorate clusters.

There's one special case where we also rely on fencing to prevent an
instance from being enabled on both sides of a split at once.  It's where
all the nodes using the instance before the failure/partition, also exist
on the inquorate side of the split afterward.  If a quorate partition then
forms, the first thing it does is fence all nodes it can't talk with,
which are the nodes on the inquorate side.  The quorate side then enables
instances of dlm/gfs/etc, the fencing having guaranteed there are none
elsewhere.

Apart from this, each service/instance/system responds internally to the
loss of quorum in its own way.  In the special case I described where all
the nodes using the instance remain after the event, dlm and gfs both
continue to run normally on the inquorate nodes; there's been no reason to
do otherwise.

I suspect what you saw is that nodes A and B failed/shutdown but weren't
using any of the dlm/gfs instances that C was.  C was then this special
case and dlm/gfs continued to run normally.  If A and B had come back and
formed a partitioned, quorate cluster, they would have fenced C before
enabling any dlm or gfs instances.

Dave


From michaelc at cs.wisc.edu  Tue Mar 14 00:11:54 2006
From: michaelc at cs.wisc.edu (Mike Christie)
Date: Mon, 13 Mar 2006 18:11:54 -0600
Subject: [Linux-cluster] Few Doubts About "GFS + ISCSI with Multipathing
	And	NIC Bonding"
In-Reply-To: <ab5b05b20603130500y7b04db71qe4ed1a11b32f35ab@mail.gmail.com>
References: <ab5b05b20603130500y7b04db71qe4ed1a11b32f35ab@mail.gmail.com>
Message-ID: <44160A4A.6060609@cs.wisc.edu>

updatemyself . wrote:
> Hai All,
> 
> I have few questions to ask..
> i already have a setup of GPFS Cluster on SAN with mulipathing (total 12 
> TB Volumes)
> And planning to go for a Another One With GFS + ISCSI with Multipathing 
> And NIC Bonding
> 
> So my doubt are about
> 1, Multipathing
> 2, NIC Bonding


For iscsi in linux you can use network bonding or dm-multipath or maybe 
even both :) For example, you can use bonding on the initiator over 
multiple host NICs, and use dm-multipath to multpath over multiple 
target portals.


> 3, Whats the Option for ISCSI Multipath same as RDAC in SAN

Not exactly sure what you mean by this. Are you thinking about RDAC as 
in Engenio's RDAC where we might need to do some sort of manual failover?


From afletdinov at mail.dc.baikal.ru  Tue Mar 14 00:20:04 2006
From: afletdinov at mail.dc.baikal.ru (Afletdinov A.R.)
Date: Tue, 14 Mar 2006 08:20:04 +0800
Subject: [Linux-cluster] GFS and extend attribute ...
In-Reply-To: <ee9c961f0603091240i774aac8ex35a17fdd19765948@mail.gmail.com>
References: <ee9c961f0603091240i774aac8ex35a17fdd19765948@mail.gmail.com>
Message-ID: <44160C34.2060808@mail.dc.baikal.ru>

Screaming Eagle wrote:

> Hi,
> I am running GFS with Coraid. I tried using extended attribute on GFS, 
> but it err out with (using setfacl )message: "Operation not 
> supported". Does anyone know for sure that GFS does not support 
> extended attribute options? Thanks.
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=182066


From saju8 at rediffmail.com  Tue Mar 14 03:58:41 2006
From: saju8 at rediffmail.com (saju  john)
Date: 14 Mar 2006 03:58:41 -0000
Subject: [Linux-cluster] Cluster service restarting Locally
Message-ID: <20060314035841.21034.qmail@webmail8.rediffmail.com>

Dear Mr. Hohberger  

Thanks for the replay.

When running only one node the frequency of restart is very less, but it happens with the same symtoms

The machines are HP DL380G3 (2 node) with MSA SAN 1000 storage.
The load average is around 4.
The cluster is primarly for running postgresql database of around 116 GB size

Saju John

On Mon, 13 Mar 2006 Lon Hohberger wrote :
>On Sat, 2006-03-11 at 10:50 +0000, saju john wrote:
> >
> > Dear Mr. Hohberger,
> >
> > Thanx for the replay.
> >
> > I saw your comments for the problem I reported. ie lock traffic is
> > getting network-starved.
>
>It could be getting I/O starved too, which might explain more given that
>this seems to happen on one node.  When running just one node and the
>service restarts, are the symptoms the same?  Does it report these kinds
>of errors, or are they different?
>
>[quote from your previous mail]
>clusvcmgrd[1388]: <err> Unable to obtain cluster lock: Connection
>timed out
>clulockd[1378]: <warning> Denied A.B.C.D: Broken pipe
>clulockd[1378]: <err> select error: Broken pipe
>[/quote]
>
>If they're different in the one-node case, what are the errors?  Also,
>are there any other errors in the logs?
>
>
> > My assumption is that, the problem is due to some curruption of meta
> > data information writing to the quroum partition ,as both nodes
> > writing to quroum cuncurrently.
>
>I really doubt that.  In the case of lock information, only one node
>writes at a time anyway...
>
> >  May be due to bug in the rawdeivce driver.I am not sure.Then
> > interesting question is ,how the cluster worked all these days(for me
> > around one year with out any major problem).
>
>The odds of random, block-level corruption going undetected when reading
> from the raw partitions is low - between (2^32):1 and (2^96):1 against
>per block, based on internal consistency checks that clumanager
>performs.  My math might be a little off, but it requires two randomly
>correct 32-bit magic numbers and one randomly valid 32-bit CRC, with
>other data incorrect to cause a problem.
>
>Specifically in the lock case, a lock block which passed all of the
>consistency checks but was *actually* corrupt would almost always cause
>clulockd to crash.
>
>Timeout errors mean that clulockd didn't respond to a request in a given
>amount of time, and can be caused by either network saturation or poor
>raw I/O performance to shared storage.  It looks like it's getting to an
>incoming request too late...
>
>
> > Could you pelase consider this also when releasing the RHCS3U7.
>
>If this is a critical issue for you, then you should file a ticket with
>Red Hat Support if you have not already done so:
>
>    http://www.redhat.com/apps/support/
>
>If you think this is a bug, you can also file a Bugzilla, and we will
>get to it when we can:
>
>    http://bugzilla.redhat.com/bugzilla/
>
>-- Lon
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060314/b3691fd5/attachment.htm>

From Birger.Wathne at ift.uib.no  Tue Mar 14 08:31:42 2006
From: Birger.Wathne at ift.uib.no (Birger Wathne)
Date: Tue, 14 Mar 2006 09:31:42 +0100
Subject: [Linux-cluster] samba on gfs
Message-ID: <44167F6E.4000401@ift.uib.no>

What is the problem with running samba on GFS, and when will it be resolved?
I have seen a hint from lon here that running samba on GFS isn't 
possible right now.
I have a 2-node cluster running NFS services from GFS, and would like to 
dedicate one node for NFS, the other for samba (running from the same 
filesystems).

I guess I could do it by NFS mounting from the NFS node, but that kind 
of defeats the purpose of moving the samba services into the cluster...

Btw: These nodes currently have Gb interfaces for the public networks, 
but only a 10Mb private network. Is that enough, or should I upgrade the 
private network when I start using both nodes actively?

-- 
birger


From zeebala at yahoo.com  Tue Mar 14 09:49:47 2006
From: zeebala at yahoo.com (bala)
Date: Tue, 14 Mar 2006 01:49:47 -0800 (PST)
Subject: [Linux-cluster] samba on gfs
In-Reply-To: <44167F6E.4000401@ift.uib.no>
Message-ID: <20060314094947.64549.qmail@web36510.mail.mud.yahoo.com>

hi guys
             iam new to linux cluster suite and gfs iam very eager to learn at present iam having celeron 400 mhz processor and i think within month i will get new system with good configuration how can i implement rhcs and gfs 

Birger Wathne <Birger.Wathne at ift.uib.no> wrote:
  What is the problem with running samba on GFS, and when will it be resolved?
I have seen a hint from lon here that running samba on GFS isn't 
possible right now.
I have a 2-node cluster running NFS services from GFS, and would like to 
dedicate one node for NFS, the other for samba (running from the same 
filesystems).

I guess I could do it by NFS mounting from the NFS node, but that kind 
of defeats the purpose of moving the samba services into the cluster...

Btw: These nodes currently have Gb interfaces for the public networks, 
but only a 10Mb private network. Is that enough, or should I upgrade the 
private network when I start using both nodes actively?

-- 
birger

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


---------------------------------
Relax. Yahoo! Mail virus scanning helps detect nasty viruses!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060314/dcf7f307/attachment.htm>

From erling.nygaard at gmail.com  Tue Mar 14 10:05:52 2006
From: erling.nygaard at gmail.com (Erling Nygaard)
Date: Tue, 14 Mar 2006 11:05:52 +0100
Subject: [Linux-cluster] samba on gfs
In-Reply-To: <44167F6E.4000401@ift.uib.no>
References: <44167F6E.4000401@ift.uib.no>
Message-ID: <adb721b40603140205g3278ca7ckced710703d175441@mail.gmail.com>

Birger

The short story is that Samba keeps some state information internally.
So there are issues with keeping multiple Samba serves in sync.
The information in question is not synced to the underlying
filesystem, so GFS can't really do the job of keeping this info in
sync between the nodes.

I am sure other people on the list can provide more details of the
problem and status of any progress :-)

Erling

On 3/14/06, Birger Wathne <Birger.Wathne at ift.uib.no> wrote:
> What is the problem with running samba on GFS, and when will it be resolved?
> I have seen a hint from lon here that running samba on GFS isn't
> possible right now.
> I have a 2-node cluster running NFS services from GFS, and would like to
> dedicate one node for NFS, the other for samba (running from the same
> filesystems).
>
> I guess I could do it by NFS mounting from the NFS node, but that kind
> of defeats the purpose of moving the samba services into the cluster...
>
> Btw: These nodes currently have Gb interfaces for the public networks,
> but only a 10Mb private network. Is that enough, or should I upgrade the
> private network when I start using both nodes actively?
>
> --
> birger
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


--
-
Mac OS X. Because making Unix user-friendly is easier than debugging Windows


From l.dardini at comune.prato.it  Tue Mar 14 12:53:06 2006
From: l.dardini at comune.prato.it (Leandro Dardini)
Date: Tue, 14 Mar 2006 13:53:06 +0100
Subject: [Linux-cluster] Cluster @hostname link utilities
Message-ID: <404AA6666D14D14CA0D410C1BC6CC4C53FC49C@exchange3.comune.prato.local>

Hi,
We all know the use of programs like "cat" or "tail" to show file
contents. I have a cluster of apache and each httpd logs to
/var/log/httpd/access.log where /var/log is a link to a gfs filesystem

[root at apache1]# ls -la /var/log/httpd
lrwxrwxrwx  1 root root 20 27 feb 15:44 /var/log/httpd ->
/gfsvolume/log/httpd_local/@hostname

For each apache I have a directory, like 

/gfsvolume/log/httpd_local/apache1
/gfsvolume/log/httpd_local/apache2

I'd like to show on screen, like with a cat program, all access.log
files reading them from each directory in cronological order. Is there
an already made program to do this? I need something as follow:

If /gfsvolume/log/httpd_local/apache1/access.log contains something like
the following:

127.0.0.1 - - [14/Mar/2006:13:27:26 +0100] "GET
/ocsinventory/deploy/label HTTP/1.0" 500 616 "-" "NSISDL/1.2" 
192.168.26.163 - - [14/Mar/2006:13:27:41 +0100] "POST /ocsinventory
HTTP/1.1" 200 83 "-" "OCS-NG_windows_client_v4014"

And /gfsvolume/log/httpd_local/apache2.comune.prato.it/access.log
contains something like the following:

192.168.26.163 - - [14/Mar/2006:13:27:23 +0100] "GET
/ocsinventory/deploy/label HTTP/1.0" 500 616 "-" "NSISDL/1.2"
192.168.1.110 - - [14/Mar/2006:13:27:42 +0100] "GET
/ocsinventory/deploy/ocsagent.exe HTTP/1.0" 500 616 "-" "NSISDL/1.2"

With "multicat" (the program I am looking for, just invented a name to
refer to it) multicat /var/log/httpd/acces.log I obtain the following:

192.168.26.163 - - [14/Mar/2006:13:27:23 +0100] "GET
/ocsinventory/deploy/label HTTP/1.0" 500 616 "-" "NSISDL/1.2"
127.0.0.1 - - [14/Mar/2006:13:27:26 +0100] "GET
/ocsinventory/deploy/label HTTP/1.0" 500 616 "-" "NSISDL/1.2" 
192.168.26.163 - - [14/Mar/2006:13:27:41 +0100] "POST /ocsinventory
HTTP/1.1" 200 83 "-" "OCS-NG_windows_client_v4014"
192.168.1.110 - - [14/Mar/2006:13:27:42 +0100] "GET
/ocsinventory/deploy/ocsagent.exe HTTP/1.0" 500 616 "-" "NSISDL/1.2"

I understand there be some configuration file that map
/var/log/httpd/access.log to 

/gfsvolume/log/httpd_local/apache1/access.log
/gfsvolume/log/httpd_local/apache2/access.log

And inform the program about the format and the position of the sorting
key.

Leandro


From sgray at bluestarinc.com  Tue Mar 14 14:31:05 2006
From: sgray at bluestarinc.com (Sean Gray)
Date: Tue, 14 Mar 2006 09:31:05 -0500
Subject: [Linux-cluster] RHEL+RAC+GFS
Message-ID: <4416D3A9.5080908@bluestarinc.com>

All:

Does anyone have a working combination of Oracle RAC (9i for E-Business 
Suite) on RHEL (4 preferably) using GFS for the DB nodes? RedHat's 
whitepaper suggests this combination as optimal. However, in my 
experience I have not yet found a magic combination that works well. I 
keep have hi load issues with dlmsendd.

Sean

-- 
Sean N. Gray
Director of Information Technology
United Radio Incorporated, DBA BlueStar
24 Spiral Drive
Florence, Kentucky 41042
office: 859.371.4423 x3263
toll free: 800.371.4423 x3263
fax: 859.371.4425
mobile: 513.616.3379


From orcl.listas at gmail.com  Tue Mar 14 18:27:17 2006
From: orcl.listas at gmail.com (Allyson - Listas)
Date: Tue, 14 Mar 2006 15:27:17 -0300
Subject: [Linux-cluster] rhcs doubts.
Message-ID: <44170B05.20805@gmail.com>

Hi guys,

I'm new at redhat cluster suite.  Could Anybody help me in some questions?

1st) I installed rhcs on 2 virtual machines and create a new cluster, 
setup a manual fence, a failvoer domain, create a IP resource and a 
service that uses just that IP for tests. Well, I'd like to know how can 
I force a failover of the service between nodes. This option is not 
available at system-config-cluster that allow just disable and enable 
the service.   I noticed that the ip service created is not a virtual 
interface like eth0:1, but it was working because I could ping it, Is it 
Normal?

2nd) What is the real fuction of a fence device?

3rd) How can I setup a quorum device, and isn't necessary for a failover 
service?  I read that it was needed at rhel3 but at rhel4 is not 
anymore, could you explain me that.


Any help is welcome :)

tks,

-- 
Allyson A. Brito


From gforte at leopard.us.udel.edu  Tue Mar 14 19:23:18 2006
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Tue, 14 Mar 2006 14:23:18 -0500
Subject: [Linux-cluster] dependencies between services
In-Reply-To: <1138981187.5992.62.camel@ayanami.boston.redhat.com>
References: <43E2948D.3060108@leopard.us.udel.edu>
	<1138981187.5992.62.camel@ayanami.boston.redhat.com>
Message-ID: <44171826.2040202@leopard.us.udel.edu>

heh, you asked me to file a bug about this a month-and-a-half ago and I 
got sidetracked fighting with Oracle and various other components, but 
now I'm back on this.  What heading should I file it under?  Cluster 
Suite v4, obviously, but what component?

-g

Lon Hohberger wrote:
> On Thu, 2006-02-02 at 18:23 -0500, Greg Forte wrote:
>> Is it possible to set up dependencies between cluster services?  That 
>> is, I have services A, B, C, and D.  B, C, D can't run unless A is 
>> running, but B, C, and D are all independent of each other and I want to 
>> be able to control them individually, i.e. be able to start/stop (or 
>> rather, enable/disable) each without affecting the others.  I know I 
>> could define them as dependent resources all in the same service, but 
>> then I can't have that independence between B, C, and D ... unless I'm 
>> missing something.
> 
> Not at the moment, but it should not be a difficult thing to add.
> 
> Could you file a bugzilla about it?
> 
> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


-- 
Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE


From treed-cluster at ultraviolet.org  Tue Mar 14 19:55:02 2006
From: treed-cluster at ultraviolet.org (treed-cluster at ultraviolet.org)
Date: Tue, 14 Mar 2006 11:55:02 -0800
Subject: [Linux-cluster] Cluster aware RAID/LVM
Message-ID: <20060314195502.GA25640@ultraviolet.org>

I would like to build a highly reliable SAN by having 3 storage nodes
exporting their disk as a block device using AoE and then have the 3
compute nodes each RAID 5 the 3 block devices exported by the storage
nodes so that we end up with one block device, the same block device, seen
by all 3 compute nodes. Then I would like to initialize this as a physical
volume and then create different logical volumes within it with each
compute node mounting a different set of logical volumes.

But my understanding is that this will not currently work because LVM is
not cluster aware. It seems that Linux software RAID is not cluster
aware either.

Perhaps I could use EVMS (which does seem to be cluster aware) on each
of the compute nodes to manage the disk on the storage nodes and then
export a specific volume from each storage node to just one compute node
which would then do RAID 5. This way we have a cluster aware volume
manager exporting volumes to be RAID'd which would only be mounted by
one host each.

Does this sound reasonable?

I would like to avoid the use of GFS anywhere in this particular system
but I might have occasion to use GFS on a different project in the
future. It's been a few years since I have seriously looked into GFS but
it seems to have come a long way towards being usable in a production
environment. I remember it used to have its own volume management. Are
most people doing GFS volume management with EVMS also?

Thanks!


-- 
Tracy Reed
http://ultraviolet.org


From filipe.miranda at gmail.com  Tue Mar 14 20:14:28 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Tue, 14 Mar 2006 17:14:28 -0300
Subject: [Linux-cluster] RHCS/RHEL3 power switches options
Message-ID: <a6d13c780603141214m743df803nff97fe3566dd71e4@mail.gmail.com>

Hello,

I'm having a really hard time trying to figure out what power switches do
work with RHCS/RHEL3.
Did anyone implement RHCS/RHEL3 with power switches ?
What options do I have when using power switches with this solution?
I appreciate any help


Att.
---
Filipe Miranda
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060314/227a7fad/attachment.htm>

From adingman at cookgroup.com  Tue Mar 14 20:58:00 2006
From: adingman at cookgroup.com (Andrew C. Dingman)
Date: Tue, 14 Mar 2006 15:58:00 -0500
Subject: [Linux-cluster] RHCS/RHEL3 power switches options
In-Reply-To: <a6d13c780603141214m743df803nff97fe3566dd71e4@mail.gmail.com>
References: <a6d13c780603141214m743df803nff97fe3566dd71e4@mail.gmail.com>
Message-ID: <1142369880.2489.39.camel@adingman.cin.cook>

I'm using APC AP7901 power switches in two different REL3 clusters. If
you don't alter the default configuration beyond setting your own
password, I believe the provided fencing agent works quite well. If you
try to use a more restricted user, it may or may not work. (Probably
not, in my experience.) The problem is that the APC telnet menus change
depending on the privileges of the connected user, so a restricted user
on the switch will not get the menus that the fencing agent expects.

I am also using GFS on those clusters, so I set up the fencing in GFS,
and then used the gulm bridge fencing agent in RHCS, which causes it to
pass the fencing work off to GFS. I have no experience using cluster
suite without GFS, on either RHEL3 or RHEL4.

Hope that helps.

On Tue, 2006-03-14 at 17:14 -0300, Filipe Miranda wrote:
> Hello,
> 
> I'm having a really hard time trying to figure out what power switches
> do work with RHCS/RHEL3.
> Did anyone implement RHCS/RHEL3 with power switches ? 
> What options do I have when using power switches with this solution?
> I appreciate any help
> 
>  
> Att.
> ---
> Filipe Miranda 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
-- 
Andrew C. Dingman
Unix Administrator
Cook Incorporated
(812)339-2235 x2131
adingman at cookgroup.com


From filipe.miranda at gmail.com  Tue Mar 14 21:11:31 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Tue, 14 Mar 2006 18:11:31 -0300
Subject: [Linux-cluster] RHCS/RHEL3 power switches options
In-Reply-To: <1142369880.2489.39.camel@adingman.cin.cook>
References: <a6d13c780603141214m743df803nff97fe3566dd71e4@mail.gmail.com>
	<1142369880.2489.39.camel@adingman.cin.cook>
Message-ID: <a6d13c780603141311g63ace521tba2ea8ee82779981@mail.gmail.com>

Andrew,

It definatelly helps, I was researching and I found this type of fence
device that would work with RHEL3/RHCS:
http://www.wti.com/rps10-ec.htm

That's why I posted this message so I could contact people that actually use
fence devices/power switches on RHEL/RHCS solutions


Thanks a lot

Att.
Filipe Miranda

On 3/14/06, Andrew C. Dingman <adingman at cookgroup.com> wrote:
>
> I'm using APC AP7901 power switches in two different REL3 clusters. If
> you don't alter the default configuration beyond setting your own
> password, I believe the provided fencing agent works quite well. If you
> try to use a more restricted user, it may or may not work. (Probably
> not, in my experience.) The problem is that the APC telnet menus change
> depending on the privileges of the connected user, so a restricted user
> on the switch will not get the menus that the fencing agent expects.
>
> I am also using GFS on those clusters, so I set up the fencing in GFS,
> and then used the gulm bridge fencing agent in RHCS, which causes it to
> pass the fencing work off to GFS. I have no experience using cluster
> suite without GFS, on either RHEL3 or RHEL4.
>
> Hope that helps.
>
> On Tue, 2006-03-14 at 17:14 -0300, Filipe Miranda wrote:
> > Hello,
> >
> > I'm having a really hard time trying to figure out what power switches
> > do work with RHCS/RHEL3.
> > Did anyone implement RHCS/RHEL3 with power switches ?
> > What options do I have when using power switches with this solution?
> > I appreciate any help
> >
> >
> > Att.
> > ---
> > Filipe Miranda
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> --
> Andrew C. Dingman
> Unix Administrator
> Cook Incorporated
> (812)339-2235 x2131
> adingman at cookgroup.com
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


--
Att.
---
Filipe T Miranda
RHCE - Red Hat Certified Engineer
OCP8i - Oracle Certified Professional
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060314/096abb1b/attachment.htm>

From Birger.Wathne at ift.uib.no  Wed Mar 15 06:17:01 2006
From: Birger.Wathne at ift.uib.no (Birger Wathne)
Date: Wed, 15 Mar 2006 07:17:01 +0100
Subject: [Linux-cluster] samba on gfs
In-Reply-To: <adb721b40603140205g3278ca7ckced710703d175441@mail.gmail.com>
References: <44167F6E.4000401@ift.uib.no>
	<adb721b40603140205g3278ca7ckced710703d175441@mail.gmail.com>
Message-ID: <4417B15D.7050807@uib.no>

Erling Nygaard wrote:

> Birger
> 
> The short story is that Samba keeps some state information internally.
> So there are issues with keeping multiple Samba serves in sync.
> The information in question is not synced to the underlying
> filesystem, so GFS can't really do the job of keeping this info in
> sync between the nodes.
> 
> I am sure other people on the list can provide more details of the
> problem and status of any progress :-)

So... This means there is a problem only when you want to run multiple samba 
servers in a cluster? There should be no problem sharing the same GFS disk 
for one samba instance and one NFS instance running on separate nodes (or 
even on the same node during maintenance)?

-- 
birger


From robert at deakin.edu.au  Wed Mar 15 06:30:16 2006
From: robert at deakin.edu.au (Robert Ruge)
Date: Wed, 15 Mar 2006 17:30:16 +1100
Subject: [Linux-cluster] samba on gfs
In-Reply-To: <4417B15D.7050807@uib.no>
Message-ID: <000f01c647f9$ec572240$0132a8c0@eit.deakin.edu.au>

I have had problems with samba running on just one node. It works for
a while and then samba just starts locking up.

Not a reccomended path if you ask me.

Robert

> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Birger Wathne
> Sent: Wednesday, 15 March 2006 5:17
> To: linux clustering
> Subject: Re: [Linux-cluster] samba on gfs
> 
> Erling Nygaard wrote:
> 
> > Birger
> > 
> > The short story is that Samba keeps some state information 
> internally.
> > So there are issues with keeping multiple Samba serves in sync.
> > The information in question is not synced to the underlying
> > filesystem, so GFS can't really do the job of keeping this info in
> > sync between the nodes.
> > 
> > I am sure other people on the list can provide more details of the
> > problem and status of any progress :-)
> 
> So... This means there is a problem only when you want to run 
> multiple samba 
> servers in a cluster? There should be no problem sharing the 
> same GFS disk 
> for one samba instance and one NFS instance running on 
> separate nodes (or 
> even on the same node during maintenance)?
> 
> -- 
> birger
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From Alain.Moulle at bull.net  Wed Mar 15 07:33:24 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Wed, 15 Mar 2006 08:33:24 +0100
Subject: [Linux-cluster] Re: CS4 behavior on killall -9 (Lon Hohberger)
Message-ID: <4417C344.4040907@bull.net>

On Mon, 2006-03-13 at 11:38 +0100, Alain Moulle wrote:

>>>> Hi
>>>> On a HA pair in mutual takeover, it seems that if we do a "killall -9" on >one
>>>> node, there is no failover, the CS4 seems to be stalled .
>>>> Any reason ? idea ?

>>Killall -9 on what specifically...?
That's a killall so ... nothing specifically, but all ...
Just to simulate sort of system hang ...
Did someone has give it a try ?
Alain
>>It sounds like a bug.
>>-- Lon


mailto:Alain.Moulle at bull.net
+------------------------------+--------------------------------+
|	Alain Moull?	       	| from France :	04 76 29 75 99  |
|                              	| FAX number  : 04 76 29 72 49  |
| Bull SA		       	|				|
| 1, Rue de Provence  		| Adr  : FREC B1-041            |
| B.P. 208			|				|
| 38432 Echirolles - CEDEX     	| Email: Alain.Moulle at bull.net  |
| France                       	| BCOM : 229 7599               |
+-------------------------------+-------------------------------+


From grimme at atix.de  Wed Mar 15 07:44:38 2006
From: grimme at atix.de (Marc Grimme)
Date: Wed, 15 Mar 2006 08:44:38 +0100
Subject: [Linux-cluster] samba on gfs
In-Reply-To: <000f01c647f9$ec572240$0132a8c0@eit.deakin.edu.au>
References: <000f01c647f9$ec572240$0132a8c0@eit.deakin.edu.au>
Message-ID: <200603150844.38490.grimme@atix.de>

Hello,
we have some customers that have samba clusters with GFS running for a long 
time without problems. There are some things you need to take into account 
but nevertheless samba on GFS runs very well even when you export the same 
data via NFS. 
E.G. one customer runs a "active/active" Samba/NFS Cluster on GFS as ADS 
Member for about a year (up to 600 Users) without problems.
I would say samba and GFS is a very nice combination.
Regards Marc.

On Wednesday 15 March 2006 07:30, Robert Ruge wrote:
> I have had problems with samba running on just one node. It works for
> a while and then samba just starts locking up.
>
> Not a reccomended path if you ask me.
>
> Robert
>
> > -----Original Message-----
> > From: linux-cluster-bounces at redhat.com
> > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Birger Wathne
> > Sent: Wednesday, 15 March 2006 5:17
> > To: linux clustering
> > Subject: Re: [Linux-cluster] samba on gfs
> >
> > Erling Nygaard wrote:
> > > Birger
> > >
> > > The short story is that Samba keeps some state information
> >
> > internally.
> >
> > > So there are issues with keeping multiple Samba serves in sync.
> > > The information in question is not synced to the underlying
> > > filesystem, so GFS can't really do the job of keeping this info in
> > > sync between the nodes.
> > >
> > > I am sure other people on the list can provide more details of the
> > > problem and status of any progress :-)
> >
> > So... This means there is a problem only when you want to run
> > multiple samba
> > servers in a cluster? There should be no problem sharing the
> > same GFS disk
> > for one samba instance and one NFS instance running on
> > separate nodes (or
> > even on the same node during maintenance)?
> >
> > --
> > birger
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 121 409-54
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From l.dardini at comune.prato.it  Wed Mar 15 08:39:48 2006
From: l.dardini at comune.prato.it (Leandro Dardini)
Date: Wed, 15 Mar 2006 09:39:48 +0100
Subject: R: [Linux-cluster] samba on gfs
Message-ID: <404AA6666D14D14CA0D410C1BC6CC4C53FC4F1@exchange3.comune.prato.local>

 
> -----Messaggio originale-----
> Da: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] Per conto di Marc Grimme
> Inviato: mercoled? 15 marzo 2006 8.45
> A: linux clustering
> Oggetto: Re: [Linux-cluster] samba on gfs
> 
> Hello,
> we have some customers that have samba clusters with GFS 
> running for a long time without problems. There are some 
> things you need to take into account but nevertheless samba 
> on GFS runs very well even when you export the same data via NFS. 
> E.G. one customer runs a "active/active" Samba/NFS Cluster on 
> GFS as ADS Member for about a year (up to 600 Users) without problems.
> I would say samba and GFS is a very nice combination.
> Regards Marc.

Maybe it can be interesting if you can post the smb.conf used.

Leandro

> 
> On Wednesday 15 March 2006 07:30, Robert Ruge wrote:
> > I have had problems with samba running on just one node. It 
> works for 
> > a while and then samba just starts locking up.
> >
> > Not a reccomended path if you ask me.
> >
> > Robert
> >
> > > -----Original Message-----
> > > From: linux-cluster-bounces at redhat.com 
> > > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of 
> Birger Wathne
> > > Sent: Wednesday, 15 March 2006 5:17
> > > To: linux clustering
> > > Subject: Re: [Linux-cluster] samba on gfs
> > >
> > > Erling Nygaard wrote:
> > > > Birger
> > > >
> > > > The short story is that Samba keeps some state information
> > >
> > > internally.
> > >
> > > > So there are issues with keeping multiple Samba serves in sync.
> > > > The information in question is not synced to the underlying 
> > > > filesystem, so GFS can't really do the job of keeping 
> this info in 
> > > > sync between the nodes.
> > > >
> > > > I am sure other people on the list can provide more 
> details of the 
> > > > problem and status of any progress :-)
> > >
> > > So... This means there is a problem only when you want to run 
> > > multiple samba servers in a cluster? There should be no problem 
> > > sharing the same GFS disk for one samba instance and one NFS 
> > > instance running on separate nodes (or even on the same 
> node during 
> > > maintenance)?
> > >
> > > --
> > > birger
> > >
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Gruss / Regards,
> 
> Marc Grimme
> Phone: +49-89 121 409-54
> http://www.atix.de/               http://www.open-sharedroot.org/
> 
> **
> ATIX - Ges. fuer Informationstechnologie und Consulting mbH 
> Einsteinstr. 10 - 85716 Unterschleissheim - Germany
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From Matthew.Patton.ctr at osd.mil  Wed Mar 15 13:40:52 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Wed, 15 Mar 2006 08:40:52 -0500
Subject: [Linux-cluster] samba on gfs
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D02@osdn06.osd.mil>

Classification: UNCLASSIFIED

I'm setting up 40 blade servers each with direct access to a shared FC SAN
volume and mounted GFS and serving it up via both SAMBA and NFS to ~10
virtual machines (XEN and VMWARE) local to that blade. I don't expect the
unix VM's to use SAMBA (exception might be HOMEDIRs) nor the Windows VM's to
use NFS, but otherwise it's active/active. NFS will also be used to rootless
boot the XEN/VMWARE virtual machines that run Linux.

So that's 40 independent SAMBA servers all potentially serving the same
files located on the common GFS volume. Think I'm headed for trouble? That
config would come in handy.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060315/b256713e/attachment.htm>

From erwan at seanodes.com  Wed Mar 15 13:45:11 2006
From: erwan at seanodes.com (Velu Erwan)
Date: Wed, 15 Mar 2006 14:45:11 +0100
Subject: [Linux-cluster] samba on gfs
In-Reply-To: <200603150844.38490.grimme@atix.de>
References: <000f01c647f9$ec572240$0132a8c0@eit.deakin.edu.au>
	<200603150844.38490.grimme@atix.de>
Message-ID: <44181A67.6070504@seanodes.com>

Marc Grimme a ?crit :

>Hello,
>we have some customers that have samba clusters with GFS running for a long 
>time without problems. There are some things you need to take into account 
>but nevertheless samba on GFS runs very well even when you export the same 
>data via NFS. 
>E.G. one customer runs a "active/active" Samba/NFS Cluster on GFS as ADS 
>Member for about a year (up to 600 Users) without problems.
>I would say samba and GFS is a very nice combination.
>Regards Marc.
>  
>
This sounds to be done in a homedir approach but this doesn't sounds to 
work when you share a directory on several samba servers.
I mean, there will be troubles if two users want to access to the same 
file by using 2 different samba servers.
The samba team sounds to address this issue in the future samba 4.0.


From filipe.miranda at gmail.com  Wed Mar 15 13:41:39 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Wed, 15 Mar 2006 10:41:39 -0300
Subject: [Linux-cluster] samba on gfs
In-Reply-To: <200603150844.38490.grimme@atix.de>
References: <000f01c647f9$ec572240$0132a8c0@eit.deakin.edu.au>
	<200603150844.38490.grimme@atix.de>
Message-ID: <a6d13c780603150541s1b5a8c85pb09bf90bb76197fc@mail.gmail.com>

Just one question about running multiple samba servers using GFS....
You can have the same dada to be shared on multiple servers, but each samba
server will need a differente NETBIOS name on the network right?

So if I have a WindowsXP client machine, it will have to map 2(or more)
network paths to reach each samba server.

Am I correct? If so, how to address this issue so the user can have just one
network mapped drive accessing multiple samba servers(if necessary, if one
machine fails for example?)

Att.
Filipe Miranda

On 3/15/06, Marc Grimme <grimme at atix.de> wrote:
>
> Hello,
> we have some customers that have samba clusters with GFS running for a
> long
> time without problems. There are some things you need to take into account
> but nevertheless samba on GFS runs very well even when you export the same
> data via NFS.
> E.G. one customer runs a "active/active" Samba/NFS Cluster on GFS as ADS
> Member for about a year (up to 600 Users) without problems.
> I would say samba and GFS is a very nice combination.
> Regards Marc.
>
> On Wednesday 15 March 2006 07:30, Robert Ruge wrote:
> > I have had problems with samba running on just one node. It works for
> > a while and then samba just starts locking up.
> >
> > Not a reccomended path if you ask me.
> >
> > Robert
> >
> > > -----Original Message-----
> > > From: linux-cluster-bounces at redhat.com
> > > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Birger Wathne
> > > Sent: Wednesday, 15 March 2006 5:17
> > > To: linux clustering
> > > Subject: Re: [Linux-cluster] samba on gfs
> > >
> > > Erling Nygaard wrote:
> > > > Birger
> > > >
> > > > The short story is that Samba keeps some state information
> > >
> > > internally.
> > >
> > > > So there are issues with keeping multiple Samba serves in sync.
> > > > The information in question is not synced to the underlying
> > > > filesystem, so GFS can't really do the job of keeping this info in
> > > > sync between the nodes.
> > > >
> > > > I am sure other people on the list can provide more details of the
> > > > problem and status of any progress :-)
> > >
> > > So... This means there is a problem only when you want to run
> > > multiple samba
> > > servers in a cluster? There should be no problem sharing the
> > > same GFS disk
> > > for one samba instance and one NFS instance running on
> > > separate nodes (or
> > > even on the same node during maintenance)?
> > >
> > > --
> > > birger
> > >
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Gruss / Regards,
>
> Marc Grimme
> Phone: +49-89 121 409-54
> http://www.atix.de/               http://www.open-sharedroot.org/
>
> **
> ATIX - Ges. fuer Informationstechnologie und Consulting mbH
> Einsteinstr. 10 - 85716 Unterschleissheim - Germany
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


--
Att.
---
Filipe T Miranda
RHCE - Red Hat Certified Engineer
OCP8i - Oracle Certified Professional
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060315/13cb1532/attachment.htm>

From grimme at atix.de  Wed Mar 15 13:52:38 2006
From: grimme at atix.de (Marc Grimme)
Date: Wed, 15 Mar 2006 14:52:38 +0100
Subject: [Linux-cluster] samba on gfs
In-Reply-To: <44181A67.6070504@seanodes.com>
References: <000f01c647f9$ec572240$0132a8c0@eit.deakin.edu.au>
	<200603150844.38490.grimme@atix.de> <44181A67.6070504@seanodes.com>
Message-ID: <200603151452.38992.grimme@atix.de>

On Wednesday 15 March 2006 14:45, Velu Erwan wrote:
> Marc Grimme a ?crit :
> >Hello,
> >we have some customers that have samba clusters with GFS running for a
> > long time without problems. There are some things you need to take into
> > account but nevertheless samba on GFS runs very well even when you export
> > the same data via NFS.
> >E.G. one customer runs a "active/active" Samba/NFS Cluster on GFS as ADS
> >Member for about a year (up to 600 Users) without problems.
> >I would say samba and GFS is a very nice combination.
> >Regards Marc.
>
> This sounds to be done in a homedir approach but this doesn't sounds to
> work when you share a directory on several samba servers.
> I mean, there will be troubles if two users want to access to the same
> file by using 2 different samba servers.
> The samba team sounds to address this issue in the future samba 4.0.
Right, because of that i wrote "active/active". It might work but I would 
accept the same problems to happen in special cases.
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 121 409-54
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From orcl.listas at gmail.com  Wed Mar 15 13:56:58 2006
From: orcl.listas at gmail.com (Allyson - Listas)
Date: Wed, 15 Mar 2006 10:56:58 -0300
Subject: [Linux-cluster] samba on gfs
In-Reply-To: <a6d13c780603150541s1b5a8c85pb09bf90bb76197fc@mail.gmail.com>
References: <000f01c647f9$ec572240$0132a8c0@eit.deakin.edu.au>	<200603150844.38490.grimme@atix.de>
	<a6d13c780603150541s1b5a8c85pb09bf90bb76197fc@mail.gmail.com>
Message-ID: <44181D2A.6010000@gmail.com>

Filipe,

I think, you can use LVS to load balance requests into servers and map 
just one virtual ip at windows.

tks,

-- 
Allyson A. Brito
MSN: allysonbrito78 at hotmail.com


Filipe Miranda wrote:

> Just one question about running multiple samba servers using GFS....
> You can have the same dada to be shared on multiple servers, but each 
> samba server will need a differente NETBIOS name on the network right?
>
> So if I have a WindowsXP client machine, it will have to map 2(or 
> more) network paths to reach each samba server.
>
> Am I correct? If so, how to address this issue so the user can have 
> just one network mapped drive accessing multiple samba servers(if 
> necessary, if one machine fails for example?)
>
> Att.
> Filipe Miranda
>
> On 3/15/06, *Marc Grimme* <grimme at atix.de <mailto:grimme at atix.de>> wrote:
>
>     Hello,
>     we have some customers that have samba clusters with GFS running
>     for a long
>     time without problems. There are some things you need to take into
>     account
>     but nevertheless samba on GFS runs very well even when you export
>     the same
>     data via NFS.
>     E.G. one customer runs a "active/active" Samba/NFS Cluster on GFS
>     as ADS
>     Member for about a year (up to 600 Users) without problems.
>     I would say samba and GFS is a very nice combination.
>     Regards Marc.
>
>     On Wednesday 15 March 2006 07:30, Robert Ruge wrote:
>     > I have had problems with samba running on just one node. It
>     works for
>     > a while and then samba just starts locking up.
>     >
>     > Not a reccomended path if you ask me.
>     >
>     > Robert
>     >
>     > > -----Original Message-----
>     > > From: linux-cluster-bounces at redhat.com
>     <mailto:linux-cluster-bounces at redhat.com>
>     > > [mailto:linux-cluster-bounces at redhat.com
>     <mailto:linux-cluster-bounces at redhat.com>] On Behalf Of Birger Wathne
>     > > Sent: Wednesday, 15 March 2006 5:17
>     > > To: linux clustering
>     > > Subject: Re: [Linux-cluster] samba on gfs
>     > >
>     > > Erling Nygaard wrote:
>     > > > Birger
>     > > >
>     > > > The short story is that Samba keeps some state information
>     > >
>     > > internally.
>     > >
>     > > > So there are issues with keeping multiple Samba serves in sync.
>     > > > The information in question is not synced to the underlying
>     > > > filesystem, so GFS can't really do the job of keeping this
>     info in
>     > > > sync between the nodes.
>     > > >
>     > > > I am sure other people on the list can provide more details
>     of the
>     > > > problem and status of any progress :-)
>     > >
>     > > So... This means there is a problem only when you want to run
>     > > multiple samba
>     > > servers in a cluster? There should be no problem sharing the
>     > > same GFS disk
>     > > for one samba instance and one NFS instance running on
>     > > separate nodes (or
>     > > even on the same node during maintenance)?
>     > >
>     > > --
>     > > birger
>     > >
>     > > --
>     > > Linux-cluster mailing list
>     > > Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>     > > https://www.redhat.com/mailman/listinfo/linux-cluster
>     >
>     > --
>     > Linux-cluster mailing list
>     > Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>     > https://www.redhat.com/mailman/listinfo/linux-cluster
>
>     --
>     Gruss / Regards,
>
>     Marc Grimme
>     Phone: +49-89 121 409-54
>     http://www.atix.de/               http://www.open-sharedroot.org/
>
>     **
>     ATIX - Ges. fuer Informationstechnologie und Consulting mbH
>     Einsteinstr. 10 - 85716 Unterschleissheim - Germany
>
>     --
>     Linux-cluster mailing list
>     Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>     https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>
>
> -- 
> Att.
> ---
> Filipe T Miranda
> RHCE - Red Hat Certified Engineer
> OCP8i - Oracle Certified Professional
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>


From grimme at atix.de  Wed Mar 15 13:58:06 2006
From: grimme at atix.de (Marc Grimme)
Date: Wed, 15 Mar 2006 14:58:06 +0100
Subject: [Linux-cluster] samba on gfs
In-Reply-To: <a6d13c780603150541s1b5a8c85pb09bf90bb76197fc@mail.gmail.com>
References: <000f01c647f9$ec572240$0132a8c0@eit.deakin.edu.au>
	<200603150844.38490.grimme@atix.de>
	<a6d13c780603150541s1b5a8c85pb09bf90bb76197fc@mail.gmail.com>
Message-ID: <200603151458.07066.grimme@atix.de>

On Wednesday 15 March 2006 14:41, Filipe Miranda wrote:
> Just one question about running multiple samba servers using GFS....
> You can have the same dada to be shared on multiple servers, but each samba
> server will need a differente NETBIOS name on the network right?
Yes. Every server also needs two IPs ond VIP and one IP and every Server needs 
to be registered in the Windows domain.
Just like the "active/active" Microsoft Cluster Services work.
>
> So if I have a WindowsXP client machine, it will have to map 2(or more)
> network paths to reach each samba server.
No. Basically every server serves exactly its shares exclusively. HA-services 
take over the availability. But the active/active granularity is the share. 
If you would like to have real loadblancing you would need a samba which is 
capable of that (I also heard samba4 will solve that issue).
>
> Am I correct? If so, how to address this issue so the user can have just
> one network mapped drive accessing multiple samba servers(if necessary, if
> one machine fails for example?)
That is not possible.
Sorry for the missunderstanding
Marc.
>
> Att.
> Filipe Miranda
>
> On 3/15/06, Marc Grimme <grimme at atix.de> wrote:
> > Hello,
> > we have some customers that have samba clusters with GFS running for a
> > long
> > time without problems. There are some things you need to take into
> > account but nevertheless samba on GFS runs very well even when you export
> > the same data via NFS.
> > E.G. one customer runs a "active/active" Samba/NFS Cluster on GFS as ADS
> > Member for about a year (up to 600 Users) without problems.
> > I would say samba and GFS is a very nice combination.
> > Regards Marc.
> >
> > On Wednesday 15 March 2006 07:30, Robert Ruge wrote:
> > > I have had problems with samba running on just one node. It works for
> > > a while and then samba just starts locking up.
> > >
> > > Not a reccomended path if you ask me.
> > >
> > > Robert
> > >
> > > > -----Original Message-----
> > > > From: linux-cluster-bounces at redhat.com
> > > > [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Birger Wathne
> > > > Sent: Wednesday, 15 March 2006 5:17
> > > > To: linux clustering
> > > > Subject: Re: [Linux-cluster] samba on gfs
> > > >
> > > > Erling Nygaard wrote:
> > > > > Birger
> > > > >
> > > > > The short story is that Samba keeps some state information
> > > >
> > > > internally.
> > > >
> > > > > So there are issues with keeping multiple Samba serves in sync.
> > > > > The information in question is not synced to the underlying
> > > > > filesystem, so GFS can't really do the job of keeping this info in
> > > > > sync between the nodes.
> > > > >
> > > > > I am sure other people on the list can provide more details of the
> > > > > problem and status of any progress :-)
> > > >
> > > > So... This means there is a problem only when you want to run
> > > > multiple samba
> > > > servers in a cluster? There should be no problem sharing the
> > > > same GFS disk
> > > > for one samba instance and one NFS instance running on
> > > > separate nodes (or
> > > > even on the same node during maintenance)?
> > > >
> > > > --
> > > > birger
> > > >
> > > > --
> > > > Linux-cluster mailing list
> > > > Linux-cluster at redhat.com
> > > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > >
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> > --
> > Gruss / Regards,
> >
> > Marc Grimme
> > Phone: +49-89 121 409-54
> > http://www.atix.de/               http://www.open-sharedroot.org/
> >
> > **
> > ATIX - Ges. fuer Informationstechnologie und Consulting mbH
> > Einsteinstr. 10 - 85716 Unterschleissheim - Germany
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Att.
> ---
> Filipe T Miranda
> RHCE - Red Hat Certified Engineer
> OCP8i - Oracle Certified Professional

-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 121 409-54
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From lhh at redhat.com  Wed Mar 15 16:04:58 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 15 Mar 2006 11:04:58 -0500
Subject: [Linux-cluster] RHCS/RHEL3 power switches options
In-Reply-To: <a6d13c780603141214m743df803nff97fe3566dd71e4@mail.gmail.com>
References: <a6d13c780603141214m743df803nff97fe3566dd71e4@mail.gmail.com>
Message-ID: <1142438698.19535.17.camel@ayanami.boston.redhat.com>

On Tue, 2006-03-14 at 17:14 -0300, Filipe Miranda wrote:
> Hello,
> 
> I'm having a really hard time trying to figure out what power switches
> do work with RHCS/RHEL3.
> Did anyone implement RHCS/RHEL3 with power switches ? 
> What options do I have when using power switches with this solution?
> I appreciate any help

These have been known to work:

* WTI NPS, IPS, or TPS series.
* WTI RPS10 (serial; two node only)
* APC 9211, 9212.
* APC 9225 with 9606 management card.
* APC 7900 and 7921 have been tried by some people with 1.2.22 and
later.

-- Lon


From lhh at redhat.com  Wed Mar 15 16:05:21 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 15 Mar 2006 11:05:21 -0500
Subject: [Linux-cluster] RHCS/RHEL3 power switches options
In-Reply-To: <1142369880.2489.39.camel@adingman.cin.cook>
References: <a6d13c780603141214m743df803nff97fe3566dd71e4@mail.gmail.com>
	<1142369880.2489.39.camel@adingman.cin.cook>
Message-ID: <1142438721.19535.19.camel@ayanami.boston.redhat.com>

On Tue, 2006-03-14 at 15:58 -0500, Andrew C. Dingman wrote:
> I'm using APC AP7901 power switches in two different REL3 clusters.

Woot.  The 7901 works too ;)

-- Lon


From lhh at redhat.com  Wed Mar 15 16:09:49 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 15 Mar 2006 11:09:49 -0500
Subject: [Linux-cluster] RHCS/RHEL3 power switches options
In-Reply-To: <a6d13c780603141311g63ace521tba2ea8ee82779981@mail.gmail.com>
References: <a6d13c780603141214m743df803nff97fe3566dd71e4@mail.gmail.com>
	<1142369880.2489.39.camel@adingman.cin.cook>
	<a6d13c780603141311g63ace521tba2ea8ee82779981@mail.gmail.com>
Message-ID: <1142438989.19535.25.camel@ayanami.boston.redhat.com>

On Tue, 2006-03-14 at 18:11 -0300, Filipe Miranda wrote:
> Andrew, 
> 
> It definatelly helps, I was researching and I found this type of fence
> device that would work with RHEL3/RHCS:
> http://www.wti.com/rps10-ec.htm

Personally, I prefer IPS800 over dual RPS10 - they provide two power
sources, and the ability to turn off machines from remote for testing,
or other maintenance conditions.

The IPS800 has two power rails, allowing NSPF configurations (if done
intelligently...), and can control more devices for only a little bit
more money than two RPS-10s.

-- Lon


From lhh at redhat.com  Wed Mar 15 16:15:25 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 15 Mar 2006 11:15:25 -0500
Subject: [Linux-cluster] rhcs doubts.
In-Reply-To: <44170B05.20805@gmail.com>
References: <44170B05.20805@gmail.com>
Message-ID: <1142439325.19535.31.camel@ayanami.boston.redhat.com>

On Tue, 2006-03-14 at 15:27 -0300, Allyson - Listas wrote:
> Hi guys,
> 
> I'm new at redhat cluster suite.  Could Anybody help me in some questions?
> 
> 1st) I installed rhcs on 2 virtual machines and create a new cluster, 
> setup a manual fence, a failvoer domain, create a IP resource and a 
> service that uses just that IP for tests. Well, I'd like to know how can 
> I force a failover of the service between nodes. This option is not 
> available at system-config-cluster that allow just disable and enable 
> the service.   

Drag it to the other node in the gui.


> I noticed that the ip service created is not a virtual 
> interface like eth0:1, but it was working because I could ping it, Is it 
> Normal?

Yes, try "/sbin/ip addr list", which is noted in the documentation.


> 2nd) What is the real fuction of a fence device?

Prevent data corruption in the event of a live-hang of a node with
outstanding dirty buffers.


> 3rd) How can I setup a quorum device, and isn't necessary for a failover 
> service?  I read that it was needed at rhel3 but at rhel4 is not 
> anymore, could you explain me that.

What do you need to know?  It's not needed because of the way CMAN
recovers - see

http://people.redhat.com/teigland/sca.pdf

-- Lon


From filipe.miranda at gmail.com  Wed Mar 15 17:41:18 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Wed, 15 Mar 2006 14:41:18 -0300
Subject: [Linux-cluster] RHCS/RHEL3 power switches options
In-Reply-To: <1142438989.19535.25.camel@ayanami.boston.redhat.com>
References: <a6d13c780603141214m743df803nff97fe3566dd71e4@mail.gmail.com>
	<1142369880.2489.39.camel@adingman.cin.cook>
	<a6d13c780603141311g63ace521tba2ea8ee82779981@mail.gmail.com>
	<1142438989.19535.25.camel@ayanami.boston.redhat.com>
Message-ID: <a6d13c780603150941k6327a6e5v892f288640c13f5a@mail.gmail.com>

But will all those models work with RHEL3/RHCS ?

Or RHEL4/RHCS?

Thanks a lot in advance

Att.
Filipe Miranda

On 3/15/06, Lon Hohberger <lhh at redhat.com> wrote:
>
> On Tue, 2006-03-14 at 18:11 -0300, Filipe Miranda wrote:
> > Andrew,
> >
> > It definatelly helps, I was researching and I found this type of fence
> > device that would work with RHEL3/RHCS:
> > http://www.wti.com/rps10-ec.htm
>
> Personally, I prefer IPS800 over dual RPS10 - they provide two power
> sources, and the ability to turn off machines from remote for testing,
> or other maintenance conditions.
>
> The IPS800 has two power rails, allowing NSPF configurations (if done
> intelligently...), and can control more devices for only a little bit
> more money than two RPS-10s.
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


--
Att.
---
Filipe T Miranda
RHCE - Red Hat Certified Engineer
OCP8i - Oracle Certified Professional
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060315/514471f2/attachment.htm>

From lhh at redhat.com  Wed Mar 15 18:20:25 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 15 Mar 2006 13:20:25 -0500
Subject: [Linux-cluster] RHCS/RHEL3 power switches options
In-Reply-To: <a6d13c780603150941k6327a6e5v892f288640c13f5a@mail.gmail.com>
References: <a6d13c780603141214m743df803nff97fe3566dd71e4@mail.gmail.com>
	<1142369880.2489.39.camel@adingman.cin.cook>
	<a6d13c780603141311g63ace521tba2ea8ee82779981@mail.gmail.com>
	<1142438989.19535.25.camel@ayanami.boston.redhat.com>
	<a6d13c780603150941k6327a6e5v892f288640c13f5a@mail.gmail.com>
Message-ID: <1142446826.19535.40.camel@ayanami.boston.redhat.com>

On Wed, 2006-03-15 at 14:41 -0300, Filipe Miranda wrote:
> But will all those models work with RHEL3/RHCS ?

> Or RHEL4/RHCS?

With the exception of the APC 9211, 9212, and 9225 (which only work on
RHCS3, and I think are out of their support lifetime anyway), all of the
models (or series) I noted should work on both RHCS3 and RHCS4.

If you have a specific question about a specific switch, ask away.

-- Lon


From lhh at redhat.com  Wed Mar 15 18:23:23 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 15 Mar 2006 13:23:23 -0500
Subject: [Linux-cluster] Re: CS4 behavior on killall -9 (Lon Hohberger)
In-Reply-To: <4417C344.4040907@bull.net>
References: <4417C344.4040907@bull.net>
Message-ID: <1142447003.19535.44.camel@ayanami.boston.redhat.com>

On Wed, 2006-03-15 at 08:33 +0100, Alain Moulle wrote:
> On Mon, 2006-03-13 at 11:38 +0100, Alain Moulle wrote:
> 
> >>>> Hi
> >>>> On a HA pair in mutual takeover, it seems that if we do a "killall -9" on >one
> >>>> node, there is no failover, the CS4 seems to be stalled .
> >>>> Any reason ? idea ?
> 
> >>Killall -9 on what specifically...?
> That's a killall so ... nothing specifically, but all ...
> Just to simulate sort of system hang ...
> Did someone has give it a try ?

On RHCS4, you probably won't get much of the desired result here.

CMAN (which manages membership transitions, among other things) runs in
the kernel.  I'm not sure (off the top of my head) that sending one of
the kernel threads a SIGKILL actually has any guaranteed effect...

-- Lon


From jparsons at redhat.com  Wed Mar 15 19:23:43 2006
From: jparsons at redhat.com (James Parsons)
Date: Wed, 15 Mar 2006 14:23:43 -0500
Subject: [Linux-cluster] RHCS/RHEL3 power switches options
In-Reply-To: <1142446826.19535.40.camel@ayanami.boston.redhat.com>
References: <a6d13c780603141214m743df803nff97fe3566dd71e4@mail.gmail.com>	<1142369880.2489.39.camel@adingman.cin.cook>	<a6d13c780603141311g63ace521tba2ea8ee82779981@mail.gmail.com>	<1142438989.19535.25.camel@ayanami.boston.redhat.com>	<a6d13c780603150941k6327a6e5v892f288640c13f5a@mail.gmail.com>
	<1142446826.19535.40.camel@ayanami.boston.redhat.com>
Message-ID: <441869BF.30008@redhat.com>

Lon Hohberger wrote:

>On Wed, 2006-03-15 at 14:41 -0300, Filipe Miranda wrote:
>  
>
>>But will all those models work with RHEL3/RHCS ?
>>    
>>
>
>  
>
>>Or RHEL4/RHCS?
>>    
>>
>
>With the exception of the APC 9211, 9212, and 9225 (which only work on
>RHCS3, and I think are out of their support lifetime anyway), all of the
>models (or series) I noted should work on both RHCS3 and RHCS4.
>
>If you have a specific question about a specific switch, ask away.
>

Yes! Now is the time to ask about APC switches...I will be down in their 
facility in less than two weeks testing our scripts against all of their 
latest products and firmware. If you have a particular switch in mind 
for a project, I will try and test that exact model for you.

-J


From brilong at cisco.com  Wed Mar 15 19:51:25 2006
From: brilong at cisco.com (Brian Long)
Date: Wed, 15 Mar 2006 14:51:25 -0500
Subject: [Linux-cluster] RHCS/RHEL3 power switches options
In-Reply-To: <1142438698.19535.17.camel@ayanami.boston.redhat.com>
References: <a6d13c780603141214m743df803nff97fe3566dd71e4@mail.gmail.com>
	<1142438698.19535.17.camel@ayanami.boston.redhat.com>
Message-ID: <1142452285.4416.46.camel@brilong-lnx>

On Wed, 2006-03-15 at 11:04 -0500, Lon Hohberger wrote:
> On Tue, 2006-03-14 at 17:14 -0300, Filipe Miranda wrote:
> > Hello,
> > 
> > I'm having a really hard time trying to figure out what power switches
> > do work with RHCS/RHEL3.
> > Did anyone implement RHCS/RHEL3 with power switches ? 
> > What options do I have when using power switches with this solution?
> > I appreciate any help
> 
> These have been known to work:
> 
> * WTI NPS, IPS, or TPS series.
> * WTI RPS10 (serial; two node only)
> * APC 9211, 9212.
> * APC 9225 with 9606 management card.
> * APC 7900 and 7921 have been tried by some people with 1.2.22 and
> later.

Given all the talk about remotely-managed power switches, are they still
needed when you can fence at the ILO level on an HP Proliant, for
example?  You can also fence most tier 1 servers using ipmitool, right?

/Brian/

-- 
       Brian Long                      |         |           |
       IT Data Center Systems          |       .|||.       .|||.
       Cisco Linux Developer           |   ..:|||||||:...:|||||||:..
       Phone: (919) 392-7363           |   C i s c o   S y s t e m s


From Birger.Wathne at ift.uib.no  Wed Mar 15 21:48:56 2006
From: Birger.Wathne at ift.uib.no (Birger Wathne)
Date: Wed, 15 Mar 2006 22:48:56 +0100
Subject: [Linux-cluster] stress-testing GFS ?
Message-ID: <44188BC8.2070800@ift.uib.no>

I would like to put my cluster through a little controlled hell before 
declaring it ready for production.

Is there any kind of stress-test/verification procedure to 'certify' 
shared storage with GFS?
Ideally there would be some distributed software that could be run in a 
cluster to check that the shared storage behaves as expected under all 
kinds of load. Throughput, concurrent writing, GFS locking, file system 
locking, etc...
Something that could interface with GFS internals to see that everything 
was 'right' at every step.

Since I have seen nothing about the issue, I assume something like that 
doesn't exist, so... Any ideas on how to stress test GFS?
Homegrown scripts? Known problems with hardware that a test should look for?


-- 
birger


From mwill at penguincomputing.com  Wed Mar 15 22:20:28 2006
From: mwill at penguincomputing.com (Michael Will)
Date: Wed, 15 Mar 2006 14:20:28 -0800
Subject: [Linux-cluster] stress-testing GFS ?
In-Reply-To: <44188BC8.2070800@ift.uib.no>
References: <44188BC8.2070800@ift.uib.no>
Message-ID: <4418932C.9080001@jellyfish.highlyscyld.com>

iozone does test for a lot of different access patterns, and can create 
nice spreadsheets including graphs
from the point of view of a single node. It also has a multiple node 
flag for running it across a cluster. See -+m and -t
options. It knows how to use 'rsh' and can also be configured for any 
other remote execution command by setting the
enviroment variable RSH to say ssh or bpsh.

Don't forget to post your benchmark results to this mailinglist ;-)

Michael

Birger Wathne wrote:

> I would like to put my cluster through a little controlled hell before 
> declaring it ready for production.
>
> Is there any kind of stress-test/verification procedure to 'certify' 
> shared storage with GFS?
> Ideally there would be some distributed software that could be run in 
> a cluster to check that the shared storage behaves as expected under 
> all kinds of load. Throughput, concurrent writing, GFS locking, file 
> system locking, etc...
> Something that could interface with GFS internals to see that 
> everything was 'right' at every step.
>
> Since I have seen nothing about the issue, I assume something like 
> that doesn't exist, so... Any ideas on how to stress test GFS?
> Homegrown scripts? Known problems with hardware that a test should 
> look for?
>
>


From toxictux at gmail.com  Wed Mar 15 22:44:14 2006
From: toxictux at gmail.com (toxictux)
Date: Wed, 15 Mar 2006 16:44:14 -0600
Subject: [Linux-cluster] Cluster Newbie Questions........
Message-ID: <17df45710603151444s51d1d34dpbdcad003f421a625@mail.gmail.com>

hi all,
    i am setting up a two node cluster with San based shared storage.
i have following questions regarding my setup.

1. i am planning to use this cluster to setup a web based application.
i saw the example in redhat documentation on how to setup http service
in cluster. is it possible to have http and mysql services served by
same cluster??

2. can i set them up on the same LUN after i created 2 separate partitions??

3. which lock manager is recommended DLM or GuLM?

Thanks,
-F


From Matthew.Patton.ctr at osd.mil  Wed Mar 15 23:14:05 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Wed, 15 Mar 2006 18:14:05 -0500
Subject: [Linux-cluster] stress-testing GFS ?
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D0D@osdn06.osd.mil>

Classification: UNCLASSIFIED

on a related note, should I anticipate an major gotcha's with respect to a
30 nodes on a GFS volume? I intend to run GFS 6.1 with DLM. Using RHEL4u2 at
the moment. 

(I posted a query earlier today but received no responses so I'm wondering
if this is getting thru.)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060315/c87197f5/attachment.htm>

From Birger.Wathne at ift.uib.no  Thu Mar 16 00:13:29 2006
From: Birger.Wathne at ift.uib.no (Birger Wathne)
Date: Thu, 16 Mar 2006 01:13:29 +0100
Subject: [Linux-cluster] stress-testing GFS ?
In-Reply-To: <4418932C.9080001@jellyfish.highlyscyld.com>
References: <44188BC8.2070800@ift.uib.no>
	<4418932C.9080001@jellyfish.highlyscyld.com>
Message-ID: <4418ADA9.8060609@ift.uib.no>

Michael Will wrote:

> iozone does test for a lot of different access patterns, and can 
> create nice spreadsheets including graphs
> from the point of view of a single node. It also has a multiple node 
> flag for running it across a cluster. See -+m and -t
> options. It knows how to use 'rsh' and can also be configured for any 
> other remote execution command by setting the
> enviroment variable RSH to say ssh or bpsh.
>
> Don't forget to post your benchmark results to this mailinglist ;-)
>
I used iozone and some homegrown scripts some years ago to test 
performance of various raid controllers as well as software raid on Sun 
systems. Always in single-node configurations.

The easiest way to communicate the performance of a raid controller to 
other people was a series of 3d surface plots. Sadly, OpenOffice doesn't 
have those, so I had to switch to that commercial office package. I 
tried gnuplot, but frankly.... compare the readability of the final plot 
with excel and there was no comparison :-/ Perhaps Matlab...

What I hoped for was something that also verified that the internal 
states of glm and the locking subsystem were as they should at every 
step of the test. Something that could certify that the hardware behaved 
as GFS expected it to when pushed more than test performance.

-- 
birger


From orcl.listas at gmail.com  Thu Mar 16 00:20:45 2006
From: orcl.listas at gmail.com (Allyson - Listas)
Date: Wed, 15 Mar 2006 21:20:45 -0300
Subject: [Linux-cluster] rhcs doubts.
In-Reply-To: <1142439325.19535.31.camel@ayanami.boston.redhat.com>
References: <44170B05.20805@gmail.com>
	<1142439325.19535.31.camel@ayanami.boston.redhat.com>
Message-ID: <4418AF5D.5040009@gmail.com>

Lon Hohberger wrote:

>On Tue, 2006-03-14 at 15:27 -0300, Allyson - Listas wrote:
>  
>
>>Hi guys,
>>
>>I'm new at redhat cluster suite.  Could Anybody help me in some questions?
>>
>>1st) I installed rhcs on 2 virtual machines and create a new cluster, 
>>setup a manual fence, a failvoer domain, create a IP resource and a 
>>service that uses just that IP for tests. Well, I'd like to know how can 
>>I force a failover of the service between nodes. This option is not 
>>available at system-config-cluster that allow just disable and enable 
>>the service.   
>>    
>>
>
>Drag it to the other node in the gui.
>  
>
    -->  I couldn't find this option at gui, but i find how to do it at 
command line..
[root at cs02 /]# clustat
Member Status: Quorate

  Member Name                              Status
  ------ ----                              ------
  cs01.example.com                         Online, rgmanager
  cs02.example.com                         Online, Local, rgmanager

  Service Name         Owner (Last)                   State
  ------- ----         ----- ------                   -----
  vip50                cs02.example.com               started
  oracle-ha-fs         cs02.example.com               started
[root at cs02 /]# clusvcadm -r oracle-ha-fs -m cs01.example.com
Trying to relocate oracle-ha-fs to cs01.example.com...success
[root at cs02 /]# clusvcadm -r vip50 -m cs01.example.com
Trying to relocate vip50 to cs01.example.com...success
[root at cs02 /]# clustat
Member Status: Quorate

  Member Name                              Status
  ------ ----                              ------
  cs01.example.com                         Online, rgmanager
  cs02.example.com                         Online, Local, rgmanager

  Service Name         Owner (Last)                   State
  ------- ----         ----- ------                   -----
  vip50                cs01.example.com               started
  oracle-ha-fs         cs01.example.com               started

>
>  
>
>>I noticed that the ip service created is not a virtual 
>>interface like eth0:1, but it was working because I could ping it, Is it 
>>Normal?
>>    
>>
>
>Yes, try "/sbin/ip addr list", which is noted in the documentation.
>
>
>  
>
tks

>>2nd) What is the real fuction of a fence device?
>>    
>>
>
>Prevent data corruption in the event of a live-hang of a node with
>outstanding dirty buffers.
>
>
>  
>
>>3rd) How can I setup a quorum device, and isn't necessary for a failover 
>>service?  I read that it was needed at rhel3 but at rhel4 is not 
>>anymore, could you explain me that.
>>    
>>
>
>What do you need to know?  It's not needed because of the way CMAN
>recovers - see
>
>http://people.redhat.com/teigland/sca.pdf
>
>-- Lon
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>
>  
>


-- 
Allyson A. Brito
MSN: allysonbrito78 at hotmail.com
SKYPE: allysonbrito
RHCE / LPI-1 / SCSA
OCP DBA 9i / OCA PL/SQL 9i


From jparsons at redhat.com  Thu Mar 16 00:28:10 2006
From: jparsons at redhat.com (James Parsons)
Date: Wed, 15 Mar 2006 19:28:10 -0500
Subject: [Linux-cluster] rhcs doubts.
In-Reply-To: <4418AF5D.5040009@gmail.com>
References: <44170B05.20805@gmail.com>	<1142439325.19535.31.camel@ayanami.boston.redhat.com>
	<4418AF5D.5040009@gmail.com>
Message-ID: <4418B11A.5010704@redhat.com>

Allyson - Listas wrote:

> Lon Hohberger wrote:
>
>> On Tue, 2006-03-14 at 15:27 -0300, Allyson - Listas wrote:
>>  
>>
>>> Hi guys,
>>>
>>> I'm new at redhat cluster suite.  Could Anybody help me in some 
>>> questions?
>>>
>>> 1st) I installed rhcs on 2 virtual machines and create a new 
>>> cluster, setup a manual fence, a failvoer domain, create a IP 
>>> resource and a service that uses just that IP for tests. Well, I'd 
>>> like to know how can I force a failover of the service between 
>>> nodes. This option is not available at system-config-cluster that 
>>> allow just disable and enable the service.     
>>
>>
>> Drag it to the other node in the gui.
>>  
>>
>    -->  I couldn't find this option at gui, but i find how to do it at 
> command line.. 

Just grab the service you want to move in the management view, and drag 
it to the node you want it to run on in the upper half of the GUI. :-)

-j

>
> [root at cs02 /]# clustat
> Member Status: Quorate
>
>  Member Name                              Status
>  ------ ----                              ------
>  cs01.example.com                         Online, rgmanager
>  cs02.example.com                         Online, Local, rgmanager
>
>  Service Name         Owner (Last)                   State
>  ------- ----         ----- ------                   -----
>  vip50                cs02.example.com               started
>  oracle-ha-fs         cs02.example.com               started
> [root at cs02 /]# clusvcadm -r oracle-ha-fs -m cs01.example.com
> Trying to relocate oracle-ha-fs to cs01.example.com...success
> [root at cs02 /]# clusvcadm -r vip50 -m cs01.example.com
> Trying to relocate vip50 to cs01.example.com...success
> [root at cs02 /]# clustat
> Member Status: Quorate
>
>  Member Name                              Status
>  ------ ----                              ------
>  cs01.example.com                         Online, rgmanager
>  cs02.example.com                         Online, Local, rgmanager
>
>  Service Name         Owner (Last)                   State
>  ------- ----         ----- ------                   -----
>  vip50                cs01.example.com               started
>  oracle-ha-fs         cs01.example.com               started
>
>>
>>  
>>
>>> I noticed that the ip service created is not a virtual interface 
>>> like eth0:1, but it was working because I could ping it, Is it Normal?
>>>   
>>
>>
>> Yes, try "/sbin/ip addr list", which is noted in the documentation.
>>
>>
>>  
>>
> tks
>
>>> 2nd) What is the real fuction of a fence device?
>>>   
>>
>>
>> Prevent data corruption in the event of a live-hang of a node with
>> outstanding dirty buffers.
>>
>>
>>  
>>
>>> 3rd) How can I setup a quorum device, and isn't necessary for a 
>>> failover service?  I read that it was needed at rhel3 but at rhel4 
>>> is not anymore, could you explain me that.
>>>   
>>
>>
>> What do you need to know?  It's not needed because of the way CMAN
>> recovers - see
>>
>> http://people.redhat.com/teigland/sca.pdf
>>
>> -- Lon
>>
>> -- 
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>  
>>
>
>


From orcl.listas at gmail.com  Thu Mar 16 00:33:26 2006
From: orcl.listas at gmail.com (Allyson - Listas)
Date: Wed, 15 Mar 2006 21:33:26 -0300
Subject: [Linux-cluster] Cluster Newbie Questions........
In-Reply-To: <17df45710603151444s51d1d34dpbdcad003f421a625@mail.gmail.com>
References: <17df45710603151444s51d1d34dpbdcad003f421a625@mail.gmail.com>
Message-ID: <4418B256.4050505@gmail.com>

toxictux wrote:

>hi all,
>    i am setting up a two node cluster with San based shared storage.
>i have following questions regarding my setup.
>
>1. i am planning to use this cluster to setup a web based application.
>i saw the example in redhat documentation on how to setup http service
>in cluster. is it possible to have http and mysql services served by
>same cluster??
>
>  
>
Yes, just make 2 diferent scripts to a better management.

>2. can i set them up on the same LUN after i created 2 separate partitions??
>
>  
>
Yes, no problems. Just remember that devices on Linux can change 
depending on your scsi id, target, lun, etc... to avoid problems I mount 
filesystems using LABEL and not the device /dev/sdX.

>3. which lock manager is recommended DLM or GuLM?
>Thanks,
>-F
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>
>  
>


-- 
Allyson A. Brito
MSN: allysonbrito78 at hotmail.com
SKYPE: allysonbrito
RHCE / LPI-1 / SCSA
OCP DBA 9i / OCA PL/SQL 9i


From toxictux at gmail.com  Thu Mar 16 02:01:40 2006
From: toxictux at gmail.com (toxictux)
Date: Wed, 15 Mar 2006 20:01:40 -0600
Subject: [Linux-cluster] Cluster Newbie Questions........
In-Reply-To: <4418B256.4050505@gmail.com>
References: <17df45710603151444s51d1d34dpbdcad003f421a625@mail.gmail.com>
	<4418B256.4050505@gmail.com>
Message-ID: <17df45710603151801r16a35c9ejd539fded8ddb711f@mail.gmail.com>

thanks allyson....


On 3/15/06, Allyson - Listas <orcl.listas at gmail.com> wrote:
> toxictux wrote:
>
> >hi all,
> >    i am setting up a two node cluster with San based shared storage.
> >i have following questions regarding my setup.
> >
> >1. i am planning to use this cluster to setup a web based application.
> >i saw the example in redhat documentation on how to setup http service
> >in cluster. is it possible to have http and mysql services served by
> >same cluster??
> >
> >
> >
> Yes, just make 2 diferent scripts to a better management.
>
> >2. can i set them up on the same LUN after i created 2 separate partitions??
> >
> >
> >
> Yes, no problems. Just remember that devices on Linux can change
> depending on your scsi id, target, lun, etc... to avoid problems I mount
> filesystems using LABEL and not the device /dev/sdX.
>
> >3. which lock manager is recommended DLM or GuLM?
> >Thanks,
> >-F
> >
> >--
> >Linux-cluster mailing list
> >Linux-cluster at redhat.com
> >https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> >
> >
>
>
> --
> Allyson A. Brito
> MSN: allysonbrito78 at hotmail.com
> SKYPE: allysonbrito
> RHCE / LPI-1 / SCSA
> OCP DBA 9i / OCA PL/SQL 9i
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From rohara at redhat.com  Thu Mar 16 03:10:34 2006
From: rohara at redhat.com (Ryan O'Hara)
Date: Wed, 15 Mar 2006 21:10:34 -0600
Subject: [Linux-cluster] Cluster Newbie Questions........
In-Reply-To: <17df45710603151444s51d1d34dpbdcad003f421a625@mail.gmail.com>
References: <17df45710603151444s51d1d34dpbdcad003f421a625@mail.gmail.com>
Message-ID: <4418D72A.2060603@redhat.com>

toxictux wrote:
> 
> 3. which lock manager is recommended DLM or GuLM?

DLM is recommended.


From suvankar_moitra at yahoo.com  Thu Mar 16 05:07:09 2006
From: suvankar_moitra at yahoo.com (SUVANKAR MOITRA)
Date: Wed, 15 Mar 2006 21:07:09 -0800 (PST)
Subject: [Linux-cluster] rhcs doubts.
In-Reply-To: <4418B11A.5010704@redhat.com>
Message-ID: <20060316050709.70034.qmail@web52309.mail.yahoo.com>

dear all,

thanks for the document , its helps a lot.

regards

Suvankar

--- James Parsons <jparsons at redhat.com> wrote:

> Allyson - Listas wrote:
> 
> > Lon Hohberger wrote:
> >
> >> On Tue, 2006-03-14 at 15:27 -0300, Allyson -
> Listas wrote:
> >>  
> >>
> >>> Hi guys,
> >>>
> >>> I'm new at redhat cluster suite.  Could Anybody
> help me in some 
> >>> questions?
> >>>
> >>> 1st) I installed rhcs on 2 virtual machines and
> create a new 
> >>> cluster, setup a manual fence, a failvoer
> domain, create a IP 
> >>> resource and a service that uses just that IP
> for tests. Well, I'd 
> >>> like to know how can I force a failover of the
> service between 
> >>> nodes. This option is not available at
> system-config-cluster that 
> >>> allow just disable and enable the service.     
> >>
> >>
> >> Drag it to the other node in the gui.
> >>  
> >>
> >    -->  I couldn't find this option at gui, but i
> find how to do it at 
> > command line.. 
> 
> Just grab the service you want to move in the
> management view, and drag 
> it to the node you want it to run on in the upper
> half of the GUI. :-)
> 
> -j
> 
> >
> > [root at cs02 /]# clustat
> > Member Status: Quorate
> >
> >  Member Name                              Status
> >  ------ ----                              ------
> >  cs01.example.com                         Online,
> rgmanager
> >  cs02.example.com                         Online,
> Local, rgmanager
> >
> >  Service Name         Owner (Last)                
>   State
> >  ------- ----         ----- ------                
>   -----
> >  vip50                cs02.example.com            
>   started
> >  oracle-ha-fs         cs02.example.com            
>   started
> > [root at cs02 /]# clusvcadm -r oracle-ha-fs -m
> cs01.example.com
> > Trying to relocate oracle-ha-fs to
> cs01.example.com...success
> > [root at cs02 /]# clusvcadm -r vip50 -m
> cs01.example.com
> > Trying to relocate vip50 to
> cs01.example.com...success
> > [root at cs02 /]# clustat
> > Member Status: Quorate
> >
> >  Member Name                              Status
> >  ------ ----                              ------
> >  cs01.example.com                         Online,
> rgmanager
> >  cs02.example.com                         Online,
> Local, rgmanager
> >
> >  Service Name         Owner (Last)                
>   State
> >  ------- ----         ----- ------                
>   -----
> >  vip50                cs01.example.com            
>   started
> >  oracle-ha-fs         cs01.example.com            
>   started
> >
> >>
> >>  
> >>
> >>> I noticed that the ip service created is not a
> virtual interface 
> >>> like eth0:1, but it was working because I could
> ping it, Is it Normal?
> >>>   
> >>
> >>
> >> Yes, try "/sbin/ip addr list", which is noted in
> the documentation.
> >>
> >>
> >>  
> >>
> > tks
> >
> >>> 2nd) What is the real fuction of a fence device?
> >>>   
> >>
> >>
> >> Prevent data corruption in the event of a
> live-hang of a node with
> >> outstanding dirty buffers.
> >>
> >>
> >>  
> >>
> >>> 3rd) How can I setup a quorum device, and isn't
> necessary for a 
> >>> failover service?  I read that it was needed at
> rhel3 but at rhel4 
> >>> is not anymore, could you explain me that.
> >>>   
> >>
> >>
> >> What do you need to know?  It's not needed
> because of the way CMAN
> >> recovers - see
> >>
> >> http://people.redhat.com/teigland/sca.pdf
> >>
> >> -- Lon
> >>
> >> -- 
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >>
>
https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> >>  
> >>
> >
> >
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster
> 


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From Anthony.Assi at irisa.fr  Thu Mar 16 08:58:21 2006
From: Anthony.Assi at irisa.fr (Anthony Assi)
Date: Thu, 16 Mar 2006 09:58:21 +0100
Subject: [Linux-cluster] Cluster Newbie Questions........
In-Reply-To: <4418D72A.2060603@redhat.com>
References: <17df45710603151444s51d1d34dpbdcad003f421a625@mail.gmail.com>
	<4418D72A.2060603@redhat.com>
Message-ID: <441928AD.4010209@irisa.fr>

Absolutely go for DLM;
and not for GULM, we are facing small problems with the lock servers of 
GULM,
and apperentley, it might not be supported with RHE5.

Ryan O'Hara wrote:

> toxictux wrote:
>
>>
>> 3. which lock manager is recommended DLM or GuLM?
>
>
> DLM is recommended.
>


From lhh at redhat.com  Thu Mar 16 14:56:14 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 16 Mar 2006 09:56:14 -0500
Subject: [Linux-cluster] Cluster Newbie Questions........
In-Reply-To: <4418B256.4050505@gmail.com>
References: <17df45710603151444s51d1d34dpbdcad003f421a625@mail.gmail.com>
	<4418B256.4050505@gmail.com>
Message-ID: <1142520974.19535.142.camel@ayanami.boston.redhat.com>

On Wed, 2006-03-15 at 21:33 -0300, Allyson - Listas wrote:

> >1. i am planning to use this cluster to setup a web based application.
> >i saw the example in redhat documentation on how to setup http service
> >in cluster. is it possible to have http and mysql services served by
> >same cluster??
> >
> Yes, just make 2 diferent scripts to a better management.

Correct.

> >2. can i set them up on the same LUN after i created 2 separate partitions??
> >
> Yes, no problems. Just remember that devices on Linux can change 
> depending on your scsi id, target, lun, etc... to avoid problems I mount 
> filesystems using LABEL and not the device /dev/sdX.

You can also look in to using CLVM for the device names, which will be
consistent cluster-wide.

If using the file system label, just type:

   LABEL=mylabel

...in the UI instead of "/dev/sda1".

> >3. which lock manager is recommended DLM or GuLM?

DLM for small node counts.

-- Lon


From theo at tkd.co.id  Thu Mar 16 15:18:58 2006
From: theo at tkd.co.id (Theodorus)
Date: Thu, 16 Mar 2006 22:18:58 +0700
Subject: [Linux-cluster] cluster suit 4.2
Message-ID: <TKDNETwspDKicUbvBDx0000019a@tkdnet.tkd.co.id>

Hi all, 

We need your help.

We have cluster suit 4.2 installed on RedHat AS 4.2.

The cluster system has been run well. The resource group can be relocated
when the one of the nodes is down.

But, if we disconnect all network cables of one node on purpose, the cluster
system stalled, why  ? 

Thanks for your help.

 
Rgds,

Theo

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060316/1b927591/attachment.htm>

From lhh at redhat.com  Thu Mar 16 15:18:14 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 16 Mar 2006 10:18:14 -0500
Subject: [Linux-cluster] stress-testing GFS ?
In-Reply-To: <D8063DF686D10247B0A49D01271285690CE91D0D@osdn06.osd.mil>
References: <D8063DF686D10247B0A49D01271285690CE91D0D@osdn06.osd.mil>
Message-ID: <1142522294.19535.146.camel@ayanami.boston.redhat.com>

On Wed, 2006-03-15 at 18:14 -0500, Patton, Matthew F, CTR, OSD-PA&E
wrote:
> Classification: UNCLASSIFIED
> 
> on a related note, should I anticipate an major gotcha's with respect
> to a 30 nodes on a GFS volume? I intend to run GFS 6.1 with DLM. Using
> RHEL4u2 at the moment. 

If there are any problems, it is probably a bug.

If you're using rgmanager, you might not want to run it on more than 16
of the 30 (uncharted waters).  Rgmanager is not needed for GFS at all,
though.

-- Lon


From lhh at redhat.com  Thu Mar 16 15:18:41 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 16 Mar 2006 10:18:41 -0500
Subject: [Linux-cluster] cluster suit 4.2
In-Reply-To: <TKDNETwspDKicUbvBDx0000019a@tkdnet.tkd.co.id>
References: <TKDNETwspDKicUbvBDx0000019a@tkdnet.tkd.co.id>
Message-ID: <1142522321.19535.148.camel@ayanami.boston.redhat.com>

On Thu, 2006-03-16 at 22:18 +0700, Theodorus wrote:
> Hi all, 
> 
> 
> We need your help.
> 
> 
> We have cluster suit 4.2 installed on RedHat AS 4.2.
> 
> 
> The cluster system has been run well. The resource group can be
> relocated when the one of the nodes is down.
> 
> 
> But, if we disconnect all network cables of one node on purpose, the
> cluster system stalled, why  ? 

What kind of fencing are you using?

-- Lon


From philip.r.dana at nwp01.usace.army.mil  Thu Mar 16 15:22:17 2006
From: philip.r.dana at nwp01.usace.army.mil (Philip R. Dana)
Date: Thu, 16 Mar 2006 07:22:17 -0800
Subject: [Linux-cluster] RHCS4 rgmanager/clurmgrd problem
Message-ID: <1142522537.12774.47.camel@nwp-wk-79033-l>

We have a two node active/passive cluster running bind as our master DNS
server. Shared storage is iSCSI on a NetApp Filer. The OS is CentOS 4.2.
Whenever the rgmanager service on the passive node is started/restarted,
the service resource on the active node fails in that named itself is
shut down. The only way to recover, as near as I can tell, is to set
autostart=0 in cluster.conf, reboot both nodes, then manually start the
service on one of the nodes. Is this by design, or an "undocumented
feature"?
Any help will be greatly appreciated. TIA.


From Matthew.Patton.ctr at osd.mil  Thu Mar 16 15:27:15 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Thu, 16 Mar 2006 10:27:15 -0500
Subject: [Linux-cluster] Cluster Newbie Questions........
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D11@osdn06.osd.mil>

Classification: UNCLASSIFIED

> Lon Hohberger wrote:

> > >3. which lock manager is recommended DLM or GuLM?
> 
> DLM for small node counts.

can you define "small" for us? less than a dozen? up to 50?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060316/31cc59ea/attachment.htm>

From ben.yarwood at juno.co.uk  Thu Mar 16 15:52:16 2006
From: ben.yarwood at juno.co.uk (Ben Yarwood)
Date: Thu, 16 Mar 2006 15:52:16 -0000
Subject: [Linux-cluster] Fedora Updates
Message-ID: <037201c64911$99b95710$3964a8c0@WS076>

Our clusters currently run fedora 4 and we only update them using yum and
the updates-released repository.  It seems that apart from new kernel
modules, the cluster components have not been updated since June last year.
Other components seem to be updated more regularly.

Can anyone shed any light on why this is the case?

Cheers
Ben


From lhh at redhat.com  Thu Mar 16 16:27:42 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 16 Mar 2006 11:27:42 -0500
Subject: [Linux-cluster] cluster suit 4.2
In-Reply-To: <007d01c64914$9404bd70$4ee17bcb@golie>
References: <TKDNETwspDKicUbvBDx0000019a@tkdnet.tkd.co.id>
	<1142522321.19535.148.camel@ayanami.boston.redhat.com>
	<007d01c64914$9404bd70$4ee17bcb@golie>
Message-ID: <1142526462.19535.156.camel@ayanami.boston.redhat.com>

On Thu, 2006-03-16 at 23:13 +0700, Paul wrote:
> manual fence, because we have redundance PS, thx
> 

You need to run fence_ack_manual on the surviving node.  Note that
running manual fencing in production environments is not supported.

There is plenty of adequate remote power fencing hardware available
which will handle multiple power supplies.

-- Lon


From mwill at penguincomputing.com  Thu Mar 16 16:52:37 2006
From: mwill at penguincomputing.com (Michael Will)
Date: Thu, 16 Mar 2006 08:52:37 -0800
Subject: [Linux-cluster] stress-testing GFS ?
Message-ID: <433093DF7AD7444DA65EFAFE3987879C02A975@jellyfish.highlyscyld.com>

openoffice 2.0 does support the graphs. I could not move them around
like in excel
but I could definitely see the default view of them.

Michael 

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Birger Wathne
Sent: Wednesday, March 15, 2006 4:13 PM
To: linux clustering
Subject: Re: [Linux-cluster] stress-testing GFS ?

Michael Will wrote:

> iozone does test for a lot of different access patterns, and can 
> create nice spreadsheets including graphs from the point of view of a 
> single node. It also has a multiple node flag for running it across a 
> cluster. See -+m and -t options. It knows how to use 'rsh' and can 
> also be configured for any other remote execution command by setting 
> the enviroment variable RSH to say ssh or bpsh.
>
> Don't forget to post your benchmark results to this mailinglist ;-)
>
I used iozone and some homegrown scripts some years ago to test
performance of various raid controllers as well as software raid on Sun
systems. Always in single-node configurations.

The easiest way to communicate the performance of a raid controller to
other people was a series of 3d surface plots. Sadly, OpenOffice doesn't
have those, so I had to switch to that commercial office package. I
tried gnuplot, but frankly.... compare the readability of the final plot
with excel and there was no comparison :-/ Perhaps Matlab...

What I hoped for was something that also verified that the internal
states of glm and the locking subsystem were as they should at every
step of the test. Something that could certify that the hardware behaved
as GFS expected it to when pushed more than test performance.

--
birger

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From baostr at gmail.com  Thu Mar 16 18:39:17 2006
From: baostr at gmail.com (Boris Ostrovsky)
Date: Thu, 16 Mar 2006 13:39:17 -0500
Subject: [Linux-cluster] stress-testing GFS ?
Message-ID: <fd8dc0d20603161039m4b4a2001lf81c278474b2abdb@mail.gmail.com>

I have recently ran some very simple iozone tests on GFS (and OCFS2) and got
somewhat disappointing results. I am attaching the spreadsheet.

The first test was to measure single-node performance with ext3, GFS and
OCFS2 partition that I mounted in a single node. The second was to use two
nodes and run iozone in parallel (by hand, i.e. without -m/-t options).

Single node performances were comparable in terms of wallclock time,
although the benchmark values for ext3 were clearly better (so I am not sure
I understand why wallclock times are so close). 2-node numbers show
substantial performance degradation.

Note, I didn't do any tuning, mostly because I didn't find much
documentation on the subject (except that for OCFS2 I set cluster size to
1MB, which helped). The nodes were running FC4 with the disk connected to
nodes via Emulex HBA. and cluster tools 1.01

I'd be very interested to hear comments on the numbers and hopefully some
tuning suggestions.


Thanks.
-boris


Date: Wed, 15 Mar 2006 14:20:28 -0800
> From: Michael Will <mwill at penguincomputing.com>
> Subject: Re: [Linux-cluster] stress-testing GFS ?
> To: linux clustering <linux-cluster at redhat.com>
> Message-ID: <4418932C.9080001 at jellyfish.highlyscyld.com>
> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>
> iozone does test for a lot of different access patterns, and can create
> nice spreadsheets including graphs
> from the point of view of a single node. It also has a multiple node
> flag for running it across a cluster. See -+m and -t
> options. It knows how to use 'rsh' and can also be configured for any
> other remote execution command by setting the
> enviroment variable RSH to say ssh or bpsh.
>
> Don't forget to post your benchmark results to this mailinglist ;-)
>
> Michael
>
> Birger Wathne wrote:
>
> > I would like to put my cluster through a little controlled hell before
> > declaring it ready for production.
> >
> > Is there any kind of stress-test/verification procedure to 'certify'
> > shared storage with GFS?
> > Ideally there would be some distributed software that could be run in
> > a cluster to check that the shared storage behaves as expected under
> > all kinds of load. Throughput, concurrent writing, GFS locking, file
> > system locking, etc...
> > Something that could interface with GFS internals to see that
> > everything was 'right' at every step.
> >
> > Since I have seen nothing about the issue, I assume something like
> > that doesn't exist, so... Any ideas on how to stress test GFS?
> > Homegrown scripts? Known problems with hardware that a test should
> > look for?
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060316/d61c1e86/attachment.htm>
-------------- next part --------------
						random	random	bkwd	record	stride
		write	rewrite	read	reread	read	write	read	rewrite	read	fwrite	frewrite	fread	freread


ext3		12.5min	113718	8335	91962	186143	4345	515	9612	258904	6002	112859	7230	76225	139576
gfs		13.5min	27217	8337	50117	62312	1611	604	8233	81180	5749	33633	7958	53301	40331
ocfs2		14.5min	42102	9345	65887	92481	1210	566	8136	155370	5605	41571	8699	78925	71724

gfs(n1)		46min	21467	5159	29705	35512	348	172	808	81188	4970	32680	8039	35667	58961
gfs(n2)		48min	40046	3493	29565	25093	504	327	906	81390	456	30035	4085	24953	22493

ocfs2 (n1)	38min	26813	4375	27406	27408	367	251	892	156194	5038	49998	8882	80288	111914
ocfs2 (n2)	35.5min	22756	5330	36728	29607	673	400	907	153949	953	45964	5117	34055	40158

From Matthew.Patton.ctr at osd.mil  Thu Mar 16 19:16:31 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Thu, 16 Mar 2006 14:16:31 -0500
Subject: [Linux-cluster] stress-testing GFS ?
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D14@osdn06.osd.mil>

Classification: UNCLASSIFIED
 
just idly wondering what the IO would be if NFS exporting the ext3 or gfs to
the other node and running iozone on 2 such clients. There was no file
contention was there? By that I mean each instance of iozone was writing to
a different directory (both on GFS) so file-level read/write locking wasn't
a factor. Presumably GFS locking is all about keeping the filesystem
meta-data intact. BTW, has anyone applied the idea behind SoftUpdates to
GFS? Say part of the heartbeat is a broadcast of the meta-data changes so
while data blocks might by written syncronously, not every meta-change has
to wait for the FC/array to commit it to disk before continuing? I'm
thinking of what we did for firewall-farm syncronization which was
1xActive/NxPassive and they all could handle each other's network traffic at
any time should the current master drop off with the only streams affected
being those initiated since the last status update message was sent out to
the passive nodes.
 
Would it work such that the nodes vote on a meta-master and all meta-data is
kept in memory and then periodically flushed? Because if each meta-change is
broadcast and each node spools it to local storage, then when it's time to
elect a new master the nodes can consult their transaction histories. Is
there a good paper that describes the detailed inner-workings of GFS aside
from having to read all the code? So far I've found this:
https://open.datacore.ch/DCwiki.open/Wiki.jsp?page=GFS
<https://open.datacore.ch/DCwiki.open/Wiki.jsp?page=GFS> 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060316/619de631/attachment.htm>

From philip.r.dana at nwp01.usace.army.mil  Thu Mar 16 20:16:41 2006
From: philip.r.dana at nwp01.usace.army.mil (Philip R. Dana)
Date: Thu, 16 Mar 2006 12:16:41 -0800
Subject: [Linux-cluster] RHCS4 rgmanager/clurmgrd problem
In-Reply-To: <1142522537.12774.47.camel@nwp-wk-79033-l>
References: <1142522537.12774.47.camel@nwp-wk-79033-l>
Message-ID: <1142540201.4163.8.camel@nwp-wk-79033-l>

I found a work around. Like the gentleman with the mysql service problem
a while back, I edited /etc/init.d/named on both nodes such than named
stop returns 0, even though named is already stopped. I'm not smart
enough, yet, to figure out why that works, but it does.

On Thu, 2006-03-16 at 07:22 -0800, Philip R. Dana wrote:
> We have a two node active/passive cluster running bind as our master DNS
> server. Shared storage is iSCSI on a NetApp Filer. The OS is CentOS 4.2.
> Whenever the rgmanager service on the passive node is started/restarted,
> the service resource on the active node fails in that named itself is
> shut down. The only way to recover, as near as I can tell, is to set
> autostart=0 in cluster.conf, reboot both nodes, then manually start the
> service on one of the nodes. Is this by design, or an "undocumented
> feature"?
> Any help will be greatly appreciated. TIA.
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From gforte at leopard.us.udel.edu  Thu Mar 16 20:26:43 2006
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Thu, 16 Mar 2006 15:26:43 -0500
Subject: [Linux-cluster] RHCS4 rgmanager/clurmgrd problem
In-Reply-To: <1142540201.4163.8.camel@nwp-wk-79033-l>
References: <1142522537.12774.47.camel@nwp-wk-79033-l>
	<1142540201.4163.8.camel@nwp-wk-79033-l>
Message-ID: <4419CA03.9090805@leopard.us.udel.edu>

this has been covered, previously, but in brief:

a) the cluster services try to stop a service before starting it when 
you enable it
b) it expects the "/etc/init.d/service stop" command to return 0, 
indicating that there was no problem
c) many of the stock service scripts return non-zero if you try to stop 
them when they're not running

depending on your point of view, (c) is the "correct" behavior or not; 
in the case of cluster services, it's obviously not.  For the purposes 
of cluster services, the script should only return non-zero on the 
'stop' command if the service was, in fact, running, and the script 
failed to stop it.  A better solution than simply returning 0 
braindeadly would be to check the output of the script's 'status' 
command, and only attempt the stop if it's actually running, then return 
non-zero if the stop fails, 0 (success) if the stop succeeds OR it 
wasn't running in the first place.  But that's a lot of work.  ;-)

-g

Philip R. Dana wrote:
> I found a work around. Like the gentleman with the mysql service problem
> a while back, I edited /etc/init.d/named on both nodes such than named
> stop returns 0, even though named is already stopped. I'm not smart
> enough, yet, to figure out why that works, but it does.
> 
> On Thu, 2006-03-16 at 07:22 -0800, Philip R. Dana wrote:
>> We have a two node active/passive cluster running bind as our master DNS
>> server. Shared storage is iSCSI on a NetApp Filer. The OS is CentOS 4.2.
>> Whenever the rgmanager service on the passive node is started/restarted,
>> the service resource on the active node fails in that named itself is
>> shut down. The only way to recover, as near as I can tell, is to set
>> autostart=0 in cluster.conf, reboot both nodes, then manually start the
>> service on one of the nodes. Is this by design, or an "undocumented
>> feature"?
>> Any help will be greatly appreciated. TIA.
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


-- 
Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE


From philip.r.dana at nwp01.usace.army.mil  Thu Mar 16 21:48:01 2006
From: philip.r.dana at nwp01.usace.army.mil (Philip R. Dana)
Date: Thu, 16 Mar 2006 13:48:01 -0800
Subject: [Linux-cluster] RHCS4 rgmanager/clurmgrd problem
In-Reply-To: <4419CA03.9090805@leopard.us.udel.edu>
References: <1142522537.12774.47.camel@nwp-wk-79033-l>
	<1142540201.4163.8.camel@nwp-wk-79033-l>
	<4419CA03.9090805@leopard.us.udel.edu>
Message-ID: <1142545681.4163.11.camel@nwp-wk-79033-l>

Greg: Your explanation clarified for me what's happening and what needs
to be done. Thanks much.

On Thu, 2006-03-16 at 15:26 -0500, Greg Forte wrote:
> this has been covered, previously, but in brief:
> 
> a) the cluster services try to stop a service before starting it when 
> you enable it
> b) it expects the "/etc/init.d/service stop" command to return 0, 
> indicating that there was no problem
> c) many of the stock service scripts return non-zero if you try to stop 
> them when they're not running
> 
> depending on your point of view, (c) is the "correct" behavior or not; 
> in the case of cluster services, it's obviously not.  For the purposes 
> of cluster services, the script should only return non-zero on the 
> 'stop' command if the service was, in fact, running, and the script 
> failed to stop it.  A better solution than simply returning 0 
> braindeadly would be to check the output of the script's 'status' 
> command, and only attempt the stop if it's actually running, then return 
> non-zero if the stop fails, 0 (success) if the stop succeeds OR it 
> wasn't running in the first place.  But that's a lot of work.  ;-)
> 
> -g
> 
> Philip R. Dana wrote:
> > I found a work around. Like the gentleman with the mysql service problem
> > a while back, I edited /etc/init.d/named on both nodes such than named
> > stop returns 0, even though named is already stopped. I'm not smart
> > enough, yet, to figure out why that works, but it does.
> > 
> > On Thu, 2006-03-16 at 07:22 -0800, Philip R. Dana wrote:
> >> We have a two node active/passive cluster running bind as our master DNS
> >> server. Shared storage is iSCSI on a NetApp Filer. The OS is CentOS 4.2.
> >> Whenever the rgmanager service on the passive node is started/restarted,
> >> the service resource on the active node fails in that named itself is
> >> shut down. The only way to recover, as near as I can tell, is to set
> >> autostart=0 in cluster.conf, reboot both nodes, then manually start the
> >> service on one of the nodes. Is this by design, or an "undocumented
> >> feature"?
> >> Any help will be greatly appreciated. TIA.
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> 
> 


From zhendershot at cranel.com  Thu Mar 16 21:51:21 2006
From: zhendershot at cranel.com (Hendershot, Zach)
Date: Thu, 16 Mar 2006 16:51:21 -0500
Subject: [Linux-cluster] (no subject)
Message-ID: <BD37DA8FEF7D8949A5731245FF66CB2BD47869@POSTOFFICE.cranel.local>

All,
    I'm playing around with Red Hat Cluster Suite and I had a quick
question. I was solely reading the manual for CS4 up until a few hours
ago. Then I stumbled upon the CS3 manual. WOW, there is so much more
detail (for example: how do you use the clusvcadm command). Why are
people who only read the CS4 manual left out from all the good
information. Additionally, of course some of the CS3 stuff is out of
date so its additional information is a mixed bag. 
    Also, I'm having trouble with service failover. I have a script that
accepts {start, stop, status} for Apache. I manually fail Apache, and CS
basically puts the service in the failed state because it can't stop the
service (the stop command returns a 1 status, because it cant stop an
already stopped service). The service never gets failed over to another
node. And back to the documentation, why is something basic like script
creation and API left out? Am I looking in the wrong place or
overlooking some of this information? Maybe I'm just too used to VCS?
Thank you all very much for your help. Have a great day.
 
--------------

Zach Hendershot
Software Engineer
Cranel, Incorporated.
Phone: 614.318.4288
Fax: 614.431.8388
Email: zhendershot at cranel.com

Technology. Integrity. Focus.


From zhendershot at cranel.com  Thu Mar 16 22:03:25 2006
From: zhendershot at cranel.com (Hendershot, Zach)
Date: Thu, 16 Mar 2006 17:03:25 -0500
Subject: [Linux-cluster] RE: 
Message-ID: <BD37DA8FEF7D8949A5731245FF66CB2BD47873@POSTOFFICE.cranel.local>

All,
	Sorry about the missing subject, I was hasty. But, I just caught
a recent conversation dealing with the service script information, so
I've got that now. But I do still have questions about the
inconsistencies with the documentation, and I would appreciate any
enlightening anybody could shed. Thanks.

Zach

-----Original Message-----
From: Hendershot, Zach 
Sent: Thursday, March 16, 2006 4:51 PM
To: 'linux-cluster at redhat.com'
Subject: 

All,
    I'm playing around with Red Hat Cluster Suite and I had a quick
question. I was solely reading the manual for CS4 up until a few hours
ago. Then I stumbled upon the CS3 manual. WOW, there is so much more
detail (for example: how do you use the clusvcadm command). Why are
people who only read the CS4 manual left out from all the good
information. Additionally, of course some of the CS3 stuff is out of
date so its additional information is a mixed bag. 
    Also, I'm having trouble with service failover. I have a script that
accepts {start, stop, status} for Apache. I manually fail Apache, and CS
basically puts the service in the failed state because it can't stop the
service (the stop command returns a 1 status, because it cant stop an
already stopped service). The service never gets failed over to another
node. And back to the documentation, why is something basic like script
creation and API left out? Am I looking in the wrong place or
overlooking some of this information? Maybe I'm just too used to VCS?
Thank you all very much for your help. Have a great day.
 
--------------

Zach Hendershot
Software Engineer
Cranel, Incorporated.
Phone: 614.318.4288
Fax: 614.431.8388
Email: zhendershot at cranel.com

Technology. Integrity. Focus.


From lhh at redhat.com  Thu Mar 16 23:59:06 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 16 Mar 2006 18:59:06 -0500
Subject: [Linux-cluster] Cluster Newbie Questions........
In-Reply-To: <D8063DF686D10247B0A49D01271285690CE91D11@osdn06.osd.mil>
References: <D8063DF686D10247B0A49D01271285690CE91D11@osdn06.osd.mil>
Message-ID: <1142553546.19535.199.camel@ayanami.boston.redhat.com>

On Thu, 2006-03-16 at 10:27 -0500, Patton, Matthew F, CTR, OSD-PA&E
wrote:
> Classification: UNCLASSIFIED
> 
> > Lon Hohberger wrote:
> 
> > > >3. which lock manager is recommended DLM or GuLM? 
> >  
> > DLM for small node counts.
> 
> can you define "small" for us? less than a dozen? up to 50?

DLM should theoretically be very scalable, but I do not know what the
largest tested node count is at this point.

I am pretty sure at least 32 should work fine.

-- Lon


From lhh at redhat.com  Thu Mar 16 23:59:57 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 16 Mar 2006 18:59:57 -0500
Subject: [Linux-cluster] RHCS4 rgmanager/clurmgrd problem
In-Reply-To: <1142540201.4163.8.camel@nwp-wk-79033-l>
References: <1142522537.12774.47.camel@nwp-wk-79033-l>
	<1142540201.4163.8.camel@nwp-wk-79033-l>
Message-ID: <1142553597.19535.201.camel@ayanami.boston.redhat.com>

On Thu, 2006-03-16 at 12:16 -0800, Philip R. Dana wrote:
> I found a work around. Like the gentleman with the mysql service problem
> a while back, I edited /etc/init.d/named on both nodes such than named
> stop returns 0, even though named is already stopped. I'm not smart
> enough, yet, to figure out why that works, but it does.

*all* init scripts should do this.

-- Lon


From gforte at leopard.us.udel.edu  Fri Mar 17 14:37:42 2006
From: gforte at leopard.us.udel.edu (Greg Forte)
Date: Fri, 17 Mar 2006 09:37:42 -0500
Subject: [Linux-cluster] shared cluster.conf?
Message-ID: <441AC9B6.2050403@leopard.us.udel.edu>

Is there any reason not to put cluster.conf on a shared filesystem 
that's mounted in fstab, and symlink /etc/cluster/cluster.conf to that 
shared location?  Then one would only have to run cman_tool version -r 
<new_version_#> after updating the conf ... or am I missing some reason 
why this is a bad idea?

-g

Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE


From zhendershot at cranel.com  Fri Mar 17 14:41:43 2006
From: zhendershot at cranel.com (Hendershot, Zach)
Date: Fri, 17 Mar 2006 09:41:43 -0500
Subject: [Linux-cluster] shared cluster.conf?
Message-ID: <BD37DA8FEF7D8949A5731245FF66CB2BD478D9@POSTOFFICE.cranel.local>

If you are running CS4 at least, all you have to do is make your changes
and then run "ccs_tool update /etc/cluster/cluster.conf" and it goes out
to all the nodes automatically. With this functionality I wouldn't want
to mess with a shared filesystem any potential mess you could get into.

Zach Hendershot

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Greg Forte
Sent: Friday, March 17, 2006 9:38 AM
To: linux clustering
Subject: [Linux-cluster] shared cluster.conf?

Is there any reason not to put cluster.conf on a shared filesystem
that's mounted in fstab, and symlink /etc/cluster/cluster.conf to that
shared location?  Then one would only have to run cman_tool version -r
<new_version_#> after updating the conf ... or am I missing some reason
why this is a bad idea?

-g

Greg Forte
gforte at udel.edu
IT - User Services
University of Delaware
302-831-1982
Newark, DE

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From carlopmart at gmail.com  Fri Mar 17 17:14:52 2006
From: carlopmart at gmail.com (carlopmart)
Date: Fri, 17 Mar 2006 18:14:52 +0100
Subject: [Linux-cluster] GFS at startup
Message-ID: <441AEE8C.20404@gmail.com>

Hi all,

  I have a simple doubt: where I can put gfs shared file system imported 
from gnbd's server: on fstab, on rc.local or on cluster.conf? i need 
this filesystem up before configured services on cluster.conf starts up.

Thanks.

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From Matthew.Patton.ctr at osd.mil  Fri Mar 17 17:17:13 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Fri, 17 Mar 2006 12:17:13 -0500
Subject: [Linux-cluster] shared cluster.conf?
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D19@osdn06.osd.mil>

Classification: UNCLASSIFIED

> Is there any reason not to put cluster.conf on a shared filesystem 
> that's mounted in fstab, and symlink /etc/cluster/cluster.conf to that 

wondered that myself. you can put the file on a shared volume (say /etc was
actually on GFS for each node). "update" should ideally have a flag or
better yet, a counterpart that says to "just reread the file and don't try
to write a new one."
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060317/f41fed82/attachment.htm>

From grimme at atix.de  Fri Mar 17 18:59:57 2006
From: grimme at atix.de (Marc Grimme)
Date: Fri, 17 Mar 2006 19:59:57 +0100
Subject: [Linux-cluster] shared cluster.conf?
In-Reply-To: <D8063DF686D10247B0A49D01271285690CE91D19@osdn06.osd.mil>
References: <D8063DF686D10247B0A49D01271285690CE91D19@osdn06.osd.mil>
Message-ID: <200603171959.57644.grimme@atix.de>

On Friday 17 March 2006 18:17, Patton, Matthew F, CTR, OSD-PA&E wrote:
> Classification: UNCLASSIFIED
>
> > Is there any reason not to put cluster.conf on a shared filesystem
> > that's mounted in fstab, and symlink /etc/cluster/cluster.conf to that
>
> wondered that myself. you can put the file on a shared volume (say /etc was
> actually on GFS for each node). "update" should ideally have a flag or
> better yet, a counterpart that says to "just reread the file and don't try
> to write a new one."
until now we didn't have any problems with cluster.conf on GFS. We are using 
it for sharedroots and did not encounter any problems with it. Even the 
ccs_tool update and cman_tool version works.
Regards Marc.
-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 121 409-54
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From lhh at redhat.com  Fri Mar 17 19:24:46 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 17 Mar 2006 14:24:46 -0500
Subject: [Linux-cluster] shared cluster.conf?
In-Reply-To: <441AC9B6.2050403@leopard.us.udel.edu>
References: <441AC9B6.2050403@leopard.us.udel.edu>
Message-ID: <1142623486.8266.54.camel@ayanami.boston.redhat.com>

On Fri, 2006-03-17 at 09:37 -0500, Greg Forte wrote:
> Is there any reason not to put cluster.conf on a shared filesystem 
> that's mounted in fstab, and symlink /etc/cluster/cluster.conf to that 
> shared location?  Then one would only have to run cman_tool version -r 
> <new_version_#> after updating the conf ... or am I missing some reason 
> why this is a bad idea?

Circular dependency:

+-> configuration
|    v  
|   cluster infrastructure
|    v
+-- gfs

-- Lon


From lhh at redhat.com  Fri Mar 17 19:27:24 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 17 Mar 2006 14:27:24 -0500
Subject: [Linux-cluster] GFS at startup
In-Reply-To: <441AEE8C.20404@gmail.com>
References: <441AEE8C.20404@gmail.com>
Message-ID: <1142623644.8266.56.camel@ayanami.boston.redhat.com>

On Fri, 2006-03-17 at 18:14 +0100, carlopmart wrote:
> Hi all,
> 
>   I have a simple doubt: where I can put gfs shared file system imported 
> from gnbd's server: on fstab, on rc.local or on cluster.conf? i need 
> this filesystem up before configured services on cluster.conf starts up.

If you put them in fstab, the GFS volumes should be mounted before
rgmanager starts.

-- Lon


From ptader at fnal.gov  Fri Mar 17 21:38:59 2006
From: ptader at fnal.gov (Paul Tader)
Date: Fri, 17 Mar 2006 15:38:59 -0600
Subject: [Linux-cluster] lock_dlm kernel panics
Message-ID: <441B2C73.3010202@fnal.gov>

We're experiencing random kernel panics that all seem to be attributed 
to the lock_dlm module.


(panic text from 3 different systems):

/var/log/messages.1:Mar  9 10:01:45 node1 kernel: EIP is at 
do_dlm_lock+0x134/0x14e [lock_dlm]

/var/log/messages.1:Mar  6 16:33:41 node1 kernel: EIP is at 
do_dlm_unlock+0x8b/0xa0 [lock_dlm]

/var/log/messages.1:Mar  7 22:28:53 node1 kernel: EIP is at 
do_dlm_lock+0x134/0x14e [lock_dlm]

/var/log/messages.3:Feb 22 13:35:07 node2 kernel: EIP is at 
do_dlm_lock+0x134/0x14e [lock_dlm]

/var/log/messages.4:Feb 18 12:17:04 node2 kernel: EIP is at 
do_dlm_unlock+0x8b/0xa0 [lock_dlm]

/var/log/messages.3:Feb 23 04:46:01 node3 kernel: EIP is at 
do_dlm_lock+0x134/0x14e [lock_dlm]


On average, nodes stay up for about a week.  The work load is steady and 
is mostly disk I/O.  These nodes were running RHES3 with GFS 6.0. 
During that setup, we experienced much more frequent panics, even when 
the nodes weren't being used.

My thought is that this is a hardware problem.  Disk array, fibre switch 
or HBA?  But in the hopes that there is some addition GFS turning or 
diagnostics I can perform that will either lead me to a hardware problem 
or GFS configuration change, I'm posting this message.

Software:
- RHES4
- GFS-6.1.2-0
- GFS-kernel-2.6.9-49.1
- One, 1Tb GFS partition

Hardware:
- 5 nodes total
- Dual Xeon CPU's 2.66GHz
- 2 Gb ram
- 1 Gb eth0
- QLogic QLA2200


Latest complete panic message:
---------------------------

Mar 17 11:38:02 nodename kernel:
Mar 17 11:38:02 nodename kernel: d0 purged 0 requests
Mar 17 11:38:02 nodename kernel: d0 mark waiting requests
Mar 17 11:38:02 nodename kernel: d0 marked 0 requests
Mar 17 11:38:02 nodename kernel: d0 recover event 17 done
Mar 17 11:38:02 nodename kernel: d0 move flags 0,0,1 ids 14,17,17
Mar 17 11:38:02 nodename kernel: d0 process held requests
Mar 17 11:38:02 nodename kernel: d0 processed 0 requests
Mar 17 11:38:02 nodename kernel: d0 resend marked requests
Mar 17 11:38:02 nodename kernel: d0 resent 0 requests
Mar 17 11:38:02 nodename kernel: d0 recover event 17 finished
Mar 17 11:38:02 nodename kernel: d0 send einval to 5
Mar 17 11:38:02 nodename kernel: d0 send einval to 5
Mar 17 11:38:02 nodename kernel: d0 (1983) req reply einval 2da2006d fr 
2 r 2
        5  9
Mar 17 11:38:02 nodename kernel: d0 send einval to 5
Mar 17 11:38:02 nodename kernel: d0 send einval to 3
Mar 17 11:38:02 nodename kernel: d0 (1983) req reply einval 410803b0 fr 
5 r 5
        5  a
Mar 17 11:38:02 nodename kernel: d0 (1983) req reply einval 456f03d1 fr 
2 r 2
        5  1
Mar 17 11:38:02 nodename kernel: d0 send einval to 5
Mar 17 11:38:02 nodename kernel: d0 send einval to 5
Mar 17 11:38:02 nodename kernel: d0 send einval to 3
Mar 17 11:38:02 nodename kernel: d0 send einval to 3
Mar 17 11:38:02 nodename kernel: d0 (1983) req reply einval aca103f2 fr 
5 r 5
        5  2
Mar 17 11:38:02 nodename kernel: d0 grant lock on lockqueue 3
Mar 17 11:38:02 nodename kernel: d0 process_lockqueue_reply id bbfe0396 
state 0
Mar 17 11:38:02 nodename kernel: d0 (1983) req reply einval d2d20215 fr 
2 r 2
        5  9
Mar 17 11:38:02 nodename kernel: d0 (1983) req reply einval d5a60059 fr 
5 r 5
        5  d
Mar 17 11:38:02 nodename kernel: d0 (1983) req reply einval d886008f fr 
3 r 3
        5  e
Mar 17 11:38:02 nodename kernel: d0 (1983) req reply einval 3130220 fr 2 r 2
        5 c3
Mar 17 11:38:02 nodename kernel: d0 unlock fe20017a no id
Mar 17 11:38:02 nodename kernel: 1976 pr_start last_stop 0 last_start 4
last_finish 0
Mar 17 11:38:02 nodename kernel: 1976 pr_start count 4 type 2 event 4 
flags 250
Mar 17 11:38:02 nodename kernel: 1976 claim_jid 2
Mar 17 11:38:02 nodename kernel: 1976 pr_start 4 done 1
Mar 17 11:38:02 nodename kernel: 1976 pr_finish flags 5a
Mar 17 11:38:02 nodename kernel: 1968 recovery_done jid 2 msg 309 a
Mar 17 11:38:02 nodename kernel: 1968 recovery_done nodeid 4 flg 18
Mar 17 11:38:02 nodename kernel: 1976 pr_start last_stop 4 last_start 8
last_finish 4
Mar 17 11:38:02 nodename kernel: 1976 pr_start count 5 type 2 event 8 
flags 21a
Mar 17 11:38:02 nodename kernel: 1976 pr_start 8 done 1
Mar 17 11:38:02 nodename kernel: 1976 pr_finish flags 1a
Mar 17 11:38:02 nodename kernel: 1976 rereq 3,624b610 id 7f1d022e 5,0
Mar 17 11:38:02 nodename kernel: 1976 pr_start last_stop 8 last_start 9
last_finish 8
Mar 17 11:38:02 nodename kernel: 1976 pr_start count 4 type 1 event 9 
flags 21a
Mar 17 11:38:02 nodename kernel: 1976 pr_start cb jid 0 id 2
Mar 17 11:38:02 nodename kernel: 1976 pr_start 9 done 0
Mar 17 11:38:02 nodename kernel: 1980 recovery_done jid 0 msg 308 11a
Mar 17 11:38:02 nodename kernel: 1980 recovery_done nodeid 2 flg 1b
Mar 17 11:38:02 nodename kernel: 1980 recovery_done start_done 9
Mar 17 11:38:02 nodename kernel: 1976 rereq 3,263e6dd id 7e2d01b9 3,0
Mar 17 11:38:02 nodename kernel: 1977 pr_finish flags 1a
Mar 17 11:38:02 nodename kernel: 1976 pr_start last_stop 9 last_start 13
last_finish 9
Mar 17 11:38:02 nodename kernel: 1976 pr_start count 5 type 2 event 13 
flags 21a
Mar 17 11:38:02 nodename kernel: 1976 pr_start 13 done 1
Mar 17 11:38:02 nodename kernel: 1976 pr_finish flags 1a
Mar 17 11:38:02 nodename kernel: 1976 pr_start last_stop 13 last_start 14
last_finish 13
Mar 17 11:38:02 nodename kernel: 1976 pr_start count 4 type 1 event 14 
flags 21a
Mar 17 11:38:02 nodename kernel: 1976 pr_start cb jid 4 id 5
Mar 17 11:38:02 nodename kernel: 1976 pr_start 14 done 0
Mar 17 11:38:02 nodename kernel: 1980 recovery_done jid 4 msg 308 11a
Mar 17 11:38:02 nodename kernel: 1980 recovery_done nodeid 5 flg 1b
Mar 17 11:38:02 nodename kernel: 1980 recovery_done start_done 14
Mar 17 11:38:02 nodename kernel: 1977 pr_finish flags 1a
Mar 17 11:38:02 nodename kernel: 1976 pr_start last_stop 14 last_start 18
last_finish 14
Mar 17 11:38:02 nodename kernel: 1976 pr_start count 5 type 2 event 18 
flags 21a
Mar 17 11:38:02 nodename kernel: 1976 pr_start 18 done 1
Mar 17 11:38:02 nodename kernel: 1976 pr_finish flags 1a
Mar 17 11:38:02 nodename kernel:
Mar 17 11:38:02 nodename kernel: lock_dlm:  Assertion failed on line 357 of
file /mnt/src/4/BUILD/gfs-kernel-2.6.9-45/smp/src/dlm/lock.c
Mar 17 11:38:02 nodename kernel: lock_dlm:  assertion:  "!error"
Mar 17 11:38:02 nodename kernel: lock_dlm:  time = 783572508
Mar 17 11:38:03 nodename kernel: d0: error=-22 num=3,a458688 lkf=9 flags=84
Mar 17 11:38:03 nodename kernel:
Mar 17 11:38:03 nodename kernel: ------------[ cut here ]------------
Mar 17 11:38:03 nodename kernel: kernel BUG at
/mnt/src/4/BUILD/gfs-kernel-2.6.9-45/smp/src/dlm/lock.c:357!
Mar 17 11:38:03 nodename kernel: invalid operand: 0000 [#1]
Mar 17 11:38:03 nodename kernel: SMP
Mar 17 11:38:03 nodename kernel: Modules linked in: parport_pc lp parport
autofs4 lock_dlm(U) gfs(U) lock_harness(U) nfs lockd dlm(U) cman(U) md5 ipv6
sunrpc dm_mirror button battery ac uhci_hcd ehci_hcd e100 mii e1000 floppy
ext3 jbd dm_mod qla2200 qla2xxx scsi_transport_fc sd_mod scsi_mod
Mar 17 11:38:03 nodename kernel: CPU:    1
Mar 17 11:38:03 nodename kernel: EIP:    0060:[<f8bbb5f3>]    Not 
tainted VLI
Mar 17 11:38:03 nodename kernel: EFLAGS: 00010246   (2.6.9-22.0.2.ELsmp)
Mar 17 11:38:03 nodename kernel: EIP is at do_dlm_unlock+0x8b/0xa0 
[lock_dlm]
Mar 17 11:38:03 nodename kernel: eax: 00000001   ebx: f518d380   ecx:
f5857f2c   edx: f8bc0155Mar 17 11:38:03 nodename kernel: esi: ffffffea 
  edi:
f518d380   ebp: f8c3f000   esp: f5857f28Mar 17 11:38:03 nodename kernel: ds:
007b   es: 007b   ss: 0068
Mar 17 11:38:03 nodename kernel: Process gfs_glockd (pid: 1979,
threadinfo=f5857000 task=f5b588b0)
Mar 17 11:38:03 nodename kernel: Stack: f8bc0155 f8c3f000 00000003 f8bbb893
f8d19612 00000001 f514c268 f514c24c
Mar 17 11:38:03 nodename kernel:        f8d0f89e f8d44440 f4bf0cc0 f514c24c
f8d44440 f514c24c f8d0ed97 f514c24c
Mar 17 11:38:03 nodename kernel:        00000001 f514c2e0 f8d0ee4e f514c24c
f514c268 f8d0ef71 00000001 f514c268
Mar 17 11:38:03 nodename kernel: Call Trace:
Mar 17 11:38:03 nodename kernel:  [<f8bbb893>] lm_dlm_unlock+0x14/0x1c 
[lock_dlm]
Mar 17 11:38:03 nodename kernel:  [<f8d19612>] gfs_lm_unlock+0x2c/0x42 [gfs]
Mar 17 11:38:03 nodename kernel:  [<f8d0f89e>] 
gfs_glock_drop_th+0xf3/0x12d [gfs]
Mar 17 11:38:03 nodename kernel:  [<f8d0ed97>] rq_demote+0x7f/0x98 [gfs]
Mar 17 11:38:03 nodename kernel:  [<f8d0ee4e>] run_queue+0x5a/0xc1 [gfs]
Mar 17 11:38:03 nodename kernel:  [<f8d0ef71>] unlock_on_glock+0x1f/0x28 
[gfs]
Mar 17 11:38:03 nodename kernel:  [<f8d10ed0>] 
gfs_reclaim_glock+0xc3/0x13c [gfs]
Mar 17 11:38:03 nodename kernel:  [<f8d03e01>] gfs_glockd+0x39/0xde [gfs]
Mar 17 11:38:03 nodename kernel:  [<c011e481>] default_wake_function+0x0/0xc
Mar 17 11:38:03 nodename kernel:  [<c02d13b2>] ret_from_fork+0x6/0x14
Mar 17 11:38:03 nodename kernel:  [<c011e481>] default_wake_function+0x0/0xc
Mar 17 11:38:03 nodename kernel:  [<f8d03dc8>] gfs_glockd+0x0/0xde [gfs]
Mar 17 11:38:03 nodename kernel:  [<c01041f1>] kernel_thread_helper+0x5/0xb
Mar 17 11:38:03 nodename kernel: Code: 73 34 8b 03 ff 73 2c ff 73 08 ff 
73 04
ff 73 0c 56 ff 70 18 68 4d 02 bc f8 e8 84 6c 56 c7 83 c4 34 68 55 01 bc f8
e8 77 6c 56 c7 <0f> 0b 65 01 a2 00 bc f8 68 57 01 bc f8 e8 32 64 56 c7 
5b 5e c3
Mar 17 11:38:03 nodename kernel:  <0>Fatal exception: panic in 5 seconds
Mar 17 13:08:01 nodename syslogd 1.4.1: restart.


Thanks,
Paul


-- 
===========================================================================
Paul Tader  <ptader at fnal.gov>  Computing Div/CSS Dept
Fermi National Accelerator Lab; PO Box 500 Batavia, IL 60510-0500


From teigland at redhat.com  Fri Mar 17 22:40:43 2006
From: teigland at redhat.com (David Teigland)
Date: Fri, 17 Mar 2006 16:40:43 -0600
Subject: [Linux-cluster] lock_dlm kernel panics
In-Reply-To: <441B2C73.3010202@fnal.gov>
References: <441B2C73.3010202@fnal.gov>
Message-ID: <20060317224043.GC29244@redhat.com>

On Fri, Mar 17, 2006 at 03:38:59PM -0600, Paul Tader wrote:
> Mar 17 11:38:02 nodename kernel: d0 unlock fe20017a no id

GFS is trying to unlock a lock that doesn't exist which causes the panic.
We know this happens if cman shuts down the dlm while it's in use (cman
does this if it's lost connection with the cluster.)  There's some new
output in the RHEL4U3 dlm that should tell us if that's in fact what's
happening or if there's some other cause that we need to uncover.

So, you should look on all nodes for any cman messages in
/var/log/messages or the console.  And when you're using the latest
version look for the new dlm message "WARNING: dlm_emergency_shutdown".  

Dave


From mag.andersen at gmail.com  Fri Mar 17 22:53:44 2006
From: mag.andersen at gmail.com (Magnus Andersen)
Date: Fri, 17 Mar 2006 17:53:44 -0500
Subject: [Linux-cluster] Unable to mount GFS on RHEL 3 U6
Message-ID: <5ea165840603171453v2e8ba0f6m3630c2bd3eec7f7c@mail.gmail.com>

Hi All,

I've successfully installed and configured GFS on my three nodes, but when I
try to mount the filesystem the prompt hangs until I kill the mount
command.  All servers are running RHEL 3 AS/ES U6 with the
2.4.21-37.0.1.ELsmp kernel and are connected to a MSA1500 SAN via FC.  I've
installed the following GFS rpms:

[root at oradw root]# rpm -qa | grep -i gfs
GFS-modules-6.0.2.27-0.1
GFS-modules-smp-6.0.2.27-0.1
GFS-6.0.2.27-0.1


Here is my pool configuration files and the output from pool_tool -s

[root at backup gfs]# cat cluster_cca.cfg
poolname cluster_cca
subpools 1
subpool 0 0 1
pooldevice 0 0 /dev/sda1
[root at backup gfs]# cat pool0.cfg
poolname pool_gfs1
subpools 1
subpool 0 0 1
pooldevice 0 0 /dev/sda2
[root at backup gfs]# cat pool1.cfg
poolname pool_gfs2
subpools 1
subpool 0 0 1
pooldevice 0 0 /dev/sdb
[root at backup gfs]# pool_tool -s
  Device                                            Pool Label
  ======                                            ==========
  /dev/pool/cluster_cca                       <- CCA device ->
  /dev/pool/pool_gfs1                     <- GFS filesystem ->
  /dev/pool/pool_gfs2                     <- GFS filesystem ->
  /dev/cciss/c0d0                  <- partition information ->
  /dev/cciss/c0d0p1                    <- EXT2/3 filesystem ->
  /dev/cciss/c0d0p2                          <- swap device ->
  /dev/cciss/c0d0p3                       <- lvm1 subdevice ->
  /dev/sda                         <- partition information ->
  /dev/sda1                                        cluster_cca
  /dev/sda2                                          pool_gfs1
  /dev/sdb                                           pool_gfs2


Here are my ccs files.

[root at backup cluster_cca]# cat cluster.ccs
cluster {
        name = "cluster_cca"
        lock_gulm {
                servers = ["backup", "oradw", "gistest2"]
        }
}
[root at backup cluster_cca]# cat fence.ccs
fence_devices {
        manual {
                agent = "fence_manual"
        }
}
[root at backup cluster_cca]# cat nodes.ccs
nodes {
        backup {
                ip_interfaces {
                        eth1 = "10.0.0.1"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = "10.0.0.1"
                                }
                        }
                }
        }
        oradw {
                ip_interfaces {
                        eth4 = "10.0.0.2"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = "10.0.0.2"
                                }
                        }
                }
        }
        gistest2 {
                ip_interfaces {
                        eth0 = "10.0.0.3"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = "10.0.0.3"
                                }
                        }
                }
        }
}


Here is the command I used to create the filesystem:

gfs_mkfs -p lock_gulm -t cluster_cca:pool_gfs2 -j 10 /dev/pool/pool_gfs2


Mount command that hangs:

mount -t gfs /dev/pool/pool_gfs2 /gfs2

Here is the output I see in my messages log file.  I see the last 5 lines
repeated for each time I tried to mount the filesystem.

Mar 17 15:47:05 backup ccsd[2645]: Starting ccsd 6.0.2.27:
Mar 17 15:47:05 backup ccsd[2645]:  Built: Jan 30 2006 15:28:33
Mar 17 15:47:05 backup ccsd[2645]:  Copyright (C) Red Hat, Inc.  2004  All
rights reserved.
Mar 17 15:48:10 backup lock_gulmd[2652]: Starting lock_gulmd 6.0.2.27.
(built Jan 30 2006 15:28:54) Copyright (C) 2004 Red Hat, Inc.  All rights
reserved.
Mar 17 15:48:10 backup lock_gulmd[2652]: You are running in Fail-over mode.
Mar 17 15:48:10 backup lock_gulmd[2652]: I am (backup) with ip (127.0.0.1)
Mar 17 15:48:10 backup lock_gulmd[2652]: Forked core [2653].
Mar 17 15:48:11 backup lock_gulmd[2652]: Forked locktable [2654].
Mar 17 15:48:12 backup lock_gulmd[2652]: Forked ltpx [2655].
Mar 17 15:48:12 backup lock_gulmd_core[2653]: I see no Masters, So I am
Arbitrating until enough Slaves talk to me.
Mar 17 15:48:12 backup lock_gulmd_core[2653]: Could not send quorum update
to slave backup
Mar 17 15:48:12 backup lock_gulmd_core[2653]: New generation of server
state. (1142628492484630)
Mar 17 15:48:12 backup lock_gulmd_LTPX[2655]: New Master at backup:127.0.0.1
Mar 17 15:52:14 backup kernel: Lock_Harness 6.0.2.27 (built Jan 30 2006
15:32:58) installed
Mar 17 15:52:14 backup kernel: GFS 6.0.2.27 (built Jan 30 2006 15:32:20)
installed
Mar 17 15:52:15 backup kernel: Gulm 6.0.2.27 (built Jan 30 2006 15:32:54)
installed
Mar 17 15:54:51 backup kernel: lock_gulm: ERROR cm_login failed. -512
Mar 17 15:54:51 backup kernel: lock_gulm: ERROR Got a -512 trying to start
the threads.
Mar 17 15:54:51 backup lock_gulmd_core[2653]: Error on xdr (GFS Kernel
Interface:127.0.0.1 idx:3 fd:8): (-104:104:Connection reset by peer)
Mar 17 15:54:51 backup kernel: lock_gulm: fsid=cluster_cca:gfs1: Exiting
gulm_mount with errors -512
Mar 17 15:54:51 backup kernel: GFS: can't mount proto = lock_gulm, table =
cluster_cca:gfs1, hostdata =


Result from gulm_tool:

[root at backup gfs]# gulm_tool nodelist backup
 Name: backup
  ip    = 127.0.0.1
  state = Logged in
  mode = Arbitrating
  missed beats = 0
  last beat = 1142632189718986
  delay avg = 10019686
  max delay = 10019735


I'm a newbie to clusters and I have no clue where to look next.  If any
other information is needed let me know.

Thanks,


--
Magnus Andersen
Systems Administrator / Oracle DBA
Walker & Associates, Inc.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060317/e8d2c2fd/attachment.htm>

From Britt.Treece at savvis.net  Fri Mar 17 23:09:12 2006
From: Britt.Treece at savvis.net (Treece, Britt)
Date: Fri, 17 Mar 2006 17:09:12 -0600
Subject: [Linux-cluster] Unable to mount GFS on RHEL 3 U6
Message-ID: <9A6FE0FCC2B29846824C5CD81C6647B90152A1D4@s228130hz1ew08.apptix-01.savvis.net>

Magnus,

 
Try starting ccsd and lock_gulmd on all three servers.  Once these start
you should be able to see all three in gulm_tool nodelist localhost.  At
that point you should be able to mount your GFS pool vol's.  

 
Your lock cluster has to have a quorum of greater than half the servers
configured in cluster.ccs, so at least 2 in your case before it will
allow a GFS vol to be mounted.

 
Regards,

 
Britt 

 
________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Magnus Andersen
Sent: Friday, March 17, 2006 4:54 PM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] Unable to mount GFS on RHEL 3 U6

 
Hi All,

I've successfully installed and configured GFS on my three nodes, but
when I try to mount the filesystem the prompt hangs until I kill the
mount command.  All servers are running RHEL 3 AS/ES U6 with the
2.4.21-37.0.1.ELsmp kernel and are connected to a MSA1500 SAN via FC.
I've installed the following GFS rpms:

[root at oradw root]# rpm -qa | grep -i gfs
GFS-modules-6.0.2.27-0.1
GFS-modules-smp-6.0.2.27-0.1
GFS-6.0.2.27-0.1


Here is my pool configuration files and the output from pool_tool -s

[root at backup gfs]# cat cluster_cca.cfg
poolname cluster_cca
subpools 1
subpool 0 0 1
pooldevice 0 0 /dev/sda1
[root at backup gfs]# cat pool0.cfg
poolname pool_gfs1
subpools 1
subpool 0 0 1
pooldevice 0 0 /dev/sda2
[root at backup gfs]# cat pool1.cfg
poolname pool_gfs2
subpools 1
subpool 0 0 1
pooldevice 0 0 /dev/sdb
[root at backup gfs]# pool_tool -s
  Device                                            Pool Label
  ======                                            ==========
  /dev/pool/cluster_cca                       <- CCA device -> 
  /dev/pool/pool_gfs1                     <- GFS filesystem ->
  /dev/pool/pool_gfs2                     <- GFS filesystem ->
  /dev/cciss/c0d0                  <- partition information ->
  /dev/cciss/c0d0p1                    <- EXT2/3 filesystem -> 
  /dev/cciss/c0d0p2                          <- swap device ->
  /dev/cciss/c0d0p3                       <- lvm1 subdevice ->
  /dev/sda                         <- partition information ->
  /dev/sda1                                        cluster_cca 
  /dev/sda2                                          pool_gfs1
  /dev/sdb                                           pool_gfs2


Here are my ccs files.

[root at backup cluster_cca]# cat cluster.ccs
cluster {
        name = "cluster_cca"
        lock_gulm {
                servers = ["backup", "oradw", "gistest2"]
        }
}
[root at backup cluster_cca]# cat fence.ccs 
fence_devices {
        manual {
                agent = "fence_manual"
        }
}
[root at backup cluster_cca]# cat nodes.ccs
nodes {
        backup {
                ip_interfaces {
                        eth1 = "10.0.0.1"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = " 10.0.0.1"
                                }
                        }
                }
        }
        oradw {
                ip_interfaces {
                        eth4 = " 10.0.0.2"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = " 10.0.0.2"
                                }
                        }
                }
        }
        gistest2 {
                ip_interfaces {
                        eth0 = " 10.0.0.3"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = " 10.0.0.3"
                                }
                        }
                }
        }
}


Here is the command I used to create the filesystem: 

gfs_mkfs -p lock_gulm -t cluster_cca:pool_gfs2 -j 10 /dev/pool/pool_gfs2


Mount command that hangs:

mount -t gfs /dev/pool/pool_gfs2 /gfs2

Here is the output I see in my messages log file.  I see the last 5
lines repeated for each time I tried to mount the filesystem.

Mar 17 15:47:05 backup ccsd[2645]: Starting ccsd 6.0.2.27
<http://6.0.2.27/> :
Mar 17 15:47:05 backup ccsd[2645]:  Built: Jan 30 2006 15:28:33 
Mar 17 15:47:05 backup ccsd[2645]:  Copyright (C) Red Hat, Inc.  2004
All rights reserved. 
Mar 17 15:48:10 backup lock_gulmd[2652]: Starting lock_gulmd 6.0.2.27
<http://6.0.2.27/> . (built Jan 30 2006 15:28:54) Copyright (C) 2004 Red
Hat, Inc.  All rights reserved. 
Mar 17 15:48:10 backup lock_gulmd[2652]: You are running in Fail-over
mode. 
Mar 17 15:48:10 backup lock_gulmd[2652]: I am (backup) with ip
(127.0.0.1 <http://127.0.0.1/> )
Mar 17 15:48:10 backup lock_gulmd[2652]: Forked core [2653]. 
Mar 17 15:48:11 backup lock_gulmd[2652]: Forked locktable [2654]. 
Mar 17 15:48:12 backup lock_gulmd[2652]: Forked ltpx [2655].
Mar 17 15:48:12 backup lock_gulmd_core[2653]: I see no Masters, So I am
Arbitrating until enough Slaves talk to me.
Mar 17 15:48:12 backup lock_gulmd_core[2653]: Could not send quorum
update to slave backup 
Mar 17 15:48:12 backup lock_gulmd_core[2653]: New generation of server
state. (1142628492484630)
Mar 17 15:48:12 backup lock_gulmd_LTPX[2655]: New Master at backup:
127.0.0.1 <http://127.0.0.1/> 
Mar 17 15:52:14 backup kernel: Lock_Harness 6.0.2.27 <http://6.0.2.27/>
(built Jan 30 2006 15:32:58) installed
Mar 17 15:52:14 backup kernel: GFS 6.0.2.27 <http://6.0.2.27/>  (built
Jan 30 2006 15:32:20) installed
Mar 17 15:52:15 backup kernel: Gulm 6.0.2.27 <http://6.0.2.27/>  (built
Jan 30 2006 15:32:54) installed
Mar 17 15:54:51 backup kernel: lock_gulm: ERROR cm_login failed. -512 
Mar 17 15:54:51 backup kernel: lock_gulm: ERROR Got a -512 trying to
start the threads. 
Mar 17 15:54:51 backup lock_gulmd_core[2653]: Error on xdr (GFS Kernel
Interface:127.0.0.1 <http://127.0.0.1/>  idx:3 fd:8):
(-104:104:Connection reset by peer) 
Mar 17 15:54:51 backup kernel: lock_gulm: fsid=cluster_cca:gfs1: Exiting
gulm_mount with errors -512 
Mar 17 15:54:51 backup kernel: GFS: can't mount proto = lock_gulm, table
= cluster_cca:gfs1, hostdata =


Result from gulm_tool:

[root at backup gfs]# gulm_tool nodelist backup
 Name: backup
  ip    = 127.0.0.1 <http://127.0.0.1/> 
  state = Logged in
  mode = Arbitrating
  missed beats = 0
  last beat = 1142632189718986
  delay avg = 10019686 
  max delay = 10019735


I'm a newbie to clusters and I have no clue where to look next.  If any
other information is needed let me know. 

Thanks,


-- 
Magnus Andersen
Systems Administrator / Oracle DBA
Walker & Associates, Inc. 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060317/64322de7/attachment.htm>

From Britt.Treece at savvis.net  Fri Mar 17 23:21:09 2006
From: Britt.Treece at savvis.net (Treece, Britt)
Date: Fri, 17 Mar 2006 17:21:09 -0600
Subject: [Linux-cluster] Unable to mount GFS on RHEL 3 U6
Message-ID: <9A6FE0FCC2B29846824C5CD81C6647B90152A1D5@s228130hz1ew08.apptix-01.savvis.net>

Also, make sure your servers /etc/hosts file on all three servers looks
similar to...

 
127.0.0.1           localhost.localdomain localhost

10.0.0.1           backup

10.0.0.2           oradw

10.0.0.3           gistest2

 
Britt

 
________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Treece, Britt
Sent: Friday, March 17, 2006 5:09 PM
To: linux clustering
Subject: RE: [Linux-cluster] Unable to mount GFS on RHEL 3 U6

 
Magnus,

 
Try starting ccsd and lock_gulmd on all three servers.  Once these start
you should be able to see all three in gulm_tool nodelist localhost.  At
that point you should be able to mount your GFS pool vol's.  

 
Your lock cluster has to have a quorum of greater than half the servers
configured in cluster.ccs, so at least 2 in your case before it will
allow a GFS vol to be mounted.

 
Regards,

 
Britt 

 
________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Magnus Andersen
Sent: Friday, March 17, 2006 4:54 PM
To: linux-cluster at redhat.com
Subject: [Linux-cluster] Unable to mount GFS on RHEL 3 U6

 
Hi All,

I've successfully installed and configured GFS on my three nodes, but
when I try to mount the filesystem the prompt hangs until I kill the
mount command.  All servers are running RHEL 3 AS/ES U6 with the
2.4.21-37.0.1.ELsmp kernel and are connected to a MSA1500 SAN via FC.
I've installed the following GFS rpms:

[root at oradw root]# rpm -qa | grep -i gfs
GFS-modules-6.0.2.27-0.1
GFS-modules-smp-6.0.2.27-0.1
GFS-6.0.2.27-0.1


Here is my pool configuration files and the output from pool_tool -s

[root at backup gfs]# cat cluster_cca.cfg
poolname cluster_cca
subpools 1
subpool 0 0 1
pooldevice 0 0 /dev/sda1
[root at backup gfs]# cat pool0.cfg
poolname pool_gfs1
subpools 1
subpool 0 0 1
pooldevice 0 0 /dev/sda2
[root at backup gfs]# cat pool1.cfg
poolname pool_gfs2
subpools 1
subpool 0 0 1
pooldevice 0 0 /dev/sdb
[root at backup gfs]# pool_tool -s
  Device                                            Pool Label
  ======                                            ==========
  /dev/pool/cluster_cca                       <- CCA device -> 
  /dev/pool/pool_gfs1                     <- GFS filesystem ->
  /dev/pool/pool_gfs2                     <- GFS filesystem ->
  /dev/cciss/c0d0                  <- partition information ->
  /dev/cciss/c0d0p1                    <- EXT2/3 filesystem -> 
  /dev/cciss/c0d0p2                          <- swap device ->
  /dev/cciss/c0d0p3                       <- lvm1 subdevice ->
  /dev/sda                         <- partition information ->
  /dev/sda1                                        cluster_cca 
  /dev/sda2                                          pool_gfs1
  /dev/sdb                                           pool_gfs2


Here are my ccs files.

[root at backup cluster_cca]# cat cluster.ccs
cluster {
        name = "cluster_cca"
        lock_gulm {
                servers = ["backup", "oradw", "gistest2"]
        }
}
[root at backup cluster_cca]# cat fence.ccs 
fence_devices {
        manual {
                agent = "fence_manual"
        }
}
[root at backup cluster_cca]# cat nodes.ccs
nodes {
        backup {
                ip_interfaces {
                        eth1 = "10.0.0.1"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = " 10.0.0.1"
                                }
                        }
                }
        }
        oradw {
                ip_interfaces {
                        eth4 = " 10.0.0.2"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = " 10.0.0.2"
                                }
                        }
                }
        }
        gistest2 {
                ip_interfaces {
                        eth0 = " 10.0.0.3"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = " 10.0.0.3"
                                }
                        }
                }
        }
}


Here is the command I used to create the filesystem: 

gfs_mkfs -p lock_gulm -t cluster_cca:pool_gfs2 -j 10 /dev/pool/pool_gfs2


Mount command that hangs:

mount -t gfs /dev/pool/pool_gfs2 /gfs2

Here is the output I see in my messages log file.  I see the last 5
lines repeated for each time I tried to mount the filesystem.

Mar 17 15:47:05 backup ccsd[2645]: Starting ccsd 6.0.2.27
<http://6.0.2.27/> :
Mar 17 15:47:05 backup ccsd[2645]:  Built: Jan 30 2006 15:28:33 
Mar 17 15:47:05 backup ccsd[2645]:  Copyright (C) Red Hat, Inc.  2004
All rights reserved. 
Mar 17 15:48:10 backup lock_gulmd[2652]: Starting lock_gulmd 6.0.2.27
<http://6.0.2.27/> . (built Jan 30 2006 15:28:54) Copyright (C) 2004 Red
Hat, Inc.  All rights reserved. 
Mar 17 15:48:10 backup lock_gulmd[2652]: You are running in Fail-over
mode. 
Mar 17 15:48:10 backup lock_gulmd[2652]: I am (backup) with ip
(127.0.0.1 <http://127.0.0.1/> )
Mar 17 15:48:10 backup lock_gulmd[2652]: Forked core [2653]. 
Mar 17 15:48:11 backup lock_gulmd[2652]: Forked locktable [2654]. 
Mar 17 15:48:12 backup lock_gulmd[2652]: Forked ltpx [2655].
Mar 17 15:48:12 backup lock_gulmd_core[2653]: I see no Masters, So I am
Arbitrating until enough Slaves talk to me.
Mar 17 15:48:12 backup lock_gulmd_core[2653]: Could not send quorum
update to slave backup 
Mar 17 15:48:12 backup lock_gulmd_core[2653]: New generation of server
state. (1142628492484630)
Mar 17 15:48:12 backup lock_gulmd_LTPX[2655]: New Master at backup:
127.0.0.1 <http://127.0.0.1/> 
Mar 17 15:52:14 backup kernel: Lock_Harness 6.0.2.27 <http://6.0.2.27/>
(built Jan 30 2006 15:32:58) installed
Mar 17 15:52:14 backup kernel: GFS 6.0.2.27 <http://6.0.2.27/>  (built
Jan 30 2006 15:32:20) installed
Mar 17 15:52:15 backup kernel: Gulm 6.0.2.27 <http://6.0.2.27/>  (built
Jan 30 2006 15:32:54) installed
Mar 17 15:54:51 backup kernel: lock_gulm: ERROR cm_login failed. -512 
Mar 17 15:54:51 backup kernel: lock_gulm: ERROR Got a -512 trying to
start the threads. 
Mar 17 15:54:51 backup lock_gulmd_core[2653]: Error on xdr (GFS Kernel
Interface:127.0.0.1 <http://127.0.0.1/>  idx:3 fd:8):
(-104:104:Connection reset by peer) 
Mar 17 15:54:51 backup kernel: lock_gulm: fsid=cluster_cca:gfs1: Exiting
gulm_mount with errors -512 
Mar 17 15:54:51 backup kernel: GFS: can't mount proto = lock_gulm, table
cluster_cca:gfs1, hostdata =


Result from gulm_tool:

[root at backup gfs]# gulm_tool nodelist backup
 Name: backup
  ip    = 127.0.0.1 <http://127.0.0.1/> 
  state = Logged in
  mode = Arbitrating
  missed beats = 0
  last beat = 1142632189718986
  delay avg = 10019686 
  max delay = 10019735


I'm a newbie to clusters and I have no clue where to look next.  If any
other information is needed let me know. 

Thanks,


-- 
Magnus Andersen
Systems Administrator / Oracle DBA
Walker & Associates, Inc. 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060317/3434e51d/attachment.htm>

From mag.andersen at gmail.com  Fri Mar 17 23:35:09 2006
From: mag.andersen at gmail.com (Magnus Andersen)
Date: Fri, 17 Mar 2006 18:35:09 -0500
Subject: [Linux-cluster] Unable to mount GFS on RHEL 3 U6
In-Reply-To: <9A6FE0FCC2B29846824C5CD81C6647B90152A1D5@s228130hz1ew08.apptix-01.savvis.net>
References: <9A6FE0FCC2B29846824C5CD81C6647B90152A1D5@s228130hz1ew08.apptix-01.savvis.net>
Message-ID: <5ea165840603171535h399993fdmdb70847ecd5cfecb@mail.gmail.com>

I ran the following on all the servers before I started to try and mount the
share.

ccsd -d /dev/pool/cluster_cca
lock_gulmd

They ran without errors.

I did setup the hosts file, but when I looked at it agin I see that I called
them backuphb, oradwhb, and gistest2hb.  Do I need to set the ccs files
backup with these names?  Or, should I change the command switches?

Thanks for your help,


--
Magnus Andersen
Systems Administrator / Oracle DBA
Walker & Associates, Inc.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060317/ec376999/attachment.htm>

From Britt.Treece at savvis.net  Sat Mar 18 00:10:52 2006
From: Britt.Treece at savvis.net (Treece, Britt)
Date: Fri, 17 Mar 2006 18:10:52 -0600
Subject: [Linux-cluster] Unable to mount GFS on RHEL 3 U6
Message-ID: <9A6FE0FCC2B29846824C5CD81C6647B90152A1D6@s228130hz1ew08.apptix-01.savvis.net>

Your cluster.ccs config needs to match what is in /etc/hosts or vice
versa.

 
If lock_gulmd is started are you seeing all three servers in gulm_tool
nodelist?

 
________________________________

From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Magnus Andersen
Sent: Friday, March 17, 2006 5:35 PM
To: linux clustering
Subject: Re: [Linux-cluster] Unable to mount GFS on RHEL 3 U6

 
I ran the following on all the servers before I started to try and mount
the share.

ccsd -d /dev/pool/cluster_cca
lock_gulmd

They ran without errors.  

I did setup the hosts file, but when I looked at it agin I see that I
called them backuphb, oradwhb, and gistest2hb.  Do I need to set the ccs
files backup with these names?  Or, should I change the command
switches? 

Thanks for your help,


-- 
Magnus Andersen
Systems Administrator / Oracle DBA
Walker & Associates, Inc. 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060317/3100a035/attachment.htm>

From mag.andersen at gmail.com  Sat Mar 18 01:37:12 2006
From: mag.andersen at gmail.com (Magnus Andersen)
Date: Fri, 17 Mar 2006 20:37:12 -0500
Subject: [Linux-cluster] Unable to mount GFS on RHEL 3 U6
In-Reply-To: <9A6FE0FCC2B29846824C5CD81C6647B90152A1D6@s228130hz1ew08.apptix-01.savvis.net>
References: <9A6FE0FCC2B29846824C5CD81C6647B90152A1D6@s228130hz1ew08.apptix-01.savvis.net>
Message-ID: <5ea165840603171737t34ff9798g8708de727ff01391@mail.gmail.com>

This is what I have now.

cluster.css
[root at backup root]# cat cluster_cca/cluster.ccs
cluster {
        name = "cluster_cca"
        lock_gulm {
                servers = ["backuphb", "oradwhb", "gistest2hb"]
        }
}

nodes.css
[root at backup root]# cat cluster_cca/nodes.ccs
nodes {
        backuphb {
                ip_interfaces {
                        eth1 = "10.0.0.1"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = "10.0.0.1"
                                }
                        }
                }
        }
        oradwhb {
                ip_interfaces {
                        eth4 = "10.0.0.2"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = "10.0.0.2"
                                }
                        }
                }
        }
        gistest2hb {
                ip_interfaces {
                        eth0 = "10.0.0.3"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = "10.0.0.3"
                                }
                        }
                }
        }
}

/etc/hosts
[root at backup root]# cat /etc/hosts
# Do not remove the following line, or various programs
# that require network functionality will fail.
127.0.0.1               backup localhost.localdomain localhost
10.0.0.1        backuphb backuphb.walkerassoc.com backuphb
10.0.0.2        oradwhb oradwhb.walkerassoc.com oradwhb
10.0.0.3        gistest2hb gistest2hb.walkerassoc.com gistest2hb

I ran this to update the cluster_cca pool after I modified the ccs files
ccs_tool -O create /root/cluster_cca /dev/pool/cluster_cca

Result out of my messages log ( this looks the same on all servers
once I start lock_gulmd )
Mar 17 20:24:17 gistest2 lock_gulmd[2383]: Starting lock_gulmd 6.0.2.27. (built
Jan 30 2006 15:28:54) Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
Mar 17 20:24:17 gistest2 lock_gulmd[2383]: You are running in Fail-over mode.
Mar 17 20:24:17 gistest2 lock_gulmd[2383]: I am (gistest2) with ip (127.0.0.1)
Mar 17 20:24:17 gistest2 lock_gulmd[2383]: Forked core [2384].
Mar 17 20:24:18 gistest2 lock_gulmd_LT000[2385]: Not serving locks from this nod
e.
Mar 17 20:24:18 gistest2 lock_gulmd[2383]: Forked locktable [2385].
Mar 17 20:24:19 gistest2 lock_gulmd[2383]: Forked ltpx [2386].


Result from gulm_tool nodelist localhost
[root at backup root]# gulm_tool nodelist localhost
 Name: backup
  ip    = 127.0.0.1
  state = Logged in
  mode = Pending
  missed beats = 0
  last beat = 0
  delay avg = 0
  max delay = 0


Thanks,

--
Magnus Andersen
Systems Administrator / Oracle DBA
Walker & Associates, Inc.


From Britt.Treece at savvis.net  Sat Mar 18 04:51:08 2006
From: Britt.Treece at savvis.net (Treece, Britt)
Date: Fri, 17 Mar 2006 22:51:08 -0600
Subject: [Linux-cluster] Unable to mount GFS on RHEL 3 U6
References: <9A6FE0FCC2B29846824C5CD81C6647B90152A1D6@s228130hz1ew08.apptix-01.savvis.net>
	<5ea165840603171737t34ff9798g8708de727ff01391@mail.gmail.com>
Message-ID: <9A6FE0FCC2B29846824C5CD81C6647B92D00FF@s228130hz1ew08.apptix-01.savvis.net>

The nodename in nodes.ccs and cluster.ccs needs to match the hostname of each server.  I'm getting the impression from the output below that it does not.
 
>From the GFS 6.0 Admin guide...
http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-sf-nodes.html
 
Make sure that you specify Nodename as the Linux hostname and that the primary IP address of the node is associated with the hostname. Specifying NodeName other than the Linux hostname (for example the interface name) can cause unpredictable results - especially if the node is connected to multiple networks. To determine the hostname of a node, use the uname -n command on the node. To verify the IP address associated with the hostname, issue a ping command to the hostname.

________________________________

From: linux-cluster-bounces at redhat.com on behalf of Magnus Andersen
Sent: Fri 3/17/2006 7:37 PM
To: linux clustering
Subject: Re: [Linux-cluster] Unable to mount GFS on RHEL 3 U6


This is what I have now.

cluster.css
[root at backup root]# cat cluster_cca/cluster.ccs
cluster {
        name = "cluster_cca"
        lock_gulm {
                servers = ["backuphb", "oradwhb", "gistest2hb"]
        }
}

nodes.css
[root at backup root]# cat cluster_cca/nodes.ccs
nodes {
        backuphb {
                ip_interfaces {
                        eth1 = "10.0.0.1"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = "10.0.0.1"
                                }
                        }
                }
        }
        oradwhb {
                ip_interfaces {
                        eth4 = "10.0.0.2"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = "10.0.0.2"
                                }
                        }
                }
        }
        gistest2hb {
                ip_interfaces {
                        eth0 = "10.0.0.3"
                }
                fence {
                        man {
                                manual {
                                        ipaddr = "10.0.0.3"
                                }
                        }
                }
        }
}

/etc/hosts
[root at backup root]# cat /etc/hosts
# Do not remove the following line, or various programs
# that require network functionality will fail.
127.0.0.1               backup localhost.localdomain localhost
10.0.0.1        backuphb backuphb.walkerassoc.com backuphb
10.0.0.2        oradwhb oradwhb.walkerassoc.com oradwhb
10.0.0.3        gistest2hb gistest2hb.walkerassoc.com gistest2hb

I ran this to update the cluster_cca pool after I modified the ccs files
ccs_tool -O create /root/cluster_cca /dev/pool/cluster_cca

Result out of my messages log ( this looks the same on all servers
once I start lock_gulmd )
Mar 17 20:24:17 gistest2 lock_gulmd[2383]: Starting lock_gulmd 6.0.2.27. (built
Jan 30 2006 15:28:54) Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
Mar 17 20:24:17 gistest2 lock_gulmd[2383]: You are running in Fail-over mode.
Mar 17 20:24:17 gistest2 lock_gulmd[2383]: I am (gistest2) with ip (127.0.0.1)
Mar 17 20:24:17 gistest2 lock_gulmd[2383]: Forked core [2384].
Mar 17 20:24:18 gistest2 lock_gulmd_LT000[2385]: Not serving locks from this nod
e.
Mar 17 20:24:18 gistest2 lock_gulmd[2383]: Forked locktable [2385].
Mar 17 20:24:19 gistest2 lock_gulmd[2383]: Forked ltpx [2386].


Result from gulm_tool nodelist localhost
[root at backup root]# gulm_tool nodelist localhost
 Name: backup
  ip    = 127.0.0.1
  state = Logged in
  mode = Pending
  missed beats = 0
  last beat = 0
  delay avg = 0
  max delay = 0


Thanks,

--
Magnus Andersen
Systems Administrator / Oracle DBA
Walker & Associates, Inc.

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 11493 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060317/9db3e3f4/attachment.bin>

From orcl.listas at gmail.com  Sat Mar 18 04:55:00 2006
From: orcl.listas at gmail.com (Allyson - Listas)
Date: Sat, 18 Mar 2006 01:55:00 -0300
Subject: [Linux-cluster] agent scripts
Message-ID: <441B92A4.9040602@gmail.com>

Hi Guys,

I'd like your help in one question on rhcs 4 up 3.

I'm working on my scripts to start/stop/monitoring a failover oracle 
9.2.0.6 database.
I created a new service and added a filesystem, ip and script resources.

There is a cookbook or something that shows requirements of development 
of the cluster scripts?
 
I'm having problems to start my service ...
When I try to enable...

[root at cs02 ora9i]# clustat
Member Status: Quorate

  Member Name                              Status
  ------ ----                              ------
  cs02.example.com                         Online, Local, rgmanager
  cs01.example.com                         Online, rgmanager

  Service Name         Owner (Last)                   State
  ------- ----         ----- ------                   -----
  vip50                cs02.example.com               started
  oracle-ha-fs         cs02.example.com               started
  ora9i-ha             (cs02.example.com)             failed
  clu9i                (none)                         stopped
[root at cs02 ora9i]# clusvcadm -e clu9i
Member cs02.example.com trying to enable clu9i...failed

I receive these messages...

Mar 18 01:59:12 cs02 clurgmgrd[2315]: <notice> Starting stopped service 
clu9i
Mar 18 01:59:12 cs02 clurgmgrd[2315]: <notice> start on script "clu9i" 
returned 5 (program not installed)
Mar 18 01:59:12 cs02 clurgmgrd[2315]: <warning> #68: Failed to start 
clu9i; return value: 1
Mar 18 01:59:12 cs02 clurgmgrd[2315]: <notice> Stopping service clu9i
Mar 18 01:59:12 cs02 clurgmgrd[2315]: <notice> stop on script "clu9i" 
returned 5 (program not installed)
Mar 18 01:59:12 cs02 clurgmgrd[2315]: <notice> Service clu9i is recovering
Mar 18 01:59:12 cs02 clurgmgrd[2315]: <warning> #71: Relocating failed 
service clu9i
Mar 18 01:59:13 cs02 clurgmgrd[2315]: <notice> Stopping service clu9i
Mar 18 01:59:13 cs02 clurgmgrd[2315]: <notice> stop on script "clu9i" 
returned 5 (program not installed)
Mar 18 01:59:13 cs02 clurgmgrd[2315]: <notice> Service clu9i is stopped

I think strange the return code 5 logged on /var/log/messages, because 
my script works well manually...

[root at cs02 ora9i]# ps -ef |grep pmon | grep -v grep
[root at cs02 ora9i]#
[root at cs02 ora9i]#
[root at cs02 ora9i]# ./ora_clu9i start
starting ora_clu9i...
[root at cs02 ora9i]# ps -ef |grep pmon | grep -v grep
ora9i     7934     1  0 02:06 ?        00:00:00 ora_pmon_clu9i
[root at cs02 ora9i]# ./ora_clu9i status
clu9i is running
[root at cs02 ora9i]# ./ora_clu9i stop
stopping ora_clu9i...
[root at cs02 ora9i]# ps -ef |grep pmon | grep -v grep
[root at cs02 ora9i]# ./ora_clu9i status
clu9i is stopped


Here's my script...

#### Oracle Environment ####
export LD_ASSUME_KERNEL=2.4.19
export ORACLE_BASE=/u01/ora9i
export ORACLE_HOME=$ORACLE_BASE/product/9.2.0
export ORACLE_SID=clu9i
export ORACLE_TERM=xterm
export NLS_LANG=AMERICAN;
export ORA_NLS33=$ORACLE_HOME/ocommon/nls/admin/data
LD_LIBRARY_PATH=$ORACLE_HOME/lib:/lib:/usr/lib
LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/lib
export LD_LIBRARY_PATH
export PATH=$PATH:$ORACLE_HOME/bin
####

prog="ora_clu9i"

start () {
echo "starting $prog..."

su - ora9i -c "$ORACLE_HOME/bin/sqlplus '/ as sysdba' > /dev/null  <<eof
startup
quit
eof
"
su - ora9i -c "lsnrctl start > /dev/null"

return 0

}

stop () {
echo "stopping $prog..."

su - ora9i -c "$ORACLE_HOME/bin/sqlplus '/ as sysdba' > /dev/null  <<eof
shutdown immediate
quit
eof
"
su - ora9i -c "lsnrctl stop > /dev/null"

return 0
}

status() {
 if [ -r /tmp/orastat ]; then
      rm /tmp/orastat ;
 fi

sqlplus /nolog <<eof > /tmp/orastat
conn hr/hr
quit
eof

DOWN=`grep -i error /tmp/orastat | grep -v grep | wc -l`

if [ $DOWN -gt 0 ]; then
    echo $ORACLE_SID is stopped;
    return 0
else
    echo $ORACLE_SID is running;
    return 1
fi
}

case "$1" in
start)
        start
        ;;
stop)
        stop
        ;;
status)
        status
        ;;
*)
        echo $"Usage: $0 {start|stop|status}"
        exit 1
esac


From orcl.listas at gmail.com  Sat Mar 18 14:51:18 2006
From: orcl.listas at gmail.com (Allyson - Listas)
Date: Sat, 18 Mar 2006 11:51:18 -0300
Subject: [Linux-cluster] agent scripts
In-Reply-To: <441B92A4.9040602@gmail.com>
References: <441B92A4.9040602@gmail.com>
Message-ID: <441C1E66.80809@gmail.com>

Hi Guys,

I Solved my problem. I was having problems because when cluster start / 
stop services it follow the order of the resources are in the service.
In my case i put  IP, FILESYSTEM and SCRIPT, when cluster start the 
service it starts the ip , after filesystem and then script. And on 
*stop* it follows the same order, that was my problem, because my script 
checks the health of database doing a connection using a binary 
(sqlplus) that was on filesystgem that clusters umount!!! So, I change 
my status method and the script and it works.

Change on Status made...

status() {
UP=`ps -ef |grep ora_pmon_clu9i |grep -v grep |wc -l`

if [ $UP -gt 0 ]; then
    echo $ORACLE_SID is running;
    return 0
else
    echo $ORACLE_SID is stoped;
    return 1
fi
}

Regards,


Allyson - Listas wrote:

> Hi Guys,
>
> I'd like your help in one question on rhcs 4 up 3.
>
> I'm working on my scripts to start/stop/monitoring a failover oracle 
> 9.2.0.6 database.
> I created a new service and added a filesystem, ip and script resources.
>
> There is a cookbook or something that shows requirements of 
> development of the cluster scripts?
>
> I'm having problems to start my service ...
> When I try to enable...
>
> [root at cs02 ora9i]# clustat
> Member Status: Quorate
>
>  Member Name                              Status
>  ------ ----                              ------
>  cs02.example.com                         Online, Local, rgmanager
>  cs01.example.com                         Online, rgmanager
>
>  Service Name         Owner (Last)                   State
>  ------- ----         ----- ------                   -----
>  vip50                cs02.example.com               started
>  oracle-ha-fs         cs02.example.com               started
>  ora9i-ha             (cs02.example.com)             failed
>  clu9i                (none)                         stopped
> [root at cs02 ora9i]# clusvcadm -e clu9i
> Member cs02.example.com trying to enable clu9i...failed
>
> I receive these messages...
>
> Mar 18 01:59:12 cs02 clurgmgrd[2315]: <notice> Starting stopped 
> service clu9i
> Mar 18 01:59:12 cs02 clurgmgrd[2315]: <notice> start on script "clu9i" 
> returned 5 (program not installed)
> Mar 18 01:59:12 cs02 clurgmgrd[2315]: <warning> #68: Failed to start 
> clu9i; return value: 1
> Mar 18 01:59:12 cs02 clurgmgrd[2315]: <notice> Stopping service clu9i
> Mar 18 01:59:12 cs02 clurgmgrd[2315]: <notice> stop on script "clu9i" 
> returned 5 (program not installed)
> Mar 18 01:59:12 cs02 clurgmgrd[2315]: <notice> Service clu9i is 
> recovering
> Mar 18 01:59:12 cs02 clurgmgrd[2315]: <warning> #71: Relocating failed 
> service clu9i
> Mar 18 01:59:13 cs02 clurgmgrd[2315]: <notice> Stopping service clu9i
> Mar 18 01:59:13 cs02 clurgmgrd[2315]: <notice> stop on script "clu9i" 
> returned 5 (program not installed)
> Mar 18 01:59:13 cs02 clurgmgrd[2315]: <notice> Service clu9i is stopped
>
> I think strange the return code 5 logged on /var/log/messages, because 
> my script works well manually...
>
> [root at cs02 ora9i]# ps -ef |grep pmon | grep -v grep
> [root at cs02 ora9i]#
> [root at cs02 ora9i]#
> [root at cs02 ora9i]# ./ora_clu9i start
> starting ora_clu9i...
> [root at cs02 ora9i]# ps -ef |grep pmon | grep -v grep
> ora9i     7934     1  0 02:06 ?        00:00:00 ora_pmon_clu9i
> [root at cs02 ora9i]# ./ora_clu9i status
> clu9i is running
> [root at cs02 ora9i]# ./ora_clu9i stop
> stopping ora_clu9i...
> [root at cs02 ora9i]# ps -ef |grep pmon | grep -v grep
> [root at cs02 ora9i]# ./ora_clu9i status
> clu9i is stopped
>
>
>
> Here's my script...
>
> #### Oracle Environment ####
> export LD_ASSUME_KERNEL=2.4.19
> export ORACLE_BASE=/u01/ora9i
> export ORACLE_HOME=$ORACLE_BASE/product/9.2.0
> export ORACLE_SID=clu9i
> export ORACLE_TERM=xterm
> export NLS_LANG=AMERICAN;
> export ORA_NLS33=$ORACLE_HOME/ocommon/nls/admin/data
> LD_LIBRARY_PATH=$ORACLE_HOME/lib:/lib:/usr/lib
> LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/lib
> export LD_LIBRARY_PATH
> export PATH=$PATH:$ORACLE_HOME/bin
> ####
>
> prog="ora_clu9i"
>
> start () {
> echo "starting $prog..."
>
> su - ora9i -c "$ORACLE_HOME/bin/sqlplus '/ as sysdba' > /dev/null  <<eof
> startup
> quit
> eof
> "
> su - ora9i -c "lsnrctl start > /dev/null"
>
> return 0
>
> }
>
> stop () {
> echo "stopping $prog..."
>
> su - ora9i -c "$ORACLE_HOME/bin/sqlplus '/ as sysdba' > /dev/null  <<eof
> shutdown immediate
> quit
> eof
> "
> su - ora9i -c "lsnrctl stop > /dev/null"
>
> return 0
> }
>
> status() {
> if [ -r /tmp/orastat ]; then
>      rm /tmp/orastat ;
> fi
>
> sqlplus /nolog <<eof > /tmp/orastat
> conn hr/hr
> quit
> eof
>
> DOWN=`grep -i error /tmp/orastat | grep -v grep | wc -l`
>
> if [ $DOWN -gt 0 ]; then
>    echo $ORACLE_SID is stopped;
>    return 0
> else
>    echo $ORACLE_SID is running;
>    return 1
> fi
> }
>
> case "$1" in
> start)
>        start
>        ;;
> stop)
>        stop
>        ;;
> status)
>        status
>        ;;
> *)
>        echo $"Usage: $0 {start|stop|status}"
>        exit 1
> esac
>
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
Allyson A. Brito
MSN: allysonbrito78 at hotmail.com
SKYPE: allysonbrito
RHCE / LPI-1 / SCSA
OCP DBA 9i / OCA PL/SQL 9i


From mag.andersen at gmail.com  Sat Mar 18 18:19:06 2006
From: mag.andersen at gmail.com (Magnus Andersen)
Date: Sat, 18 Mar 2006 13:19:06 -0500
Subject: [Linux-cluster] Unable to mount GFS on RHEL 3 U6
In-Reply-To: <9A6FE0FCC2B29846824C5CD81C6647B92D00FF@s228130hz1ew08.apptix-01.savvis.net>
References: <9A6FE0FCC2B29846824C5CD81C6647B90152A1D6@s228130hz1ew08.apptix-01.savvis.net>
	<5ea165840603171737t34ff9798g8708de727ff01391@mail.gmail.com>
	<9A6FE0FCC2B29846824C5CD81C6647B92D00FF@s228130hz1ew08.apptix-01.savvis.net>
Message-ID: <5ea165840603181019w45d92a06ncf873596ba08a880@mail.gmail.com>

Britt,

Thanks a million for your help.  You got me to look at what was wrong.
 I changed my hosts file to look the same as the cluster.css and
nodes.css file and I still didn't see all three servers on the
gulm_tool nodelist.  I went back and looked at my hosts file and saw
that I also called localhost the same as the hostname.  I removed the
entry and left localhost.localdomain localhost and checked again.

IT WORKS!!! ... :)

I've sucessfully mounted the shares on all servers and created object
on the shares it all works.

Again, thanks for the help.

Love these groups... :)

Sincerely,

--
Magnus Andersen
Systems Administrator / Oracle DBA
Walker & Associates, Inc.


From carlopmart at gmail.com  Sat Mar 18 19:58:01 2006
From: carlopmart at gmail.com (carlopmart)
Date: Sat, 18 Mar 2006 20:58:01 +0100
Subject: [Linux-cluster] Load balancing on CS4?
Message-ID: <441C6649.2070803@gmail.com>

Hi all,

  I have configured two nodes on a vmware host. I would do load 
balancing   for apache services (50-50). How can I do this? I didn't 
find anything about this on RedHat's documentation.

Thanks.
-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From basv at sara.nl  Sun Mar 19 10:41:40 2006
From: basv at sara.nl (Bas van der Vlies)
Date: Sun, 19 Mar 2006 11:41:40 +0100
Subject: [Linux-cluster] lock_dlm kernel panics
In-Reply-To: <20060317224043.GC29244@redhat.com>
References: <441B2C73.3010202@fnal.gov> <20060317224043.GC29244@redhat.com>
Message-ID: <52FFC5E9-FAE4-4414-B0B9-65103591CE59@sara.nl>


On Mar 17, 2006, at 11:40 PM, David Teigland wrote:

> On Fri, Mar 17, 2006 at 03:38:59PM -0600, Paul Tader wrote:
>> Mar 17 11:38:02 nodename kernel: d0 unlock fe20017a no id
>
> GFS is trying to unlock a lock that doesn't exist which causes the  
> panic.
> We know this happens if cman shuts down the dlm while it's in use  
> (cman
> does this if it's lost connection with the cluster.)  There's some new
> output in the RHEL4U3 dlm that should tell us if that's in fact what's
> happening or if there's some other cause that we need to uncover.
>
> So, you should look on all nodes for any cman messages in
> /var/log/messages or the console.  And when you're using the latest
> version look for the new dlm message "WARNING:  
> dlm_emergency_shutdown".
>

We had a similiar problem on our 4 node GFS cluster. I have send the  
crash reports
to the list as attachment for all 4 nodes. One cman crash and 3 dlm  
crashes.

Can the list handle attachments or must i send it inline?

nodes:
2.6.16-rc5 kernel
GFS cvs STABLE


--
Bas van der Vlies
basv at sara.nl


From filipe.miranda at gmail.com  Sun Mar 19 15:00:38 2006
From: filipe.miranda at gmail.com (Filipe Miranda)
Date: Sun, 19 Mar 2006 12:00:38 -0300
Subject: [Linux-cluster] Load balancing on CS4?
In-Reply-To: <441C6649.2070803@gmail.com>
References: <441C6649.2070803@gmail.com>
Message-ID: <a6d13c780603190700y1c6ac155t8678ffbfa107736d@mail.gmail.com>

Hello there,

Here you can find the documentation on how to setup the LVS:
http://www.redhat.com/docs/manuals/csgfs/browse/rh-cs-en/pt-lvs.html

To try to setup the 50-50 ratio I guess the best shot is taking a look in
this link bellow:
http://www.redhat.com/docs/manuals/csgfs/browse/rh-cs-en/s1-lvs-scheduling.html#S2-LVS-SCHED

I hope this helps you,

Since you are testing, if it works please give us a feedback.

Att.
Filipe Miranda


On 3/18/06, carlopmart <carlopmart at gmail.com> wrote:
>
> Hi all,
>
>   I have configured two nodes on a vmware host. I would do load
> balancing   for apache services (50-50). How can I do this? I didn't
> find anything about this on RedHat's documentation.
>
> Thanks.
> --
> CL Martinez
> carlopmart {at} gmail {d0t} com
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


--
Att.
---
Filipe T Miranda
RHCE - Red Hat Certified Engineer
OCP8i - Oracle Certified Professional
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060319/c2e71a75/attachment.htm>

From forums at daltonfirth.co.uk  Sun Mar 19 21:07:03 2006
From: forums at daltonfirth.co.uk (James Firth)
Date: Sun, 19 Mar 2006 21:07:03 +0000
Subject: [Linux-cluster] Node Failure Detection Problems
Message-ID: <441DC7F7.2@daltonfirth.co.uk>

Hi,

I have some questions on configuring and tuning heartbeats and 
node-failure detection.

I have a 2-node cluster.  Whenever a node fails it seems to take a while 
to detect node failure.

First question: I have reduced heartbeat hello_timer to 1 second, and 
deadnode_timeout to 5 seconds.  Is there an elegant way to do this with 
cluster.conf?  Currently I'm setting 
/proc/cluster/config/cman/hello_timer with an init script hack.

Failure is detected by cman within 5 seconds, no problem, but clustat 
hangs during this time.

Second question: clustat continues to hang for around 10 more seconds - 
15 in total, before clurgmgrd does a state change.

Does anyone know where this additional 10 seconds comes from?  Is it 
configurable?

Here is the system log for the transition:
 >>>
Mar 19 21:01:33 firthy kernel: CMAN: removing node emsy from the cluster 
: Missed too many heartbeats
Mar 19 21:01:33 firthy fenced[1878]: emsy not a cluster member after 0 
sec post_fail_delay
Mar 19 21:01:33 firthy fenced[1878]: fencing node "emsy"
Mar 19 21:01:35 firthy fenced[1878]: fence "emsy" success
Mar 19 21:01:44 firthy clurgmgrd[3347]: <info> Magma Event: Membership 
Change
Mar 19 21:01:44 firthy clurgmgrd[3347]: <info> State change: emsy DOWN
<<<

Many thanks,
James Firth


From forums at daltonfirth.co.uk  Sun Mar 19 21:38:12 2006
From: forums at daltonfirth.co.uk (James Firth)
Date: Sun, 19 Mar 2006 21:38:12 +0000
Subject: [Linux-cluster] Node Failure Detection Problems
In-Reply-To: <441DC7F7.2@daltonfirth.co.uk>
References: <441DC7F7.2@daltonfirth.co.uk>
Message-ID: <441DCF44.5020402@daltonfirth.co.uk>

James Firth wrote:
> Hi,
> 
> I have some questions on configuring and tuning heartbeats and 
> node-failure detection.

Further to my earlier mail - am also having problems with exported gnbd 
devices on node failure.  I want to get gnbd to give up trying to 
reconnect on node failure, but it insists on retrying ad infinitum, 
causing services that are using imported gnbd volumes to lock.

Regards,
James Firth


From toxictux at gmail.com  Mon Mar 20 02:35:00 2006
From: toxictux at gmail.com (toxictux)
Date: Sun, 19 Mar 2006 20:35:00 -0600
Subject: [Linux-cluster] fencing trouble with fence_wti
Message-ID: <17df45710603191835p1fa8cfd5j59b574ccd18f12d@mail.gmail.com>

hi all,
     i setup a 2 node cluster with http service. everything seems ok
except for the fencing. i am using WTI ips 800. my problem is,
whenever one of the nodes goes down. it does not get fenced. i get
messages in my syslog

""fencing node "node2"""
""fence "node2" failed""

when i do
$fence_node node2
it doesnt work either.

i get following error message in my syslog
Fence of "node2" was unsuccessful

however, when i manually do it with fence_wti, it works,

$fence_wti -a 216.xxx.xxx.xxx -p passwd -n 2

i am unable to see any other messages anywhere else.

can anyone give any pointers?? or any suggestions on getting detailed
debug message?

Thanks
-F


From Alain.Moulle at bull.net  Mon Mar 20 07:34:47 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Mon, 20 Mar 2006 08:34:47 +0100
Subject: [Linux-cluster] CS4 Update 2/ Copy cluster.conf from one node on
	the peer one ?
Message-ID: <441E5B17.90309@bull.net>

Hi

For a HA pair nodes with CS4 active on both nodes,
is there any case where the CS4 decides by itself
to copy the cluster.conf from one node on the peer one ?

And if so, which cases ?

Thanks
Alain


From thorsten.henrici at gfd.de  Mon Mar 20 08:07:51 2006
From: thorsten.henrici at gfd.de (thorsten.henrici at gfd.de)
Date: Mon, 20 Mar 2006 09:07:51 +0100
Subject: [Linux-cluster] Netmask of IP Address resource
	system-config-cluster 1.0.25
Message-ID: <OF17D62D08.D41AB32F-ONC1257137.002C7584-C1257137.002CAA62@obi.de>


Hi,
I'm a bit baffled, that I can't enter a netmask when configuring an IP
Adress as a ressorce with the the system-config-cluster 1.0.25 tool.
As a result, when starting rgmanager, the Service IP Adress gets a /32
netmask, which is not correct of cause.

Since there won't be too many changes to the cluster.conf file in the long
run, it would be alright to just edit it by hand. Unfortuneatly I wasn't
able to figure out what the correct syntax is

eg.

<ip address="10.112.24.17/24" monitor_link="1"/>

or

<netmask = "255.255.255.0"/>

Is the DTD of cluster.conf generally available?

My cluster.conf is attached below. Please feel free to suggest
improvements, since I'm totally new to the RH ClusterSuite.
For example, I don't want to use a fencing device, because I don't need
any shared storage for the MySQL Cluster, which works with two seperate
data nodes, that is the data itself will exist twice physically on
seperate storage.

Do I have to configure some kind of fencing anyways. If yes, how would
this kind of dummy fencing look like in the cluster.conf?
(Or is the <fencedevices/> entry all I need?)

Many thanks in advance !

<?xml version="1.0"?>
<cluster config_version="6" name="mysql_server">
        <fence_daemon post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="sgfddewer132a" votes="1">
                        <fence/>
                </clusternode>
                <clusternode name="sgfddewer132b" votes="1">
                        <fence/>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <fencedevices/>
        <rm>
                <failoverdomains>
                        <failoverdomain name="fd_mysql-cluster_001"
ordered="1" restricted="1">
                                <failoverdomainnode name="sgfddewer132a"
priority="1"/>
                                <failoverdomainnode name="sgfddewer132b"
priority="2"/>
                        </failoverdomain>
                </failoverdomains>
                <resources>
                        <ip address="10.112.24.17" monitor_link="1"/>
                        <script file="/etc/init.d/mysql.server"
name="mysql_server"/>
                </resources>
                <service autostart="1" domain="fd_mysql-cluster_001"
name="s_mysql_server_001">
                        <ip ref="10.112.24.17"/>
                        <script ref="mysql_server"/>
                </service>
        </rm>
</cluster>

Mit freundlichen Gr??en / Kind Regards

Thorsten Henrici

Abteilung IT-Kommunikation
GfD Gesellschaft f?r Datenverarbeitung mbH
------------------------------------------------------------------------
e-mail thorsten.henrici at gfd.de
fon: +49 21 9676-1857
fax: +49 21 9676-1932

Industriestrasse 10
D-42929 Wermelskirchen

--
IMPORTANT NOTICE:
This email is confidential, may be legally privileged, and is for the
intended recipient only. Access, disclosure, copying, distribution, or
reliance on any of it by anyone else is prohibited and may be a criminal
offence. Please delete if obtained in error and email confirmation to the sender.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060320/cf1916fd/attachment.htm>

From pcaulfie at redhat.com  Mon Mar 20 08:21:43 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 20 Mar 2006 08:21:43 +0000
Subject: [Linux-cluster] Node Failure Detection Problems
In-Reply-To: <441DC7F7.2@daltonfirth.co.uk>
References: <441DC7F7.2@daltonfirth.co.uk>
Message-ID: <441E6617.3080300@redhat.com>

James Firth wrote:
> Hi,
> 
> I have some questions on configuring and tuning heartbeats and
> node-failure detection.
> 
> I have a 2-node cluster.  Whenever a node fails it seems to take a while
> to detect node failure.
> 
> First question: I have reduced heartbeat hello_timer to 1 second, and
> deadnode_timeout to 5 seconds.  Is there an elegant way to do this with
> cluster.conf?  Currently I'm setting
> /proc/cluster/config/cman/hello_timer with an init script hack.
> 

The latest code (RHEL4 U3, or CVS STABLE/RHEL4) allows you to put these into
cluster.conf as

<cluster>
<cman deadnode_timeout="5" hello_timer="1" />

...

</cluster/
-- 

patrick


From carlopmart at gmail.com  Mon Mar 20 09:13:13 2006
From: carlopmart at gmail.com (carlopmart)
Date: Mon, 20 Mar 2006 10:13:13 +0100
Subject: [Linux-cluster] Load balancing on CS4?
In-Reply-To: <a6d13c780603190700y1c6ac155t8678ffbfa107736d@mail.gmail.com>
References: <441C6649.2070803@gmail.com>
	<a6d13c780603190700y1c6ac155t8678ffbfa107736d@mail.gmail.com>
Message-ID: <441E7229.3020608@gmail.com>

Thanks filipe, but I have another question: Can LVS coexists with CS4 on 
the same server?? I do not have another server to accomplish this task.

Thanks.

Filipe Miranda wrote:
> Hello there,
> 
> Here you can find the documentation on how to setup the LVS:
> http://www.redhat.com/docs/manuals/csgfs/browse/rh-cs-en/pt-lvs.html 
> <http://www.redhat.com/docs/manuals/csgfs/browse/rh-cs-en/pt-lvs.html>
> 
> To try to setup the 50-50 ratio I guess the best shot is taking a look 
> in this link bellow:
> http://www.redhat.com/docs/manuals/csgfs/browse/rh-cs-en/s1-lvs-scheduling.html#S2-LVS-SCHED 
> <http://www.redhat.com/docs/manuals/csgfs/browse/rh-cs-en/s1-lvs-scheduling.html#S2-LVS-SCHED>
> 
> I hope this helps you,
> 
> Since you are testing, if it works please give us a feedback.
> 
> Att.
> Filipe Miranda
> 
> 
> 
> 
> On 3/18/06, * carlopmart* <carlopmart at gmail.com 
> <mailto:carlopmart at gmail.com>> wrote:
> 
>     Hi all,
> 
>       I have configured two nodes on a vmware host. I would do load
>     balancing   for apache services (50-50). How can I do this? I didn't
>     find anything about this on RedHat's documentation.
> 
>     Thanks.
>     --
>     CL Martinez
>     carlopmart {at} gmail {d0t} com
> 
>     --
>     Linux-cluster mailing list
>     Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>     https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> 
> 
> -- 
> Att.
> ---
> Filipe T Miranda
> RHCE - Red Hat Certified Engineer
> OCP8i - Oracle Certified Professional
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From colman at codagenomics.com  Mon Mar 20 15:27:30 2006
From: colman at codagenomics.com (Richard Colman)
Date: Mon, 20 Mar 2006 07:27:30 -0800
Subject: [Linux-cluster] Seeking Backup Linux Cluster
In-Reply-To: <441E5B17.90309@bull.net>
Message-ID: <003201c64c32$ce0ea5a0$56d0e744@D98D2G81>

I need to provide a "back-up" cluster to our main cluster, say 4 nodes with
head and NFS DATA. We primarily run lisp, perl and shell in a parallel,
asynchronous, parallel processing mode - no hosting, serving or anything
like that.

Can anyone recommend any cost-effective approaches, from buying time on
another cluster, to renting a dedicated cluster on a hosting service, to
purchasing low-cost but reliable hardware, etc.

Your experiences much appreciated.

Rick Colman


From bmarzins at redhat.com  Mon Mar 20 17:51:45 2006
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Mon, 20 Mar 2006 11:51:45 -0600
Subject: [Linux-cluster] Node Failure Detection Problems
In-Reply-To: <441DCF44.5020402@daltonfirth.co.uk>
References: <441DC7F7.2@daltonfirth.co.uk> <441DCF44.5020402@daltonfirth.co.uk>
Message-ID: <20060320175144.GC5223@ether.msp.redhat.com>

On Sun, Mar 19, 2006 at 09:38:12PM +0000, James Firth wrote:
> James Firth wrote:
> >Hi,
> >
> >I have some questions on configuring and tuning heartbeats and 
> >node-failure detection.
> 
> Further to my earlier mail - am also having problems with exported gnbd 
> devices on node failure.  I want to get gnbd to give up trying to 
> reconnect on node failure, but it insists on retrying ad infinitum, 
> causing services that are using imported gnbd volumes to lock.

Are you exporting the gnbds in clustered or unclustered mode (with the -c
option or not)? In uncached, you should be able to run "gnbd_import -Or <gnbd>"
It wont actually remove the device if it is opened, but it should cause
all the pending IOs to fail.  In uncached mode, after your timeout, all the
IOs should get flushed assuming that gnbd can fence the server. If this
isn't happening, can you please send me a more complete description of your
gnbd setup and problem, including the result of following set of commands, run
after the server node fails.

# gnbd_import -l
# gnbd_import -Or <dead_gnbd>
(You might get a message saying "gnbd_import: waiting for all users to close
device <dead_gnbd>". This is fine. Just Ctrl-C to stop waiting. In this case,
the gnbd device will not be removed because something still has it open, but
all the pending IOs should have returned as failed, and all future IOs will
fail)
# gnbd_import -l

-Ben

> Regards,
> James Firth
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From zhendershot at cranel.com  Mon Mar 20 18:16:37 2006
From: zhendershot at cranel.com (Hendershot, Zach)
Date: Mon, 20 Mar 2006 13:16:37 -0500
Subject: [Linux-cluster] Fencing Methods
Message-ID: <BD37DA8FEF7D8949A5731245FF66CB2BD47B48@POSTOFFICE.cranel.local>

Hello,
    I was wondering about various fencing methods. We don't have any
"supported" hardware available to do proper fencing via the Red Hat
fencing agents. Other clustered filesystems like the Veritas CFS and
Oracle's ocfs2 solve the fencing problem by simply panic'ing the machine
to keep the IO from hitting the disk. My question is this: can I
configure GFS to do the same thing? Has anybody done this before? Is
this even suggested or is there a "Better Way". Thanks for your help. 
 
--------------

Zach Hendershot
Software Engineer
Cranel, Incorporated.
Phone: 614.318.4288
Fax: 614.431.8388
Email: zhendershot at cranel.com

Technology. Integrity. Focus.

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060320/f51e268d/attachment.htm>

From renapte at vsnl.net  Mon Mar 20 18:23:57 2006
From: renapte at vsnl.net (renapte at vsnl.net)
Date: Mon, 20 Mar 2006 23:23:57 +0500
Subject: [Linux-cluster] Deadlock handling in GFS?
Message-ID: <7e07df7decbc.7decbc7e07df@vsnl.net>

Hey
Can anybody tell me where I can get details about how GFS handles deadlocks?
Thanks
Renuka


From l.dardini at comune.prato.it  Mon Mar 20 18:37:26 2006
From: l.dardini at comune.prato.it (Leandro Dardini)
Date: Mon, 20 Mar 2006 19:37:26 +0100
Subject: R: [Linux-cluster] Fencing Methods
Message-ID: <404AA6666D14D14CA0D410C1BC6CC4C5465214@exchange3.comune.prato.local>

 
________________________________

	Da: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] Per conto di Hendershot, Zach
	Inviato: luned? 20 marzo 2006 19.17
	A: linux-cluster at redhat.com
	Oggetto: [Linux-cluster] Fencing Methods
	
	
	Hello,
	    I was wondering about various fencing methods. We don't have any "supported" hardware available to do proper fencing via the Red Hat fencing agents. Other clustered filesystems like the Veritas CFS and Oracle's ocfs2 solve the fencing problem by simply panic'ing the machine to keep the IO from hitting the disk. My question is this: can I configure GFS to do the same thing? Has anybody done this before? Is this even suggested or is there a "Better Way". Thanks for your help. 
	 

Writing a fencing agent for your hardware is not so difficult. Take a look for example to /sbin/fence_brocade.
 
Leandro
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060320/64d1e069/attachment.htm>

From teigland at redhat.com  Mon Mar 20 20:14:08 2006
From: teigland at redhat.com (David Teigland)
Date: Mon, 20 Mar 2006 14:14:08 -0600
Subject: [Linux-cluster] Fencing Methods
In-Reply-To: <BD37DA8FEF7D8949A5731245FF66CB2BD47B48@POSTOFFICE.cranel.local>
References: <BD37DA8FEF7D8949A5731245FF66CB2BD47B48@POSTOFFICE.cranel.local>
Message-ID: <20060320201407.GC546@redhat.com>

On Mon, Mar 20, 2006 at 01:16:37PM -0500, Hendershot, Zach wrote:
> Hello,
>     I was wondering about various fencing methods. We don't have any
> "supported" hardware available to do proper fencing via the Red Hat
> fencing agents. Other clustered filesystems like the Veritas CFS and
> Oracle's ocfs2 solve the fencing problem by simply panic'ing the machine
> to keep the IO from hitting the disk. 

This assumes the machine knows it should fence itself, which isn't always
the case.  If the machine is hung somewhere and comes back to life after
it's been recovered, it could write and corrupt the fs, a panic doesn't
solve this.

Dave


From zhendershot at cranel.com  Mon Mar 20 20:32:17 2006
From: zhendershot at cranel.com (Hendershot, Zach)
Date: Mon, 20 Mar 2006 15:32:17 -0500
Subject: [Linux-cluster] Fencing Methods
Message-ID: <BD37DA8FEF7D8949A5731245FF66CB2BD47BCD@POSTOFFICE.cranel.local>


> On Mon, Mar 20, 2006 at 01:16:37PM -0500, Hendershot, Zach wrote:
> > Hello,
> >     I was wondering about various fencing methods. We don't have any

> > "supported" hardware available to do proper fencing via the Red Hat 
> > fencing agents. Other clustered filesystems like the Veritas CFS and

> > Oracle's ocfs2 solve the fencing problem by simply panic'ing the 
> > machine to keep the IO from hitting the disk.

> This assumes the machine knows it should fence itself, which isn't
always the case.  If the machine is hung somewhere and comes
> back to life after it's been recovered, it could write and corrupt the
fs, a panic doesn't solve this.
> 
> Dave

That's a good point, I wasn't thinking. Oracle and (I assume) Veritas do
this by relying on a kernel thread that writes out timestamps and if it
doesn't write an expected timestamp (and other nodes see it as dead) it
panics itself to self-fence. How does RHCS decide if a node is dead? I
was under the understanding that if the other nodes don't receive a
heartbeat from the node for a timeout period they execute the fence
command on the node. I'm interested why that choice was made, was it a
technical problem with the above method or a design decision? Have a
good one.

Zach


From teigland at redhat.com  Mon Mar 20 20:51:51 2006
From: teigland at redhat.com (David Teigland)
Date: Mon, 20 Mar 2006 14:51:51 -0600
Subject: [Linux-cluster] Fencing Methods
In-Reply-To: <BD37DA8FEF7D8949A5731245FF66CB2BD47BCD@POSTOFFICE.cranel.local>
References: <BD37DA8FEF7D8949A5731245FF66CB2BD47BCD@POSTOFFICE.cranel.local>
Message-ID: <20060320205151.GD546@redhat.com>

On Mon, Mar 20, 2006 at 03:32:17PM -0500, Hendershot, Zach wrote:
> That's a good point, I wasn't thinking. Oracle and (I assume) Veritas do
> this by relying on a kernel thread that writes out timestamps and if it
> doesn't write an expected timestamp (and other nodes see it as dead) it
> panics itself to self-fence.

That still doesn't work.  The node can easily wake up and write before it
panics.

> How does RHCS decide if a node is dead? I was under the understanding
> that if the other nodes don't receive a heartbeat from the node for a
> timeout period they execute the fence command on the node. 

One of the remaining nodes in the cluster fences the node who has been
declared dead by the cluster manager.  Fencing the dead node does not
involve running anything on the dead node; it just amounts to turning its
power off or something.

> I'm interested why that choice was made, was it a technical problem with
> the above method or a design decision? Have a good one.

Self-fencing is simply not correct and will lead to fs corruption.
That's not been acceptable to us.

Dave


From Matthew.Patton.ctr at osd.mil  Mon Mar 20 22:37:09 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Mon, 20 Mar 2006 17:37:09 -0500
Subject: [Linux-cluster] other cluster FS
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D29@osdn06.osd.mil>

Classification: UNCLASSIFIED

with the latest community kernel release I was looking at oracle's FS and
then SGI's CXFS. I liked the CXFS paper and wondered if GFS was working in
similar fashion where meta-data is stored by a 'master' and then pieces of
it farmed out to nodes as required. So essentially all meta-data resides on
the first one to mount GFS I guess and it handles the periodic flushing of
changes? Is there a machanism to designate "backup" masters or does this
arise implicitly by virtue of a neighboring node(s)recovering a peer's
journals after an election?

CXFS is/was available on seemingly a number of different OS's. Anyone
actually using it? Is GFS ported to anything besides Linux? I want and
intend to standardize on Linux as the bare-metal OS, but being able to play
nice with winblows would be really cool. Does CXFS use DLM and other pieces
of the Linux Clustering package set? I was really not impressed by the claim
that OCFS2 panics the machine on error. Is that for real?

Anyone try IBM's GPFS?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060320/2be04833/attachment.htm>

From forums at daltonfirth.co.uk  Mon Mar 20 22:33:48 2006
From: forums at daltonfirth.co.uk (James Firth)
Date: Mon, 20 Mar 2006 22:33:48 +0000
Subject: [Linux-cluster] Node Failure Detection Problems
In-Reply-To: <20060320175144.GC5223@ether.msp.redhat.com>
References: <441DC7F7.2@daltonfirth.co.uk> <441DCF44.5020402@daltonfirth.co.uk>
	<20060320175144.GC5223@ether.msp.redhat.com>
Message-ID: <441F2DCC.5020203@daltonfirth.co.uk>

Benjamin Marzinski wrote:
> Are you exporting the gnbds in clustered or unclustered mode (with the -c
> option or not)? In uncached, you should be able to run "gnbd_import -Or <gnbd>"
> It wont actually remove the device if it is opened, but it should cause
> all the pending IOs to fail.  

Hi Ben - I am running uncached (NOT using -c)

> In uncached mode, after your timeout, all the
> IOs should get flushed assuming that gnbd can fence the server. 
> If this
> isn't happening, can you please send me a more complete description of your
> gnbd setup and problem, including the result of following set of commands, run
> after the server node fails.
> 

We have a high-availability server pair, and need a common storage pool 
and require no single point of failure, so we don't like any of the SAN 
approaches.

Instead we have created an md device (raid level 1), with one local and 
one gndb imported device.  We are not using multipath, instead each 
server has two bound network devices, connected to different hubs, with 
one

The gnbd import and raid mount is managed as a cluster service, is 
failover between either node, and is only ever running on one node at 
any given moment in time.

We are using DLM locking, and fenced with proprietary fence (although 
fencing is not a vital part of data integrity in our schema)

Failure of the active node is handled well, the services migrate, the 
remaining node mounts a (degraded) md device from it's local disk, and 
cluster operation is maintained.

When the dead node returns, a custom script hot-adds the returning 
gnbd_imported disk, and the md device recovers.

The problem comes when the STANDBY node fails.  The md device does not 
take well to failure of the imported gnbd device.  On failure of the 
standby node, the md mount on the active node just hangs.

So at the moment we have had to write a custom script that checks for 
failure of the node from which gnbd_services are imported.

On detection of failure, our script has to manually fail the md device 
(mdadm --fail), at which point the md devices unfreezes.  Our script 
then hot-removes the device from the md array, for completeness.

I had thought that the gnbd_recvd should have hooked up with CMAN/Magma 
and that the device imported from the failed node should automatically 
fail.

Regards,
James


From carlopmart at gmail.com  Mon Mar 20 23:31:56 2006
From: carlopmart at gmail.com (carlopmart)
Date: Tue, 21 Mar 2006 00:31:56 +0100
Subject: [Linux-cluster] GNBD startup script
Message-ID: <441F3B6C.6060800@gmail.com>

Hi all,

  Somebody knows where I can find startup script for gnbd client and
server for RHEL4 U3 and CS4??

Thanks.

-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From theo at tkd.co.id  Tue Mar 21 08:02:31 2006
From: theo at tkd.co.id (Theodorus)
Date: Tue, 21 Mar 2006 15:02:31 +0700
Subject: [Linux-cluster] services not starting... 
In-Reply-To: <441F3B6C.6060800@gmail.com>
Message-ID: <TKDNET9mK6PYzDabigP000000e6@tkdnet.tkd.co.id>

Hi, I'm theo. I'm newbie on this cluster thing.
I using Redhat 4 update whith cman_kernel-2.6.9-36.0 for kernel 2.6.11
Fences are set to ilo HP. Installed with cman and dlm.

The problem is when I plug off on node where the services are run on that
node, the services doesn't failover to other node... the node just stopping 
And can't be restart or disable. The services are just stall on stopping
status.

Member Status: Quorate

  Member Name                              Status
  ------ ----                              ------
  node1                                    Online, Local, rgmanager
  node2                                    Online, rgmanager

  Service Name         Owner (Last)                   State         
  ------- ----         ----- ------                   -----         
  sql_serv             node1                          stopping        
  nfs_serv             node1                          stopping  

those status is when I plug on again the node.

Is any one can tell me why this can be resolve? Thanx

Regards,

theo


From Fabrizio.Lippolis at AurigaInformatica.it  Tue Mar 21 11:35:05 2006
From: Fabrizio.Lippolis at AurigaInformatica.it (Fabrizio Lippolis)
Date: Tue, 21 Mar 2006 12:35:05 +0100
Subject: [Linux-cluster] dlm kernel bug (node crashed)
Message-ID: <441FE4E9.50704@aurigainformatica.it>

I have set up a linux cluster consisting of two nodes and a disk array 
formatted with GFS. I have configured the cluster to run a mysql process 
where the data files are on the disk array. Though it isn't yet in a 
production environment seems that everything works fine. When I kill the 
mysql process running on one machine, the other mounts the shared 
filesystem and starts the mysql process itself.

Yesterday I experienced a crash of the node running the mysql process 
appareantly with no reason. I had to brutally switch off the node and 
restart since it was completely locked. Looking in the /var/log/messages 
file I found the following:

Mar 20 17:30:00 AICLSRV02 kernel: CMAN: Being told to leave the cluster 
by node 2
Mar 20 17:30:00 AICLSRV02 kernel: CMAN: we are leaving the cluster.
Mar 20 17:30:00 AICLSRV02 kernel: SM: 0000002c sm_stop: SG still joined
Mar 20 17:30:00 AICLSRV02 kernel: SM: 0100002d sm_stop: SG still joined
Mar 20 17:30:00 AICLSRV02 kernel: SM: 0200004f sm_stop: SG still joined
Mar 20 17:30:00 AICLSRV02 kernel: SM: 03000044 sm_stop: SG still joined
Mar 20 17:30:00 AICLSRV02 clurgmgrd[24620]: <warning> #67: Shutting down 
uncleanly
Mar 20 17:30:00 AICLSRV02 ccsd[4847]: Cluster manager shutdown. 
Attemping to reconnect...
Mar 20 17:30:01 AICLSRV02 kernel: t 179
Mar 20 17:30:01 AICLSRV02 kernel: clvmd add node 2
Mar 20 17:30:01 AICLSRV02 kernel: clvmd total nodes 2
Mar 20 17:30:01 AICLSRV02 kernel: clvmd rebuild resource directory
Mar 20 17:30:01 AICLSRV02 kernel: clvmd rebuilt 0 resources
Mar 20 17:30:01 AICLSRV02 kernel: clvmd purge requests
Mar 20 17:30:01 AICLSRV02 kernel: clvmd purged 0 requests
Mar 20 17:30:01 AICLSRV02 kernel: clvmd mark waiting requests
Mar 20 17:30:01 AICLSRV02 kernel: clvmd marked 0 requests
Mar 20 17:30:01 AICLSRV02 kernel: clvmd recover event 179 done
Mar 20 17:30:01 AICLSRV02 kernel: clvmd move flags 0,0,1 ids 176,179,179
Mar 20 17:30:01 AICLSRV02 kernel: clvmd process held requests
Mar 20 17:30:01 AICLSRV02 kernel: clvmd processed 0 requests
Mar 20 17:30:01 AICLSRV02 kernel: clvmd resend marked requests
Mar 20 17:30:01 AICLSRV02 kernel: clvmd resent 0 requests
Mar 20 17:30:01 AICLSRV02 kernel: clvmd recover event 179 finished
Mar 20 17:30:01 AICLSRV02 kernel: Magma move flags 1,0,0 ids 173,173,173
Mar 20 17:30:01 AICLSRV02 kernel: Magma move flags 0,1,0 ids 173,181,173
Mar 20 17:30:01 AICLSRV02 kernel: Magma move use event 181
Mar 20 17:30:01 AICLSRV02 kernel: Magma recover event 181
Mar 20 17:30:01 AICLSRV02 kernel: Magma add node 2
Mar 20 17:30:01 AICLSRV02 kernel: Magma total nodes 2
Mar 20 17:30:01 AICLSRV02 kernel: Magma rebuild resource directory
Mar 20 17:30:01 AICLSRV02 kernel: Magma rebuilt 0 resources
Mar 20 17:30:01 AICLSRV02 kernel: Magma purge requests
Mar 20 17:30:01 AICLSRV02 kernel: Magma purged 0 requests
Mar 20 17:30:01 AICLSRV02 kernel: Magma mark waiting requests
Mar 20 17:30:01 AICLSRV02 kernel: Magma marked 0 requests
Mar 20 17:30:01 AICLSRV02 kernel: Magma recover event 181 done
Mar 20 17:30:01 AICLSRV02 kernel: Magma move flags 0,0,1 ids 173,181,181
Mar 20 17:30:01 AICLSRV02 kernel: Magma process held requests
Mar 20 17:30:01 AICLSRV02 kernel: Magma processed 0 requests
Mar 20 17:30:01 AICLSRV02 kernel: Magma resend marked requests
Mar 20 17:30:01 AICLSRV02 kernel: Magma resent 0 requests
Mar 20 17:30:01 AICLSRV02 kernel: Magma recover event 181 finished
Mar 20 17:30:01 AICLSRV02 kernel: Magma send einval to 2
Mar 20 17:30:01 AICLSRV02 kernel: Magma (24620) req reply einval 14b026a 
fr 2 r 2 usrm::rg="mysql
Mar 20 17:30:01 AICLSRV02 kernel: lk 2,66e728 id 0 -1,3 10001
Mar 20 17:30:01 AICLSRV02 kernel: 18109 qc 2,66e728 -1,3 id 1c00ec sts 0 0
Mar 20 17:30:01 AICLSRV02 kernel: 7078 lk 2,66e725 id 0 -1,3 10001
Mar 20 17:30:01 AICLSRV02 kernel: 18109 qc 2,66e725 -1,3 id 2202ce sts 0 0
Mar 20 17:30:01 AICLSRV02 kernel: 7078 lk 2,66e284 id 0 -1,3 10001
Mar 20 17:30:01 AICLSRV02 kernel: 18109 qc 2,66e284 -1,3 id 1801d9 sts 0 0

[...]

Mar 20 17:30:04 AICLSRV02 kernel: 7544 lk 2,14c8 id 0 -1,3 10000
Mar 20 17:30:04 AICLSRV02 kernel:
Mar 20 17:30:04 AICLSRV02 kernel: lock_dlm:  Assertion failed on line 
411 of file /usr/src/build/573241-i686/BUILD/smp/src/dlm/lock.c
Mar 20 17:30:04 AICLSRV02 kernel: lock_dlm:  assertion:  "!error"
Mar 20 17:30:04 AICLSRV02 kernel: lock_dlm:  time = 1565317077
Mar 20 17:30:04 AICLSRV02 kernel: AICLSRV01: num=2,14c8 err=-22 cur=-1 
req=3 lkf=10000
Mar 20 17:30:04 AICLSRV02 kernel:
Mar 20 17:30:04 AICLSRV02 kernel: ------------[ cut here ]------------
Mar 20 17:30:04 AICLSRV02 kernel: kernel BUG at 
/usr/src/build/573241-i686/BUILD/smp/src/dlm/lock.c:411!
Mar 20 17:30:04 AICLSRV02 kernel: invalid operand: 0000 [#1]
Mar 20 17:30:04 AICLSRV02 kernel: SMP
Mar 20 17:30:04 AICLSRV02 kernel: Modules linked in: lock_dlm(U) gfs(U) 
lock_harness(U) dlm(U) cman(U) parport_pc lp parport autofs4 rfcomm 
l2cap bluetooth sunrpc dm_mod video button battery ac md5 ipv6 uhci_hcd 
ehci_hcd hw_random tg3 floppy ext3 jbd cciss sd_mod scsi_mod
Mar 20 17:30:04 AICLSRV02 kernel: CPU:    1
Mar 20 17:30:04 AICLSRV02 kernel: EIP:    0060:[<f8964bf5>]    Not 
tainted VLI
Mar 20 17:30:04 AICLSRV02 kernel: EFLAGS: 00010296   (2.6.11-1.1369_FC4smp)
Mar 20 17:30:04 AICLSRV02 kernel: EIP is at do_dlm_lock+0x1b7/0x21d 
[lock_dlm]
Mar 20 17:30:04 AICLSRV02 kernel: eax: 00000004   ebx: 00000000   ecx: 
c035ea4c   edx: 00000296
Mar 20 17:30:04 AICLSRV02 kernel: esi: f27f4d80   edi: ffffffea   ebp: 
00000002   esp: f4bb2d88
Mar 20 17:30:04 AICLSRV02 kernel: ds: 007b   es: 007b   ss: 0068
Mar 20 17:30:04 AICLSRV02 kernel: Process lsof (pid: 7544, 
threadinfo=f4bb2000 task=c36cea80)
Mar 20 17:30:04 AICLSRV02 kernel: Stack: f8969e75 f27f4d80 00000002 
000014c8 00000000 ffffffea ffffffff 00000003
Mar 20 17:30:04 AICLSRV02 kernel:        00010000 00000003 00000000 
f5ee4000 00000001 00010000 202000d0 20202020
Mar 20 17:30:04 AICLSRV02 kernel:        20203220 20202020 20202020 
34312020 00183863 f5736200 00000003 00000008
Mar 20 17:30:04 AICLSRV02 kernel: Call Trace:
Mar 20 17:30:04 AICLSRV02 kernel:  [<f8964cff>] lm_dlm_lock+0x52/0x5e 
[lock_dlm]
Mar 20 17:30:04 AICLSRV02 kernel:  [<f8964cad>] lm_dlm_lock+0x0/0x5e 
[lock_dlm]
Mar 20 17:30:04 AICLSRV02 kernel:  [<f8adcffc>] gfs_lm_lock+0x3d/0x5c [gfs]
Mar 20 17:30:04 AICLSRV02 kernel:  [<f8ad2039>] 
gfs_glock_xmote_th+0xae/0x1d3 [gfs]
Mar 20 17:30:04 AICLSRV02 kernel:  [<f8ad163c>] rq_promote+0x126/0x150 [gfs]
Mar 20 17:30:05 AICLSRV02 kernel:  [<f8ad1840>] run_queue+0xee/0x113 [gfs]
Mar 20 17:30:05 AICLSRV02 kernel:  [<f8ad2aea>] gfs_glock_nq+0x93/0x144 
[gfs]
Mar 20 17:30:05 AICLSRV02 kernel:  [<c016f43b>] link_path_walk+0x53/0xe0
Mar 20 17:30:05 AICLSRV02 kernel:  [<f8ad3196>] 
gfs_glock_nq_init+0x18/0x2d [gfs]
Mar 20 17:30:05 AICLSRV02 kernel:  [<f8aea0fe>] gfs_getattr+0x3b/0x5b [gfs]
Mar 20 17:30:05 AICLSRV02 kernel:  [<f8aea0c3>] gfs_getattr+0x0/0x5b [gfs]
Mar 20 17:30:05 AICLSRV02 kernel:  [<c016a42b>] vfs_getattr+0x40/0xa2
Mar 20 17:30:05 AICLSRV02 kernel:  [<c016a4b5>] vfs_stat+0x28/0x3a
Mar 20 17:30:05 AICLSRV02 kernel:  [<c0142c36>] 
audit_syscall_entry+0x132/0x160
Mar 20 17:30:05 AICLSRV02 kernel:  [<c016aa30>] sys_stat64+0xf/0x28
Mar 20 17:30:05 AICLSRV02 kernel:  [<c01086f9>] do_syscall_trace+0xef/0x123
Mar 20 17:30:05 AICLSRV02 kernel:  [<c0104025>] syscall_call+0x7/0xb
Mar 20 17:30:05 AICLSRV02 kernel: Code: 7c 24 14 89 4c 24 0c 89 5c 24 10 
89 6c 24 08 89 74 24 04 c7 04 24 28 a6 96 f8 e8 6e cf 7b c7 c7 04 24 75 
9e 96 f8 e8 62 cf 7b c7 <0f> 0b 9b 01 a0 a4 96 f8 c7 04 24 3c a5 96 f8 
e8 7a c5 7b c7 66
Mar 20 17:30:05 AICLSRV02 kernel:  <0>Fatal exception: panic in 5 seconds


On the other node, I see this log:

Mar 20 17:29:03 AICLSRV01 kernel: CMAN: removing node AICLSRV02 from the 
cluster : Missed too many heartbeats
Mar 20 17:29:03 AICLSRV01 fenced[2092]: AICLSRV02 not a cluster member 
after 0 sec post_fail_delay
Mar 20 17:29:03 AICLSRV01 fenced[2092]: fencing node "AICLSRV02"
Mar 20 17:29:03 AICLSRV01 fence_manual: Node AICLSRV02 needs to be reset 
before recovery can procede.  Waiting for AICLSRV02 to rejoin the 
cluster or for manual acknowledgement that it has been reset (i.e. 
fence_ack_manual -n AICLSRV02)


Any suggestion about what happened? Any way to avoid it? Thanks in advance.


Fabrizio Lippolis


From pcaulfie at redhat.com  Tue Mar 21 11:44:56 2006
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 21 Mar 2006 11:44:56 +0000
Subject: [Linux-cluster] dlm kernel bug (node crashed)
In-Reply-To: <441FE4E9.50704@aurigainformatica.it>
References: <441FE4E9.50704@aurigainformatica.it>
Message-ID: <441FE738.6080706@redhat.com>

Fabrizio Lippolis wrote:

> On the other node, I see this log:
> 
> Mar 20 17:29:03 AICLSRV01 kernel: CMAN: removing node AICLSRV02 from the
> cluster : Missed too many heartbeats

That's a clue :)

AICLSRV02 has either stopped responding for some reason, or the network
connection between them has been interrupted.

-- 

patrick


From cjk at techma.com  Tue Mar 21 12:47:03 2006
From: cjk at techma.com (Kovacs, Corey J.)
Date: Tue, 21 Mar 2006 07:47:03 -0500
Subject: [Linux-cluster] GFS 6u7?
Message-ID: <FF2CE0D593AEE34B955FEC77BD5AFBE0079E2F@tmaemail.techma.com>

With RHEL3u7 being out for about a week now, I expect GFS 6u7 to be available
soon as well. Does anyone have any information as to how much longer it will
be
until it's released? 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060321/7bdb5682/attachment.htm>

From forigato at gmail.com  Tue Mar 21 16:10:09 2006
From: forigato at gmail.com (ANDRE LUIS FORIGATO)
Date: Tue, 21 Mar 2006 13:10:09 -0300
Subject: [Linux-cluster] What is it in logs?
Message-ID: <9e7b71460603210810o1486c575g@mail.gmail.com>

Hello all,

What is it in logs? What it means?

Mar 10 21:46:51 websave1 clulockd[4428]: <warning> Denied 200.87.95.172:
Broken pipe

Mar 10 22:46:24 websave1 clumanager: [3713]: <notice> Starting Red Hat
Cluster Manager...
Mar 10 22:46:24 websave1 cluquorumd[3729]: <warning> STONITH: No drivers
configured for host '200.87.95.171'!
Mar 10 22:46:24 websave1 cluquorumd[3729]: <warning> STONITH: Data integrity
may be compromised!
Mar 10 22:46:24 websave1 cluquorumd[3729]: <warning> STONITH: No drivers
configured for host '200.87.95.172'!
Mar 10 22:46:24 websave1 cluquorumd[3729]: <warning> STONITH: Data integrity
may be compromised!
Mar 10 22:46:24 websave1 clumanager: cluquorumd startup succeeded

Mar 10 22:46:35 websave1 clumembd[3753]: <notice> Member 200.87.95.171 UP
Mar 10 22:46:37 websave1 cluquorumd[3730]: <warning> Membership reports #1
as down, but disk reports as up: State uncertain!
Mar 10 22:46:37 websave1 cluquorumd[3730]: <notice> Quorum Formed; Starting
Service Manager
Mar 10 22:46:37 websave1 clusvcmgrd: [4362]: <notice> service notice:
Stopping service oracle_cluster ...
Mar 10 22:46:37 websave1 clusvcmgrd: [4362]: <notice> service notice:
Running user script '/u01/cluster/scripts/oracluster.sh stop'

Mar 10 22:46:53 websave1 clusvcmgrd: [4362]: <notice> service notice:
Stopped service oracle_cluster ...
Mar 10 22:46:53 websave1 clusvcmgrd: [4699]: <notice> service notice:
Stopping service websave_cluster ...
Mar 10 22:46:53 websave1 clusvcmgrd: [4699]: <notice> service notice:
Running user script '/opt/websave/cluster/websave stop'
Mar 10 22:46:56 websave1 clusvcmgrd: [4699]: <notice> service notice:
Stopped service websave_cluster ...
Mar 10 22:46:56 websave1 clusvcmgrd: [4905]: <notice> service notice:
Stopping service tomcat_cluster ...
Mar 10 22:46:56 websave1 clusvcmgrd: [4905]: <notice> service notice:
Running user script '/opt/websave/cluster/tomcat stop'
Mar 10 22:46:56 websave1 su(pam_unix)[4935]: session opened for user websave
by (uid=0)
Mar 10 22:46:59 websave1 su(pam_unix)[4935]: session closed for user websave
Mar 10 22:47:01 websave1 clusvcmgrd: [4905]: <notice> service notice:
Stopped service tomcat_cluster ...

Mar 10 22:46:53 websave1 clusvcmgrd: [4362]: <notice> service notice:
Stopped service oracle_cluster ...
Mar 10 22:46:53 websave1 clusvcmgrd: [4699]: <notice> service notice:
Stopping service websave_cluster ...
Mar 10 22:46:53 websave1 clusvcmgrd: [4699]: <notice> service notice:
Running user script '/opt/websave/cluster/websave stop'
Mar 10 22:46:56 websave1 clusvcmgrd: [4699]: <notice> service notice:
Stopped service websave_cluster ...
Mar 10 22:46:56 websave1 clusvcmgrd: [4905]: <notice> service notice:
Stopping service tomcat_cluster ...
Mar 10 22:46:56 websave1 clusvcmgrd: [4905]: <notice> service notice:
Running user script '/opt/websave/cluster/tomcat stop'
Mar 10 22:46:56 websave1 su(pam_unix)[4935]: session opened for user websave
by (uid=0)
Mar 10 22:46:59 websave1 su(pam_unix)[4935]: session closed for user websave
Mar 10 22:47:01 websave1 clusvcmgrd: [4905]: <notice> service notice:
Stopped service tomcat_cluster ...


Mar 10 22:47:04 websave1 clusvcmgrd: [5079]: <notice> service notice:
Running user script '/u01/cluster/scripts/oracluster.sh start'
Mar 10 22:47:04 websave1 su(pam_unix)[5365]: session opened for user oracle
by (uid=0)

Mar 10 22:47:04 websave1 snmpd[2227]: Received SNMP packet(s) from
200.87.95.171
Mar 10 22:47:06 websave1 clumembd[3753]: <notice> Member 200.87.95.172 UP
Mar 10 22:47:07 websave1 cluquorumd[5391]: <err> VF: Abort: Invalid header
in reply from member #1
Mar 10 22:47:10 websave1 su(pam_unix)[5365]: session closed for user oracle
Mar 10 22:47:10 websave1 su(pam_unix)[5396]: session opened for user oracle
by (uid=0)
Mar 10 22:47:45 websave1 su(pam_unix)[5396]: session closed for user oracle
Mar 10 22:47:45 websave1 clusvcmgrd: [5079]: <notice> service notice:
Started service oracle_cluster ...
Mar 10 22:47:49 websave1 su(pam_unix)[6167]: session opened for user oracle
by (uid=0)
Mar 10 22:47:49 websave1 su(pam_unix)[6167]: session closed for user oracle
Mar 10 22:48:14 websave1 su(pam_unix)[6395]: session opened for user oracle
by (uid=0)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060321/0128e728/attachment.htm>

From Anthony.Assi at irisa.fr  Tue Mar 21 16:17:53 2006
From: Anthony.Assi at irisa.fr (Anthony Assi)
Date: Tue, 21 Mar 2006 17:17:53 +0100
Subject: [Linux-cluster] GFS Log Files.
Message-ID: <44202731.4000007@irisa.fr>

Hi,

How can i specify were to LOG GFS Infos?

like i.e: /var/log/gfs


From Alain.Moulle at bull.net  Tue Mar 21 17:02:22 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Tue, 21 Mar 2006 18:02:22 +0100
Subject: [Linux-cluster] CS4 Update 2 / problem with systems dump ?
Message-ID: <4420319E.3030307@bull.net>

Hi

How do you manage the problem of dumping machines ?

I mean, if in a HA pair, the 1st node is crashing
and proceeding to dump, the CS4 on peer node will
automatically power off the first one, and therefore
interrupt the dump process.

Does that mean that we never can have dump when
the nodes are "under" CS4 ?
Or is there a way to manage this point ?

Thanks
Alain


From teigland at redhat.com  Tue Mar 21 17:09:37 2006
From: teigland at redhat.com (David Teigland)
Date: Tue, 21 Mar 2006 11:09:37 -0600
Subject: [Linux-cluster] CS4 Update 2 / problem with systems dump ?
In-Reply-To: <4420319E.3030307@bull.net>
References: <4420319E.3030307@bull.net>
Message-ID: <20060321170937.GB20792@redhat.com>

On Tue, Mar 21, 2006 at 06:02:22PM +0100, Alain Moulle wrote:
> Hi
> 
> How do you manage the problem of dumping machines ?
> 
> I mean, if in a HA pair, the 1st node is crashing
> and proceeding to dump, the CS4 on peer node will
> automatically power off the first one, and therefore
> interrupt the dump process.
> 
> Does that mean that we never can have dump when
> the nodes are "under" CS4 ?
> Or is there a way to manage this point ?

You might set a fencing delay that would allow the dump to complete, e.g.
  <fence_daemon post_fail_delay="10">
  </fence_daemon>

Dave


From m_list at eshine.de  Tue Mar 21 17:46:11 2006
From: m_list at eshine.de (Arnd)
Date: Tue, 21 Mar 2006 18:46:11 +0100
Subject: [Linux-cluster] GFS hangs if one clusternode is powered off
Message-ID: <44203BE3.70806@eshine.de>

Hello list,

I'm just setting up an linux cluster with GFS and local shared discs
(SAN). All the clusternodes have the same LUN presented and I did all
the necessary steps to create an GFS on that disc. Its all just fine
working, but when powering one of the nodes off the whole GFS hangs. No
reads and no writes are possible to the filesystem. Even every process
accessing the device waits for the I/O to get completed.

The clusternodes are: adnux1, adnux2, adnux3, adnux4, adlade1
During the tests I'm only running the hosts: adnux2, adnux3, adnux4

While GFS is running fine, I can check the cluster with cman_tool:

adnux2 / # cman_tool status     
Protocol version: 5.0.1
Config version: 1
Cluster name: adnuxCluster1
Cluster ID: 41625
Cluster Member: Yes
Membership state: Cluster-Member
Nodes: 3
Expected_votes: 1
Total_votes: 3
Quorum: 2  
Active subsystems: 6
Node name: adnux2
Node addresses: 192.168.1.152 

&

adnux2 / # cat /proc/cluster/services
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           7   5 run       -
[1 2 3]

DLM Lock Space:  "clvmd"                             2   3 run       -
[3 1 2]

DLM Lock Space:  "adnux"                             8   6 run       -
[1 3 2]

GFS Mount Group: "adnux"                             9   7 run       -
[1 3 2]


When powering adnux4 off the other two hosts cannot access the GFS in
any way. The file /var/log/messages from one of the nodes says:
...
Mar 21 18:35:37 adnux2 kernel: CMAN: node adnux4 has been removed from
the cluster : Missed too many heartbeats
Mar 21 18:35:38 adnux2 fenced[5627]: adnux4 not a cluster member after 0
sec post_fail_delay
Mar 21 18:35:38 adnux2 fenced[5627]: fencing node "adnux4"
Mar 21 18:35:38 adnux2 fenced[5627]: fence "adnux4" failed

cman_tool tells that the cluster is still up:

adnux2 ~ # cman_tool status
Protocol version: 5.0.1
Config version: 1
Cluster name: adnuxCluster1
Cluster ID: 41625
Cluster Member: Yes
Membership state: Cluster-Member
Nodes: 2
Expected_votes: 1
Total_votes: 2
Quorum: 2  
Active subsystems: 6
Node name: adnux2
Node addresses: 192.168.1.152 

&

adnux2 ~ # cat /proc/cluster/services
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           7   5 recover 2 -
[1 2]

DLM Lock Space:  "clvmd"                             2   3 recover 0 -
[1 2]

DLM Lock Space:  "adnux"                             8   6 recover 0 -
[1 2]

GFS Mount Group: "adnux"                             9   7 recover 0 -
[1 2]


Even successfully running "fence_manual -n adnux4" and "fence_ack_manual
-n adnux4" remains without any affect. I'm wondering why the GFS is
blocking? It can not be that the failed node must be fenced in order to
gfs be able to function?!

I was searching for many ideas but only found some people pointing to an
maybe misconfigured fencing. Now I'm hoping to find here where my
mistake is.

Thank you in advance.


-- configuration file /etc/cluster/cluster.conf --

<?xml version="1.0"?>
<cluster name="adnuxCluster1" config_version="1">

<cman expected_votes="1" quorum="1">
</cman>

<clusternodes>
     <clusternode name="adnux1" votes="1">
       <fence>
           <method name="single">
             <device name="human" nodename="adnux1"/>
           </method>
        </fence>
     </clusternode>
     <clusternode name="adnux2" votes="1">
       <fence>
           <method name="single">
             <device name="human" nodename="adnux2"/>
           </method>
        </fence>
     </clusternode>
     <clusternode name="adnux3" votes="1">
       <fence>
           <method name="single">
             <device name="human" nodename="adnux3"/>
           </method>
        </fence>
     </clusternode>
     <clusternode name="adnux4" votes="1">
       <fence>
           <method name="single">
             <device name="human" nodename="adnux4"/>
           </method>
        </fence>
      </clusternode>
      <clusternode name="adlade1" votes="1">
       <fence>
           <method name="single">
             <device name="human" nodename="adlade1"/>
           </method>
        </fence>
      </clusternode>
</clusternodes>
<fence_devices>
  <device name="human" agent="fence_manual"/>
</fence_devices>

</cluster>


From m_list at eshine.de  Tue Mar 21 17:55:08 2006
From: m_list at eshine.de (Arnd)
Date: Tue, 21 Mar 2006 18:55:08 +0100
Subject: [Linux-cluster] GFS hangs if one clusternode is powered off
Message-ID: <44203DFC.9060008@eshine.de>

Hello list,

I'm just setting up an linux cluster with GFS and local shared discs
(SAN). All the clusternodes have the same LUN presented and I did all
the necessary steps to create an GFS on that disc. Its all just fine
working, but when powering one of the nodes off the whole GFS hangs. No
reads and no writes are possible to the filesystem. Even every process
accessing the device waits for the I/O to get completed.

The clusternodes are: adnux1, adnux2, adnux3, adnux4, adlade1
During the tests I'm only running the hosts: adnux2, adnux3, adnux4

While GFS is running fine, I can check the cluster with cman_tool:

adnux2 / # cman_tool status     
Protocol version: 5.0.1
Config version: 1
Cluster name: adnuxCluster1
Cluster ID: 41625
Cluster Member: Yes
Membership state: Cluster-Member
Nodes: 3
Expected_votes: 1
Total_votes: 3
Quorum: 2  
Active subsystems: 6
Node name: adnux2
Node addresses: 192.168.1.152

&

adnux2 / # cat /proc/cluster/services
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           7   5 run       -
[1 2 3]

DLM Lock Space:  "clvmd"                             2   3 run       -
[3 1 2]

DLM Lock Space:  "adnux"                             8   6 run       -
[1 3 2]

GFS Mount Group: "adnux"                             9   7 run       -
[1 3 2]


When powering adnux4 off the other two hosts cannot access the GFS in
any way. The file /var/log/messages from one of the nodes says:
...
Mar 21 18:35:37 adnux2 kernel: CMAN: node adnux4 has been removed from
the cluster : Missed too many heartbeats
Mar 21 18:35:38 adnux2 fenced[5627]: adnux4 not a cluster member after 0
sec post_fail_delay
Mar 21 18:35:38 adnux2 fenced[5627]: fencing node "adnux4"
Mar 21 18:35:38 adnux2 fenced[5627]: fence "adnux4" failed

cman_tool tells that the cluster is still up:

adnux2 ~ # cman_tool status
Protocol version: 5.0.1
Config version: 1
Cluster name: adnuxCluster1
Cluster ID: 41625
Cluster Member: Yes
Membership state: Cluster-Member
Nodes: 2
Expected_votes: 1
Total_votes: 2
Quorum: 2  
Active subsystems: 6
Node name: adnux2
Node addresses: 192.168.1.152

&

adnux2 ~ # cat /proc/cluster/services
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           7   5 recover 2 -
[1 2]

DLM Lock Space:  "clvmd"                             2   3 recover 0 -
[1 2]

DLM Lock Space:  "adnux"                             8   6 recover 0 -
[1 2]

GFS Mount Group: "adnux"                             9   7 recover 0 -
[1 2]


Even successfully running "fence_manual -n adnux4" and "fence_ack_manual
-n adnux4" remains without any affect. I'm wondering why the GFS is
blocking? It can not be that the failed node must be fenced in order to
gfs be able to function?!

I was searching for many ideas but only found some people pointing to an
maybe misconfigured fencing. Now I'm hoping to find here where my
mistake is.

Thank you in advance.


-- configuration file /etc/cluster/cluster.conf --

<?xml version="1.0"?>
<cluster name="adnuxCluster1" config_version="1">

<cman expected_votes="1" quorum="1">
</cman>

<clusternodes>
     <clusternode name="adnux1" votes="1">
       <fence>
           <method name="single">
             <device name="human" nodename="adnux1"/>
           </method>
        </fence>
     </clusternode>
     <clusternode name="adnux2" votes="1">
       <fence>
           <method name="single">
             <device name="human" nodename="adnux2"/>
           </method>
        </fence>
     </clusternode>
     <clusternode name="adnux3" votes="1">
       <fence>
           <method name="single">
             <device name="human" nodename="adnux3"/>
           </method>
        </fence>
     </clusternode>
     <clusternode name="adnux4" votes="1">
       <fence>
           <method name="single">
             <device name="human" nodename="adnux4"/>
           </method>
        </fence>
      </clusternode>
      <clusternode name="adlade1" votes="1">
       <fence>
           <method name="single">
             <device name="human" nodename="adlade1"/>
           </method>
        </fence>
      </clusternode>
</clusternodes>
<fence_devices>
  <device name="human" agent="fence_manual"/>
</fence_devices>

</cluster>

 
From teigland at redhat.com  Tue Mar 21 17:56:55 2006
From: teigland at redhat.com (David Teigland)
Date: Tue, 21 Mar 2006 11:56:55 -0600
Subject: [Linux-cluster] GFS hangs if one clusternode is powered off
In-Reply-To: <44203BE3.70806@eshine.de>
References: <44203BE3.70806@eshine.de>
Message-ID: <20060321175655.GD20792@redhat.com>

On Tue, Mar 21, 2006 at 06:46:11PM +0100, Arnd wrote:
> Mar 21 18:35:38 adnux2 fenced[5627]: fencing node "adnux4"
> Mar 21 18:35:38 adnux2 fenced[5627]: fence "adnux4" failed

> Fence Domain:    "default"                           7   5 recover 2 -
> [1 2]

The fencing failure is the problem.

> <fence_devices>
>   <device name="human" agent="fence_manual"/>
> </fence_devices>

The names used in cluster.conf changed a while back, it's now:

<fencedevices>
        <fencedevice name="human" agent="fence_manual"/>
</fencedevices>

Dave


From Matthew.Patton.ctr at osd.mil  Tue Mar 21 21:45:43 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Tue, 21 Mar 2006 16:45:43 -0500
Subject: [Linux-cluster] RAIDing a CLVM?
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D33@osdn06.osd.mil>

Classification: UNCLASSIFIED

call this a screwy idea and I can't seem to find a relevant thread.

my cluster is made up of machines each with a SINGLE hard drive. I want to
use 1/2 of each disk and pool them all together and make a RAID 10 or 5 set.
And then be able to access said volume RW from any node and should a node
die, the filesystem keeps running.

I can't think of a way to combine (C)LVM, GFS, GNBD, and MD (software RAID)
and make it work unless just one of the nodes becomes the MD master and then
just exports it via NFS. Can it be done? Do commercial options exist to pull
off this trick?

My backup plan is to define a shared SAN volume intended for high WRITE
volume, partition it, and each node that needs to scribble manages it's own
partition+filesystem. And it's up to me to make sure no 2 nodes try to own
the same piece.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060321/be7acf7b/attachment.htm>

From ian at blenke.com  Tue Mar 21 22:15:58 2006
From: ian at blenke.com (Ian C. Blenke)
Date: Tue, 21 Mar 2006 17:15:58 -0500
Subject: [Linux-cluster] RAIDing a CLVM?
In-Reply-To: <D8063DF686D10247B0A49D01271285690CE91D33@osdn06.osd.mil>
References: <D8063DF686D10247B0A49D01271285690CE91D33@osdn06.osd.mil>
Message-ID: <44207B1E.4090008@blenke.com>

Patton, Matthew F, CTR, OSD-PA&E wrote:

> Classification: UNCLASSIFIED
>
> call this a screwy idea and I can't seem to find a relevant thread.
>
> my cluster is made up of machines each with a SINGLE hard drive. I 
> want to use 1/2 of each disk and pool them all together and make a 
> RAID 10 or 5 set. And then be able to access said volume RW from any 
> node and should a node die, the filesystem keeps running.
>
> I can't think of a way to combine (C)LVM, GFS, GNBD, and MD (software 
> RAID) and make it work unless just one of the nodes becomes the MD 
> master and then just exports it via NFS. Can it be done? Do commercial 
> options exist to pull off this trick?
>

You _can_ use lvcreate -m to create a CLVM'ed mirror and GFS across 
that. It's rather young still though, I'm not sure I'd do anything but 
play around with it at the moment. It requires at least 3 pv's, and you 
really don't get to specify where things are laid out.
   |
    # lvcreate -m 1 -n mirror1 --alloc anywhere -L 4G vg

|I blogged a bit on it:

    http://ian.blenke.com/projects/xen/cluster/buildingxenclusterstep9.html

Does anyone know what happened to dd-raid? Daniel Phillips at RedHat had 
a really neat raid3.5 thing going that seemed a _lot_ like ZFS in many 
respects, with the benefit of being cluster aware.

> My backup plan is to define a shared SAN volume intended for high 
> WRITE volume, partition it, and each node that needs to scribble 
> manages it's own partition+filesystem. And it's up to me to make sure 
> no 2 nodes try to own the same piece.
>

I too would like to see a commodity solution for network mirrored 
storage used in a cluster configuration without some form of shared 
media between the cluster nodes.

My experiments found that:
1) AoE doesn't like it when a target disappears. Restarting vblade isn't 
enough, you get to reload the aoe driver on the other node.
and
2) Xen 3.0 has some serious UDP checksum corruption issues.

Hope this helps..

- Ian C. Blenke <ian at blenke.com> http://ian.blenke.com/


From forums at daltonfirth.co.uk  Tue Mar 21 22:41:00 2006
From: forums at daltonfirth.co.uk (James Firth)
Date: Tue, 21 Mar 2006 22:41:00 +0000
Subject: [Linux-cluster] RAIDing a CLVM?
In-Reply-To: <D8063DF686D10247B0A49D01271285690CE91D33@osdn06.osd.mil>
References: <D8063DF686D10247B0A49D01271285690CE91D33@osdn06.osd.mil>
Message-ID: <442080FC.4050908@daltonfirth.co.uk>

Patton, Matthew F, CTR, OSD-PA&E wrote:
> I can't think of a way to combine (C)LVM, GFS, GNBD, and MD (software 
> RAID) and make it work unless just one of the nodes becomes the MD 
> master and then just exports it via NFS. Can it be done? Do commercial 
> options exist to pull off this trick?

Hi,

We're working on the same problem. We have tried two approaches, both 
with their own fairly serious drawbacks.

Our goal was a 2-node all-in-one HA mega server, providing all office 
services from one cluster, and with no single point of failure.

The first uses a raid master for each pair.  Each member of the pair 
exports a disk using GNBD.  The pair negotiate a master using CMAN, and 
that master assembles a RAID device using one GNBD import, plus one 
local disk, and then exports it using NFS, or in the case of GFS being 
used, exports the assembled raid device via a third GNBD export.

Our trick here was each node exported it's contributory disk, using 
GNDB, by default, so long as at east one other node was active (quorum > 
1), knowing only one master would ever be active. This significantly 
reduced complexity.

Problems are:
  - GNDB instabilities cause frequent locks and crashes, especially 
busying DLM (suspected).
  - NFS export scheme also causes locks and hangs to NFS clients on 
failover *IF* a member of the pair then subsequently imports and an NFS 
client, as needed in some of our mega-server ideas.
  - NFS export is not too useful when file locking is important, e.g. 
subversion, procmail etc (yes, procmail, if your mail server is also 
your Samba homes server).  You have to dell mailproc to use alternative 
mailbox locking else mailboxes get corrupted.
  - GFS on assembled device with GNDB export scheme works best, but 
still causes locks and hangs.  Note also an exporting client must NOT 
import it's own exported GNBD volume, so there is no symmetry between 
the pair, and it's quite difficult to manage.


Our second approach is something we've just embarked on, and so far is 
proving more successful, using DRBD.  DRBD is used to create a mirrored 
pair of volumes, a bit like GNBD+MD as above.

The result is a block device accessible from both machines, but the 
problem is that only one member of the pair is writable (master), and 
the other is a read-only mount.

If the master server dies, the remaining DRBD becomes the master, and 
becomes writable.  When the dead node recovers, the recovered node 
becomes a slave, read-only.

The problem is with the read-only aspect, so you still need to have an 
exporting mechanism for the assembled DRBD volume running on the DRBD 
master.  We plan to do this via GNBD export (GFS FS installed).

That's where the complexity comes in - as the DRBD negotiation appears 
to be totally independent of cluster control suite, and so we're having 
to use customizations to start the exporting daemon on the DRBD master.


Conclusions
---

 From all we've learned to date, it still seems a dedicated file server 
or SAN approach is necessary to maintain availability.

Either of the above schemes would work fairly well if we were just 
building a HA storage component, because most of the complexities we've 
encountered come about when the shared storage device is used by 
services on the same cluster nodes.

Most, if not all of what we've done so far is not suitable for a 
production environment, as it just increases the coupling between nodes, 
and therefore increases the chance of a cascade failure of the cluster. 
  In all seriousness I believe a single machine with RAID-1 pair has a 
higher MTBF than any of our experiments.

Many parts of the CCS/GFS suite so far released have serious issues when 
used in non-standard configurations.  For example, exception handling 
we've encountered usually defaults to "while (1) { retry(); sleep(1); }"

I've read last year about plans for GFS mirroring from RedHat, and 
haven't found much else since.  If anyone knows more I'd love to hear.

It also appears that the guys behind DRBD want to further develop their 
mirroring so that both volumes can be writable, in which case you can 
just stick GFS on the assembled device, and run whichever exporting 
method you like as a normal cluster service.


James

www.daltonfirth.co.uk


From Alain.Moulle at bull.net  Wed Mar 22 07:59:17 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Wed, 22 Mar 2006 08:59:17 +0100
Subject: [Linux-cluster] CS4 Update 2 / problem with systems dump ?
Message-ID: <442103D5.90806@bull.net>

On Tue, Mar 21, 2006 at 06:02:22PM +0100, Alain Moulle wrote:
>> >> How do you manage the problem of dumping machines ?
>> >>
>> >> I mean, if in a HA pair, the 1st node is crashing
>> >> and proceeding to dump, the CS4 on peer node will
>> >> automatically power off the first one, and therefore
>> >> interrupt the dump process.
>> >>
>> >> Does that mean that we never can have dump when
>> >> the nodes are "under" CS4 ?
>> >> Or is there a way to manage this point ?

>> You might set a fencing delay that would allow the dump to complete, e.g.
>>   <fence_daemon post_fail_delay="10">
>>   </fence_daemon>
>> Dave
OK but does that mean that one we have patched this, the peer node will
wait in all cases this delay before fencing the node with problem, even
if this node is not dumping , right ?
So, the workaround that you propose is to be used only this way :
1. a node has crashed and was about to dump but has been fenced.
2. patch the post_fail_delay
3. re-start CS4 on both nodes
4. wait for a new crash and dump, and in this case, the failover
   will take at least the post_fail_delay value.

Am I right ?

Thanks
Alain


From magobin at gmail.com  Wed Mar 22 08:00:09 2006
From: magobin at gmail.com (Alex aka Magobin)
Date: Wed, 22 Mar 2006 09:00:09 +0100
Subject: [Linux-cluster] Disconnecting eth cable, cluster hung-uP!
Message-ID: <1143014409.7656.14.camel@localhost.localdomain>

hi, 
Cluster that I'm testing in lab with two node works fine, but
unfortunately I don't have any fence device for my test. It can switch
services from one node to other without any problem and if I shut a node
all services go to other node...BUT
Today I tried to simply disconnect ethernet cable to one node and I saw
that both node hung-up....I can't use clustat anymore...

In log I can see that CMAN remove correctly node from cluster (missed
too many heartbeats) and at same time I have a "fenced: nodo1 not a
cluster member after 0 sec post_fail_delay"

After that..only a lot of "fence "nodo1" failed"

but in this case, simply removing ethernet cable....other node doesn't
start services...why?

Plus...I maked a script to shut correctly the services in case of
emergency but in this case It'hung-up during 
"Waiting for services to stop:"


How can I resolve this problem....

Thanks in advance!
Alex


From l.dardini at comune.prato.it  Wed Mar 22 08:46:30 2006
From: l.dardini at comune.prato.it (Leandro Dardini)
Date: Wed, 22 Mar 2006 09:46:30 +0100
Subject: R: [Linux-cluster] Disconnecting eth cable, cluster hung-uP!
Message-ID: <404AA6666D14D14CA0D410C1BC6CC4C5465315@exchange3.comune.prato.local>

 
> -----Messaggio originale-----
> Da: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] Per conto di Alex 
> aka Magobin
> Inviato: mercoled? 22 marzo 2006 9.00
> A: linux-cluster at redhat.com
> Oggetto: [Linux-cluster] Disconnecting eth cable, cluster hung-uP!
> 
> hi,
> Cluster that I'm testing in lab with two node works fine, but 
> unfortunately I don't have any fence device for my test. It 
> can switch services from one node to other without any 
> problem and if I shut a node all services go to other 
> node...BUT Today I tried to simply disconnect ethernet cable 
> to one node and I saw that both node hung-up....I can't use 
> clustat anymore...
> 
> In log I can see that CMAN remove correctly node from cluster 
> (missed too many heartbeats) and at same time I have a 
> "fenced: nodo1 not a cluster member after 0 sec post_fail_delay"
> 
> After that..only a lot of "fence "nodo1" failed"

This is you problem. Fencing is the most important feature of a cluster!

> 
> but in this case, simply removing ethernet cable....other 
> node doesn't start services...why?

When you made access to a shared media, to grant data integrity all writes must be coordinated. If a node is not responding and the cluster is not sure it is not writing on the shared media, it pause the access to device to avoid or minimize data corruption.

Fencing is the action the device take against a "not responding node" to be sure it hasn't still access to the shared media. Fencing can be against the power of the "not responding node", turning it off or against the shared media, like blocking the port where the FC card is connected.

> 
> Plus...I maked a script to shut correctly the services in 
> case of emergency but in this case It'hung-up during "Waiting 
> for services to stop:"
> 

This happens for a short period of time, but after few seconds the service stop correctly. Are you running this script without connection between nodes? You cannot shutdown a service when the cluster is not quorate

Leandro

PS
Are you italian? Scrivimi pure in privato se hai ancora problemi


> 
> How can I resolve this problem....
> 
> Thanks in advance!
> Alex
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From Alain.Moulle at bull.net  Wed Mar 22 09:45:48 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Wed, 22 Mar 2006 10:45:48 +0100
Subject: [Linux-cluster] CS4 Update2 / same question about sync cluster.conf
Message-ID: <44211CCC.5040003@bull.net>

Hi

Again, as I did not get any answer, I give more
precisions on question :
I've seen that ccsd is allowed to sync the cluster.conf
file between two nodes if one of them has been updated
while the CS4 is running.

But is there any other case where the cluster.conf is
sync, even without any update of the file on one side
or the other ? (meaning, even if the cluster.conf files
are always the same on both nodes)

Thanks
Alain


From mdl at veles.ru  Wed Mar 22 11:10:30 2006
From: mdl at veles.ru (Denis Medvedev)
Date: Wed, 22 Mar 2006 14:10:30 +0300
Subject: [Linux-cluster] RAIDing a CLVM?
In-Reply-To: <442080FC.4050908@daltonfirth.co.uk>
References: <D8063DF686D10247B0A49D01271285690CE91D33@osdn06.osd.mil>
	<442080FC.4050908@daltonfirth.co.uk>
Message-ID: <442130A6.3000106@veles.ru>


A better approach is to export not an GNBD but an iSCSI device from DRBD.


James Firth wrote:


> Patton, Matthew F, CTR, OSD-PA&E wrote:
>
>> I can't think of a way to combine (C)LVM, GFS, GNBD, and MD (software 
>> RAID) and make it work unless just one of the nodes becomes the MD 
>> master and then just exports it via NFS. Can it be done? Do 
>> commercial options exist to pull off this trick?
>
>
> Hi,
>
> We're working on the same problem. We have tried two approaches, both 
> with their own fairly serious drawbacks.
>
> Our goal was a 2-node all-in-one HA mega server, providing all office 
> services from one cluster, and with no single point of failure.
>
> The first uses a raid master for each pair.  Each member of the pair 
> exports a disk using GNBD.  The pair negotiate a master using CMAN, 
> and that master assembles a RAID device using one GNBD import, plus 
> one local disk, and then exports it using NFS, or in the case of GFS 
> being used, exports the assembled raid device via a third GNBD export.
>
> Our trick here was each node exported it's contributory disk, using 
> GNDB, by default, so long as at east one other node was active (quorum 
> > 1), knowing only one master would ever be active. This significantly 
> reduced complexity.
>
> Problems are:
>  - GNDB instabilities cause frequent locks and crashes, especially 
> busying DLM (suspected).
>  - NFS export scheme also causes locks and hangs to NFS clients on 
> failover *IF* a member of the pair then subsequently imports and an 
> NFS client, as needed in some of our mega-server ideas.
>  - NFS export is not too useful when file locking is important, e.g. 
> subversion, procmail etc (yes, procmail, if your mail server is also 
> your Samba homes server).  You have to dell mailproc to use 
> alternative mailbox locking else mailboxes get corrupted.
>  - GFS on assembled device with GNDB export scheme works best, but 
> still causes locks and hangs.  Note also an exporting client must NOT 
> import it's own exported GNBD volume, so there is no symmetry between 
> the pair, and it's quite difficult to manage.
>
>
>
> Our second approach is something we've just embarked on, and so far is 
> proving more successful, using DRBD.  DRBD is used to create a 
> mirrored pair of volumes, a bit like GNBD+MD as above.
>
> The result is a block device accessible from both machines, but the 
> problem is that only one member of the pair is writable (master), and 
> the other is a read-only mount.
>
> If the master server dies, the remaining DRBD becomes the master, and 
> becomes writable.  When the dead node recovers, the recovered node 
> becomes a slave, read-only.
>
> The problem is with the read-only aspect, so you still need to have an 
> exporting mechanism for the assembled DRBD volume running on the DRBD 
> master.  We plan to do this via GNBD export (GFS FS installed).
>
> That's where the complexity comes in - as the DRBD negotiation appears 
> to be totally independent of cluster control suite, and so we're 
> having to use customizations to start the exporting daemon on the DRBD 
> master.
>
>
> Conclusions
> ---
>
> From all we've learned to date, it still seems a dedicated file server 
> or SAN approach is necessary to maintain availability.
>
> Either of the above schemes would work fairly well if we were just 
> building a HA storage component, because most of the complexities 
> we've encountered come about when the shared storage device is used by 
> services on the same cluster nodes.
>
> Most, if not all of what we've done so far is not suitable for a 
> production environment, as it just increases the coupling between 
> nodes, and therefore increases the chance of a cascade failure of the 
> cluster.  In all seriousness I believe a single machine with RAID-1 
> pair has a higher MTBF than any of our experiments.
>
> Many parts of the CCS/GFS suite so far released have serious issues 
> when used in non-standard configurations.  For example, exception 
> handling we've encountered usually defaults to "while (1) { retry(); 
> sleep(1); }"
>
> I've read last year about plans for GFS mirroring from RedHat, and 
> haven't found much else since.  If anyone knows more I'd love to hear.
>
> It also appears that the guys behind DRBD want to further develop 
> their mirroring so that both volumes can be writable, in which case 
> you can just stick GFS on the assembled device, and run whichever 
> exporting method you like as a normal cluster service.
>
>
>
> James
>
> www.daltonfirth.co.uk
>
>
>
>
>
>
>
>
>
>
>
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


From forums at daltonfirth.co.uk  Wed Mar 22 12:04:06 2006
From: forums at daltonfirth.co.uk (James Firth)
Date: Wed, 22 Mar 2006 12:04:06 +0000
Subject: [Linux-cluster] Port Fencing Dell AX100 SAN
In-Reply-To: <442103D5.90806@bull.net>
References: <442103D5.90806@bull.net>
Message-ID: <44213D36.1030303@daltonfirth.co.uk>

Has anyone managed to port-fence a Dell AX100 SAN?

Have a customer with 2 1850 PowerEdge and a SAN but no DRAC.  Was 
wondering if it was possible to port-fence failed nodes at the SAN?

Thanks,
James


From dgolden at cp.dias.ie  Wed Mar 22 13:47:43 2006
From: dgolden at cp.dias.ie (David Golden)
Date: Wed, 22 Mar 2006 13:47:43 +0000
Subject: [Linux-cluster] RAIDing a CLVM?
In-Reply-To: <442130A6.3000106@veles.ru>
References: <D8063DF686D10247B0A49D01271285690CE91D33@osdn06.osd.mil>
	<442080FC.4050908@daltonfirth.co.uk> <442130A6.3000106@veles.ru>
Message-ID: <20060322134742.GB8974@ariadne.cp.dias.ie>

On 2006-03-22 14:10:30 +0300, Denis Medvedev wrote:
> 
> A better approach is to export not an GNBD but an iSCSI device from DRBD.
>

I was idly considering such an approach, but hadn't quite
done anything yet: have you got it working satisfactorily 
across failover, i.e. without iscsi initiators pointing
at the target noticing anything much beyond a short lag, no 
creeping corruption from cache issues, etc?  


From Matthew.Patton.ctr at osd.mil  Wed Mar 22 14:05:25 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Wed, 22 Mar 2006 09:05:25 -0500
Subject: [Linux-cluster] RAIDing a CLVM?
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D38@osdn06.osd.mil>

Classification: UNCLASSIFIED

> A better approach is to export not an GNBD but an iSCSI 
> device from DRBD.

how about exporting each single HD as iSCSI instead of GNBD in the first
place then? Is the iSCSI subsystem more robust in that sense? Then on the
softwareRaid master instead of exporting a filesystem, export a raid'ed
iSCSI device. If the RAID meta-data were spread out onto multiple disks
(which actually would mean multiple machines) then if the master went down
any of the others should be able to rebuild the RAID set or at least run in
degraded mode.

back to more reading and tinkering...
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060322/1f26754b/attachment.htm>

From forums at daltonfirth.co.uk  Wed Mar 22 14:15:56 2006
From: forums at daltonfirth.co.uk (James Firth)
Date: Wed, 22 Mar 2006 14:15:56 +0000
Subject: [Linux-cluster] RAIDing a CLVM?
In-Reply-To: <D8063DF686D10247B0A49D01271285690CE91D38@osdn06.osd.mil>
References: <D8063DF686D10247B0A49D01271285690CE91D38@osdn06.osd.mil>
Message-ID: <44215C1C.8050402@daltonfirth.co.uk>

Patton, Matthew F, CTR, OSD-PA&E wrote:
> Classification: UNCLASSIFIED
> 
>  > A better approach is to export not an GNBD but an iSCSI
>  > device from DRBD.
> 
> how about exporting each single HD as iSCSI instead of GNBD in the first 
> place then? Is the iSCSI subsystem more robust in that sense? Then on 
> the softwareRaid master instead of exporting a filesystem, export a 
> raid'ed iSCSI device. If the RAID meta-data were spread out onto 
> multiple disks (which actually would mean multiple machines) then if the 
> master went down any of the others should be able to rebuild the RAID 
> set or at least run in degraded mode.

In theory, yes.  This is what we have "working" in the lab using GNBD. 
Under "normal" fail-over conditions, the remaining node does re-assemble 
  a degraded RAID device.  The problems mainly stem from GNBD under 
certain fail-over conditions.  Both in importing/exporting the component 
disks, and then importing/exporting the assembled raid device.

I'm working out of the office today but will post our init scripts for 
the GNBD equivalent, on the understanding that you post back your iSCSI 
stuff and mods to our scripts.

Regards,
James


From teigland at redhat.com  Wed Mar 22 15:12:34 2006
From: teigland at redhat.com (David Teigland)
Date: Wed, 22 Mar 2006 09:12:34 -0600
Subject: [Linux-cluster] CS4 Update 2 / problem with systems dump ?
In-Reply-To: <442103D5.90806@bull.net>
References: <442103D5.90806@bull.net>
Message-ID: <20060322151234.GB12284@redhat.com>

On Wed, Mar 22, 2006 at 08:59:17AM +0100, Alain Moulle wrote:
> >> You might set a fencing delay that would allow the dump to complete, e.g.
> >>   <fence_daemon post_fail_delay="10">
> >>   </fence_daemon>

> OK but does that mean that one we have patched this, the peer node will
> wait in all cases this delay before fencing the node with problem, even
> if this node is not dumping , right ?

When fenced goes to fence a failed node, it waits 10s before actually
killing it.  That applies to all nodes that fail.

> So, the workaround that you propose is to be used only this way :
> 1. a node has crashed and was about to dump but has been fenced.
> 2. patch the post_fail_delay
> 3. re-start CS4 on both nodes
> 4. wait for a new crash and dump, and in this case, the failover
>    will take at least the post_fail_delay value.

I'm not sure what you mean by this, but it doesn't sound right.
post_fail_delay would be added permanently to cluster.conf which
is the same on all nodes... you don't change it.

Dave


From m_list at eshine.de  Wed Mar 22 15:12:16 2006
From: m_list at eshine.de (Arnd)
Date: Wed, 22 Mar 2006 16:12:16 +0100
Subject: [Linux-cluster] Running GFS without fencing and maybe locking ;-)
Message-ID: <44216950.2030909@eshine.de>

While setting up our cluster I was wondering why GFS blocks the
filesystem when one of the nodes fails (the cluster remains in state
"recover" and waits for the failed node to be fenced). Manual fence is
quite to slow (if I am issuing it) and while we are running more than
one services on a node we cannot shut it down over ILO or deactivate the
Brocade port:

 - The nodes have an mysql database running completely in memory (which
is lost if fenced is powering this system off)

 - The nodes have more filesystems mounted which may also fail if I'm
deactivating the port on the Brocade switch


Our cluster consists of 4 Webservers and one management server. This
management server is the only server which needs write access to the GFS
(for example changing the html-files):

	webserver1 - webserver4: mount GFS -o ro (readonly)
	mgm-server1: mount GFS -o rw (write access)

My idea:

If one of the webserver fails then the cluster will issue an
fence_script with an exitcode "0". The node is fenced by the cluster and
while the filesystem wasn't mounted rw it cannot be destroyed.

The only possible way the filesystem can get corrupted is when the
management-server fails.

So is it possible to run the GFS with 4 readonly nodes and only one node
 which should be taken care if it fails? How does locking (lock_dlm)
work  in this case? I suppose that it only needs to take care for any
writes to the filesystem but here I might be wrong?!

Can I use lock_nolock (when making the filesystem) if only one node is
writing to the GFS?

Arnd


From gevery at vcommerce.com  Wed Mar 22 15:23:19 2006
From: gevery at vcommerce.com (Gary Every)
Date: Wed, 22 Mar 2006 08:23:19 -0700
Subject: [Linux-cluster] Removing a node from a cluster
Message-ID: <1143040999.31944.34.camel@every.vcommerce.com>

Greetings, all!

Trying my first RHEL cluster, and in setting it up, I've gone through a
couple attempts, and had licensing issues. Those have been resolved, and
now I have two machines running, attached to an EMC for the shared
storage. Here's my issue:

Apparently when I was trying to get the cs-deploy-tool to authenticate
both my machines, I must  have successfully joined ONE of them to a
cluster. Now when I try to run the tool, it states that the one server
is already part of a cluster and can't be added.

I stopped ccsd and lock_gulmd, tried again, with still no success. I
don't have a cluster.conf file on the server in question, so how can it
be part of a cluster, and how can I "disengage" it from that unknown
cluster?

TIA,

--
Gary Every
Linux Administrator
VCommerce Corporation
gevery at vcommerce.com


From Alain.Moulle at bull.net  Wed Mar 22 15:24:41 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Wed, 22 Mar 2006 16:24:41 +0100
Subject: [Linux-cluster] CS4 Update 2 / problem with systems dump ?
In-Reply-To: <20060322151234.GB12284@redhat.com>
References: <442103D5.90806@bull.net> <20060322151234.GB12284@redhat.com>
Message-ID: <44216C39.4090906@bull.net>

David Teigland wrote:
> On Wed, Mar 22, 2006 at 08:59:17AM +0100, Alain Moulle wrote:
> 
>>>>You might set a fencing delay that would allow the dump to complete, e.g.
>>>>  <fence_daemon post_fail_delay="10">
>>>>  </fence_daemon>
> 
> 
>>OK but does that mean that one we have patched this, the peer node will
>>wait in all cases this delay before fencing the node with problem, even
>>if this node is not dumping , right ?
> 
> 
> When fenced goes to fence a failed node, it waits 10s before actually
> killing it.  That applies to all nodes that fail.
> 
> 
>>So, the workaround that you propose is to be used only this way :
>>1. a node has crashed and was about to dump but has been fenced.
>>2. patch the post_fail_delay
>>3. re-start CS4 on both nodes
>>4. wait for a new crash and dump, and in this case, the failover
>>   will take at least the post_fail_delay value.
> 
> 
> I'm not sure what you mean by this, but it doesn't sound right.
> post_fail_delay would be added permanently to cluster.conf which
> is the same on all nodes... you don't change it.
>
> Dave
>
>

Yes, that's what I have understood, and as dump can take let's
say 20mn, that means that I'll have to put <fence_daemon post_fail_delay="1200">
but only in case of real problem, to let the failed node ending its dump.
But we can envisage this only if we already have had a system crash,
because of the long time to failover, otherwise we must keep 10s for
fence delay, that's why I propose the list above.
Right ?
Alain


From m.catanese at kinetikon.com  Wed Mar 22 16:17:16 2006
From: m.catanese at kinetikon.com (Matteo Catanese)
Date: Wed, 22 Mar 2006 17:17:16 +0100
Subject: [Linux-cluster] Re: More CS4 fencing fun
Message-ID: <04F16056-F674-49C9-892D-D8A4851DB05B@kinetikon.com>

On Tue, 2006-03-07 at 17:04 +0100, Matteo Catanese wrote:


 >Fencing never completes because iLO does not have power.  This an
 >architectural limitation to using iLO (or IPMI, actually) in a cluster
 >environment as the sole fencing method.  Compare to RSA - which can  
have
 >its own external power supply - even though it is an integrated  
solution
 >like iLO.

To me this is a fence_ilo limitation


 >With redundant power supplies, the expectation is that different
 >circuits (or preferably - different power sources entirely) are used,
 >which should make the tested case significantly less likely to occur.

Yes but i want a NSPOF cluster, not a less_likely_SPOF one


 >iLO being unreachable means iLO is unreachable, and assumptions as to
 >why should probably not be limited to lack of power.  Routing  
problems,
 >bad network cable, disconnected cable, and the occasional infinite
 >iLO-DHCP loop will all make iLO unreachable, but in no way confirm  
that
 >the node is dead.

We are always talking about avoiding _single point of failure_, not  
multiple ones.

My ILO_IP_ADDRESSES are static so no infinite dhcp loop

I have bonded (mode 1) heartbeat channel on 2 separate bridged  
switches (powered by 2 different powersupply) so if  one node does  
not reach the other one _AND_ fence_ilo fails, this means (by a SPOF  
point of view) that the other node had power failure.

So please at least for fence_ilo allow some parameter to let fence  
spit out a warning and unlock the cluster service


Matteo


From Matthew.Patton.ctr at osd.mil  Wed Mar 22 17:34:03 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Wed, 22 Mar 2006 12:34:03 -0500
Subject: [Linux-cluster] Re: gfs + nfsd crash
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D3A@osdn06.osd.mil>

Classification: UNCLASSIFIED

sorry, I'm going thru archives...

Upon reading the changelog, why shouldn't we fix NFSD which was acting
carelessly, instead of hacking GFS (and with a sketchy assumption to boot)
to handle NFSD's broken behavior? 

------
Changes by:	bmarzins sourceware org	2006-03-08 20:47:09
Modified files:
	gfs-kernel/src/gfs: ops_inode.c

Log message:
 Really gross hack!!!
 This is a workaround for one of the bugs the got lumped into 166701. It
 Breaks POSIX behavior in a corner case to avoid crashing... It's icky.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060322/6cc0c0fb/attachment.htm>

From Matthew.Patton.ctr at osd.mil  Wed Mar 22 18:22:21 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Wed, 22 Mar 2006 13:22:21 -0500
Subject: [Linux-cluster] One big GFS or a handful of smaller ones?
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D3B@osdn06.osd.mil>

Classification: UNCLASSIFIED

With CLVM on top of a couple of SAN volumes, I can grow the underlying VG
and LVs. Say I want HOME, VMImage, APPS, and CONTENT all on GFS so every
node can share. Is there a locking benefit or a performance benefit by
having one big LV (and therefore one GFS filesystem) with subdirectories to
organize things versus having multiple LVs each formatted GFS? In other
words, each node can have 1 or 4 GFS mounts. Kicking a node over means 1 or
alternatively all 4 GFS filesystems have to go into recovery. At first blush
the latter doesn't sound like a very good idea with more things to go wrong
and time out etc. Or is having multiple GFS mounts really not so bad and the
ability to selectively grow/shrink and unmount a filesystem a quite useful
benefit?

thoughts?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060322/176246c9/attachment.htm>

From gevery at vcommerce.com  Wed Mar 22 19:25:08 2006
From: gevery at vcommerce.com (Gary Every)
Date: Wed, 22 Mar 2006 12:25:08 -0700
Subject: [Linux-cluster] Unable to find the EMC devices when implementing
Message-ID: <1143055508.31944.47.camel@every.vcommerce.com>

I have two RHEL test boxes with HBA's attached to an EMC. I've formatted
them ext2 and mounted them - successfully, and identically between the
two machines


When using the cs-deploy-tool, when I try to add a server, I get an
error on my console that says:
scsi device with scsi_id '360060160c6971000ee25cc8c25b9da11' is
multipathed, not implemented, so removing

When I add the second server I get the same thing, and when the Shared
Devices window comes up, there is nothing in them. Is there a way that I
can implement multipathed scsi devices?


TIA,
G.~


From zhendershot at cranel.com  Wed Mar 22 19:46:53 2006
From: zhendershot at cranel.com (Hendershot, Zach)
Date: Wed, 22 Mar 2006 14:46:53 -0500
Subject: [Linux-cluster] 2 Node Cluster - SAN Fencing
Message-ID: <BD37DA8FEF7D8949A5731245FF66CB2BDA78A2@POSTOFFICE.cranel.local>

Hi,
    I have a 2 node cluster that I'm setting up fencing on (with
fence_brocade). As a test I bring down the heartbeat network. A few
seconds pass and BOTH nodes fence each other off of the SAN (via the 2nd
network interface on the machines). Should I put the brocade switch on
the heartbeat network, or is there a better way? Thanks.
 
--------------

Zach Hendershot
Software Engineer
Cranel, Incorporated.
Phone: 614.318.4288
Fax: 614.431.8388
Email: zhendershot at cranel.com

Technology. Integrity. Focus.

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060322/f09e9ea8/attachment.htm>

From bmarzins at redhat.com  Wed Mar 22 19:18:56 2006
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Wed, 22 Mar 2006 13:18:56 -0600
Subject: [Linux-cluster] RAIDing a CLVM?
In-Reply-To: <442130A6.3000106@veles.ru>
References: <D8063DF686D10247B0A49D01271285690CE91D33@osdn06.osd.mil>
	<442080FC.4050908@daltonfirth.co.uk> <442130A6.3000106@veles.ru>
Message-ID: <20060322191855.GA7326@ether.msp.redhat.com>

On Wed, Mar 22, 2006 at 02:10:30PM +0300, Denis Medvedev wrote:
> 
> A better approach is to export not an GNBD but an iSCSI device from DRBD.
> 

I would definitely go with DRBD for this setup. If I understand this setup
correctly, there is a data corruption possibility.

If you have two machines doing raid1 over a local device and a gnbd device,
you have the problem were if machine A dies after it has written to it's local
disk but not the disk on machine B. The mirror is out of sync. GNBD doesn't
do anything to help with that, and md on machine B doesn't know anything about
the state of machine A, so it can't correct the problem. So you are left with
an out of sync mirror, which is BAD. DRBD was made for exactly this setup,
and will (I believe) automagically handle this correctly.

-Ben
 
> James Firth wrote:
> 
> 
> >Patton, Matthew F, CTR, OSD-PA&E wrote:
> >
> >>I can't think of a way to combine (C)LVM, GFS, GNBD, and MD (software 
> >>RAID) and make it work unless just one of the nodes becomes the MD 
> >>master and then just exports it via NFS. Can it be done? Do 
> >>commercial options exist to pull off this trick?
> >
> >
> >Hi,
> >
> >We're working on the same problem. We have tried two approaches, both 
> >with their own fairly serious drawbacks.
> >
> >Our goal was a 2-node all-in-one HA mega server, providing all office 
> >services from one cluster, and with no single point of failure.
> >
> >The first uses a raid master for each pair.  Each member of the pair 
> >exports a disk using GNBD.  The pair negotiate a master using CMAN, 
> >and that master assembles a RAID device using one GNBD import, plus 
> >one local disk, and then exports it using NFS, or in the case of GFS 
> >being used, exports the assembled raid device via a third GNBD export.
> >
> >Our trick here was each node exported it's contributory disk, using 
> >GNDB, by default, so long as at east one other node was active (quorum 
> >> 1), knowing only one master would ever be active. This significantly 
> >reduced complexity.
> >
> >Problems are:
> > - GNDB instabilities cause frequent locks and crashes, especially 
> >busying DLM (suspected).
> > - NFS export scheme also causes locks and hangs to NFS clients on 
> >failover *IF* a member of the pair then subsequently imports and an 
> >NFS client, as needed in some of our mega-server ideas.
> > - NFS export is not too useful when file locking is important, e.g. 
> >subversion, procmail etc (yes, procmail, if your mail server is also 
> >your Samba homes server).  You have to dell mailproc to use 
> >alternative mailbox locking else mailboxes get corrupted.
> > - GFS on assembled device with GNDB export scheme works best, but 
> >still causes locks and hangs.  Note also an exporting client must NOT 
> >import it's own exported GNBD volume, so there is no symmetry between 
> >the pair, and it's quite difficult to manage.
> >
> >
> >
> >Our second approach is something we've just embarked on, and so far is 
> >proving more successful, using DRBD.  DRBD is used to create a 
> >mirrored pair of volumes, a bit like GNBD+MD as above.
> >
> >The result is a block device accessible from both machines, but the 
> >problem is that only one member of the pair is writable (master), and 
> >the other is a read-only mount.
> >
> >If the master server dies, the remaining DRBD becomes the master, and 
> >becomes writable.  When the dead node recovers, the recovered node 
> >becomes a slave, read-only.
> >
> >The problem is with the read-only aspect, so you still need to have an 
> >exporting mechanism for the assembled DRBD volume running on the DRBD 
> >master.  We plan to do this via GNBD export (GFS FS installed).
> >
> >That's where the complexity comes in - as the DRBD negotiation appears 
> >to be totally independent of cluster control suite, and so we're 
> >having to use customizations to start the exporting daemon on the DRBD 
> >master.
> >
> >
> >Conclusions
> >---
> >
> >From all we've learned to date, it still seems a dedicated file server 
> >or SAN approach is necessary to maintain availability.
> >
> >Either of the above schemes would work fairly well if we were just 
> >building a HA storage component, because most of the complexities 
> >we've encountered come about when the shared storage device is used by 
> >services on the same cluster nodes.
> >
> >Most, if not all of what we've done so far is not suitable for a 
> >production environment, as it just increases the coupling between 
> >nodes, and therefore increases the chance of a cascade failure of the 
> >cluster.  In all seriousness I believe a single machine with RAID-1 
> >pair has a higher MTBF than any of our experiments.
> >
> >Many parts of the CCS/GFS suite so far released have serious issues 
> >when used in non-standard configurations.  For example, exception 
> >handling we've encountered usually defaults to "while (1) { retry(); 
> >sleep(1); }"
> >
> >I've read last year about plans for GFS mirroring from RedHat, and 
> >haven't found much else since.  If anyone knows more I'd love to hear.
> >
> >It also appears that the guys behind DRBD want to further develop 
> >their mirroring so that both volumes can be writable, in which case 
> >you can just stick GFS on the assembled device, and run whichever 
> >exporting method you like as a normal cluster service.
> >
> >
> >
> >James
> >
> >www.daltonfirth.co.uk
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >-- 
> >Linux-cluster mailing list
> >Linux-cluster at redhat.com
> >https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From teigland at redhat.com  Wed Mar 22 20:09:09 2006
From: teigland at redhat.com (David Teigland)
Date: Wed, 22 Mar 2006 14:09:09 -0600
Subject: [Linux-cluster] Running GFS without fencing and maybe locking ; -)
In-Reply-To: <44216950.2030909@eshine.de>
References: <44216950.2030909@eshine.de>
Message-ID: <20060322200909.GC12284@redhat.com>

On Wed, Mar 22, 2006 at 04:12:16PM +0100, Arnd wrote:
> Our cluster consists of 4 Webservers and one management server. This
> management server is the only server which needs write access to the GFS
> (for example changing the html-files):
> 
> 	webserver1 - webserver4: mount GFS -o ro (readonly)
> 	mgm-server1: mount GFS -o rw (write access)
> 
> My idea:
> 
> If one of the webserver fails then the cluster will issue an
> fence_script with an exitcode "0". The node is fenced by the cluster and
> while the filesystem wasn't mounted rw it cannot be destroyed.
> 
> The only possible way the filesystem can get corrupted is when the
> management-server fails.
> 
> So is it possible to run the GFS with 4 readonly nodes and only one node
>  which should be taken care if it fails? How does locking (lock_dlm)
> work  in this case? I suppose that it only needs to take care for any
> writes to the filesystem but here I might be wrong?!

In the next version of GFS we're adding a "spectator" mount option which
is very similar to "ro".  Spectator mounts do not claim a journal when
they mount, cannot be converted to "rw", and don't need to be fenced if
they fail.

If you guarantee that your ro mounts will never be remounted to rw, then
it should be safe to not fence them when they fail (since they'll never
under any circumstance make any writes to the fs).

The problem is not related to fencing, but comes when the node mounting rw
fails.  There are only ro mounts left and none of them can recover the
journal of the failed node because they can't write.  What these ro mounts
do next is the important part, and there appears to be a shortcoming in
the current code that I just noticed.  It looks like the ro mounts will
continue reading the fs normally without the journal of the failed rw node
ever being replayed.  They'll likely come across some inconsistent part of
the fs and panic/withdraw.  It shouldn't be difficult to test this.

It's not clear yet how difficult this problem will be to fix in the
current stable code.

> Can I use lock_nolock (when making the filesystem) if only one node is
> writing to the GFS?

No, readonly nodes still need to do all the necessary locking for reading.

Dave


From teigland at redhat.com  Wed Mar 22 20:16:57 2006
From: teigland at redhat.com (David Teigland)
Date: Wed, 22 Mar 2006 14:16:57 -0600
Subject: [Linux-cluster] CS4 Update 2 / problem with systems dump ?
In-Reply-To: <44216C39.4090906@bull.net>
References: <442103D5.90806@bull.net> <20060322151234.GB12284@redhat.com>
	<44216C39.4090906@bull.net>
Message-ID: <20060322201657.GD12284@redhat.com>

On Wed, Mar 22, 2006 at 04:24:41PM +0100, Alain Moulle wrote:

> Yes, that's what I have understood, and as dump can take let's say 20mn,
> that means that I'll have to put <fence_daemon post_fail_delay="1200">
> but only in case of real problem, to let the failed node ending its dump.
> But we can envisage this only if we already have had a system crash,
> because of the long time to failover, otherwise we must keep 10s for
> fence delay, that's why I propose the list above.
> Right ?

It sounds like post_fail_delay won't work for you then; it's a static
value, you can't go changing it when you want a larger delay.

Dave


From forums at daltonfirth.co.uk  Wed Mar 22 21:34:31 2006
From: forums at daltonfirth.co.uk (James Firth)
Date: Wed, 22 Mar 2006 21:34:31 +0000
Subject: [Linux-cluster] RAIDing a CLVM?
In-Reply-To: <20060322191855.GA7326@ether.msp.redhat.com>
References: <D8063DF686D10247B0A49D01271285690CE91D33@osdn06.osd.mil>	<442080FC.4050908@daltonfirth.co.uk>
	<442130A6.3000106@veles.ru>
	<20060322191855.GA7326@ether.msp.redhat.com>
Message-ID: <4421C2E7.70001@daltonfirth.co.uk>

Benjamin Marzinski wrote:
> If you have two machines doing raid1 over a local device and a gnbd device,
> you have the problem were if machine A dies after it has written to it's local
> disk but not the disk on machine B. The mirror is out of sync. 

This is recoverable and MD can handle this.  Exactly the same can happen 
on a single machine.  If disk A dies during a write operation and causes 
a power spike that crashes or bus-locks the machine, on recovery disk B 
is in the same state as you describe.

You will lose the last file system transaction, but the journal on the 
surviving node saves the day.

Besides, the mirror is always out of sync when a node recovers.  The 
mirror operation is always towards the last disk to be "mdadm --add"ed

> DRBD was made for exactly this setup, and will (I believe) automagically handle this correctly.

Quite right, I agree.  The major problem with DRBD is that cluster 
manager has no control over the active/standby state of the disks.  So a 
custom exporting daemon is needed that moves with DRBD master.  We are 
finding it gets a bit complex on failover.

Regards,
James


From bmarzins at redhat.com  Thu Mar 23 03:34:27 2006
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Wed, 22 Mar 2006 21:34:27 -0600
Subject: [Linux-cluster] RAIDing a CLVM?
In-Reply-To: <20060322191855.GA7326@ether.msp.redhat.com>
References: <D8063DF686D10247B0A49D01271285690CE91D33@osdn06.osd.mil>
	<442080FC.4050908@daltonfirth.co.uk> <442130A6.3000106@veles.ru>
	<20060322191855.GA7326@ether.msp.redhat.com>
Message-ID: <20060323033426.GC7326@ether.msp.redhat.com>

On Wed, Mar 22, 2006 at 01:18:56PM -0600, Benjamin Marzinski wrote:
> On Wed, Mar 22, 2006 at 02:10:30PM +0300, Denis Medvedev wrote:
> > 
> > A better approach is to export not an GNBD but an iSCSI device from DRBD.
> > 
> 
> I would definitely go with DRBD for this setup. If I understand this setup
> correctly, there is a data corruption possibility.
> 
> If you have two machines doing raid1 over a local device and a gnbd device,
> you have the problem were if machine A dies after it has written to it's local
> disk but not the disk on machine B. The mirror is out of sync. GNBD doesn't
> do anything to help with that, and md on machine B doesn't know anything about
> the state of machine A, so it can't correct the problem. So you are left with
> an out of sync mirror, which is BAD. DRBD was made for exactly this setup,
> and will (I believe) automagically handle this correctly.

This is ignoring the obvious issue that after machine A is dead, B will
presumeably keep writing to it's device, so it will obviously be out of sync.
And you probably knew that. It's been a long week.  But still, this sounds
exactly like what DRBD was designed for.

-Ben
 
> -Ben
>  
> > James Firth wrote:
> > 
> > 
> > >Patton, Matthew F, CTR, OSD-PA&E wrote:
> > >
> > >>I can't think of a way to combine (C)LVM, GFS, GNBD, and MD (software 
> > >>RAID) and make it work unless just one of the nodes becomes the MD 
> > >>master and then just exports it via NFS. Can it be done? Do 
> > >>commercial options exist to pull off this trick?
> > >
> > >
> > >Hi,
> > >
> > >We're working on the same problem. We have tried two approaches, both 
> > >with their own fairly serious drawbacks.
> > >
> > >Our goal was a 2-node all-in-one HA mega server, providing all office 
> > >services from one cluster, and with no single point of failure.
> > >
> > >The first uses a raid master for each pair.  Each member of the pair 
> > >exports a disk using GNBD.  The pair negotiate a master using CMAN, 
> > >and that master assembles a RAID device using one GNBD import, plus 
> > >one local disk, and then exports it using NFS, or in the case of GFS 
> > >being used, exports the assembled raid device via a third GNBD export.
> > >
> > >Our trick here was each node exported it's contributory disk, using 
> > >GNDB, by default, so long as at east one other node was active (quorum 
> > >> 1), knowing only one master would ever be active. This significantly 
> > >reduced complexity.
> > >
> > >Problems are:
> > > - GNDB instabilities cause frequent locks and crashes, especially 
> > >busying DLM (suspected).
> > > - NFS export scheme also causes locks and hangs to NFS clients on 
> > >failover *IF* a member of the pair then subsequently imports and an 
> > >NFS client, as needed in some of our mega-server ideas.
> > > - NFS export is not too useful when file locking is important, e.g. 
> > >subversion, procmail etc (yes, procmail, if your mail server is also 
> > >your Samba homes server).  You have to dell mailproc to use 
> > >alternative mailbox locking else mailboxes get corrupted.
> > > - GFS on assembled device with GNDB export scheme works best, but 
> > >still causes locks and hangs.  Note also an exporting client must NOT 
> > >import it's own exported GNBD volume, so there is no symmetry between 
> > >the pair, and it's quite difficult to manage.
> > >
> > >
> > >
> > >Our second approach is something we've just embarked on, and so far is 
> > >proving more successful, using DRBD.  DRBD is used to create a 
> > >mirrored pair of volumes, a bit like GNBD+MD as above.
> > >
> > >The result is a block device accessible from both machines, but the 
> > >problem is that only one member of the pair is writable (master), and 
> > >the other is a read-only mount.
> > >
> > >If the master server dies, the remaining DRBD becomes the master, and 
> > >becomes writable.  When the dead node recovers, the recovered node 
> > >becomes a slave, read-only.
> > >
> > >The problem is with the read-only aspect, so you still need to have an 
> > >exporting mechanism for the assembled DRBD volume running on the DRBD 
> > >master.  We plan to do this via GNBD export (GFS FS installed).
> > >
> > >That's where the complexity comes in - as the DRBD negotiation appears 
> > >to be totally independent of cluster control suite, and so we're 
> > >having to use customizations to start the exporting daemon on the DRBD 
> > >master.
> > >
> > >
> > >Conclusions
> > >---
> > >
> > >From all we've learned to date, it still seems a dedicated file server 
> > >or SAN approach is necessary to maintain availability.
> > >
> > >Either of the above schemes would work fairly well if we were just 
> > >building a HA storage component, because most of the complexities 
> > >we've encountered come about when the shared storage device is used by 
> > >services on the same cluster nodes.
> > >
> > >Most, if not all of what we've done so far is not suitable for a 
> > >production environment, as it just increases the coupling between 
> > >nodes, and therefore increases the chance of a cascade failure of the 
> > >cluster.  In all seriousness I believe a single machine with RAID-1 
> > >pair has a higher MTBF than any of our experiments.
> > >
> > >Many parts of the CCS/GFS suite so far released have serious issues 
> > >when used in non-standard configurations.  For example, exception 
> > >handling we've encountered usually defaults to "while (1) { retry(); 
> > >sleep(1); }"
> > >
> > >I've read last year about plans for GFS mirroring from RedHat, and 
> > >haven't found much else since.  If anyone knows more I'd love to hear.
> > >
> > >It also appears that the guys behind DRBD want to further develop 
> > >their mirroring so that both volumes can be writable, in which case 
> > >you can just stick GFS on the assembled device, and run whichever 
> > >exporting method you like as a normal cluster service.
> > >
> > >
> > >
> > >James
> > >
> > >www.daltonfirth.co.uk
> > >
> > >
> > >
> > >
> > >
> > >
> > >
> > >
> > >
> > >
> > >
> > >-- 
> > >Linux-cluster mailing list
> > >Linux-cluster at redhat.com
> > >https://www.redhat.com/mailman/listinfo/linux-cluster
> > >
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From mbhabani at iitk.ac.in  Thu Mar 23 06:01:05 2006
From: mbhabani at iitk.ac.in (Bhabani Shankar Mallik)
Date: Thu, 23 Mar 2006 11:31:05 +0530
Subject: [Linux-cluster] HCL Opteron + RHEL clusters
Message-ID: <A29344AF7996434796ABA9C9BEFC99C00EE518@xchange.cc.iitk.ac.in>


Dear All,

I am going to setup a 48 node opteron(Dual CPUs) cluster 
with RHEL. Can anyone provide me some information how to set up ?
(some links, cookbooks ??)
I want to make use of Oracle Real Application support.
Step by step guide will be highly appreciated.

PS : In India HCL is supplying Linux cluster with such configuration.
Anyone from India has some experience, please inform me.

Thanks,

Bhabani 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 2612 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060323/cfcbf8b5/attachment.bin>

From m_list at eshine.de  Thu Mar 23 09:42:00 2006
From: m_list at eshine.de (Arnd)
Date: Thu, 23 Mar 2006 10:42:00 +0100
Subject: [Linux-cluster] Running GFS without fencing and maybe locking ;-)
In-Reply-To: <20060322200909.GC12284@redhat.com>
References: <44216950.2030909@eshine.de> <20060322200909.GC12284@redhat.com>
Message-ID: <44226D68.50008@eshine.de>


David Teigland wrote:
> If you guarantee that your ro mounts will never be remounted to rw, then
> it should be safe to not fence them when they fail (since they'll never
> under any circumstance make any writes to the fs).
> 
> The problem is not related to fencing, but comes when the node mounting rw
> fails.  There are only ro mounts left and none of them can recover the
> journal of the failed node because they can't write.  What these ro mounts
> do next is the important part, and there appears to be a shortcoming in
> the current code that I just noticed.  It looks like the ro mounts will
> continue reading the fs normally without the journal of the failed rw node
> ever being replayed.  They'll likely come across some inconsistent part of
> the fs and panic/withdraw.  It shouldn't be difficult to test this.

OK, understood.

For this test I set up a 3 nodes cluster:

adnux2 and adnux3 have the GFS mounted readonly
adnux4 has GFS mounted with write access.

For the test I was copying files from /usr to the GFS and during this
copy process I powered off this host (adnux4).

Both nodes noticed the failed node and adnux2 fenced it. But as you
said, it was not possible to write the journal back:

/var/log/messages (adnux2):
---
Mar 23 10:02:45 adnux2 kernel: CMAN: removing node adnux4 from the
cluster : Missed too many heartbeats
Mar 23 10:02:46 adnux2 fenced[9559]: adnux4 not a cluster member after 0
sec post_fail_delay
Mar 23 10:02:46 adnux2 fenced[9559]: fencing node "adnux4"
Mar 23 10:02:46 adnux2 fenced[9559]: fence "adnux4" success
...
Mar 23 10:02:53 adnux2 kernel: GFS: fsid=adnuxCluster1:adnux.0: jid=2:
Trying to acquire journal lock...
Mar 23 10:02:53 adnux2 kernel: GFS: fsid=adnuxCluster1:adnux.0: jid=2:
Looking at journal...
Mar 23 10:02:53 adnux2 kernel: GFS: fsid=adnuxCluster1:adnux.0: jid=2:
Can't replay: read-only FS
Mar 23 10:02:53 adnux2 kernel: GFS: fsid=adnuxCluster1:adnux.0: jid=2:
Failed
Mar 23 10:02:55 adnux2 kernel: GFS: fsid=adnuxCluster1:adnux.0: jid=1:
Trying to acquire journal lock...
Mar 23 10:02:55 adnux2 kernel: GFS: fsid=adnuxCluster1:adnux.0: jid=1:
Looking at journal...
Mar 23 10:02:55 adnux2 kernel: GFS: fsid=adnuxCluster1:adnux.0: jid=1: Done
...
---

So even if I'm adding two hosts to the cluster with write access then
how can I control, that if one of these nodes fails, the other node with
write access is replaying the journal (and not the node which is doing
the fencing and has the filesystem mounted readonly)?


Ok, back to the tests. Both nodes (adnux2 and adnux3) weren't able to
access the filesystem after adnux4 failed:

adnux3 data # ls -l /home/data/gfs
ls: /home/data/gfs: Input/output error

None of the following steps helped:
- adnux4 rejoins the cluster
- adnux4 mounts the GFS with write access
- adnux4: unmounting filesystem and repaired the GFS with gfs_fsck

Only after mounting and remounting the filesystem on one of the nodes,
the access was possible again.

But to finish this now, I think we need these spectators and at least
two nodes which can be fenced (only GFS on the HBA, no Database...).

Thanks again, Dave.

Arnd


From s.bridgwater at sinergy.it  Thu Mar 23 10:08:23 2006
From: s.bridgwater at sinergy.it (Simon Bridgwater)
Date: Thu, 23 Mar 2006 11:08:23 +0100
Subject: [Linux-cluster] cluster across different storage devices
Message-ID: <bbaf2333e4349940a2b2445c81fda29b@sinergy.it>

Hi,
    I would like to know if the following setup is supported by Red Hat
Cluster Suite 4.

I want three nodes in failover with two nodes connected to one storage
unit and another node connected to another secondary storage unit. Each
node can be connected either by fibre channel or iscsi. The secondary
storage unit is replicated syncronously from the primary storage unit. 

If one of the two nodes connected to the primary storage fails then the
service fails-over to the second node connected to the primary storage
unit.
If the primary storage unit fails the whole cluster should fail over to
the one node connected to the secondary storage unit.

Is this kind of setup possibile ? They tell me that with other linux
clusters this type of setup is possible.

Simon Bridgwater
Sinergy


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060323/4e108475/attachment.htm>

From lhh at redhat.com  Thu Mar 23 15:05:46 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 23 Mar 2006 10:05:46 -0500
Subject: [Linux-cluster] 2 Node Cluster - SAN Fencing
In-Reply-To: <BD37DA8FEF7D8949A5731245FF66CB2BDA78A2@POSTOFFICE.cranel.local>
References: <BD37DA8FEF7D8949A5731245FF66CB2BDA78A2@POSTOFFICE.cranel.local>
Message-ID: <1143126346.25515.55.camel@ayanami.boston.redhat.com>

On Wed, 2006-03-22 at 14:46 -0500, Hendershot, Zach wrote:
> Hi,
>     I have a 2 node cluster that I'm setting up fencing on (with
> fence_brocade). As a test I bring down the heartbeat network. A few
> seconds pass and BOTH nodes fence each other off of the SAN (via the
> 2nd network interface on the machines). Should I put the brocade
> switch on the heartbeat network, or is there a better way? Thanks.
>  
> 

2-node clusters need power fencing.

-- Lon


From ben.yarwood at juno.co.uk  Thu Mar 23 17:32:31 2006
From: ben.yarwood at juno.co.uk (Ben Yarwood)
Date: Thu, 23 Mar 2006 17:32:31 -0000
Subject: [Linux-cluster] lvextend gone wrong
Message-ID: <024001c64e9f$c397f440$3964a8c0@WS076>

Setup : GFS 6.1, 3 Node cluster, Fedora Core 4

I was running lvextend on a cluster file system today and it failed saying
that it could not lock the file system. I did not save this message sorry
and now can't find it.

After this none of the other cluster nodes can access the file system that I
was trying to expand.

Looking at /proc/cluster/services I get the following messages on the noeds,
can anyone help sort this out please?

GFS Mount Group: "64mp3"                            10  11 recover 2 -
[1]
GFS Mount Group: "64mp3"                             0  11 join
S-1,80,3
[]

Cheers
Ben Yarwood


From lhh at redhat.com  Thu Mar 23 17:52:10 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 23 Mar 2006 12:52:10 -0500
Subject: [Linux-cluster] Re: More CS4 fencing fun
In-Reply-To: <04F16056-F674-49C9-892D-D8A4851DB05B@kinetikon.com>
References: <04F16056-F674-49C9-892D-D8A4851DB05B@kinetikon.com>
Message-ID: <1143136330.25515.211.camel@ayanami.boston.redhat.com>

Hi Matteo,

First off, you are correct.  Strictly from a "SPF protection / all other
failure scenarios are irrelevant" point of view, losing power -> fencing
failure is bad.

However, I hope I can convince you that this particular view is not the
right one to take in this case, but I doubt I will be able to.


On Wed, 2006-03-22 at 17:17 +0100, Matteo Catanese wrote:

> We are always talking about avoiding _single point of failure_, not  
> multiple ones.

We recover from several multi-point failures if there is a deterministic
way to do so.  Ex, sustaining 5 nodes failing in a 16-node cluster.

More so than NSPF, the cluster is designed to minimize uncertainty in
any failure case if possible - especially where data integrity is
concerned (i.e. fencing).

Given the above design goal, one can still very easily build NSPF
two-node clusters, but there are limitations on the hardware you can
use.  For example - 

* With iLO, you need redundant power supplies.

* With IPMI, you need redundant power supplies and an extra NIC.

* With single power supplies, you should use a remote power switch with
redundant power rails (where the internal electronics can run off of
either for full NSPF protection).  As of this writing, I am unaware of
any such thing available from any of the major IHVs.

* If redundant power supplies are not "redundant enough" in your
opinion, then you should probably use a redundant remote power switch as
noted above.


> So please at least for fence_ilo allow some parameter to let fence  
> spit out a warning and unlock the cluster service

Fencing, put simply, is a deterministic set of steps to take to
guarantee that a dead or misbehaving node can not (not "might not" or
"probably will not") access shared resources/partitions/storage.  It is
designed to have exactly two possible outcomes given a correctly
configured environment:

  - The node has been cut off from shared resources, or

  - Fencing the node has failed

If fencing fails, we retry forever.  Fencing failures are otherwise
unrecoverable.  The only way to recover from a particular fencing
failure is to provide a different fencing mechanism as a backup...


Ok, on how one could change the behavior...

>From a design perspective, if we were to change the behavior of fencing,
I would recommend changing it in fenced, not fence_ilo (e.g. give a
fenced a max_retries count or something), because once we do it for iLO,
we will have to do it for many other agents.  For example, most or all
of the supported APC switches only have a single (non-redundant) power
rail, so fence_apc would have to be changed too.


Here are some things you can do for your configuration:

(a) Add a human layer.  Add a manual fencing agent as a cascade to
detect this particular problem.  This is, in my opinion, the least
likely to solve your problem in the way you want, but if you consider a
power failure of a node fairly unlikely.


(b) Make fencing not fail.  Edit /sbin/fence_ilo and make it do what you
need.


(c) Roll your own fencing agent and add it as a cascade which will do
specifically what you want it to if iLO fencing fails.  For
example, /sbin/fence_dontcare.

#!/bin/bash
logger -p "daemon.emerg" "WARNING - iLO failed; data integrity may be
compromised, but continuing anyway."
echo "Ruh roh!" | mail my at email.addr
exit 0

Don't forget to add fence references to your cluster.conf.


(d) Buy a redundant external power switch as a cascade (or primary
fencing method) in the case that iLO is unreachable.  Here is a WTI NPS
on eBay for $125:

http://cgi.ebay.com/WTI-NPS-115-Remote-Telnet-Power-Reboot-NIB-Switch_W0QQitemZ9701395350QQcategoryZ11175QQssPageNameZWDVWQQrdZ1QQcmdZViewItem

The NPS has two power rails, and the internal electronics can run off of
either.  I.E., you can actually build a NSPF configuration with nodes
w/o redundant power supplies - without having to weaken any guarantees
about data integrity.  (Note: the NPS 115 has is past its end of life;
WTI has a replacement, but it will cost more than $125.).

-- Lon


From teigland at redhat.com  Thu Mar 23 18:05:49 2006
From: teigland at redhat.com (David Teigland)
Date: Thu, 23 Mar 2006 12:05:49 -0600
Subject: [Linux-cluster] Deadlock handling in GFS?
In-Reply-To: <7e07df7decbc.7decbc7e07df@vsnl.net>
References: <7e07df7decbc.7decbc7e07df@vsnl.net>
Message-ID: <20060323180549.GF32080@redhat.com>

On Mon, Mar 20, 2006 at 11:23:57PM +0500, renapte at vsnl.net wrote:
> Can anybody tell me where I can get details about how GFS handles deadlocks?

GFS avoids deadlocks among its own internal glocks by always requesting
them in the same order.  GFS resolves conversion deadlocks with some help
from the lock manager (one of gfs's locks is released temporarily).

Dave


From teigland at redhat.com  Thu Mar 23 18:19:02 2006
From: teigland at redhat.com (David Teigland)
Date: Thu, 23 Mar 2006 12:19:02 -0600
Subject: [Linux-cluster] Running GFS without fencing and maybe locking ; -)
In-Reply-To: <44226D68.50008@eshine.de>
References: <44216950.2030909@eshine.de> <20060322200909.GC12284@redhat.com>
	<44226D68.50008@eshine.de>
Message-ID: <20060323181902.GG32080@redhat.com>

On Thu, Mar 23, 2006 at 10:42:00AM +0100, Arnd wrote:
> So even if I'm adding two hosts to the cluster with write access then
> how can I control, that if one of these nodes fails, the other node with
> write access is replaying the journal (and not the node which is doing
> the fencing and has the filesystem mounted readonly)?

This looks like another problem in gfs.  We need can't have a readonly
node potentially preventing a rw node from doing recovery.  (The ro node
shouldn't be taking the journal lock if it can't eventually do the
recovery.)

> Ok, back to the tests. Both nodes (adnux2 and adnux3) weren't able to
> access the filesystem after adnux4 failed:
> 
> adnux3 data # ls -l /home/data/gfs
> ls: /home/data/gfs: Input/output error

This looks like the fs might have been withdrawn (shutdown) after gfs
found some inconsistency.

Dave


From lhh at redhat.com  Thu Mar 23 19:30:36 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 23 Mar 2006 14:30:36 -0500
Subject: [Linux-cluster] Node Failure Detection Problems
In-Reply-To: <441DC7F7.2@daltonfirth.co.uk>
References: <441DC7F7.2@daltonfirth.co.uk>
Message-ID: <1143142236.25515.218.camel@ayanami.boston.redhat.com>

On Sun, 2006-03-19 at 21:07 +0000, James Firth wrote:

> 
> Second question: clustat continues to hang for around 10 more seconds - 
> 15 in total, before clurgmgrd does a state change.

Hrm, it might be *because* because you are running clustat at the time
of the failure.

Could you put this in bugzilla?

-- Lon


From ptader at fnal.gov  Thu Mar 23 21:12:29 2006
From: ptader at fnal.gov (Paul Tader)
Date: Thu, 23 Mar 2006 15:12:29 -0600
Subject: [Linux-cluster] Shutdown/startup
Message-ID: <44230F3D.2050409@fnal.gov>

So how does one properly shutdown and/or start up a GFS 6.1 cluster. 
Documentation on the procedures are surprisingly sparse.  For example, 
what I've experienced during shutdown are messages like:

CMAN: sendmsg failed: -101
CMAN: we are leaving the cluster.

from the node shutting down while the the other nodes report that this 
node needs to be fenced off.  What?!  Fenced during a controlled 
shutdown?  Is that correct?  During a complete cluster shutdown, of 
course quorum, at some point is lost, freezing the nodes that have not 
completely shutdown forcing a hard reset on those nodes.  I'm probably 
missing something trivial here, but this doesn't seem right.

Start up also suffers from the "quorum" problem, but it seems to 
recovery once enough nodes are up.


Paul
-- 
===========================================================================
Paul Tader  <ptader at fnal.gov> Computing Div/CSS Dept
Fermi National Accelerator Lab; PO Box 500 MS 369 Batavia, IL 60510-0500


From treddy at rallydev.com  Thu Mar 23 21:50:03 2006
From: treddy at rallydev.com (Tarun Reddy)
Date: Thu, 23 Mar 2006 14:50:03 -0700
Subject: [Linux-cluster] Using LABEL in "Device Special File" as a resource
Message-ID: <1FC9ADB0-0E74-4BDD-87FF-51E2FF1CE3D0@rallydev.com>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Under RHEL3 clustering, I'm trying to mount a partition using the  
LABEL since iscsi bindings have a tendency to switch around.

Using Google to search the archives, I found this message:
http://www.redhat.com/archives/linux-cluster/2005-October/msg00129.html

but this doesn't seem to work with Redhat Clustering on RHEL3. Is  
there another way of doing this?

mount LABEL=/www /www

does work from the command line so maybe this is just a redhat-config- 
cluster limitation.

Any help would be appreciated.

Thanks,
Tarun
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.2 (Darwin)

iD8DBQFEIxgLTaG0LVoRoBYRAvoTAJ459wnBqNL3sc5ZEFQmmJ/jl2kdDACeOxkd
r2y0DoSqdNQFoRbpYRt4+vc=
=lIZa
-----END PGP SIGNATURE-----


From ben.yarwood at juno.co.uk  Fri Mar 24 00:26:16 2006
From: ben.yarwood at juno.co.uk (Ben Yarwood)
Date: Fri, 24 Mar 2006 00:26:16 -0000
Subject: [Linux-cluster] Can't write to file systems
Message-ID: <028301c64ed9$90c49570$3964a8c0@WS076>


I am really stumped and could do with some help.

I have a 3 node gfs cluster running gfs 6.1 nad it has started to behave
very strangely after I had some problems earlier today expanding one of the
file systems.  

At the moment all the nodes are in the cluser and it is quorate, and all the
gfs file systems are mounted.  Reading from the gfs file systems works fine
but anything that tries to write to them causes the file system to hang.  

I see a lot of gfs_recoverd and dlm_recoverd processes running.  Do these
have something to do with it?

I'm getting a little desperate to sort this out.  Any help would be gretly
appericated


/cat/proc/cluster is:


Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 run       -
[3 1 2]

DLM Lock Space:  "clvmd"                             2   3 run       -
[3 1 2]

DLM Lock Space:  "wav"                              39   4 run       -
[3 1 2]

DLM Lock Space:  "320mp3"                           41   6 run       -
[3 1 2]

DLM Lock Space:  "192mp3"                           43   8 run       -
[3 1 2]

DLM Lock Space:  "64mp3"                            45  10 run       -
[3 1 2]

DLM Lock Space:  "images"                           47  12 run       -
[3 1 2]

DLM Lock Space:  "data"                             49  14 run       -
[3 1 2]

DLM Lock Space:  "ftp"                              51  16 run       -
[3 1 2]

DLM Lock Space:  "customers"                        53  18 run       -
[3 1 2]

DLM Lock Space:  "Magma"                            56  21 run       -
[2 1]

GFS Mount Group: "wav"                              40   5 run       -
[3 1 2]

GFS Mount Group: "192mp3"                           44   9 run       -
[3 1 2]

GFS Mount Group: "64mp3"                            46  11 run       -
[3 1 2]

GFS Mount Group: "images"                           48  13 run       -
[3 1 2]

GFS Mount Group: "data"                             50  15 run       -
[3 1 2]

GFS Mount Group: "ftp"                              52  17 run       -
[3 1 2]

GFS Mount Group: "customers"                        54  19 run       -
[3 1 2]

User:            "usrm::manager"                    55  20 run       -
[2 1]


Cheers

Ben Yarwood


From suvankar_moitra at yahoo.com  Fri Mar 24 04:46:41 2006
From: suvankar_moitra at yahoo.com (SUVANKAR MOITRA)
Date: Thu, 23 Mar 2006 20:46:41 -0800 (PST)
Subject: [Linux-cluster] HCL Opteron + RHEL clusters
In-Reply-To: <A29344AF7996434796ABA9C9BEFC99C00EE518@xchange.cc.iitk.ac.in>
Message-ID: <20060324044641.47618.qmail@web52305.mail.yahoo.com>

hi Bhabani ,

I did it RHEL 4 and cluster suite 4 and oracle 10g in
2 node cluster.

regards

suvankar

kolkata, india

--- Bhabani Shankar Mallik <mbhabani at iitk.ac.in>
wrote:

> 
> Dear All,
> 
> I am going to setup a 48 node opteron(Dual CPUs)
> cluster 
> with RHEL. Can anyone provide me some information
> how to set up ?
> (some links, cookbooks ??)
> I want to make use of Oracle Real Application
> support.
> Step by step guide will be highly appreciated.
> 
> PS : In India HCL is supplying Linux cluster with
> such configuration.
> Anyone from India has some experience, please inform
> me.
> 
> Thanks,
> 
> Bhabani 
> > --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From phung at cs.columbia.edu  Fri Mar 24 11:19:00 2006
From: phung at cs.columbia.edu (Dan B. Phung)
Date: Fri, 24 Mar 2006 06:19:00 -0500 (EST)
Subject: [Linux-cluster] Shutdown/startup
In-Reply-To: <44230F3D.2050409@fnal.gov>
Message-ID: <Pine.LNX.4.44.0603240617420.2898-100000@minsk.clic.cs.columbia.edu>

I don't know if it's correct....but I have init scripts that 
shut down the services in order:

blade07:phung:) cat /etc/init.d/gfs_all
#!/bin/sh

case "$1" in
        start|restart)
                for i in ccsd cman fenced clvmd_init gfs; do
                        sh /etc/init.d/$i start
                done
                ;;
        stop|reload|force-reload)
                for i in gfs clvmd_init fenced cman ccsd; do
                        sh /etc/init.d/$i stop
                done
                ;;
    *)
        echo $"Usage: $0 {start|stop}"
        ;;
esac


On 23, Mar, 2006, Paul Tader declared:

> So how does one properly shutdown and/or start up a GFS 6.1 cluster. 
> Documentation on the procedures are surprisingly sparse.  For example, 
> what I've experienced during shutdown are messages like:
> 
> CMAN: sendmsg failed: -101
> CMAN: we are leaving the cluster.
> 
> from the node shutting down while the the other nodes report that this 
> node needs to be fenced off.  What?!  Fenced during a controlled 
> shutdown?  Is that correct?  During a complete cluster shutdown, of 
> course quorum, at some point is lost, freezing the nodes that have not 
> completely shutdown forcing a hard reset on those nodes.  I'm probably 
> missing something trivial here, but this doesn't seem right.
> 
> Start up also suffers from the "quorum" problem, but it seems to 
> recovery once enough nodes are up.
> 
> 
> Paul
> 

-- 
email:  phung at cs.columbia.edu
phone:  646-775-6090
fax:    212-666-0140
office: CS Dept. 520, 1214 Amsterdam Ave., MC 0401, New York, NY 10027


From m.catanese at kinetikon.com  Fri Mar 24 10:06:22 2006
From: m.catanese at kinetikon.com (Matteo Catanese)
Date: Fri, 24 Mar 2006 11:06:22 +0100
Subject: [Linux-cluster] Re: More CS4 fencing fun
Message-ID: <6988786F-0794-4481-B6D1-95B51B056E73@kinetikon.com>

Hi Lon,
you mail is "music" for my ears :D

I will try your /sbin/fence_dontcare immediately.

I will, anyway, try to explain myself better because english is not  
my main language.

I understand that cluster suite is also about multiple fail  
protection and date integrity but our goal is having  a 100% NSPOF  
cluster, i dont want to be interrupted in weekends when i play my  
favourite video game (WOW) just because ONE component broke and all  
cluster hung :-)

Sure our hardware configuration can sustain also some multi-point  
failure, but NSPOF is our mail goal

We have almost everything redundant.

Every server have dual power supplies connected to independent power  
source, dual nic, internal HD are mirrored with an hot spare, 2 FC  
cards to connect to a MSA 1000, with redundant controllers and  
redundant power supply connected to independent power source too.

On msa1000 we have a raid 5 with hot spare.

We have all this things and it's really frustrating for us that if  
active node's  mainboard fails, for shout circuit or too high  
temperature or some vital component failure or whatever, then all hungs.

About  WTI :

In my case WTI should be useful only in case of multiple failure, for  
example both network switch fails so heartbeat fails and ilo fails  
too  and with /sbin/fence_dontcare i will have corruption. Is this  
correct ?

I will need a supplemental NIC for every server to connect to WTI,  
but since WTI have only one ethernet port i will need  a separate hub  
or switch to connect to it , or i can connect one server to the  
ethernet port and another one to the serial port? Can i manage both  
serial and ethernet port ?


Matteo


From jstoner at opsource.net  Fri Mar 24 16:22:58 2006
From: jstoner at opsource.net (Jeff Stoner)
Date: Fri, 24 Mar 2006 16:22:58 -0000
Subject: [Linux-cluster] Kernel versions
Message-ID: <38A48FA2F0103444906AD22E14F1B5A3030D3965@mailxchg01.corp.opsource.net>

Am I nuts?

Running RHEL 4 (update 2) with latest kernel (2.6.9-34.Elsmp) on x86_64.
I installed the cman-kernel-smp-2.6.9-36.0.x64_64.rpm (and the
corresponding dlm kernel module rpm).

The modules got put into my original kernel's modules directory
(/lib/modules/2.6.9-11.Elsmp/kernel/cluster) and updated the symbol
tables for that kernel.

Modprobe fails to load the cman and dlm modules...because they aren't in
the correct directory (and yes, I rebooted after installing the latest
kernel, so it was the running new kernel when I installed the clustering
RPMs.)

I hacked the cman init script by adding a "--set-version 2.6.9-11.Elsmp)
in the modprobe commands. Cluster starts fine. Going to be testing the
Dickens out of it today and next week.

So...am I nuts?

--Jeff
SME - UNIX
OpSource Inc.

PGP Key ID 0x6CB364CA 


From mwill at penguincomputing.com  Fri Mar 24 16:44:49 2006
From: mwill at penguincomputing.com (Michael Will)
Date: Fri, 24 Mar 2006 08:44:49 -0800
Subject: [Linux-cluster] Re: More CS4 fencing fun
Message-ID: <433093DF7AD7444DA65EFAFE3987879C02AE4D@jellyfish.highlyscyld.com>

 
Yes, and what if the NPS fails? Is it less likely than a single power
supply to fail?
Is it less likely even than two redundand power supplies to fail? Is it
somewhere inbetween?

I am vary of adding more pieces to an HA solution, the more complex it
gets the more chance
for it to fail in an unexpected and not foreseen way.

My recommendation for extreme HA is to always use controlled PDU's and
to not rely on IPMI to do
reliable fencing because I heard of machines not beeing reachable by
IPMI after a kernel panic
that confused the NIC/BMC configuration.

Michael

-----Original Message-----
From: linux-cluster-bounces at redhat.com
[mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lon Hohberger
Sent: Thursday, March 23, 2006 9:52 AM
To: linux clustering
Subject: Re: [Linux-cluster] Re: More CS4 fencing fun

Hi Matteo,

First off, you are correct.  Strictly from a "SPF protection / all other
failure scenarios are irrelevant" point of view, losing power -> fencing
failure is bad.

However, I hope I can convince you that this particular view is not the
right one to take in this case, but I doubt I will be able to.


On Wed, 2006-03-22 at 17:17 +0100, Matteo Catanese wrote:

> We are always talking about avoiding _single point of failure_, not 
> multiple ones.

We recover from several multi-point failures if there is a deterministic
way to do so.  Ex, sustaining 5 nodes failing in a 16-node cluster.

More so than NSPF, the cluster is designed to minimize uncertainty in
any failure case if possible - especially where data integrity is
concerned (i.e. fencing).

Given the above design goal, one can still very easily build NSPF
two-node clusters, but there are limitations on the hardware you can
use.  For example - 

* With iLO, you need redundant power supplies.

* With IPMI, you need redundant power supplies and an extra NIC.

* With single power supplies, you should use a remote power switch with
redundant power rails (where the internal electronics can run off of
either for full NSPF protection).  As of this writing, I am unaware of
any such thing available from any of the major IHVs.

* If redundant power supplies are not "redundant enough" in your
opinion, then you should probably use a redundant remote power switch as
noted above.


> So please at least for fence_ilo allow some parameter to let fence 
> spit out a warning and unlock the cluster service

Fencing, put simply, is a deterministic set of steps to take to
guarantee that a dead or misbehaving node can not (not "might not" or
"probably will not") access shared resources/partitions/storage.  It is
designed to have exactly two possible outcomes given a correctly
configured environment:

  - The node has been cut off from shared resources, or

  - Fencing the node has failed

If fencing fails, we retry forever.  Fencing failures are otherwise
unrecoverable.  The only way to recover from a particular fencing
failure is to provide a different fencing mechanism as a backup...


Ok, on how one could change the behavior...

>From a design perspective, if we were to change the behavior of 
>fencing,
I would recommend changing it in fenced, not fence_ilo (e.g. give a
fenced a max_retries count or something), because once we do it for iLO,
we will have to do it for many other agents.  For example, most or all
of the supported APC switches only have a single (non-redundant) power
rail, so fence_apc would have to be changed too.


Here are some things you can do for your configuration:

(a) Add a human layer.  Add a manual fencing agent as a cascade to
detect this particular problem.  This is, in my opinion, the least
likely to solve your problem in the way you want, but if you consider a
power failure of a node fairly unlikely.


(b) Make fencing not fail.  Edit /sbin/fence_ilo and make it do what you
need.


(c) Roll your own fencing agent and add it as a cascade which will do
specifically what you want it to if iLO fencing fails.  For example,
/sbin/fence_dontcare.

#!/bin/bash
logger -p "daemon.emerg" "WARNING - iLO failed; data integrity may be
compromised, but continuing anyway."
echo "Ruh roh!" | mail my at email.addr
exit 0

Don't forget to add fence references to your cluster.conf.


(d) Buy a redundant external power switch as a cascade (or primary
fencing method) in the case that iLO is unreachable.  Here is a WTI NPS
on eBay for $125:

http://cgi.ebay.com/WTI-NPS-115-Remote-Telnet-Power-Reboot-NIB-Switch_W0
QQitemZ9701395350QQcategoryZ11175QQssPageNameZWDVWQQrdZ1QQcmdZViewItem

The NPS has two power rails, and the internal electronics can run off of
either.  I.E., you can actually build a NSPF configuration with nodes
w/o redundant power supplies - without having to weaken any guarantees
about data integrity.  (Note: the NPS 115 has is past its end of life;
WTI has a replacement, but it will cost more than $125.).

-- Lon

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From lhh at redhat.com  Fri Mar 24 17:01:54 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 24 Mar 2006 12:01:54 -0500
Subject: [Linux-cluster] Using LABEL in "Device Special File" as a resource
In-Reply-To: <1FC9ADB0-0E74-4BDD-87FF-51E2FF1CE3D0@rallydev.com>
References: <1FC9ADB0-0E74-4BDD-87FF-51E2FF1CE3D0@rallydev.com>
Message-ID: <1143219714.25515.382.camel@ayanami.boston.redhat.com>

On Thu, 2006-03-23 at 14:50 -0700, Tarun Reddy wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Under RHEL3 clustering, I'm trying to mount a partition using the  
> LABEL since iscsi bindings have a tendency to switch around.
> 
> Using Google to search the archives, I found this message:
> http://www.redhat.com/archives/linux-cluster/2005-October/msg00129.html
> 
> but this doesn't seem to work with Redhat Clustering on RHEL3. Is  
> there another way of doing this?

It really should work; it works on my RHEL3 system:

[root at magenta root]# e2label /dev/sdd3

[root at magenta root]# e2label /dev/sdd3 futon
[root at magenta root]# e2label /dev/sdd3
futon
[root at magenta root]# vi /etc/cluster.xml
...
    <service checkinterval="10" id="0" name="foo">
      <device id="0" name="LABEL=futon" sharename="">
               <!--   ^- name used to be /dev/sdd3 -->
        <mount forceunmount="yes" fstype="ext3" mountpoint="/mnt/1"
options=""/>
      </device>
      ...
    </service>
...

Starting the service and running the mount command yielded this on my
machine:

[root at magenta root]# mount
...
/dev/sdd3 on /mnt/1 type ext3 (rw)
...

Stopping the service causes the mount to go away correctly on my system.
The label->device resolution has not changed since before the release of
RHCS3, so there must be something missing that we don't know about.
Note that "LABEL" is case-sensitive, and must be all capital letters,
and that no spaces are allowed.

-- Lon


From teigland at redhat.com  Fri Mar 24 17:23:18 2006
From: teigland at redhat.com (David Teigland)
Date: Fri, 24 Mar 2006 11:23:18 -0600
Subject: [Linux-cluster] Can't write to file systems
In-Reply-To: <028301c64ed9$90c49570$3964a8c0@WS076>
References: <028301c64ed9$90c49570$3964a8c0@WS076>
Message-ID: <20060324172318.GB24180@redhat.com>

On Fri, Mar 24, 2006 at 12:26:16AM -0000, Ben Yarwood wrote:
> 
> I am really stumped and could do with some help.
> 
> I have a 3 node gfs cluster running gfs 6.1 nad it has started to behave
> very strangely after I had some problems earlier today expanding one of the
> file systems.  
> 
> At the moment all the nodes are in the cluser and it is quorate, and all the
> gfs file systems are mounted.  Reading from the gfs file systems works fine
> but anything that tries to write to them causes the file system to hang.  

I've never heard of that happening before.  Here are a couple things you
could try.

- Try mounting one fs at a time and seeing if there's problems with just
  one of them, or all, or only when there's more than one fs mounted...

- Shut down the cluster and have just a single node mount each fs
  using lock_nolock: mount -t gfs /dev/foo /gfs -o lockproto=lock_nolock
  This mounts the fs as a local file system, so make sure you don't have
  other nodes mounting the fs!  Once it's mounted like this, see if you
  can access the fs normally.

> I see a lot of gfs_recoverd and dlm_recoverd processes running.  Do these
> have something to do with it?

Those will be running all the time.

Dave


From teigland at redhat.com  Fri Mar 24 17:27:18 2006
From: teigland at redhat.com (David Teigland)
Date: Fri, 24 Mar 2006 11:27:18 -0600
Subject: [Linux-cluster] Shutdown/startup
In-Reply-To: <44230F3D.2050409@fnal.gov>
References: <44230F3D.2050409@fnal.gov>
Message-ID: <20060324172718.GC24180@redhat.com>

On Thu, Mar 23, 2006 at 03:12:29PM -0600, Paul Tader wrote:
> So how does one properly shutdown and/or start up a GFS 6.1 cluster. 
> Documentation on the procedures are surprisingly sparse.  For example, 
> what I've experienced during shutdown are messages like:
> 
> CMAN: sendmsg failed: -101
> CMAN: we are leaving the cluster.
> 
> from the node shutting down while the the other nodes report that this 
> node needs to be fenced off.  What?!  Fenced during a controlled 
> shutdown?  Is that correct?  During a complete cluster shutdown, of 
> course quorum, at some point is lost, freezing the nodes that have not 
> completely shutdown forcing a hard reset on those nodes.  I'm probably 
> missing something trivial here, but this doesn't seem right.
> 
> Start up also suffers from the "quorum" problem, but it seems to 
> recovery once enough nodes are up.

You read the notes here about startup/shutdown?
http://sources.redhat.com/cluster/doc/usage.txt

To shutdown, you umount all fs's on all nodes, fence_tool leave on
all nodes, then leave the cluster on all.  To leave the cluster without
quorum getting in the way use 'cman_tool leave remove'.

To start up, have all nodes join the cluster with cman_tool join,
then have all nodes join the fence domain with fence_tool join,
then start other things.

Dave


From jhedden at fnal.gov  Fri Mar 24 21:32:06 2006
From: jhedden at fnal.gov (Jason Hedden)
Date: Fri, 24 Mar 2006 15:32:06 -0600
Subject: [Linux-cluster] Shutdown/startup
In-Reply-To: <44230F3D.2050409@fnal.gov>
References: <44230F3D.2050409@fnal.gov>
Message-ID: <44246556.7090909@fnal.gov>

Hey Paul,

I didn't reply to the list with this.

I couldn't believe it either when I found out that RHCS 3.x worked this 
way... With RHCS 3.x the fencing deamon is evil, you practically have to 
chkconfig off the cluster service let him get fenced and bring the 
system back up.

I thought for sure this would have been fixed in the future version.

Jason Hedden

Paul Tader wrote:

> So how does one properly shutdown and/or start up a GFS 6.1 cluster. 
> Documentation on the procedures are surprisingly sparse.  For example, 
> what I've experienced during shutdown are messages like:
>
> CMAN: sendmsg failed: -101
> CMAN: we are leaving the cluster.
>
> from the node shutting down while the the other nodes report that this 
> node needs to be fenced off.  What?!  Fenced during a controlled 
> shutdown?  Is that correct?  During a complete cluster shutdown, of 
> course quorum, at some point is lost, freezing the nodes that have not 
> completely shutdown forcing a hard reset on those nodes.  I'm probably 
> missing something trivial here, but this doesn't seem right.
>
> Start up also suffers from the "quorum" problem, but it seems to 
> recovery once enough nodes are up.
>
>
> Paul


From mbhabani at iitk.ac.in  Sat Mar 25 06:02:47 2006
From: mbhabani at iitk.ac.in (Bhabani Shankar Mallik)
Date: Sat, 25 Mar 2006 11:32:47 +0530
Subject: [Linux-cluster] HCL Opteron + RHEL clusters
Message-ID: <A29344AF7996434796ABA9C9BEFC99C00EE51D@xchange.cc.iitk.ac.in>


Dear Suvankar,

Would you please help me, regarding the setup.I mean how to set the things.
What about your harware ?
I want to know more regarding cluster suite 4 and oracle 10g.

your help will be greatly appreciated.

Bhabani

-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of SUVANKAR MOITRA
Sent: Fri 24-Mar-06 10:16 AM
To: linux clustering
Subject: Re: [Linux-cluster] HCL Opteron + RHEL clusters
 
hi Bhabani ,

I did it RHEL 4 and cluster suite 4 and oracle 10g in
2 node cluster.

regards

suvankar

kolkata, india

--- Bhabani Shankar Mallik <mbhabani at iitk.ac.in>
wrote:

> 
> Dear All,
> 
> I am going to setup a 48 node opteron(Dual CPUs)
> cluster 
> with RHEL. Can anyone provide me some information
> how to set up ?
> (some links, cookbooks ??)
> I want to make use of Oracle Real Application
> support.
> Step by step guide will be highly appreciated.
> 
> PS : In India HCL is supplying Linux cluster with
> such configuration.
> Anyone from India has some experience, please inform
> me.
> 
> Thanks,
> 
> Bhabani 
> > --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
>
https://www.redhat.com/mailman/listinfo/linux-cluster


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 

--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3296 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060325/ca7a419e/attachment.bin>

From ben.yarwood at juno.co.uk  Sat Mar 25 16:37:55 2006
From: ben.yarwood at juno.co.uk (Ben Yarwood)
Date: Sat, 25 Mar 2006 16:37:55 -0000
Subject: [Linux-cluster] Can't write to file systems
In-Reply-To: <20060324172318.GB24180@redhat.com>
Message-ID: <032d01c6502a$77a505f0$3964a8c0@WS076>

Unfortunately I had to resolve the problem before I got your reply so I
didn't have a chance to try your solutions.  The mailing list seems to have
been having slowness issues.

In a slightly desperate act I fenced and rejoined each node one after the
other but didn't start gfs.  When I rejoined the second of the three nodes,
everything magically started working again, the third node started accessing
the gfs file systems as expected.  Next I remounted the gfs file systems on
the other boxes and they have also been behaving themselves.  Finally I
restarted  rgmanager one one node and all my nfs mounts where restored.
Slightly irrationally I haven't started rgmanager on the other nodes as I
think for some reason it may be to blame.

In summary, it seemed that access to the file systems was being prevented by
the cluster for some reason.  Any more thoughts anyone may have would be
appreciated.

In addition is there a way to check:

1 If a file system is suspended?
2 Why?
3 Which node is causing the problem?
4 A guide to interpreting the structure/comments of  /proc/cluster/services?


Cheers
Ben
 

> On Fri, Mar 24, 2006 at 12:26:16AM -0000, Ben Yarwood wrote:
> > 
> > I am really stumped and could do with some help.
> > 
> > I have a 3 node gfs cluster running gfs 6.1 nad it has started to 
> > behave very strangely after I had some problems earlier today 
> > expanding one of the file systems.
> > 
> > At the moment all the nodes are in the cluser and it is 
> quorate, and 
> > all the gfs file systems are mounted.  Reading from the gfs file 
> > systems works fine but anything that tries to write to them 
> causes the file system to hang.
> 
> I've never heard of that happening before.  Here are a couple 
> things you could try.
> 
> - Try mounting one fs at a time and seeing if there's 
> problems with just
>   one of them, or all, or only when there's more than one fs 
> mounted...
> 
> - Shut down the cluster and have just a single node mount each fs
>   using lock_nolock: mount -t gfs /dev/foo /gfs -o 
> lockproto=lock_nolock
>   This mounts the fs as a local file system, so make sure you 
> don't have
>   other nodes mounting the fs!  Once it's mounted like this, 
> see if you
>   can access the fs normally.
> 
> > I see a lot of gfs_recoverd and dlm_recoverd processes running.  Do 
> > these have something to do with it?
> 
> Those will be running all the time.
> 
> Dave
> 
> 


From celso at webbertek.com.br  Sun Mar 26 15:41:17 2006
From: celso at webbertek.com.br (Celso K. Webber)
Date: Sun, 26 Mar 2006 12:41:17 -0300
Subject: [Linux-cluster] Cluster Suite v3 quorum devices over iSCSI
Message-ID: <20060326152411.M4423@webbertek.com.br>

Hello all,

I've configured a new cluster for a customer with the following details:
* using Red Hat Enterprise Linux ES v3 Update 6 (cannot update to U7 because
Adaptec's SAS controller driver officialy supports only RHEL3U6)
* using Cluster Suite v3U6 (latest packages, clumanager-1.2.28-1)
* 2-node cluster with IBM xSeries 366 servers
* storage is a NetApp FAS270c
* shared state (quorum) managed over an iSCSI LUN on the NetApp
* using Red Hat's latest release of iscsi-initiator-utils (v 3.6.3-3)
* storage volumes for services accessed through NFS on the NetApp
* access to the storage done on a separate VLAN with Linux 802.3ad channel
bonding configured, using Cisco switches with port configured for LACP link
aggregation
* NetApp also configured for access in the network using link aggregation

We had many problems with quorum formation under this environment. If we
booted both servers at the same time, quorum would be formed successfully. If
after that we rebooted one of the servers, it would not rejoin the cluster
until we stopped clumanager on both sides and restarted.

After a long interaction with an iSCSI and NetApp specialist, we could find
some references of a problem with iSCSI that required Cluser Suite v3 to use
broadcast heartbeating instead of multicast heartbeating:
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=136553

Although the above Bugzilla is apparently resolved, we followed the directions
of changing to broadcast heartbeating and it solved our problem:

"Comment #4 From Lon Hohberger  	 on 2005-01-10 13:00 EST  	[reply]  	   	 

I'm pretty sure this is a duplicate of the following bug:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=114653

Basically, clumanager 1.2.9 reboots if it fails to join a multicast
group after a few seconds.  You should be able to fix this by changing
clumanager to use 'broadcast' instead of multicast heartbeating in the
GUI."


Question: is there a limitation or need to use broadcast heartbeating under an
iSCSI environment? Why multicast heartbeating didn't work here?


Thank you very much,

Celso.

--
Celso K. Webber <celso at webbertek.com.br>
(41) 3284-3035
(41) 8813-1919


From celso at webbertek.com.br  Sun Mar 26 15:48:24 2006
From: celso at webbertek.com.br (Celso K. Webber)
Date: Sun, 26 Mar 2006 12:48:24 -0300
Subject: [Linux-cluster] Cluster Suite v3 IPMI fencing time
Message-ID: <20060326154228.M36405@webbertek.com.br>

Hello again,

I have another question about CSv3U6 under the following environment:
* using Red Hat Enterprise Linux ES v3 Update 6 (cannot update to U7 because
Adaptec's SAS controller driver officialy supports only RHEL3U6)
* using Cluster Suite v3U6 (latest packages, clumanager-1.2.28-1)
* 2-node cluster with IBM xSeries 366 servers
* storage is a NetApp FAS270c
* shared state (quorum) managed over an iSCSI LUN on the NetApp
* using Red Hat's latest release of iscsi-initiator-utils (v 3.6.3-3)
* storage volumes for services accessed through NFS on the NetApp
* access to the storage done on a separate VLAN with Linux 802.3ad channel
bonding configured, using Cisco switches with port configured for LACP link
aggregation
* NetApp also configured for access in the network using link aggregation 

We configured fencing using IPMI over LAN. Although it works ok, it takes
almot 5 minutes for the fencing process to complete, please see these logs:

Mar 25 22:52:48 member1 cluquorumd[4112]: <warning> --> Commencing STONITH <--
Mar 25 22:55:00 member1 cluquorumd[4112]: <notice> STONITH: member2 has been
fenced!
Mar 25 22:57:11 member1 cluquorumd[4112]: <notice> STONITH: member2 is no
longer fenced off.


Questions:
1. member1 does not take over the services until the fencing process
completes, is that correct?
2. if it is a correct behaviour that member1 waits for the fencing completion
to take over services, how can I reduce the total fencing time? I didn't find
any parameters under the cludb man page for this.


Again, thank you all for any help.

Regards,

Celso.
--
Celso K. Webber <celso at webbertek.com.br>
(41) 3284-3035
(41) 8813-1919


From celso at webbertek.com.br  Sun Mar 26 21:49:43 2006
From: celso at webbertek.com.br (Celso K. Webber)
Date: Sun, 26 Mar 2006 18:49:43 -0300
Subject: [Linux-cluster] Cluster Suite v3 mounting NFS volumes
Message-ID: <20060326214429.M85634@webbertek.com.br>

Hello everybody,

As stated on previous email messages, we are configuring a cluster where the
storage system is a NetApp device.

Because of that, all service volumes (for Oracle in our case) are NFS mounted
in the cluster.

I'd like to have the NFS filesystems mounted only on the active machine, thus
avoiding a user from mistakenly NFS mount the filesystems on the other machine
and start playing with the datafiles from there.

Although NFS will protect from filesystem corruption if I mount from both
machines, it is still possible that someone starts playing with files at the
"standby" machine while the "active" machine have them into production.

Question: is there a reason for CSv3 not supporting a configuration where NFS
devices mount/umount are controlled by the Cluster?

For now I've included mount/umount commands on the service script.

Thank you all.

Celso.
--
Celso K. Webber <celso at webbertek.com.br>
(41) 3284-3035
(41) 8813-1919


From treddy at rallydev.com  Mon Mar 27 01:02:46 2006
From: treddy at rallydev.com (Tarun Reddy)
Date: Sun, 26 Mar 2006 18:02:46 -0700
Subject: [Linux-cluster] Using LABEL in "Device Special File" as a resource
In-Reply-To: <1143219714.25515.382.camel@ayanami.boston.redhat.com>
References: <1FC9ADB0-0E74-4BDD-87FF-51E2FF1CE3D0@rallydev.com>
	<1143219714.25515.382.camel@ayanami.boston.redhat.com>
Message-ID: <6E3B700A-A8B7-4C1D-BE71-095E2A8546CF@rallydev.com>

Hmmm....

I guess I never tried to edit cluster.xml by hand. On my system at  
least, redhat-config-cluster does not allow the value. I'll try  
editing cluster.xml tomorrow and see if that works. Could  you let me  
know if you can get redhat-config-cluster to accept the LABEL  
designation? Could it possibly be that my LABEL is "/www" (no quotes)?

Thanks,
Tarun


On Mar 24, 2006, at 10:01 AM, Lon Hohberger wrote:

> On Thu, 2006-03-23 at 14:50 -0700, Tarun Reddy wrote:
>> -----BEGIN PGP SIGNED MESSAGE-----
>> Hash: SHA1
>>
>> Under RHEL3 clustering, I'm trying to mount a partition using the
>> LABEL since iscsi bindings have a tendency to switch around.
>>
>> Using Google to search the archives, I found this message:
>> http://www.redhat.com/archives/linux-cluster/2005-October/ 
>> msg00129.html
>>
>> but this doesn't seem to work with Redhat Clustering on RHEL3. Is
>> there another way of doing this?
>
> It really should work; it works on my RHEL3 system:
>
> [root at magenta root]# e2label /dev/sdd3
>
> [root at magenta root]# e2label /dev/sdd3 futon
> [root at magenta root]# e2label /dev/sdd3
> futon
> [root at magenta root]# vi /etc/cluster.xml
> ...
>     <service checkinterval="10" id="0" name="foo">
>       <device id="0" name="LABEL=futon" sharename="">
>                <!--   ^- name used to be /dev/sdd3 -->
>         <mount forceunmount="yes" fstype="ext3" mountpoint="/mnt/1"
> options=""/>
>       </device>
>       ...
>     </service>
> ...
>
> Starting the service and running the mount command yielded this on my
> machine:
>
> [root at magenta root]# mount
> ...
> /dev/sdd3 on /mnt/1 type ext3 (rw)
> ...
>
> Stopping the service causes the mount to go away correctly on my  
> system.
> The label->device resolution has not changed since before the  
> release of
> RHCS3, so there must be something missing that we don't know about.
> Note that "LABEL" is case-sensitive, and must be all capital letters,
> and that no spaces are allowed.
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From celso at webbertek.com.br  Mon Mar 27 01:11:38 2006
From: celso at webbertek.com.br (Celso K. Webber)
Date: Sun, 26 Mar 2006 22:11:38 -0300
Subject: [Linux-cluster] Using LABEL in "Device Special File" as a resource
In-Reply-To: <6E3B700A-A8B7-4C1D-BE71-095E2A8546CF@rallydev.com>
References: <1FC9ADB0-0E74-4BDD-87FF-51E2FF1CE3D0@rallydev.com>
	<1143219714.25515.382.camel@ayanami.boston.redhat.com>
	<6E3B700A-A8B7-4C1D-BE71-095E2A8546CF@rallydev.com>
Message-ID: <20060327010355.M20339@webbertek.com.br>

Hello Tarun,

I had to solve this problem once under RHEL AS v2.1 (first version of Cluster
Manager), and it worked perfectly. By that time my problem was that the
machines had different internal disk configurations, so they had differente
"/dev/sdXX" names.

Under Cluster Suite v3 I solved this problem by pointing the real devices to
raw devices, and /dev/raw/XXX after that. I don't know if this has
implications on performance, but the systems worked apparently well.

I had once some problems with iSCSI on a IBM Blade Center where the iSCSI
volumes changed their /dev/XXX if the machine booted with the USB floppy/cdrom
connected (iSCSI dev was /dev/sdb with no USB floppy, but it was recognized as
/dev/sdc if the USB floppy was connected when the machine booted).
Fortunately, it was a regular filesystem (no cluster) and I could use
LABEL=XXX entries in /etc/fstab.

By this time I cannot figure out how could you solve your problem. I imagine
that it should not be difficult to write a Perl or Shell script that ran just
before the clumanager started, and issuing commands such as "iscsi-ls" it
could map a corresponding raw device, or even changing your /etc/cluster.xml
on the fly and writing it to the quorum using a "cludb" command.

Sorry for not pointing out a better solution, ok?

Regards,

Celso.

On Sun, 26 Mar 2006 18:02:46 -0700, Tarun Reddy wrote
> Hmmm....
> 
> I guess I never tried to edit cluster.xml by hand. On my system at  
> least, redhat-config-cluster does not allow the value. I'll try  
> editing cluster.xml tomorrow and see if that works. Could  you let 
> me  know if you can get redhat-config-cluster to accept the LABEL  
> designation? Could it possibly be that my LABEL is "/www" (no quotes)?
> 
> Thanks,
> Tarun
> 
> On Mar 24, 2006, at 10:01 AM, Lon Hohberger wrote:
> 
> > On Thu, 2006-03-23 at 14:50 -0700, Tarun Reddy wrote:
> >> -----BEGIN PGP SIGNED MESSAGE-----
> >> Hash: SHA1
> >>
> >> Under RHEL3 clustering, I'm trying to mount a partition using the
> >> LABEL since iscsi bindings have a tendency to switch around.
> >>
> >> Using Google to search the archives, I found this message:
> >> http://www.redhat.com/archives/linux-cluster/2005-October/ 
> >> msg00129.html
> >>
> >> but this doesn't seem to work with Redhat Clustering on RHEL3. Is
> >> there another way of doing this?
> >
> > It really should work; it works on my RHEL3 system:
> >
> > [root at magenta root]# e2label /dev/sdd3
> >
> > [root at magenta root]# e2label /dev/sdd3 futon
> > [root at magenta root]# e2label /dev/sdd3
> > futon
> > [root at magenta root]# vi /etc/cluster.xml
> > ...
> >     <service checkinterval="10" id="0" name="foo">
> >       <device id="0" name="LABEL=futon" sharename="">
> >                <!--   ^- name used to be /dev/sdd3 -->
> >         <mount forceunmount="yes" fstype="ext3" mountpoint="/mnt/1"
> > options=""/>
> >       </device>
> >       ...
> >     </service>
> > ...
> >
> > Starting the service and running the mount command yielded this on my
> > machine:
> >
> > [root at magenta root]# mount
> > ...
> > /dev/sdd3 on /mnt/1 type ext3 (rw)
> > ...
> >
> > Stopping the service causes the mount to go away correctly on my  
> > system.
> > The label->device resolution has not changed since before the  
> > release of
> > RHCS3, so there must be something missing that we don't know about.
> > Note that "LABEL" is case-sensitive, and must be all capital letters,
> > and that no spaces are allowed.
> >
> > -- Lon
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


--
Celso K. Webber <celso at webbertek.com.br>
(41) 3284-3035
(41) 8813-1919


From basv at sara.nl  Mon Mar 27 06:23:40 2006
From: basv at sara.nl (Bas van der Vlies)
Date: Mon, 27 Mar 2006 08:23:40 +0200
Subject: [Linux-cluster] DLM messages
Message-ID: <C2EC4062-0671-4594-B1D0-8AB6B0106CFE@sara.nl>

GFS: cvs STABLE

I just noticed that as we get these dlm messages the load will  
increase about 60 and stays there. what does this messages
mean?

--- kern.log ----
Mar 26 15:07:36 ifs2 kernel: dlm: lisa_vg3_lv1:  
process_lockqueue_reply id 237030a state 0
Mar 26 15:35:25 ifs2 kernel: dlm: lisa_vg3_lv1: cancel reply ret 0
Mar 26 15:35:25 ifs2 kernel: lock_dlm: unlock sb_status 0 2,15a940b8  
flags 0
Mar 26 15:35:25 ifs2 kernel: dlm: lisa_vg3_lv1:  
process_lockqueue_reply id 23e02f9 state 0

Regards

--
Bas van der Vlies
basv at sara.nl


From l.dardini at comune.prato.it  Mon Mar 27 07:23:00 2006
From: l.dardini at comune.prato.it (Leandro Dardini)
Date: Mon, 27 Mar 2006 09:23:00 +0200
Subject: R: [Linux-cluster] 2 Node Cluster - SAN Fencing
Message-ID: <404AA6666D14D14CA0D410C1BC6CC4C546557E@exchange3.comune.prato.local>

 
> -----Messaggio originale-----
> Da: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] Per conto di Lon Hohberger
> Inviato: gioved? 23 marzo 2006 16.06
> A: linux clustering
> Oggetto: Re: [Linux-cluster] 2 Node Cluster - SAN Fencing
> 
> On Wed, 2006-03-22 at 14:46 -0500, Hendershot, Zach wrote:
> > Hi,
> >     I have a 2 node cluster that I'm setting up fencing on (with 
> > fence_brocade). As a test I bring down the heartbeat network. A few 
> > seconds pass and BOTH nodes fence each other off of the SAN 
> (via the 
> > 2nd network interface on the machines). Should I put the brocade 
> > switch on the heartbeat network, or is there a better way? Thanks.
> >  
> > 
> 
> 2-node clusters need power fencing.

Do you mean that fence_brocade like fencing devices are not suitable to fence a two node cluster?

Leandro

> 
> -- Lon
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From teigland at redhat.com  Mon Mar 27 07:29:21 2006
From: teigland at redhat.com (David Teigland)
Date: Mon, 27 Mar 2006 01:29:21 -0600
Subject: R: [Linux-cluster] 2 Node Cluster - SAN Fencing
In-Reply-To: <404AA6666D14D14CA0D410C1BC6CC4C546557E@exchange3.comune.prato.local>
References: <404AA6666D14D14CA0D410C1BC6CC4C546557E@exchange3.comune.prato.local>
Message-ID: <20060327072921.GA27410@redhat.com>

On Mon, Mar 27, 2006 at 09:23:00AM +0200, Leandro Dardini wrote:
>  
> 
> > -----Messaggio originale-----
> > Da: linux-cluster-bounces at redhat.com 
> > [mailto:linux-cluster-bounces at redhat.com] Per conto di Lon Hohberger
> > Inviato: gioved? 23 marzo 2006 16.06
> > A: linux clustering
> > Oggetto: Re: [Linux-cluster] 2 Node Cluster - SAN Fencing
> > 
> > On Wed, 2006-03-22 at 14:46 -0500, Hendershot, Zach wrote:
> > > Hi,
> > >     I have a 2 node cluster that I'm setting up fencing on (with 
> > > fence_brocade). As a test I bring down the heartbeat network. A few 
> > > seconds pass and BOTH nodes fence each other off of the SAN 
> > (via the 
> > > 2nd network interface on the machines). Should I put the brocade 
> > > switch on the heartbeat network, or is there a better way? Thanks.
> > 
> > 2-node clusters need power fencing.
> 
> Do you mean that fence_brocade like fencing devices are not suitable to
> fence a two node cluster?

In situations where one of the two nodes didn't actually fail you can
get both nodes fencing each other.  With SAN fencing both will disable
each other and neither will be able to access storage, with power fencing
the first one reboots the second so the second doesn't get a chance to
reboot the first.

Dave


From teigland at redhat.com  Mon Mar 27 08:46:43 2006
From: teigland at redhat.com (David Teigland)
Date: Mon, 27 Mar 2006 02:46:43 -0600
Subject: [Linux-cluster] DLM messages
In-Reply-To: <C2EC4062-0671-4594-B1D0-8AB6B0106CFE@sara.nl>
References: <C2EC4062-0671-4594-B1D0-8AB6B0106CFE@sara.nl>
Message-ID: <20060327084643.GB27410@redhat.com>

On Mon, Mar 27, 2006 at 08:23:40AM +0200, Bas van der Vlies wrote:
> GFS: cvs STABLE
> 
> I just noticed that as we get these dlm messages the load will  
> increase about 60 and stays there. what does this messages
> mean?
> 
> process_lockqueue_reply id 237030a state 0

These are common and haven't been associated with any problems
before.  We should probably remove the printk.  It's caused
by a grant message arriving before the reply to the lock request.

> Mar 26 15:35:25 ifs2 kernel: dlm: lisa_vg3_lv1: cancel reply ret 0
> Mar 26 15:35:25 ifs2 kernel: lock_dlm: unlock sb_status 0 2,15a940b8  
> flags 0

I've never seen these before.  They're related to cancels which
GFS only does during recovery.  What applications are using GFS?

Dave


From Alain.Moulle at bull.net  Mon Mar 27 09:23:43 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Mon, 27 Mar 2006 11:23:43 +0200
Subject: [Linux-cluster] CS4 / devices naming persistency
Message-ID: <4427AF1F.5050004@bull.net>

Hi

Just for confirmation or/and suggestions:
I think that there is a problem of device naming unicity
and persistency (ie. after reboot) when we configure
some FS as "to be failovered" ressources in CS4.
In this case, which solution do you preconise :
use of LABELs on devices ?
or solution such as udev, etc. ?

This problem occurs only if we have FS as "to be failovered"
ressources. If not, I think that device naming unicity
and persistency does not matter at all for CS4 ...

Am I right ?

Thanks
Alain


From Alain.Moulle at bull.net  Mon Mar 27 09:25:55 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Mon, 27 Mar 2006 11:25:55 +0200
Subject: [Linux-cluster] CS4/ "Monitoring the health of remote power switch"
	?
Message-ID: <4427AFA3.7020604@bull.net>

Hi

As I did not get any answer to this
question some weeks ago, I try again :

In the CS4 Red Hat Documentation chapter
"Red Hat Cluster Manager Overview" , it
is written :
"To monitor the health of the other nodes,
each node monitors the health of remote
power switch if any ,etc."

Is it really possible ?
Where could we configure this ?
Or is it hidden in the CS4 ,I mean
is it automatically done, based on the
fence lines information in cluster.conf,
and if so what exactly is done ? (periodic
ping on the IP adr of remote power switch )

Thanks if you can clarify this point for me.

Alain


From basv at sara.nl  Mon Mar 27 09:02:55 2006
From: basv at sara.nl (Bas van der Vlies)
Date: Mon, 27 Mar 2006 11:02:55 +0200
Subject: [Linux-cluster] DLM messages
In-Reply-To: <20060327084643.GB27410@redhat.com>
References: <C2EC4062-0671-4594-B1D0-8AB6B0106CFE@sara.nl>
	<20060327084643.GB27410@redhat.com>
Message-ID: <4427AA3F.3040009@sara.nl>

David Teigland wrote:
> On Mon, Mar 27, 2006 at 08:23:40AM +0200, Bas van der Vlies wrote:
>> GFS: cvs STABLE
>>
>> I just noticed that as we get these dlm messages the load will  
>> increase about 60 and stays there. what does this messages
>> mean?
>>
>> process_lockqueue_reply id 237030a state 0
> 
> These are common and haven't been associated with any problems
> before.  We should probably remove the printk.  It's caused
> by a grant message arriving before the reply to the lock request.
> 
Thanks i also found that answer in the archive

>> Mar 26 15:35:25 ifs2 kernel: dlm: lisa_vg3_lv1: cancel reply ret 0
>> Mar 26 15:35:25 ifs2 kernel: lock_dlm: unlock sb_status 0 2,15a940b8  
>> flags 0
> 
> I've never seen these before.  They're related to cancels which
> GFS only does during recovery.  What applications are using GFS?
> 
Our setup is a 4 node GFS-server that act as Home NFS-server for our 
cluster. This cluster is used by different universities and research 
institutes. All applications are using NFS and therfore also GFS.

Regards


-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************


From Alain.Moulle at bull.net  Mon Mar 27 09:44:31 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Mon, 27 Mar 2006 11:44:31 +0200
Subject: [Linux-cluster] 2 Node Cluster - SAN Fencing
Message-ID: <4427B3FF.4050502@bull.net>

>Hi,
>>     I have a 2 node cluster that I'm setting up fencing on (with
>> fence_brocade). As a test I bring down the heartbeat network. A few
>> seconds pass and BOTH nodes fence each other off of the SAN (via the
>> 2nd network interface on the machines). Should I put the brocade
>> switch on the heartbeat network, or is there a better way? Thanks.

>2-node clusters need power fencing.
>-- Lon

Hi Lon

I'm not sure it solves Zach's problem ... in fact, I think it
is quite similar to the question I asked several weeks ago about
failure on the heart-beat eth :
whatever fencing solution, when we got a failure on heart-beat
eth, both nodes can decide to fence the other, so the first
one to detect the hb failure wins but the probability that
both nodes decide simultanesouly is not null ... and the problem
remains the same with fence_brocade as well as with a power fence
such as fence_ipmilan or any other ...

Right ?
Alain


From basv at sara.nl  Mon Mar 27 11:24:05 2006
From: basv at sara.nl (Bas van der Vlies)
Date: Mon, 27 Mar 2006 13:24:05 +0200
Subject: [Linux-cluster] DLM messages
In-Reply-To: <4427AA3F.3040009@sara.nl>
References: <C2EC4062-0671-4594-B1D0-8AB6B0106CFE@sara.nl>	<20060327084643.GB27410@redhat.com>
	<4427AA3F.3040009@sara.nl>
Message-ID: <4427CB55.2060203@sara.nl>

Bas van der Vlies wrote:
> David Teigland wrote:
>> On Mon, Mar 27, 2006 at 08:23:40AM +0200, Bas van der Vlies wrote:
>>> GFS: cvs STABLE
>>>
>>> I just noticed that as we get these dlm messages the load will  
>>> increase about 60 and stays there. what does this messages
>>> mean?
>>>
>>> process_lockqueue_reply id 237030a state 0
>>
>> These are common and haven't been associated with any problems
>> before.  We should probably remove the printk.  It's caused
>> by a grant message arriving before the reply to the lock request.
>>
> Thanks i also found that answer in the archive
> 
>>> Mar 26 15:35:25 ifs2 kernel: dlm: lisa_vg3_lv1: cancel reply ret 0
>>> Mar 26 15:35:25 ifs2 kernel: lock_dlm: unlock sb_status 0 2,15a940b8  
>>> flags 0
>>
>> I've never seen these before.  They're related to cancels which
>> GFS only does during recovery.  What applications are using GFS?
>>
> Our setup is a 4 node GFS-server that act as Home NFS-server for our 
> cluster. This cluster is used by different universities and research 
> institutes. All applications are using NFS and therfore also GFS.
> 

The load is even getting higher and the node does not respond to NFS
requests. When it try to fence the nodeby disabing the heartbeat-network
interface (It was the master). We get all kind od cman/dlm kernel 
crashes when the node wants to rejoin

Here are some kern.log outputs

==== FS4 ====
Mar 27 12:29:26 ifs4 kernel: ------------[ cut here ]------------
Mar 27 12:29:26 ifs4 kernel: kernel BUG at 
/usr/src/gfs/stable_1.0.2/stable/cluster/gfs-kernel/src/dlm/lock.c:357!
Mar 27 12:29:26 ifs4 kernel: invalid opcode: 0000 [#1]
Mar 27 12:29:26 ifs4 kernel: SMP
Mar 27 12:29:26 ifs4 kernel: Modules linked in: lock_dlm dlm cman 
dm_round_robin dm_multipath sg ide_floppy ide_cd cdrom qla2xxx siimage 
piix e1000 gfs lock_harness dm_mod
Mar 27 12:29:26 ifs4 kernel: CPU:    0
Mar 27 12:29:26 ifs4 kernel: EIP:    0060:[<f8aa5586>]    Tainted: GF 
   VLI
Mar 27 12:29:26 ifs4 kernel: EFLAGS: 00010246   (2.6.16-rc5-sara3 #1)
Mar 27 12:29:26 ifs4 kernel: EIP is at do_dlm_unlock+0x91/0xaa [lock_dlm]
Mar 27 12:29:26 ifs4 kernel: eax: 00000004   ebx: f08b35c0   ecx: 
0000b8f7   edx: 00000246
Mar 27 12:29:26 ifs4 kernel: esi: ffffffea   edi: f8c29000   ebp: 
eff65ee8   esp: eff65edc
Mar 27 12:29:26 ifs4 kernel: ds: 007b   es: 007b   ss: 0068
Mar 27 12:29:26 ifs4 kernel: Process gfs_glockd (pid: 6605, 
threadinfo=eff64000 task=eff46030)
Mar 27 12:29:26 ifs4 kernel: Stack: <0>f8aa9d89 f8c29000 f167e3c0 
eff65ef4 f8aa5824 f08b35c0 eff65f08 f899a7bc
Mar 27 12:29:26 ifs4 kernel: f08b35c0 00000003 f167e3e4 eff65f2c 
f8990ca4 f8c29000 f08b35c0 00000003
Mar 27 12:29:26 ifs4 kernel: f89c4ec0 f167e3c0 00000001 f167e3c0 
eff65f40 f8993680 f167e3c0 f167e3c0
Mar 27 12:29:26 ifs4 kernel: Call Trace:
Mar 27 12:29:26 ifs4 kernel: [<c0103599>] show_stack_log_lvl+0xad/0xb5
Mar 27 12:29:26 ifs4 kernel: [<c01036db>] show_registers+0x10d/0x176
Mar 27 12:29:26 ifs4 kernel: [<c01038ad>] die+0xf2/0x16d
Mar 27 12:29:26 ifs4 kernel: [<c0103996>] do_trap+0x6e/0x8a
Mar 27 12:29:26 ifs4 kernel: [<c0103bed>] do_invalid_op+0x90/0x97
Mar 27 12:29:26 ifs4 kernel: [<c010322f>] error_code+0x4f/0x54
Mar 27 12:29:26 ifs4 kernel: [<f8aa5824>] lm_dlm_unlock+0x1d/0x24 [lock_dlm]
Mar 27 12:29:26 ifs4 kernel: [<f899a7bc>] gfs_lm_unlock+0x2c/0x46 [gfs]
Mar 27 12:29:26 ifs4 kernel: [<f8990ca4>] gfs_glock_drop_th+0xf0/0x12d [gfs]
Mar 27 12:29:26 ifs4 kernel: [<f8993680>] inode_go_drop_th+0x13/0x18 [gfs]
Mar 27 12:29:26 ifs4 kernel: [<f89901f9>] rq_demote+0x79/0x95 [gfs]
Mar 27 12:29:26 ifs4 kernel: [<f89902b4>] run_queue+0x56/0xbb [gfs]
Mar 27 12:29:26 ifs4 kernel: [<f89903d6>] unlock_on_glock+0x1f/0x29 [gfs]
Mar 27 12:29:26 ifs4 kernel: [<f899232a>] gfs_reclaim_glock+0xbf/0x138 [gfs]
Mar 27 12:29:26 ifs4 kernel: [<f8986682>] gfs_glockd+0x3b/0xe3 [gfs]
Mar 27 12:29:26 ifs4 kernel: [<c0100ed9>] kernel_thread_helper+0x5/0xb
Mar 27 12:29:26 ifs4 kernel: Code: 73 34 ff 73 2c ff 73 08 ff 73 04 ff 
73 0c 56 8b 03 ff 70 18 68 a0 a6 aa f8 e8 80 19 67 c7 83 c4 34 68 89 9d 
aa f8 e8 73 19 67 c7 <0f> 0b 65 01 c0 a4 aa f8 68 a0 a5 aa f8 e8 27 12 
67 c7 8d 65 f8
Mar 27 12:29:36 ifs4 kernel: ------------[ cut here ]------------
Mar 27 12:29:36 ifs4 kernel: kernel BUG at 
/usr/src/gfs/stable_1.0.2/stable/cluster/gfs-kernel/src/dlm/lock.c:357!
Mar 27 12:29:36 ifs4 kernel: invalid opcode: 0000 [#2]
Mar 27 12:29:36 ifs4 kernel: SMP
Mar 27 12:29:36 ifs4 kernel: Modules linked in: lock_dlm dlm cman 
dm_round_robin dm_multipath sg ide_floppy ide_cd cdrom qla2xxx siimage 
piix e1000 gfs lock_harness dm_mod
Mar 27 12:29:36 ifs4 kernel: CPU:    1
Mar 27 12:29:36 ifs4 kernel: EIP:    0060:[<f8aa5586>]    Tainted: GF 
   VLI
Mar 27 12:29:36 ifs4 kernel: EFLAGS: 00010206   (2.6.16-rc5-sara3 #1)
Mar 27 12:29:36 ifs4 kernel: EIP is at do_dlm_unlock+0x91/0xaa [lock_dlm]
Mar 27 12:29:36 ifs4 kernel: eax: 00000004   ebx: e948f6c0   ecx: 
c034ab40   edx: 00000206
Mar 27 12:29:36 ifs4 kernel: esi: ffffffea   edi: f8b57000   ebp: 
f4fafee8   esp: f4fafedc
Mar 27 12:29:36 ifs4 kernel: ds: 007b   es: 007b   ss: 0068
Mar 27 12:29:36 ifs4 kernel: Process gfs_glockd (pid: 6472, 
threadinfo=f4fae000 task=f5d44550)
Mar 27 12:29:36 ifs4 kernel: Stack: <0>f8aa9d89 f8b57000 f33799a8 
f4fafef4 f8aa5824 e948f6c0 f4faff08 f899a7bc
Mar 27 12:29:36 ifs4 kernel: e948f6c0 00000003 f33799cc f4faff2c 
f8990ca4 f8b57000 e948f6c0 00000003
Mar 27 12:29:36 ifs4 kernel: f89c4ec0 f33799a8 00000001 f33799a8 
f4faff40 f8993680 f33799a8 f33799a8
Mar 27 12:29:36 ifs4 kernel: Call Trace:
Mar 27 12:29:36 ifs4 kernel: [<c0103599>] show_stack_log_lvl+0xad/0xb5
Mar 27 12:29:36 ifs4 kernel: [<c01036db>] show_registers+0x10d/0x176
Mar 27 12:29:36 ifs4 kernel: [<c01038ad>] die+0xf2/0x16d
Mar 27 12:29:36 ifs4 kernel: [<c0103996>] do_trap+0x6e/0x8a
Mar 27 12:29:36 ifs4 kernel: [<c0103bed>] do_invalid_op+0x90/0x97
Mar 27 12:29:36 ifs4 kernel: [<c010322f>] error_code+0x4f/0x54
Mar 27 12:29:36 ifs4 kernel: [<f8aa5824>] lm_dlm_unlock+0x1d/0x24 [lock_dlm]
Mar 27 12:29:36 ifs4 kernel: [<f899a7bc>] gfs_lm_unlock+0x2c/0x46 [gfs]
Mar 27 12:29:36 ifs4 kernel: [<f8990ca4>] gfs_glock_drop_th+0xf0/0x12d [gfs]
Mar 27 12:29:36 ifs4 kernel: [<f8993680>] inode_go_drop_th+0x13/0x18 [gfs]
Mar 27 12:29:36 ifs4 kernel: [<f89901f9>] rq_demote+0x79/0x95 [gfs]
Mar 27 12:29:36 ifs4 kernel: [<f89902b4>] run_queue+0x56/0xbb [gfs]
Mar 27 12:29:36 ifs4 kernel: [<f89903d6>] unlock_on_glock+0x1f/0x29 [gfs]
Mar 27 12:29:36 ifs4 kernel: [<f899232a>] gfs_reclaim_glock+0xbf/0x138 [gfs]
Mar 27 12:29:36 ifs4 kernel: [<f8986682>] gfs_glockd+0x3b/0xe3 [gfs]
Mar 27 12:29:36 ifs4 kernel: [<c0100ed9>] kernel_thread_helper+0x5/0xb
Mar 27 12:29:36 ifs4 kernel: Code: 73 34 ff 73 2c ff 73 08 ff 73 04 ff 
73 0c 56 8b 03 ff 70 18 68 a0 a6 aa f8 e8 80 19 67 c7 83 c4 34 68 89 9d 
aa f8 e8 73 19 67 c7 <0f> 0b 65 01 c0 a4 aa f8 68 a0 a5 aa f8 e8 27 12 
67 c7 8d 65 f8


=== FS2 ==
Mar 27 12:28:25 ifs2 kernel: ------------[ cut here ]------------
Mar 27 12:28:25 ifs2 kernel: kernel BUG at 
/usr/src/gfs/stable_1.0.2/stable/cluster/cman-kernel/src/membership.c:3151!
Mar 27 12:28:25 ifs2 kernel: invalid opcode: 0000 [#1]
Mar 27 12:28:25 ifs2 kernel: SMP
Mar 27 12:28:25 ifs2 kernel: Modules linked in: lock_dlm dlm cman 
dm_round_robin dm_multipath sg ide_floppy ide_cd cdrom qla2xxx siimage 
piix e1000 gfs lock_harness dm_mod
Mar 27 12:28:25 ifs2 kernel: CPU:    0
Mar 27 12:28:25 ifs2 kernel: EIP:    0060:[<f8ac7825>]    Tainted: GF 
   VLI
Mar 27 12:28:25 ifs2 kernel: EFLAGS: 00010246   (2.6.16-rc5-sara3 #1)
Mar 27 12:28:25 ifs2 kernel: EIP is at elect_master+0x34/0x41 [cman]
Mar 27 12:28:25 ifs2 kernel: eax: f8a3e000   ebx: 00000080   ecx: 
00000080   edx: 00000000
Mar 27 12:28:25 ifs2 kernel: esi: f8adb584   edi: f677bfcc   ebp: 
f677bf7c   esp: f677bf78
Mar 27 12:28:25 ifs2 kernel: ds: 007b   es: 007b   ss: 0068
Mar 27 12:28:25 ifs2 kernel: Process cman_memb (pid: 5892, 
threadinfo=f677a000 task=f6984030)
Mar 27 12:28:25 ifs2 kernel: Stack: <0>f63374a0 f677bf90 f8ac5430 
f677bf98 00000000 f63374a0 f677bfa4 f8ac3acc
Mar 27 12:28:25 ifs2 kernel: f68dba40 00000000 f6984030 f677bfe4 
f8ac3cb3 0000001f 00000000 c0102612
Mar 27 12:28:25 ifs2 kernel: 00000000 f6984030 c01124e1 00100100 
00200200 00000000 00000000 00000000
Mar 27 12:28:25 ifs2 kernel: Call Trace:
Mar 27 12:28:25 ifs2 kernel: [<c0103599>] show_stack_log_lvl+0xad/0xb5
Mar 27 12:28:25 ifs2 kernel: [<c01036db>] show_registers+0x10d/0x176
Mar 27 12:28:25 ifs2 kernel: [<c01038ad>] die+0xf2/0x16d
Mar 27 12:28:25 ifs2 kernel: [<c0103996>] do_trap+0x6e/0x8a
Mar 27 12:28:25 ifs2 kernel: [<c0103bed>] do_invalid_op+0x90/0x97
Mar 27 12:28:25 ifs2 kernel: [<c010322f>] error_code+0x4f/0x54
Mar 27 12:28:25 ifs2 kernel: [<f8ac5430>] a_node_just_died+0x118/0x178 
[cman]
Mar 27 12:28:25 ifs2 kernel: [<f8ac3acc>] process_dead_nodes+0x4e/0x7a 
[cman]
Mar 27 12:28:25 ifs2 kernel: [<f8ac3cb3>] membership_kthread+0x1bb/0x38d 
[cman]
Mar 27 12:28:25 ifs2 kernel: [<c0100ed9>] kernel_thread_helper+0x5/0xb
Mar 27 12:28:25 ifs2 kernel: Code: 8b 1d 44 c3 ad f8 39 d9 7d 21 a1 48 
c3 ad f8 8b 14 88 85 d2 74 10 83 7a 1c 02 75 0a 8b 45 08 89 10 8b 42 14 
eb 0f 41 39 d9 7c df <0f> 0b 4f 0c 60 f2 ac f8 31 c0 5b 5d c3 55 89 e5 
ff 35 48 c3 ad

== fs3 ==
ar 27 12:22:30 ifs3 kernel: kernel BUG at 
/usr/src/gfs/stable_1.0.2/stable/cluster/gfs-kernel/src/dlm/lock.c:428!
Mar 27 12:22:30 ifs3 kernel: invalid opcode: 0000 [#1]
Mar 27 12:22:30 ifs3 kernel: SMP
Mar 27 12:22:30 ifs3 kernel: Modules linked in: lock_dlm dlm cman 
dm_round_robin dm_multipath sg ide_floppy ide_cd cdrom qla2xxx siimage 
piix e1000 gfs lock_harness dm_mod
Mar 27 12:22:30 ifs3 kernel: CPU:    0
Mar 27 12:22:30 ifs3 kernel: EIP:    0060:[<f8aa5714>]    Tainted: GF 
   VLI
Mar 27 12:22:30 ifs3 kernel: EFLAGS: 00010246   (2.6.16-rc5-sara3 #1)
Mar 27 12:22:30 ifs3 kernel: EIP is at do_dlm_lock+0x138/0x152 [lock_dlm]
Mar 27 12:22:30 ifs3 kernel: eax: 00000004   ebx: ffffffea   ecx: 
0000b152   edx: 00000246
Mar 27 12:22:30 ifs3 kernel: esi: eb12b4c0   edi: f5f8f180   ebp: 
f62edb00   esp: f62edacc
Mar 27 12:22:30 ifs3 kernel: ds: 007b   es: 007b   ss: 0068
Mar 27 12:22:30 ifs3 kernel: Process nfsd (pid: 6278, 
threadinfo=f62ec000 task=f62c1030)
Mar 27 12:22:30 ifs3 kernel: Stack: <0>f8aa9d89 00000001 20202020 
32202020 20202020 20202020 32363820 33646565
Mar 27 12:22:30 ifs3 kernel: f62e0018 70baf030 eb12b4c0 00000008 
f8b57000 f62edb34 f8aa57b9 eb12b4c0
Mar 27 12:22:30 ifs3 kernel: 00000000 eb12b4c0 00000008 ffffffff 
00000003 00000003 eb12b4c0 00000000
Mar 27 12:22:30 ifs3 kernel: Call Trace:
Mar 27 12:22:30 ifs3 kernel: [<c0103599>] show_stack_log_lvl+0xad/0xb5
Mar 27 12:22:30 ifs3 kernel: [<c01036db>] show_registers+0x10d/0x176
Mar 27 12:22:30 ifs3 kernel: [<c01038ad>] die+0xf2/0x16d
Mar 27 12:22:30 ifs3 kernel: [<c0103996>] do_trap+0x6e/0x8a
Mar 27 12:22:30 ifs3 kernel: [<c0103bed>] do_invalid_op+0x90/0x97
Mar 27 12:22:30 ifs3 kernel: [<c010322f>] error_code+0x4f/0x54
Mar 27 12:22:30 ifs3 kernel: [<f8aa57b9>] lm_dlm_lock+0x4f/0x5b [lock_dlm]
Mar 27 12:22:30 ifs3 kernel: [<f899a776>] gfs_lm_lock+0x32/0x4c [gfs]
Mar 27 12:22:30 ifs3 kernel: [<f89909f8>] gfs_glock_xmote_th+0x125/0x161 
[gfs]
Mar 27 12:22:30 ifs3 kernel: [<f8993625>] inode_go_xmote_th+0x20/0x25 [gfs]
Mar 27 12:22:30 ifs3 kernel: [<f89900fd>] rq_promote+0xb3/0x136 [gfs]
Mar 27 12:22:30 ifs3 kernel: [<f89902e3>] run_queue+0x85/0xbb [gfs]
Mar 27 12:22:30 ifs3 kernel: [<f8991281>] gfs_glock_nq+0xce/0x119 [gfs]
Mar 27 12:22:30 ifs3 kernel: [<f89917f0>] gfs_glock_nq_init+0x1d/0x36 [gfs]
Mar 27 12:22:30 ifs3 kernel: [<f8991858>] gfs_glock_nq_num+0x37/0x7f [gfs]
Mar 27 12:22:30 ifs3 kernel: [<f89a17d1>] gfs_get_dentry+0xa9/0x29c [gfs]
Mar 27 12:22:30 ifs3 kernel: [<c01915bc>] find_exported_dentry+0x2f/0x4e1
Mar 27 12:22:30 ifs3 kernel: [<f89a1461>] gfs_decode_fh+0xc1/0xc9 [gfs]
Mar 27 12:22:30 ifs3 kernel: [<c0193c37>] fh_verify+0x35f/0x4db
Mar 27 12:22:30 ifs3 kernel: [<c0194d5c>] nfsd_access+0x27/0xe5
Mar 27 12:22:30 ifs3 kernel: [<c019ab4f>] nfsd3_proc_access+0x95/0xa2
Mar 27 12:22:30 ifs3 kernel: [<c019229f>] nfsd_dispatch+0xbe/0x17f
Mar 27 12:22:30 ifs3 kernel: [<c02e0a52>] svc_process+0x381/0x5c7
Mar 27 12:22:30 ifs3 kernel: [<c019208c>] nfsd+0x18d/0x2e2
Mar 27 12:22:30 ifs3 kernel: [<c0100ed9>] kernel_thread_helper+0x5/0xb
Mar 27 12:22:30 ifs3 kernel: Code: 26 50 0f bf 46 24 50 53 ff 76 08 ff 
76 04 ff 76 0c ff 77 18 68 e0 a6 aa f8 e8 f2 17 67 c7 83 c4 38 68 89 9d 
aa f8 e8 e5 17 67 c7 <0f> 0b ac 01 c0 a4 aa f8 68 a0 a5 aa f8 e8 99 10 
67 c7 8d 65 f4


-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv at sara.nl      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************


From jan.kudjak at snt.sk  Mon Mar 27 13:37:02 2006
From: jan.kudjak at snt.sk (Kudjak Jan)
Date: Mon, 27 Mar 2006 15:37:02 +0200
Subject: [Linux-cluster] 4 node gfs cluster, quorum needs 3
Message-ID: <875696A5F2525C41B27CD923700BAFC93B1BDE@skmail01.central.snt.eu>


Hello,

I have at this time 4 node gfs cluster using RLM.
Two nodes (node1, node2) have mounted gfs filesystem and other two (node3, node4) are working as loadballancers and as redundant lock servers (no gfs fs mounted on node3 or node4).
(i am using GFS-6.0.2.20-2, GFS-modules-smp-6.0.2.20-2, kernel-smp-2.4.21-32.0.1.EL)

So when all nodes are up there is:

quorum_has = 4
quorum_needs = 3

I tried to stop lock_gulm on node3 and node4.
Although the cluster was in state

quorum_has = 2
quorum_needs = 3

the gfs filesystem on node1 or node2 still remained read/write accessible. 
Is this behaviour correct ?

----

nodes   quorum_needs	quorum_has	filesystem

3		>=2		2		r/w

4		>=3		2		r/w ?????

5		>=3		3		r/w


Can anybody help me out to correct or even extend the table above?
Where is the truth ? :) or have I misunderstood something ?

Thanks a lot for your answers.

--
J?n Kudj?k
UNIX/Linux Consultant


From Matthew.Patton.ctr at osd.mil  Mon Mar 27 15:13:17 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Mon, 27 Mar 2006 10:13:17 -0500
Subject: [Linux-cluster] Shutdown/startup
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D3E@osdn06.osd.mil>

Classification: UNCLASSIFIED

> On Thu, Mar 23, 2006 at 03:12:29PM -0600, Paul Tader wrote:
...
> To shutdown, you umount all fs's on all nodes, fence_tool leave on
> all nodes, then leave the cluster on all.  To leave the
> cluster without quorum getting in the way use 'cman_tool leave remove'.

So I naively ask, why not make "cman_tool leave remove" part of the standard
system shutdown scripts? Having to manually futz with cluster nodes
shouldn't be necessary.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060327/d9cfd814/attachment.htm>

From lhh at redhat.com  Mon Mar 27 18:54:07 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 27 Mar 2006 13:54:07 -0500
Subject: [Linux-cluster] CS4/ "Monitoring the health of remote power
	switch" ?
In-Reply-To: <4427AFA3.7020604@bull.net>
References: <4427AFA3.7020604@bull.net>
Message-ID: <1143485647.4749.0.camel@ayanami.boston.redhat.com>

On Mon, 2006-03-27 at 11:25 +0200, Alain Moulle wrote:

> "To monitor the health of the other nodes,
> each node monitors the health of remote
> power switch if any ,etc."

It actually does not monitor the power switches.

-- Lon


From lhh at redhat.com  Mon Mar 27 18:05:04 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 27 Mar 2006 13:05:04 -0500
Subject: [Linux-cluster] Cluster Suite v3 IPMI fencing time
In-Reply-To: <20060326154228.M36405@webbertek.com.br>
References: <20060326154228.M36405@webbertek.com.br>
Message-ID: <1143482704.4749.8.camel@ayanami.boston.redhat.com>

On Sun, 2006-03-26 at 12:48 -0300, Celso K. Webber wrote:

> Mar 25 22:52:48 member1 cluquorumd[4112]: <warning> --> Commencing STONITH <--
> Mar 25 22:55:00 member1 cluquorumd[4112]: <notice> STONITH: member2 has been
> fenced!
> Mar 25 22:57:11 member1 cluquorumd[4112]: <notice> STONITH: member2 is no
> longer fenced off.
> 
> 
> Questions:
> 1. member1 does not take over the services until the fencing process
> completes, is that correct?

Yes.

> 2. if it is a correct behaviour that member1 waits for the fencing completion
> to take over services, how can I reduce the total fencing time? I didn't find
> any parameters under the cludb man page for this.

Turn off ACPI on both machines, or else IPMI will try a "graceful"
shutdown, which is exactly what you do *not* want.  If the machine is
dead or hung, it may not respond to the ACPI request at all.

Change the kernel command line to include "acpi=off" and chkconfig --del
acpid

Note that with IPMI, you should use the NIC with IPMI for only IPMI
traffic (or at least separate IPMI traffic from the cluster traffic), or
it can become a single point of failure.

-- Lon	


From lhh at redhat.com  Mon Mar 27 18:05:58 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 27 Mar 2006 13:05:58 -0500
Subject: [Linux-cluster] Cluster Suite v3 mounting NFS volumes
In-Reply-To: <20060326214429.M85634@webbertek.com.br>
References: <20060326214429.M85634@webbertek.com.br>
Message-ID: <1143482758.4749.10.camel@ayanami.boston.redhat.com>

On Sun, 2006-03-26 at 18:49 -0300, Celso K. Webber wrote:

> Question: is there a reason for CSv3 not supporting a configuration where NFS
> devices mount/umount are controlled by the Cluster?

Yes, that's the case.  RHCS4 has this functionality, though.

> For now I've included mount/umount commands on the service script.

That should work fine.

-- Lon


From lhh at redhat.com  Mon Mar 27 18:24:39 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 27 Mar 2006 13:24:39 -0500
Subject: [Linux-cluster] Cluster Suite v3 quorum devices over iSCSI
In-Reply-To: <20060326152411.M4423@webbertek.com.br>
References: <20060326152411.M4423@webbertek.com.br>
Message-ID: <1143483879.4749.28.camel@ayanami.boston.redhat.com>

On Sun, 2006-03-26 at 12:41 -0300, Celso K. Webber wrote:

> We had many problems with quorum formation under this environment. If we
> booted both servers at the same time, quorum would be formed successfully. If
> after that we rebooted one of the servers, it would not rejoin the cluster
> until we stopped clumanager on both sides and restarted.

Are you using disk-based or network-based tiebreaker?

> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=136553

I was never convinced that this was an iSCSI / RHCS3 interaction
problem, for what it's worth ;)


> Question: is there a limitation or need to use broadcast heartbeating under an
> iSCSI environment? Why multicast heartbeating didn't work here?

There shouldn't be, but if there is, it's due to a network / iSCSI
interaction problem that I do not really understand ;)

I wrote some stuff here about RHCS3 + network configuration here, so I
won't retype it.  Take a look and see if anything rings a bell with your
particular hardware configuration:

http://people.redhat.com/lhh/network-stuff.html

-- Lon


From ptader at fnal.gov  Mon Mar 27 18:32:31 2006
From: ptader at fnal.gov (Paul Tader)
Date: Mon, 27 Mar 2006 12:32:31 -0600
Subject: [Linux-cluster] Shutdown/startup
In-Reply-To: <20060324172718.GC24180@redhat.com>
References: <44230F3D.2050409@fnal.gov> <20060324172718.GC24180@redhat.com>
Message-ID: <44282FBF.9000806@fnal.gov>

David Teigland wrote:
> On Thu, Mar 23, 2006 at 03:12:29PM -0600, Paul Tader wrote:
> 
>>So how does one properly shutdown and/or start up a GFS 6.1 cluster. 
>>Documentation on the procedures are surprisingly sparse.  For example, 
>>what I've experienced during shutdown are messages like:
>>
>>CMAN: sendmsg failed: -101
>>CMAN: we are leaving the cluster.
>>
>>from the node shutting down while the the other nodes report that this 
>>node needs to be fenced off.  What?!  Fenced during a controlled 
>>shutdown?  Is that correct?  During a complete cluster shutdown, of 
>>course quorum, at some point is lost, freezing the nodes that have not 
>>completely shutdown forcing a hard reset on those nodes.  I'm probably 
>>missing something trivial here, but this doesn't seem right.
>>
>>Start up also suffers from the "quorum" problem, but it seems to 
>>recovery once enough nodes are up.
> 
> 
> You read the notes here about startup/shutdown?
> http://sources.redhat.com/cluster/doc/usage.txt
> 
> To shutdown, you umount all fs's on all nodes, fence_tool leave on
> all nodes, then leave the cluster on all.  To leave the cluster without
> quorum getting in the way use 'cman_tool leave remove'.
> 
> To start up, have all nodes join the cluster with cman_tool join,
> then have all nodes join the fence domain with fence_tool join,
> then start other things.
> 
> Dave
> 
> 

Thanks Dave.  The process detailed in this document brought the nodes 
down gracefully.

[root at node]# umount /gfs_volume
[root at node]# vgchange -aln
[root at node]# killall clvmd
[root at node]# fence_tool leave
[root at node]# cman_tool leave -w
[root at node]# killall ccsd
[root at node]# (reboot/shutdown)


I'm going to incorporate this in to a revised shutdown init script and 
post what work.

Paul


-- 
===========================================================================
Paul Tader  <ptader at fnal.gov> Computing Div/CSS Dept
Fermi National Accelerator Lab; PO Box 500 MS 369 Batavia, IL 60510-0500


From treddy at rallydev.com  Mon Mar 27 18:53:40 2006
From: treddy at rallydev.com (Tarun Reddy)
Date: Mon, 27 Mar 2006 11:53:40 -0700
Subject: [Linux-cluster] Using LABEL in "Device Special File" as a resource
In-Reply-To: <20060327010355.M20339@webbertek.com.br>
References: <1FC9ADB0-0E74-4BDD-87FF-51E2FF1CE3D0@rallydev.com>
	<1143219714.25515.382.camel@ayanami.boston.redhat.com>
	<6E3B700A-A8B7-4C1D-BE71-095E2A8546CF@rallydev.com>
	<20060327010355.M20339@webbertek.com.br>
Message-ID: <3BE36D30-9767-429D-B352-BB6AF1608188@rallydev.com>

Just FYI... changing cluster.xml by hand works. redhat-config-cluster  
is simply not allowing the device in it's rules checking. That would  
be nice to change the code that checks this. I guess I'll file a  
bugzilla report for it.

And as long as you don't change that device entry, (anything on it),  
the rest of redhat-config-cluster works. Also in cluster.xml, use  
LABEL=<mountpoint> with no quotes. I assume if you have a space  
(IMHO, a bad practice), you could escape it with a \.

Thanks for everyone's input.
Tarun

On Mar 26, 2006, at 6:11 PM, Celso K. Webber wrote:

> Hello Tarun,
>
> I had to solve this problem once under RHEL AS v2.1 (first version  
> of Cluster
> Manager), and it worked perfectly. By that time my problem was that  
> the
> machines had different internal disk configurations, so they had  
> differente
> "/dev/sdXX" names.
>
> Under Cluster Suite v3 I solved this problem by pointing the real  
> devices to
> raw devices, and /dev/raw/XXX after that. I don't know if this has
> implications on performance, but the systems worked apparently well.
>
> I had once some problems with iSCSI on a IBM Blade Center where the  
> iSCSI
> volumes changed their /dev/XXX if the machine booted with the USB  
> floppy/cdrom
> connected (iSCSI dev was /dev/sdb with no USB floppy, but it was  
> recognized as
> /dev/sdc if the USB floppy was connected when the machine booted).
> Fortunately, it was a regular filesystem (no cluster) and I could use
> LABEL=XXX entries in /etc/fstab.
>
> By this time I cannot figure out how could you solve your problem.  
> I imagine
> that it should not be difficult to write a Perl or Shell script  
> that ran just
> before the clumanager started, and issuing commands such as "iscsi- 
> ls" it
> could map a corresponding raw device, or even changing your /etc/ 
> cluster.xml
> on the fly and writing it to the quorum using a "cludb" command.
>
> Sorry for not pointing out a better solution, ok?
>
> Regards,
>
> Celso.
>
> On Sun, 26 Mar 2006 18:02:46 -0700, Tarun Reddy wrote
>> Hmmm....
>>
>> I guess I never tried to edit cluster.xml by hand. On my system at
>> least, redhat-config-cluster does not allow the value. I'll try
>> editing cluster.xml tomorrow and see if that works. Could  you let
>> me  know if you can get redhat-config-cluster to accept the LABEL
>> designation? Could it possibly be that my LABEL is "/www" (no  
>> quotes)?
>>
>> Thanks,
>> Tarun
>>
>> On Mar 24, 2006, at 10:01 AM, Lon Hohberger wrote:
>>
>>> On Thu, 2006-03-23 at 14:50 -0700, Tarun Reddy wrote:
>>>> -----BEGIN PGP SIGNED MESSAGE-----
>>>> Hash: SHA1
>>>>
>>>> Under RHEL3 clustering, I'm trying to mount a partition using the
>>>> LABEL since iscsi bindings have a tendency to switch around.
>>>>
>>>> Using Google to search the archives, I found this message:
>>>> http://www.redhat.com/archives/linux-cluster/2005-October/
>>>> msg00129.html
>>>>
>>>> but this doesn't seem to work with Redhat Clustering on RHEL3. Is
>>>> there another way of doing this?
>>>
>>> It really should work; it works on my RHEL3 system:
>>>
>>> [root at magenta root]# e2label /dev/sdd3
>>>
>>> [root at magenta root]# e2label /dev/sdd3 futon
>>> [root at magenta root]# e2label /dev/sdd3
>>> futon
>>> [root at magenta root]# vi /etc/cluster.xml
>>> ...
>>>     <service checkinterval="10" id="0" name="foo">
>>>       <device id="0" name="LABEL=futon" sharename="">
>>>                <!--   ^- name used to be /dev/sdd3 -->
>>>         <mount forceunmount="yes" fstype="ext3" mountpoint="/mnt/1"
>>> options=""/>
>>>       </device>
>>>       ...
>>>     </service>
>>> ...
>>>
>>> Starting the service and running the mount command yielded this  
>>> on my
>>> machine:
>>>
>>> [root at magenta root]# mount
>>> ...
>>> /dev/sdd3 on /mnt/1 type ext3 (rw)
>>> ...
>>>
>>> Stopping the service causes the mount to go away correctly on my
>>> system.
>>> The label->device resolution has not changed since before the
>>> release of
>>> RHCS3, so there must be something missing that we don't know about.
>>> Note that "LABEL" is case-sensitive, and must be all capital  
>>> letters,
>>> and that no spaces are allowed.
>>>
>>> -- Lon
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> --
> Celso K. Webber <celso at webbertek.com.br>
> (41) 3284-3035
> (41) 8813-1919
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From celso at webbertek.com.br  Mon Mar 27 19:54:37 2006
From: celso at webbertek.com.br (Celso K. Webber)
Date: Mon, 27 Mar 2006 16:54:37 -0300
Subject: [Linux-cluster] Cluster Suite v3 IPMI fencing time
In-Reply-To: <1143482704.4749.8.camel@ayanami.boston.redhat.com>
References: <20060326154228.M36405@webbertek.com.br>
	<1143482704.4749.8.camel@ayanami.boston.redhat.com>
Message-ID: <20060327191930.M95262@webbertek.com.br>

Lon,

Thank you very much for your response, please see below:

On Mon, 27 Mar 2006 13:05:04 -0500, Lon Hohberger wrote
...
> > 2. if it is a correct behaviour that member1 waits for the fencing completion
> > to take over services, how can I reduce the total fencing time? I didn't find
> > any parameters under the cludb man page for this.
> 
> Turn off ACPI on both machines, or else IPMI will try a "graceful"
> shutdown, which is exactly what you do *not* want.  If the machine is
> dead or hung, it may not respond to the ACPI request at all.

Yes, the ACPI is already off.

If I issue a "ipmitool -I lan ... chassis power cycle" command, with
clumanager turned off, the machine resets within a few seconds, with no
graceful shutdown (it does a hard power off/on).

While the fence process is underway, I can go to a command line and issue a
"ipmitool -I lan ... power status" command, so I think there would be no
difficulties with IPMI status checking for the ipmilan stonith module.

If, on the other hand, I issue a "clufence -r <node>", it turns off the
machine immediately but waits almost 2 minutes to turn it on again, and after
that the clufence command waits for another 2 minutes until it returns to the
shell prompt.

So I'm guessing this has something to do with IBM's implementation of IPMI on
these xSeries 366 machines. Under Dell PowerEdges the fence process was quite
quick for me.

By the way, both the BIOS and the BMC firmware codes were updated to the
latest release before setting up all this enrionment.

> Note that with IPMI, you should use the NIC with IPMI for only IPMI
> traffic (or at least separate IPMI traffic from the cluster traffic),
>  or it can become a single point of failure.

Humm ... interesting, maybe I'll have to change my architecture a little bit,
although I'm issuing IPMI commnds through the least used interface (IPMI seems
to work on both onboard interfaces of this server).

If someone has some similar experiences with this long delay with fencing,
please let me know.

Regards,

Celso.


From celso at webbertek.com.br  Mon Mar 27 20:05:57 2006
From: celso at webbertek.com.br (Celso K. Webber)
Date: Mon, 27 Mar 2006 17:05:57 -0300
Subject: [Linux-cluster] Cluster Suite v3 quorum devices over iSCSI
In-Reply-To: <1143483879.4749.28.camel@ayanami.boston.redhat.com>
References: <20060326152411.M4423@webbertek.com.br>
	<1143483879.4749.28.camel@ayanami.boston.redhat.com>
Message-ID: <20060327195534.M91236@webbertek.com.br>

Hello again,

On Mon, 27 Mar 2006 13:24:39 -0500, Lon Hohberger wrote
> On Sun, 2006-03-26 at 12:41 -0300, Celso K. Webber wrote:
> 
> > We had many problems with quorum formation under this environment. If we
> > booted both servers at the same time, quorum would be formed successfully. If
> > after that we rebooted one of the servers, it would not rejoin the cluster
> > until we stopped clumanager on both sides and restarted.
> 
> Are you using disk-based or network-based tiebreaker?
> 
I was using network-based tiebraker, and changing to broadcast heartbeating
solved the problem.

After doing some tests, I changed do disk-based heartbeating, although the GUI
warned me that IP-based heartbeating was recommended for a 2-node
configuration (I have 2 nodes).

> > https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=136553
> 
> I was never convinced that this was an iSCSI / RHCS3 interaction
> problem, for what it's worth ;)
> 

Yes, we are also not sure about these supposed iSCSI / CS3 problem, but the
only references we encountered were those from the Bugzilla.

> 
> http://people.redhat.com/lhh/network-stuff.html
> 
I'll surely take a loot at it! Thanks!

By the way, tell me if you think we should open a case on Red Hat support for
this.

Best regards,

Celso.


From lhh at redhat.com  Mon Mar 27 20:39:44 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 27 Mar 2006 15:39:44 -0500
Subject: [Linux-cluster] Cluster Suite v3 IPMI fencing time
In-Reply-To: <20060327191930.M95262@webbertek.com.br>
References: <20060326154228.M36405@webbertek.com.br>
	<1143482704.4749.8.camel@ayanami.boston.redhat.com>
	<20060327191930.M95262@webbertek.com.br>
Message-ID: <1143491984.4749.121.camel@ayanami.boston.redhat.com>

On Mon, 2006-03-27 at 16:54 -0300, Celso K. Webber wrote:
> Lon,
> 
> Thank you very much for your response, please see below:

> If, on the other hand, I issue a "clufence -r <node>", it turns off the
> machine immediately but waits almost 2 minutes to turn it on again, and after
> that the clufence command waits for another 2 minutes until it returns to the
> shell prompt.

That sounds like a long time, but I don't have those machines, so I
don't know what might be causing the delay.

-- Lon


From lhh at redhat.com  Mon Mar 27 20:41:57 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 27 Mar 2006 15:41:57 -0500
Subject: [Linux-cluster] Re: More CS4 fencing fun
In-Reply-To: <6988786F-0794-4481-B6D1-95B51B056E73@kinetikon.com>
References: <6988786F-0794-4481-B6D1-95B51B056E73@kinetikon.com>
Message-ID: <1143492117.4749.126.camel@ayanami.boston.redhat.com>

On Fri, 2006-03-24 at 11:06 +0100, Matteo Catanese wrote:
> Hi Lon,
> you mail is "music" for my ears :D
> 
> I will try your /sbin/fence_dontcare immediately.

Best wishes!  If it breaks, all of the pieces are yours to keep.

> i dont want to be interrupted in weekends when i play my  
> favourite video game (WOW) just because ONE component broke and all  
> cluster hung :-)

Great game.


> Sure our hardware configuration can sustain also some multi-point  
> failure, but NSPOF is our mail goal

Remember that a redundant remote power switch doesn't obviate the need
for iLO.  iLO is *much* more than a power button.  It has remote console
abilities and other management stuff -- all which is very useful for
system administration and maintenance.

In my opinion, the power-button feature of iLO is the *least* useful
part.  


> In my case WTI should be useful only in case of multiple failure, for  
> example both network switch fails so heartbeat fails and ilo fails  
> too  and with /sbin/fence_dontcare i will have corruption. Is this  
> correct ?

With the dontcare hack, you can have corruption if the node stops
heartbeating (for any reason) and iLO does not respond at the time
fence_ilo is called.

Examples - Live-hang of the node with the iLO disconnected, too much
system load to get out heartbeats, network congestion/saturation, bad
cables, routing problems, internal problem in the switch, ARP storms,
power surges, iLO bugs/failure, too many people logged in to iLO, etc.

I do not know all of the possible the failure case(s).  That is why the
last cluster I set up has a remote power controller, even though all of
the nodes individually have iLO as well.  Call me paranoid if you want,
but please, think about these two points:

(1) Uptime with corrupt data does not equal availability

... and, more importantly ...

(2) It *really* sucks to have to restore from backup when you could be
playing WoW...


> I will need a supplemental NIC for every server to connect to WTI,  

Actually, it should be on the same network as the cluster uses for
communications, especially in two-node CMAN/DLM clusters; check out:

http://people.redhat.com/teigland.sca.pdf

-- Lon


From carlopmart at gmail.com  Tue Mar 28 09:55:46 2006
From: carlopmart at gmail.com (carlopmart)
Date: Tue, 28 Mar 2006 11:55:46 +0200
Subject: [Linux-cluster] About GFS and OCFS2
Message-ID: <44290822.30602@gmail.com>

Hi all,

  I have setup CS 4 on two nodes under  VMWare Server. I would like to 
use this cluster for web and email services. Which one of these 
filesystems recommends me to put on a shared storage: GFS or OCFS2??

  My email traffic and volume is very high ...

Many thanks
-- 
CL Martinez
carlopmart {at} gmail {d0t} com


From Frank.Weyns at ordina.nl  Tue Mar 28 10:40:53 2006
From: Frank.Weyns at ordina.nl (Weyns, Frank)
Date: Tue, 28 Mar 2006 12:40:53 +0200
Subject: [Linux-cluster] RHCS 3:   A NFS only solution possible ?
References: <6988786F-0794-4481-B6D1-95B51B056E73@kinetikon.com>
	<1143492117.4749.126.camel@ayanami.boston.redhat.com>
Message-ID: <DAF5BA614A38C84A99B3167DFB341A02308A83@BA12-0013.work.local>

Hi ,
 
With RHCS 4 it is possible to build a cluster using as shared storage NFS NetApp filers. It does not need a  quorum disk because of the fencing devices.....  But .... 
 
Because of an old application I need to build this cluster on RHEL 3u5 and RHCS 3 ... this means raw shared storage.
(Or do i get confused with mc service guard ?)
 
So can I create a raw device on NFS ? or is there another way around the quorum?/raw disks ?
 
Any other ideas or hints ?
 
Frank  (a) Weyns (.) net
Disclaimer

Dit bericht met eventuele bijlagen is vertrouwelijk en uitsluitend bestemd voor de geadresseerde. Indien u niet de bedoelde ontvanger bent, wordt u verzocht de afzender te waarschuwen en dit bericht met eventuele bijlagen direct te verwijderen en/of te vernietigen. Het is niet toegestaan dit bericht en eventuele bijlagen te vermenigvuldigen, door te sturen, openbaar te maken, op te slaan of op andere wijze te gebruiken. Ordina N.V. en/of haar groepsmaatschappijen accepteren geen verantwoordelijkheid of aansprakelijkheid voor schade die voortvloeit uit de inhoud en/of de verzending van dit bericht.

This e-mail and any attachments are confidential and is solely intended for the addressee only. If you are not the intended recipient, please notify the sender and delete and/or destroy this message and any attachments immediately. It is prohibited to copy, to distribute, to disclose or to use this e-mail and any attachments in any other way. Ordina N.V. and/or its group companies do not accept any responsibility nor liability for any damage resulting from the content of and/or the transmission of this message.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3437 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060328/b73fe5bf/attachment.bin>

From m.catanese at kinetikon.com  Tue Mar 28 11:01:58 2006
From: m.catanese at kinetikon.com (Matteo Catanese)
Date: Tue, 28 Mar 2006 13:01:58 +0200
Subject: [Linux-cluster] Re: More CS4 fencing fun
Message-ID: <B77BEA71-6DD8-4329-8315-D26AD8F8B33F@kinetikon.com>

Ok the WOW blackmail was enough for me :-)

For a number  reasons i cant buy a WTI or whatever.


I have a MSA1000 with dual controller and dual Storageworks fabric  
switches.

Those switches are OEM version of Brocade Silkworm 2x00.

Each one have a management ethernet port.

Can i use those ports as a 2nd (and 3d) fence device ?


Dl380 have qlogic FC adapter.

Vanilla qlogic kernel driver supplies IP over fibre facility,  
allowing the fence_agent to reach the fabric switch
by means of fibre avoiding all mess with ethernet.

http://download.qlogic.com/drivers/35501/README_qla2xxx2-6.htm

 >>The qla2xip driver will create a network-interface binding to each  
IP-capable recognized HBA. Binding entries can be viewed from the  
messages file after the IP driver has loaded:
 >>qla2xip: QLogic IP via Fibre Channel Network Driver
 >>qla2xip: Driver Version 1.0b2, Entry point: e08e5060
 >>qla2xip: Mapping interface fc0 to HBA 210100e08b20a15b
 >>qla2xip: Mapping interface fc1 to HBA 210200e08b40a25b

Unfortunatelly  neither red hat driver nor HP one support this  
facility so i must use the ethernet port.

 > http://people.redhat.com/teigland.sca.pdf

Page not found 404

Matteo


From jstoner at opsource.net  Tue Mar 28 14:33:06 2006
From: jstoner at opsource.net (Jeff Stoner)
Date: Tue, 28 Mar 2006 15:33:06 +0100
Subject: [Linux-cluster] Re: More CS4 fencing fun
Message-ID: <38A48FA2F0103444906AD22E14F1B5A3030D3E5D@mailxchg01.corp.opsource.net>

> -----Original Message-----
>  > http://people.redhat.com/teigland.sca.pdf
> 
> Page not found 404

http://people.redhat.com/teigland/sca.pdf


--Jeff
SME - UNIX
OpSource Inc.

PGP Key ID 0x6CB364CA 


From lhh at redhat.com  Tue Mar 28 14:34:12 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 28 Mar 2006 09:34:12 -0500
Subject: [Linux-cluster] Re: More CS4 fencing fun
In-Reply-To: <B77BEA71-6DD8-4329-8315-D26AD8F8B33F@kinetikon.com>
References: <B77BEA71-6DD8-4329-8315-D26AD8F8B33F@kinetikon.com>
Message-ID: <1143556452.4749.252.camel@ayanami.boston.redhat.com>

On Tue, 2006-03-28 at 13:01 +0200, Matteo Catanese wrote:

> Each one have a management ethernet port.
> 
> Can i use those ports as a 2nd (and 3d) fence device ?

You can.

I have never set up a Brocade as a fence device, but I am told they
work ;)  The documentation should have better details.

Generally, use the FC switches as the second fence *level* for each
node, and you should be golden.  Fenced will only use the second fence
level if the first one fails (i.e. you pull out the power cables/network
cables/whatever, causing iLO to fail) and will equally protect your
data.  That, and, you don't need any new hardware.

Basically, for fabric fencing to correctly work, each node needs to have
all of its fiber ports zoned off at the FC switches in one fence level.

Something like:
    <method name="1">
       <... ilo />
    </method>
    <method name="2">
       <... switch 1 port 1/>
       <... switch 2 port 1/>
    </method>


>  > http://people.redhat.com/teigland.sca.pdf

Sorry, I made a typo...        slash
                                 |
                                 v
http://people.redhat.com/teigland/sca.pdf

-- Lon


From m.catanese at kinetikon.com  Tue Mar 28 15:15:20 2006
From: m.catanese at kinetikon.com (Matteo Catanese)
Date: Tue, 28 Mar 2006 17:15:20 +0200
Subject: [Linux-cluster] Re: More CS4 fencing fun
Message-ID: <44F84B0D-CE45-4810-98E6-4821B9A8ADB4@kinetikon.com>

So i have your blessing with this configuration ???

-------------- next part --------------
A non-text attachment was scrubbed...
Name: cluster.conf
Type: application/octet-stream
Size: 2286 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060328/fcf75c31/attachment.obj>
-------------- next part --------------


Matteo


From cfeist at redhat.com  Tue Mar 28 17:39:40 2006
From: cfeist at redhat.com (Chris Feist)
Date: Tue, 28 Mar 2006 11:39:40 -0600
Subject: [Linux-cluster] GFS 6u7?
In-Reply-To: <FF2CE0D593AEE34B955FEC77BD5AFBE0079E2F@tmaemail.techma.com>
References: <FF2CE0D593AEE34B955FEC77BD5AFBE0079E2F@tmaemail.techma.com>
Message-ID: <442974DC.7050201@redhat.com>

It should be available now.

Kovacs, Corey J. wrote:
> With RHEL3u7 being out for about a week now, I expect GFS 6u7 to be 
> available
> soon as well. Does anyone have any information as to how much longer it 
> will be
> until it's released?
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster


From lhh at redhat.com  Tue Mar 28 20:01:46 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 28 Mar 2006 15:01:46 -0500
Subject: [Linux-cluster] Re: More CS4 fencing fun
In-Reply-To: <44F84B0D-CE45-4810-98E6-4821B9A8ADB4@kinetikon.com>
References: <44F84B0D-CE45-4810-98E6-4821B9A8ADB4@kinetikon.com>
Message-ID: <1143576106.3759.55.camel@ayanami.boston.redhat.com>

On Tue, 2006-03-28 at 17:15 +0200, Matteo Catanese wrote:
> So i have your blessing with this configuration ???

It sounds NSPF - like you wanted, using existing hardware (e.g. less
components = less likely any one breaks, and no more money spent), and
it should not weaken in multi-point failures like with other hacks (e.g.
fence_dontcare).

The net effect is that you should be able to play WoW slightly more.

-- Lon


From celso at webbertek.com.br  Tue Mar 28 22:55:06 2006
From: celso at webbertek.com.br (Celso K. Webber)
Date: Tue, 28 Mar 2006 19:55:06 -0300
Subject: [Linux-cluster] RHCS 3:   A NFS only solution possible ?
In-Reply-To: <DAF5BA614A38C84A99B3167DFB341A02308A83@BA12-0013.work.local>
References: <6988786F-0794-4481-B6D1-95B51B056E73@kinetikon.com>	<1143492117.4749.126.camel@ayanami.boston.redhat.com>
	<DAF5BA614A38C84A99B3167DFB341A02308A83@BA12-0013.work.local>
Message-ID: <4429BECA.4030409@webbertek.com.br>

Hello Frank,

I've just returned from a customer installation where we had the very 
same problem.

We set up a meeting to decide between going to CSv4 or not because of a 
NetApp storage.

We ended up staying with CSv3, but creating a LUN over iSCSI on the 
NetApp filer to serve as a LUN for our raw device quorum partitions.

As far as I can tell, this solution is supported by Red Hat, I've found 
some december 2004 news on the Net reporting an agreement between Red 
Hat and NetApp to support Cluster Suite over iSCSI. Can someone from Red 
Hat confirm this?

The only difficulties I've had is that I was forced to configure the 
CSv3 to use broadcast heartbeating. Multicast didn't work in any way for 
the quorum devices, with very strange behaviours.

Another try would be using Disk-based tiebraker, but I'll let this issue 
for an expertise. We had a 2-node cluster, and the CS GUI recommends IP 
tiebraker for a 2-node cluster.

By the way, this configuration was also fully supported by NetApp. Their 
consultant was very positive about using this solution over iSCSI and 
even told us that we would receive support from NetApp.

As a technical note, NetApp recommended using the iSCSI initiator 
provided with RHEL, which was our preferd option.

Hope this helps you.

Celso.

PS: this environment is under production today, supporting Oracle 9i. 
The Oracle datafiles are over NFS on NetApp.


Weyns, Frank escreveu:
> Hi ,
>  
> With RHCS 4 it is possible to build a cluster using as shared storage NFS NetApp filers. It does not need a  quorum disk because of the fencing devices.....  But .... 
>  
> Because of an old application I need to build this cluster on RHEL 3u5 and RHCS 3 ... this means raw shared storage.
> (Or do i get confused with mc service guard ?)
>  
> So can I create a raw device on NFS ? or is there another way around the quorum?/raw disks ?
>  
> Any other ideas or hints ?
>  
> Frank  (a) Weyns (.) net
> Disclaimer
> 
> Dit bericht met eventuele bijlagen is vertrouwelijk en uitsluitend bestemd voor de geadresseerde. Indien u niet de bedoelde ontvanger bent, wordt u verzocht de afzender te waarschuwen en dit bericht met eventuele bijlagen direct te verwijderen en/of te vernietigen. Het is niet toegestaan dit bericht en eventuele bijlagen te vermenigvuldigen, door te sturen, openbaar te maken, op te slaan of op andere wijze te gebruiken. Ordina N.V. en/of haar groepsmaatschappijen accepteren geen verantwoordelijkheid of aansprakelijkheid voor schade die voortvloeit uit de inhoud en/of de verzending van dit bericht.
> 
> This e-mail and any attachments are confidential and is solely intended for the addressee only. If you are not the intended recipient, please notify the sender and delete and/or destroy this message and any attachments immediately. It is prohibited to copy, to distribute, to disclose or to use this e-mail and any attachments in any other way. Ordina N.V. and/or its group companies do not accept any responsibility nor liability for any damage resulting from the content of and/or the transmission of this message.
> 
> 
> ------------------------------------------------------------------------
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
*Celso Kopp Webber*

celso at webbertek.com.br <mailto:celso at webbertek.com.br>

*Webbertek - Opensource Knowledge*
(41) 8813-1919
(41) 3284-3035


From teigland at redhat.com  Wed Mar 29 01:26:18 2006
From: teigland at redhat.com (David Teigland)
Date: Tue, 28 Mar 2006 19:26:18 -0600
Subject: [Linux-cluster] Shutdown/startup
In-Reply-To: <D8063DF686D10247B0A49D01271285690CE91D3E@osdn06.osd.mil>
References: <D8063DF686D10247B0A49D01271285690CE91D3E@osdn06.osd.mil>
Message-ID: <20060329012617.GC23060@redhat.com>

> So I naively ask, why not make "cman_tool leave remove" part of the standard
> system shutdown scripts? Having to manually futz with cluster nodes
> shouldn't be necessary.

I believe the cman init script was recently changed to do remove.
In the past there was some concern about the chance of split-brain
that arises when you reduce expected votes, but we don't think that
should be a problem.

Dave


From saju8 at rediffmail.com  Wed Mar 29 04:31:18 2006
From: saju8 at rediffmail.com (saju  john)
Date: 29 Mar 2006 04:31:18 -0000
Subject: [Linux-cluster] Cluster service restarting Locally
Message-ID: <20060329043118.31570.qmail@webmail8.rediffmail.com>

  
Dear Lon,

Could you please tell me whether my problem with "cluster service restarting localy" is fixed in RHCSU7.

Thanks in advance

Saju John

On Mon, 13 Mar 2006 Lon Hohberger wrote :
>On Sat, 2006-03-11 at 10:50 +0000, saju john wrote:
> >
> > Dear Mr. Hohberger,
> >
> > Thanx for the replay.
> >
> > I saw your comments for the problem I reported. ie lock traffic is
> > getting network-starved.
>
>It could be getting I/O starved too, which might explain more given that
>this seems to happen on one node.  When running just one node and the
>service restarts, are the symptoms the same?  Does it report these kinds
>of errors, or are they different?
>
>[quote from your previous mail]
>clusvcmgrd[1388]: <err> Unable to obtain cluster lock: Connection
>timed out
>clulockd[1378]: <warning> Denied A.B.C.D: Broken pipe
>clulockd[1378]: <err> select error: Broken pipe
>[/quote]
>
>If they're different in the one-node case, what are the errors?  Also,
>are there any other errors in the logs?
>
>
> > My assumption is that, the problem is due to some curruption of meta
> > data information writing to the quroum partition ,as both nodes
> > writing to quroum cuncurrently.
>
>I really doubt that.  In the case of lock information, only one node
>writes at a time anyway...
>
> >  May be due to bug in the rawdeivce driver.I am not sure.Then
> > interesting question is ,how the cluster worked all these days(for me
> > around one year with out any major problem).
>
>The odds of random, block-level corruption going undetected when reading
> from the raw partitions is low - between (2^32):1 and (2^96):1 against
>per block, based on internal consistency checks that clumanager
>performs.  My math might be a little off, but it requires two randomly
>correct 32-bit magic numbers and one randomly valid 32-bit CRC, with
>other data incorrect to cause a problem.
>
>Specifically in the lock case, a lock block which passed all of the
>consistency checks but was *actually* corrupt would almost always cause
>clulockd to crash.
>
>Timeout errors mean that clulockd didn't respond to a request in a given
>amount of time, and can be caused by either network saturation or poor
>raw I/O performance to shared storage.  It looks like it's getting to an
>incoming request too late...
>
>
> > Could you pelase consider this also when releasing the RHCS3U7.
>
>If this is a critical issue for you, then you should file a ticket with
>Red Hat Support if you have not already done so:
>
>    http://www.redhat.com/apps/support/
>
>If you think this is a bug, you can also file a Bugzilla, and we will
>get to it when we can:
>
>    http://bugzilla.redhat.com/bugzilla/
>
>-- Lon
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060329/6c0dcd09/attachment.htm>

From fajar at telkom.co.id  Tue Mar 28 02:20:47 2006
From: fajar at telkom.co.id (Fajar A. Nugraha)
Date: Tue, 28 Mar 2006 09:20:47 +0700
Subject: [Linux-cluster] CS4 / devices naming persistency
In-Reply-To: <4427AF1F.5050004@bull.net>
References: <4427AF1F.5050004@bull.net>
Message-ID: <44289D7F.9080507@telkom.co.id>

Alain Moulle wrote:
> Hi
>
> Just for confirmation or/and suggestions:
> I think that there is a problem of device naming unicity
> and persistency (ie. after reboot) when we configure
> some FS as "to be failovered" ressources in CS4.
> In this case, which solution do you preconise :
> use of LABELs on devices ?
> or solution such as udev, etc. ?
>
>   
Why not LVM?

-- 
Fajar


From SCHITWEE at dsta.gov.sg  Wed Mar 29 07:52:22 2006
From: SCHITWEE at dsta.gov.sg (See Chit Wee)
Date: Wed, 29 Mar 2006 15:52:22 +0800
Subject: [Linux-cluster] CS4 Virtual Ip address Service re-starts 
Message-ID: <3F7B430CA7FF344CBF147DB1315CB2F3038CDFF7@DSTA-MSG-02.dsta.gov.sg>

Message Classification: Restricted

I'm using RHEL CS4 and the objective is to create a 2-node Cluster, IP:
10.10.10.35 and IP: 10.10.10.39, using Active/Passive configuration.  
 
For a start, I want to create a Virtual IP address (10.10.10.22) service
that will failover to the other node when one of the nodes is down.
After creation of the service called "mf", I enabled the service in the
Cluster Configuration GUI and it is started okay. In the
"var/log/messages" file, after about approx. 2 minutes, I get the
following messages : 
 
clurgmgrd [7838]:<notice> status on ip "10.10.10.22" returned 1 (generic
error) 
clurgmgrd [7838]:<notice> Stopping service mf 
clurgmgrd [7838]:<notice> Service mf is recovering 
clurgmgrd [7838]:<notice> Recovering failed service mf 
clurgmgrd [7838]:<notice> Service mf started 
 
After about 2 minutes, the whole affair repeats itself again for
eternity. 
 
Is there any script for the virtual ip address that I can access to set
the status ?  


This e-mail is intended only for the named addressee(s) and may contain confidential and/or privileged information. If you are not the named addressee (or have received this e-mail in error), please notify the sender immediately. The unauthorised use, disclosure, distribution or copying of the contents in this e-mail is prohibited. Thank you.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060329/4bc83ea3/attachment.htm>

From costi at recognos.ro  Wed Mar 29 07:01:47 2006
From: costi at recognos.ro (Constantin Daniel VULTUR)
Date: Wed, 29 Mar 2006 10:01:47 +0300
Subject: [Linux-cluster] dlm problem
Message-ID: <442A30DB.3070905@recognos.ro>

Hi guys,

I got into an problem while using GFS with DLM on Fedora Core 4.
Every thing is working fine, the boot process, the mounting of the GFS 
volume, the reading of the journals,
until I try to read files from that volume. And from then the GFS hangs.
I tried to look into the system logs but there I can't find anything.
The only thing that I find it different from my other cluster wich is 
based on Ubuntu, is the content of
/proc/cluster/dlm_debug

On FC4 :
[root at cluster01 ~]# cat /proc/cluster/dlm_debug
6 finished
clvmd move flags 1,0,0 ids 15,15,15
clvmd move flags 0,1,0 ids 15,20,15
clvmd move use event 20
clvmd recover event 20
clvmd add node 5
clvmd total nodes 5
clvmd rebuild resource directory
clvmd rebuilt 1 resources
clvmd purge requests
clvmd purged 0 requests
clvmd mark waiting requests
clvmd marked 0 requests
clvmd recover event 20 done
clvmd move flags 0,0,1 ids 15,20,20
clvmd process held requests
clvmd processed 0 requests
clvmd resend marked requests
clvmd resent 0 requests
clvmd recover event 20 finished
data move flags 1,0,0 ids 16,16,16
data move flags 0,1,0 ids 16,21,16
data move use event 21
data recover event 21
data add node 5
data total nodes 4
data rebuild resource directory
data rebuilt 8 resources
data purge requests
data purged 0 requests
data mark waiting requests
data marked 0 requests
data recover event 21 done
data move flags 0,0,1 ids 16,21,21
data process held requests
data processed 0 requests
data resend marked requests
data resent 0 requests
data recover event 21 finished
[root at cluster01 ~]#


And on Ubuntu:
root at web1:~# cat /proc/cluster/dlm_debug
        5
data (8066) req reply einval 660c01e6 fr 5 r 5        5
data (8066) req reply einval 660c01e6 fr 5 r 5        5
data (8066) req reply einval 65fb0158 fr 5 r 5        5
data (8066) req reply einval 65fb0158 fr 5 r 5        5
data (8066) req reply einval 65fb0158 fr 5 r 5        5
data (8066) req reply einval 65fb0158 fr 5 r 5        5
data (8066) req reply einval 65fb0158 fr 5 r 5        5
data (8066) req reply einval 65fb0158 fr 5 r 5        5
data (8066) req reply einval 65fb0158 fr 5 r 5        5
data (8066) req reply einval 65fb0158 fr 5 r 5        5
data (8066) req reply einval 65fb0158 fr 5 r 5        5
data (8066) req reply einval 65fb0158 fr 5 r 5        5
data (8066) req reply einval 63b4013a fr 5 r 5        5
data (8066) req reply einval 63b4013a fr 5 r 5        5
data send einval to 7
data send einval to 7
data send einval to 4
data send einval to 4
data send einval to 4
root at web1:~#


So I thought that this is and DLM locking problem, so I started to look 
for clues
[root at cluster01 ~]# lsmod | grep dlm
lock_dlm               42084  1
lock_harness            4392  2 lock_dlm,gfs
dlm                   118220  5 lock_dlm
cman                  130208  21 lock_dlm,dlm

[root at cluster01 ~]# dmesg | grep dlm
dlm: no version for "struct_module" found: kernel tainted.
GFS: Trying to join cluster "lock_dlm", "cluster:data"

The packages that are installed are:
[root at cluster01 ~]# rpm -qa | grep dlm
dlm-kernel-2.6.11.5-20050601.152643.FC4.17
dlm-kernheaders-2.6.11.5-20050601.152643.FC4.17
dlm-devel-1.0.0-3
dlm-1.0.0-3


And the machine is
[root at cluster01 ~]# uname -a
Linux cluster01 2.6.14-1.1653_FC4 #1 Tue Dec 13 21:32:09 EST 2005 i686 
i686 i386 GNU/Linux

Right now I don't know for sure which is the problem. I hope that 
someone can explain me what I did wrong.

Thanks in advance for your help.

Costi.


From Frank.Weyns at ordina.nl  Wed Mar 29 08:25:56 2006
From: Frank.Weyns at ordina.nl (Weyns, Frank)
Date: Wed, 29 Mar 2006 10:25:56 +0200
Subject: [Linux-cluster] CS4 Virtual Ip address Service re-starts 
References: <3F7B430CA7FF344CBF147DB1315CB2F3038CDFF7@DSTA-MSG-02.dsta.gov.sg>
Message-ID: <DAF5BA614A38C84A99B3167DFB341A02308A84@BA12-0013.work.local>

From: linux-cluster-bounces at redhat.com on behalf of See Chit Wee
Subject: [Linux-cluster] CS4 Virtual Ip address Service re-starts 

>clurgmgrd [7838]:<notice> status on ip "10.10.10.22" returned 1 (generic error) 
>clurgmgrd [7838]:<notice> Stopping service mf 
>clurgmgrd [7838]:<notice> Service mf is recovering 
>clurgmgrd [7838]:<notice> Recovering failed service mf 
>clurgmgrd [7838]:<notice> Service mf started 
 
Can you manually add the "10.10.10.22" to the  interface with ifconfig ?
(do you use a "normal" eth interface or a "bonded" one ?)
Does the service stay alive after adding the manually (and leaving it out of the cluster service) ?
 

This e-mail is intended only for the named addressee(s) and may contain confidential and/or privileged information. If you are not the named addressee (or have received this e-mail in error), please notify the sender immediately. The unauthorised use, disclosure, distribution or copying of the contents in this e-mail is prohibited. Thank you.

Disclaimer

Dit bericht met eventuele bijlagen is vertrouwelijk en uitsluitend bestemd voor de geadresseerde. Indien u niet de bedoelde ontvanger bent, wordt u verzocht de afzender te waarschuwen en dit bericht met eventuele bijlagen direct te verwijderen en/of te vernietigen. Het is niet toegestaan dit bericht en eventuele bijlagen te vermenigvuldigen, door te sturen, openbaar te maken, op te slaan of op andere wijze te gebruiken. Ordina N.V. en/of haar groepsmaatschappijen accepteren geen verantwoordelijkheid of aansprakelijkheid voor schade die voortvloeit uit de inhoud en/of de verzending van dit bericht.

This e-mail and any attachments are confidential and is solely intended for the addressee only. If you are not the intended recipient, please notify the sender and delete and/or destroy this message and any attachments immediately. It is prohibited to copy, to distribute, to disclose or to use this e-mail and any attachments in any other way. Ordina N.V. and/or its group companies do not accept any responsibility nor liability for any damage resulting from the content of and/or the transmission of this message.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060329/baa33456/attachment.htm>

From marco.lusini at governo.it  Wed Mar 29 10:59:23 2006
From: marco.lusini at governo.it (Marco Lusini)
Date: Wed, 29 Mar 2006 12:59:23 +0200
Subject: [Linux-cluster] CS4 script resource is looping on error
In-Reply-To: <DAF5BA614A38C84A99B3167DFB341A02308A84@BA12-0013.work.local>
Message-ID: <00a801c6531f$d68611c0$04000100@nicchio>


I am using RHEL4 with CS4 to manage a 2 node cluster with services like
HTTPD, MYSQLD, etc...
 
I have successfully set up CS4 using IPMI as a fence device and I am
creating various scripts to control the different resources (as known 
you can't use those in /etc/init.d due to failures in stop-after-stop
scenarios).
 
As a test I have created script that always return error on status, and set
up a service
with the failing script as the single resource.
 
------- snip ------
#!/bin/bash
case "$1" in
    start)
        exit 0
        ;;
    stop)
        exit 0
        ;;
    status)
        exit 1
        ;;
esac
------ snip -------
 
Now my cluster keeps restarting that resource or, if I change the recover
policy, keeps relocating it forever. If I choose disable as recover policy,
the reource get disabled whitout even trying to restart/recover.
 
Is this the expected behaviour?
 
Is there a way to configure CS4 to first try to restart locally, then
to try relocate and finally disable the service?
 
Regards,
 
Marco Lusini
 

_______________________________________________________
Messaggio analizzato e protetto da tecnologia antivirus

Servizio erogato dal sistema informativo della 
Presidenza del Consiglio dei Ministri
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060329/e5cecebb/attachment.htm>

From jerome.castang at adelpha-lan.org  Thu Mar 30 09:52:28 2006
From: jerome.castang at adelpha-lan.org (Castang Jerome)
Date: Thu, 30 Mar 2006 11:52:28 +0200
Subject: [Linux-cluster] iSCSI and GFS
Message-ID: <442BAA5C.5010500@adelpha-lan.org>

Hello,

I have two scenarii:

1/ an iSCSI SAN accessed by an Open-iSCSI initiator (namely t1) wich 
provides iSCSI-target (IET) to two GFS Cluster nodes. (with DLM) (c1 and c2)

2/ an iSCSI SAN accessed by TWO Open-iSCSI initiators (t1 and t2)  wich 
provide iSCSI-target (IET) to two GFS Cluster nodes. (with DLM) (c1 and c2)
Each initiator porvides target to each cluster node. (t1 provides to c1, 
and t2 provides to c2)

t1 and t2 provide iSCSI target to their respective nodes.
t1 and t2 export scsi drives (/dev/sd*) not the logical volumes.
c1 and c2 are cluster members (not t1 and t2).

In the first case, a modification on the filesystem (creating a file, 
for example) by c1 is reflected on c2.
In the second case, a filesystem modification by c1 (respectively c2) is 
not reflected on c2 (respectively c1).

Any idea ?

Thanks,

-- 
Jerome Castang
mail: jcastang at adelpha-lan.org


From dist-list at LEXUM.UMontreal.CA  Thu Mar 30 15:26:09 2006
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Thu, 30 Mar 2006 10:26:09 -0500
Subject: [Linux-cluster] cluster and Apache tomcat
Message-ID: <442BF891.5000600@lexum.umontreal.ca>

Hello everybody,

Next month, we'll have 6 servers connected to a SAN. Those servers will
be in a web farm. And web documents will on a GFS file system. We will
use the Redhat cluster suite.


I'd like to install Tomcat 5 and/or 5.5 on those servers and use its
cluster's possibilities (mod_jk for example).

Some webapps need sessions. But I read that tomcat will handle that.

But How do you handle webapps installation on all servers ? Do you
deploy the war 6 times ? If not is there a tomcat way to replicate these
webapps ?

Can I put the webapps folder on a GFS partition ?

How about tomcat writing to a gfs file system ?

Thanks !


From grimme at atix.de  Thu Mar 30 15:36:15 2006
From: grimme at atix.de (Marc Grimme)
Date: Thu, 30 Mar 2006 17:36:15 +0200
Subject: [Linux-cluster] cluster and Apache tomcat
In-Reply-To: <442BF891.5000600@lexum.umontreal.ca>
References: <442BF891.5000600@lexum.umontreal.ca>
Message-ID: <200603301736.15558.grimme@atix.de>

Hi,
On Thursday 30 March 2006 17:26, FM wrote:
> Hello everybody,
>
> Next month, we'll have 6 servers connected to a SAN. Those servers will
> be in a web farm. And web documents will on a GFS file system. We will
> use the Redhat cluster suite.
>
>
> I'd like to install Tomcat 5 and/or 5.5 on those servers and use its
> cluster's possibilities (mod_jk for example).
>
> Some webapps need sessions. But I read that tomcat will handle that.
>
> But How do you handle webapps installation on all servers ? Do you
> deploy the war 6 times ? If not is there a tomcat way to replicate these
> webapps ?
No you don't need to, just put it on GFS.
>
> Can I put the webapps folder on a GFS partition ?
Dito. No problem as general answer. But it sometimes depends on the 
application (what webapps you have).
>
> How about tomcat writing to a gfs file system ?
You can put anything on GFS without problem when using tomcat. We have 
installed a 16 node cluster shared root, shared data, shared tomcat vars on 
GFS running without problems. There are some specialities but these are 
dependent on applications and stuff.

Hope that helps
Regards Marc.
>
> Thanks !
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-- 
Gruss / Regards,

Marc Grimme
Phone: +49-89 121 409-54
http://www.atix.de/               http://www.open-sharedroot.org/

**
ATIX - Ges. fuer Informationstechnologie und Consulting mbH
Einsteinstr. 10 - 85716 Unterschleissheim - Germany


From lhh at redhat.com  Thu Mar 30 15:40:49 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 30 Mar 2006 10:40:49 -0500
Subject: [Linux-cluster] CS4 Virtual Ip address Service re-starts
In-Reply-To: <3F7B430CA7FF344CBF147DB1315CB2F3038CDFF7@DSTA-MSG-02.dsta.gov.sg>
References: <3F7B430CA7FF344CBF147DB1315CB2F3038CDFF7@DSTA-MSG-02.dsta.gov.sg>
Message-ID: <1143733249.3759.67.camel@ayanami.boston.redhat.com>

On Wed, 2006-03-29 at 15:52 +0800, See Chit Wee wrote:
> Message Classification: Restricted
> 
> I'm using RHEL CS4 and the objective is to create a 2-node Cluster,
> IP: 10.10.10.35 and IP: 10.10.10.39, using Active/Passive
> configuration.  
>  
> For a start, I want to create a Virtual IP address (10.10.10.22)
> service that will failover to the other node when one of the nodes is
> down. After creation of the service called "mf", I enabled the service
> in the Cluster Configuration GUI and it is started okay. In the
> "var/log/messages" file, after about approx. 2 minutes, I get the
> following messages : 
>  
> clurgmgrd [7838]:<notice> status on ip "10.10.10.22" returned 1
> (generic error) 
> clurgmgrd [7838]:<notice> Stopping service mf 
> clurgmgrd [7838]:<notice> Service mf is recovering 
> clurgmgrd [7838]:<notice> Recovering failed service mf 
> clurgmgrd [7838]:<notice> Service mf started 
>  
> After about 2 minutes, the whole affair repeats itself again for
> eternity. 
>  
> Is there any script for the virtual ip address that I can access to
> set the status ?  

This is solved in current CVS - and will be solved with U3.  It's got to
do with the router-pinging code being broken.

To get rid of it for now, delete the depth="20" monitor/status actions
from /usr/share/cluster/ip.sh and restart rgmanager.

-- Lon

> 


From celso at webbertek.com.br  Thu Mar 30 15:41:57 2006
From: celso at webbertek.com.br (Celso K. Webber)
Date: Thu, 30 Mar 2006 12:41:57 -0300
Subject: [Linux-cluster] Logwatch script for CS3 and/or CS4
Message-ID: <442BFC45.5050806@webbertek.com.br>

Hello all,

Does anyone has prepared a logwatch script for analyzing the Cluster 
Suite log messages?

I usually setup the environment to separate the cluster messages at 
/var/log/cluster, so I was planning to create a logwatch config file to 
alert any problems through the logwatch facility that comes setup by 
default in RHEL.

If anyone has something already done, I'd be happy to receive any 
suggestions.

Thank you very much.

Celso.


-- 
*Celso Kopp Webber*

celso at webbertek.com.br <mailto:celso at webbertek.com.br>

*Webbertek - Opensource Knowledge*


From lhh at redhat.com  Thu Mar 30 16:00:47 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 30 Mar 2006 11:00:47 -0500
Subject: [Linux-cluster] CS4 script resource is looping on error
In-Reply-To: <00a801c6531f$d68611c0$04000100@nicchio>
References: <00a801c6531f$d68611c0$04000100@nicchio>
Message-ID: <1143734447.3759.88.camel@ayanami.boston.redhat.com>

On Wed, 2006-03-29 at 12:59 +0200, Marco Lusini wrote:
>  
> As a test I have created script that always return error on status,
> and set up a service
> with the failing script as the single resource.
>  
> ------- snip ------
> #!/bin/bash
> case "$1" in
>     start)
>         exit 0
>         ;;
>     stop)
>         exit 0
>         ;;
>     status)
>         exit 1
>         ;;
> esac
> ------ snip -------
>  
> Now my cluster keeps restarting that resource or, if I change the
> recover
> policy, keeps relocating it forever. If I choose disable as recover
> policy,
> the reource get disabled whitout even trying to restart/recover.
>  
> Is this the expected behaviour?

Yes.

> Is there a way to configure CS4 to first try to restart locally, then
> to try relocate and finally disable the service?

If it fails to start after a failure, it is relocated to another node. 

If it fails to start on all nodes, it is placed in the 'stopped' state.

Tracking the history of nodes where a service has started but at some
point failed is slightly difficult since node IDs are not guaranteed to
be static in linux-cluster right now: a node can leave and a different
node can join and take the vacant node ID.  It is, however, possible to
configure static node IDs in cluster.conf, which would help.

If you are worried about this particular state (where the service is
horribly broken and moving around or restarting a lot), you can perform
checks in your script to see if it just started + crashed locally; this
will help.

You can also file a bugzilla / feature request -- it's certainly not
impossible to implement at some point.

-- Lon


From Matthew.Patton.ctr at osd.mil  Thu Mar 30 18:31:01 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Thu, 30 Mar 2006 13:31:01 -0500
Subject: [Linux-cluster] CS4 Virtual Ip address Service re-starts
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D48@osdn06.osd.mil>

Classification: UNCLASSIFIED

> [mailto:linux-cluster-bounces at redhat.com]On Behalf Of Lon Hohberger

> This is solved in current CVS - and will be solved with U3.  
> It's got to do with the router-pinging code being broken.

on a tangentially related note. I've noticed that if I define a default
gateway in at least 2 different ifcfg-eth? files since the interfaces are on
different nets, the routing breaks. The hack is to change the default
routing handler script so it doesn't clobber other entries. It would seem
somebody forgot that machines can be dual-homed and each have interface
specific default gateways. If I bind a socket to one or the other interface
the outbound packet needs to follow the default gateway for that interface.

Ok, so this may not be an altogether common scenario but shouldn't it be
allowed? Our heartbeat/backdoor management network is on a different net
from the customer side and each have their own notions of what the default
should be.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060330/6a27ce43/attachment.htm>

From christiannygaard at gmail.com  Thu Mar 30 19:23:08 2006
From: christiannygaard at gmail.com (Christian Nygaard)
Date: Thu, 30 Mar 2006 21:23:08 +0200
Subject: [Linux-cluster] GNBD using non-shared storage and mirroring
Message-ID: <4d4321660603301123v1c484dddte8f28e043a0fac1c@mail.gmail.com>

A quick question is Figure 7. GNBD serving using non-shared storage and
mirroring in the
cluster<http://www.redhat.com/magazine/008jun05/features/gfs/#fig-mirroring>working
today in practise
with the current RHEL 4 GFS or is it a bit of vaporware schematic that will
be supported in an upcomming future release when multipath
GNBD works?

http://www.redhat.com/magazine/008jun05/features/gfs/

Cheers,
Chris
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060330/220f2c45/attachment.htm>

From dist-list at LEXUM.UMontreal.CA  Thu Mar 30 20:21:09 2006
From: dist-list at LEXUM.UMontreal.CA (FM)
Date: Thu, 30 Mar 2006 15:21:09 -0500
Subject: [Linux-cluster] cluster and Apache tomcat
In-Reply-To: <200603301736.15558.grimme@atix.de>
References: <442BF891.5000600@lexum.umontreal.ca>
	<200603301736.15558.grimme@atix.de>
Message-ID: <442C3DB5.7040603@lexum.umontreal.ca>

Thanks for the response,

So in this scenario :
2 apache servers (webroot : /home/web/site1)
2 tomcat servers (tomcat root : /usr/local/tomcat)

Can I put the entire tomcat root on the SAN (GFS)? Java JVM on each server.


Marc Grimme wrote:
> Hi,
> On Thursday 30 March 2006 17:26, FM wrote:
>> Hello everybody,
>>
>> Next month, we'll have 6 servers connected to a SAN. Those servers will
>> be in a web farm. And web documents will on a GFS file system. We will
>> use the Redhat cluster suite.
>>
>>
>> I'd like to install Tomcat 5 and/or 5.5 on those servers and use its
>> cluster's possibilities (mod_jk for example).
>>
>> Some webapps need sessions. But I read that tomcat will handle that.
>>
>> But How do you handle webapps installation on all servers ? Do you
>> deploy the war 6 times ? If not is there a tomcat way to replicate these
>> webapps ?
> No you don't need to, just put it on GFS.
>> Can I put the webapps folder on a GFS partition ?
> Dito. No problem as general answer. But it sometimes depends on the 
> application (what webapps you have).
>> How about tomcat writing to a gfs file system ?
> You can put anything on GFS without problem when using tomcat. We have 
> installed a 16 node cluster shared root, shared data, shared tomcat vars on 
> GFS running without problems. There are some specialities but these are 
> dependent on applications and stuff.
> 
> Hope that helps
> Regards Marc.
>> Thanks !
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From mag.andersen at gmail.com  Thu Mar 30 20:32:44 2006
From: mag.andersen at gmail.com (Magnus Andersen)
Date: Thu, 30 Mar 2006 15:32:44 -0500
Subject: [Linux-cluster] RHEL 3 U7 update ok?
Message-ID: <5ea165840603301232v1308d2c5l4e8e3b43f93d5bce@mail.gmail.com>

Hi All,

I'm running GFS-6.0.2.27-0.1 on a RHEL 3 AS U6 machine.  I'd like to
update to U7, but I am worried if the kernel update will cause issues
with my GFS install.

Any thoughts?

Thanks,

--
Magnus Andersen
Systems Administrator / Oracle DBA
Walker & Associates, Inc.


From SCHITWEE at dsta.gov.sg  Fri Mar 31 00:57:33 2006
From: SCHITWEE at dsta.gov.sg (See Chit Wee)
Date: Fri, 31 Mar 2006 08:57:33 +0800
Subject: [Linux-cluster] RE: CS4 Virtual Ip address Service re-starts 
Message-ID: <3F7B430CA7FF344CBF147DB1315CB2F3039313A1@DSTA-MSG-02.dsta.gov.sg>

Message Classification: Restricted

Message Classification: Restricted
I'm using RHEL CS4 and the objective is to create a 2-node Cluster, IP:
10.10.10.35 and IP: 10.10.10.39, using Active/Passive configuration.  
 
For a start, I want to create a Virtual IP address (10.10.10.22) service
that will failover to the other node when one of the nodes is down.
After creation of the service called "mf", I enabled the service in the
Cluster Configuration GUI and it is started okay. In the
"var/log/messages" file, after about approx. 2 minutes, I get the
following messages : 
 
clurgmgrd [7838]:<notice> status on ip "10.10.10.22" returned 1 (generic
error) 
clurgmgrd [7838]:<notice> Stopping service mf 
clurgmgrd [7838]:<notice> Service mf is recovering 
clurgmgrd [7838]:<notice> Recovering failed service mf 
clurgmgrd [7838]:<notice> Service mf started 
 
After about 2 minutes, the whole affair repeats itself again for
eternity. 
 
Is there any script for the virtual ip address that I can access to set
the status ?  


This is solved in current CVS - and will be solved with U3. It's got to
do with the router-pinging code being broken.

To get rid of it for now, delete the depth="20" monitor/status actions
from /usr/share/cluster/ip.sh and restart rgmanager.

>> Thanks, Lon, it worked as you said. BTW, I've Redhat support
subscription, any idea where to download CS4 U3 ? Regards.

 
-- Lon

> 


This e-mail is intended only for the named addressee(s) and may contain confidential and/or privileged information. If you are not the named addressee (or have received this e-mail in error), please notify the sender immediately. The unauthorised use, disclosure, distribution or copying of the contents in this e-mail is prohibited. Thank you.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060331/d81e6bc6/attachment.htm>

From Alain.Moulle at bull.net  Fri Mar 31 06:56:56 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Fri, 31 Mar 2006 08:56:56 +0200
Subject: [Linux-cluster] CS4 problem "Missed too many heartbeats"
Message-ID: <442CD2B8.2080203@bull.net>

Hi

I have a configuration where the following test
gives :

1. start CS4 on both nodes of a HA pair
2. start the service (doing just echo in a file) on node1
3. poweroff -f node1
   ==> so the service is migrating on node2 successfully
4. when node1 is up again, after verification of ping on
   heartbeat interface,  I start again CS4 and
   on "Starting fence domain:" , it remains stalled

In the syslog of node2 , I can see :

Mar 31 08:45:26 s_kernel at yack1 kernel: CMAN: node yack0 rejoining
Mar 31 08:45:56 s_kernel at yack1 kernel: CMAN: removing node yack0 from the
cluster : Missed too many heartbeats
Mar 31 08:45:56 s_sys at yack1 fenced[8855]: yack0 not a cluster member after 0 sec
post_fail_delay
Mar 31 08:45:56 s_sys at yack1 fenced[8855]: fencing node "yack0"
Mar 31 08:46:21 s_sys at yack1 fenced[8855]: fence "yack0" success

Any idea on this problem ?

Thanks
Alain


From placid at adelpha-lan.org  Fri Mar 31 07:01:04 2006
From: placid at adelpha-lan.org (Castang Jerome)
Date: Fri, 31 Mar 2006 09:01:04 +0200
Subject: [Linux-cluster] CS4 problem "Missed too many heartbeats"
In-Reply-To: <442CD2B8.2080203@bull.net>
References: <442CD2B8.2080203@bull.net>
Message-ID: <442CD3B0.2030802@adelpha-lan.org>

Alain Moulle a ?crit :
> Hi
>
> I have a configuration where the following test
> gives :
>
> 1. start CS4 on both nodes of a HA pair
> 2. start the service (doing just echo in a file) on node1
> 3. poweroff -f node1
>    ==> so the service is migrating on node2 successfully
> 4. when node1 is up again, after verification of ping on
>    heartbeat interface,  I start again CS4 and
>    on "Starting fence domain:" , it remains stalled
>
> In the syslog of node2 , I can see :
>
> Mar 31 08:45:26 s_kernel at yack1 kernel: CMAN: node yack0 rejoining
> Mar 31 08:45:56 s_kernel at yack1 kernel: CMAN: removing node yack0 from the
> cluster : Missed too many heartbeats
> Mar 31 08:45:56 s_sys at yack1 fenced[8855]: yack0 not a cluster member after 0 sec
> post_fail_delay
> Mar 31 08:45:56 s_sys at yack1 fenced[8855]: fencing node "yack0"
> Mar 31 08:46:21 s_sys at yack1 fenced[8855]: fence "yack0" success
>
> Any idea on this problem ?
>
> Thanks
> Alain
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>   
I'm not a Cluster King ,
but I had this probleme while running iptables. (looks like something 
wrong in iptables definitions)
I stopped it, and everythings works...
but i don't now how to modify iptables, to make it good.

Hope it helps a little :)

-- 
Jerome CASTANG
Tel: 06.85.74.33.02
mail: jerome.castang at adelpha-lan.org

---------------------------------------------
Comme le dit un vieu proverbe chinois: RTFM !


From teigland at redhat.com  Fri Mar 31 07:06:00 2006
From: teigland at redhat.com (David Teigland)
Date: Fri, 31 Mar 2006 01:06:00 -0600
Subject: [Linux-cluster] CS4 problem "Missed too many heartbeats"
In-Reply-To: <442CD2B8.2080203@bull.net>
References: <442CD2B8.2080203@bull.net>
Message-ID: <20060331070600.GA29439@redhat.com>

On Fri, Mar 31, 2006 at 08:56:56AM +0200, Alain Moulle wrote:
> Hi
> 
> I have a configuration where the following test
> gives :
> 
> 1. start CS4 on both nodes of a HA pair
> 2. start the service (doing just echo in a file) on node1
> 3. poweroff -f node1
>    ==> so the service is migrating on node2 successfully
> 4. when node1 is up again, after verification of ping on
>    heartbeat interface,  I start again CS4 and
>    on "Starting fence domain:" , it remains stalled

You'll need to send the output of cman_tool status|nodes|services
from both nodes.

Dave


From l.dardini at comune.prato.it  Fri Mar 31 08:36:32 2006
From: l.dardini at comune.prato.it (Leandro Dardini)
Date: Fri, 31 Mar 2006 10:36:32 +0200
Subject: R: [Linux-cluster] cluster and Apache tomcat
Message-ID: <404AA6666D14D14CA0D410C1BC6CC4C5465903@exchange3.comune.prato.local>

 
> -----Messaggio originale-----
> Da: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] Per conto di FM
> Inviato: gioved? 30 marzo 2006 22.21
> A: Marc Grimme
> Cc: Redhat Cluster
> Oggetto: Re: [Linux-cluster] cluster and Apache tomcat
> 
> Thanks for the response,
> 
> So in this scenario :
> 2 apache servers (webroot : /home/web/site1)
> 2 tomcat servers (tomcat root : /usr/local/tomcat)
> 
> Can I put the entire tomcat root on the SAN (GFS)? Java JVM 
> on each server.
> 

Not completely. Each tomcat needs a distinct logs, work and temp directory. Plus the server.xml contains data specificy to each tomcat (ip address of the replication process). Just work with @hostname links and it is possible to take on GFS the CATALINA_HOME and JAVA_HOME, plus all the context. The Catalina directory can be shared.

Just ad advice, two node clusters are not raccomanded at all (I have your identical situation, two apache + two tomcat) expecially with a SAN fencing device, it is highly possible each node will fence the other.

Leandro

> 
> 
> Marc Grimme wrote:
> > Hi,
> > On Thursday 30 March 2006 17:26, FM wrote:
> >> Hello everybody,
> >>
> >> Next month, we'll have 6 servers connected to a SAN. Those servers 
> >> will be in a web farm. And web documents will on a GFS 
> file system. 
> >> We will use the Redhat cluster suite.
> >>
> >>
> >> I'd like to install Tomcat 5 and/or 5.5 on those servers 
> and use its 
> >> cluster's possibilities (mod_jk for example).
> >>
> >> Some webapps need sessions. But I read that tomcat will 
> handle that.
> >>
> >> But How do you handle webapps installation on all servers ? Do you 
> >> deploy the war 6 times ? If not is there a tomcat way to replicate 
> >> these webapps ?
> > No you don't need to, just put it on GFS.
> >> Can I put the webapps folder on a GFS partition ?
> > Dito. No problem as general answer. But it sometimes depends on the 
> > application (what webapps you have).
> >> How about tomcat writing to a gfs file system ?
> > You can put anything on GFS without problem when using 
> tomcat. We have 
> > installed a 16 node cluster shared root, shared data, shared tomcat 
> > vars on GFS running without problems. There are some 
> specialities but 
> > these are dependent on applications and stuff.
> > 
> > Hope that helps
> > Regards Marc.
> >> Thanks !
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> > 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 


From Alain.Moulle at bull.net  Fri Mar 31 10:58:44 2006
From: Alain.Moulle at bull.net (Alain Moulle)
Date: Fri, 31 Mar 2006 12:58:44 +0200
Subject: [Linux-cluster] CS4 problem "Missed too many heartbeats"
In-Reply-To: <20060331070600.GA29439@redhat.com>
References: <442CD2B8.2080203@bull.net> <20060331070600.GA29439@redhat.com>
Message-ID: <442D0B64.3090106@bull.net>


>>I have a configuration where the following test
>>gives :
>>1. start CS4 on both nodes of a HA pair
>>2. start the service (doing just echo in a file) on node1
>>3. poweroff -f node1
>>   ==> so the service is migrating on node2 successfully
>>4. when node1 is up again, after verification of ping on
>>   heartbeat interface,  I start again CS4 and
>>   on "Starting fence domain:" , it remains stalled

> You'll need to send the output of cman_tool status|nodes|services
> from both nodes.
> Dave

Hi
Thanks Dave but in fact, meanwhile I 've found the problem
which is linked to our scripts at boot time and the Emulex
FC driver ...
Sorry but thanks ;-)
Alain
-- 


mailto:Alain.Moulle at bull.net
+------------------------------+--------------------------------+
|	Alain Moull?	       	| from France :	04 76 29 75 99  |
|                              	| FAX number  : 04 76 29 72 49  |
| Bull SA		       	|				|
| 1, Rue de Provence  		| Adr  : FREC B1-041            |
| B.P. 208			|				|
| 38432 Echirolles - CEDEX     	| Email: Alain.Moulle at bull.net  |
| France                       	| BCOM : 229 7599               |
+-------------------------------+-------------------------------+


From jlbeti at dsic.upv.es  Fri Mar 31 15:12:03 2006
From: jlbeti at dsic.upv.es (Jose Luis Beti)
Date: Fri, 31 Mar 2006 17:12:03 +0200
Subject: [Linux-cluster] RH Cluster Suite 4 + GFS + iptables
Message-ID: <442D46C3.6050807@dsic.upv.es>

Hi everybody,
I'm new in the list, so I apologize if this question has been answered 
before.

Anyone could explain me how to configure IPTABLES  to allow working 
right Redhat Cluster Suite RHEL4 + GFS?
What ports and protocols (tcp, udp) should I configure?

Thanks in advanced.

Jose Beti


From treed at copilotconsulting.com  Mon Mar 13 19:33:44 2006
From: treed at copilotconsulting.com (Tracy R Reed)
Date: Mon, 13 Mar 2006 19:33:44 -0000
Subject: [Linux-cluster] Cluster aware RAID/LVM
Message-ID: <4415C8FC.10102@copilotconsulting.com>


I've seen this asked before and even googled but I only see people
asking and no answers: What is the status of cluster aware RAID/LVM in
Linux?

I would like to build a highly reliable SAN by having 3 storage nodes
exporting their disk as a block device using AoE and then have the 3
compute nodes each RAID 5 the 3 block devices exported by the storage
nodes so that we end up with one block device, the same block device,
seen by all 3 compute nodes. I would like to initialize this as a
physical volume and then create different logical volumes within it with
each compute node mounting a different set of logical volumes.

But my understanding is that this will not currently work because LVM is
not cluster aware. It seems that Linux software RAID is not cluster
aware either.

Perhaps I could use EVMS (which does seem to be cluster aware) on each
of the compute nodes to manage the disk on the storage nodes and then
export a specific volume from each storage node to just one compute node
which would then do RAID 5. This way we have a cluster aware volume
manager exporting volumes to be RAID'd which would only be mounted by
one host each.

Does this sound reasonable?

I would like to avoid the use of GFS anywhere in this particular system
but I might have occasion to use GFS on a different project in the
future. It's been a few years since I have seriously looked into GFS but
it seems to have come a long way towards being usable in a production
environment. I remember it used to have its own volume management. Are
most people doing GFS volume management with EVMS also?

Thanks!

-- 
Tracy R Reed
http://copilotconsulting.com
1-877-MY-COPILOT


From nut4java at yahoo.com  Wed Mar 15 16:36:10 2006
From: nut4java at yahoo.com (John Coffee)
Date: Wed, 15 Mar 2006 16:36:10 -0000
Subject: [Linux-cluster] max filesystem and file size
Message-ID: <20060315163557.62509.qmail@web53210.mail.yahoo.com>

I've seen a few questions about filesystem size and
file size but no answers.

What is the max GFS filesystem size?  I see it says 8T
in the docs.  Is that truely the limit?

What is the max file size?

Does it matter if it's 32 or 64 bit OS?


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From aneesh.kumar at gmail.com  Fri Mar 17 10:59:59 2006
From: aneesh.kumar at gmail.com (Aneesh Kumar)
Date: Fri, 17 Mar 2006 16:29:59 +0530
Subject: [Linux-cluster] standard mechanism to communicate between cluster
	nodes from kernel
Message-ID: <cc723f590603170259v2c57ce5fi781cbe80d5a4c54b@mail.gmail.com>

Hi all,

I was trying to understand whether there is a standard set of API we
are working on for communicating between different nodes in a cluster
inside kernel. I looked at ocfs2 and the ocfs2 dlm code base seems to
use tcp via o2net_send_tcp_msg and the redhat dlm seems to sctp. There
is also tipc (net/tipc) code in the kernel now ( I am not sure about
the details of tipc). This confuses me a lot. If i want to use all
these cluster components what is the standard way. I am right now
looking at clusterproc
(http://www.openssi.org/cgi-bin/view?page=proc-hooks.html ) and
wondering what should be the communication mechanism.  clusterproc was
earlier based on CI which provided a simple easy way to define
different cluster services( more or less like rpcgen style
http://ci-linux.sourceforge.net/ics.shtml ). Does we are looking for a
framework like that ?

NOTE: I am not trying to find out which one is the best. I am trying
to find out if there is a standard way of doing this

-aneesh


From colman at uci.edu  Fri Mar 17 19:11:28 2006
From: colman at uci.edu (Richard Colman)
Date: Fri, 17 Mar 2006 11:11:28 -0800
Subject: [Linux-cluster] Cluster Hosting
In-Reply-To: <200603171959.57644.grimme@atix.de>
Message-ID: <001d01c649f6$9b75f820$79a8a8c0@D98D2G81>

Hi,

I need to establish a back-up cluster for biotech programming, say about 4
nodes plus head node and NFS data. Anyone know of a hosting service that can
do this at a reasonable price?

TNX for any ideas.


Rick Colman


From mag.andersen at gmail.com  Fri Mar 17 21:52:09 2006
From: mag.andersen at gmail.com (Magnus Andersen)
Date: Fri, 17 Mar 2006 16:52:09 -0500
Subject: [Linux-cluster] Uanble to mount GFS
Message-ID: <5ea165840603171352m55d43a29yce0eb4a541e899f9@mail.gmail.com>

Hi All,

I've successfully installed and configured GFS on my three nodes, but when I
try to mount the filesystem the prompt hangs until I kill the mount
command.  All servers are running RHEL 3 AS/ES U6 with the
2.4.21-37.0.1.ELsmp kernel and are connected to a MSA1500 SAN via FC.  I've
installed the following GFS rpms:

[root at oradw root]# rpm -qa | grep -i gfs
GFS-modules-6.0.2.27-0.1
GFS-modules-smp-6.0.2.27-0.1
GFS-6.0.2.27-0.1


Here is my pool configuration files and the output from pool_tool -s

[root at backup gfs]# cat cluster_cca.cfg
poolname cluster_cca
subpools 1
subpool 0 0 1
pooldevice 0 0 /dev/sda1
[root at backup gfs]# cat pool0.cfg
poolname pool_gfs1
subpools 1
subpool 0 0 1
pooldevice 0 0 /dev/sda2
[root at backup gfs]# cat pool1.cfg
poolname pool_gfs2
subpools 1
subpool 0 0 1
pooldevice 0 0 /dev/sdb
[root at backup gfs]# pool_tool -s
  Device                                            Pool Label
  ======                                            ==========
  /dev/pool/cluster_cca                       <- CCA device ->
  /dev/pool/pool_gfs1                     <- GFS filesystem ->
  /dev/pool/pool_gfs2                     <- GFS filesystem ->
  /dev/cciss/c0d0                  <- partition information ->
  /dev/cciss/c0d0p1                    <- EXT2/3 filesystem ->
  /dev/cciss/c0d0p2                          <- swap device ->
  /dev/cciss/c0d0p3                       <- lvm1 subdevice ->
  /dev/sda                         <- partition information ->
  /dev/sda1                                        cluster_cca
  /dev/sda2                                          pool_gfs1
  /dev/sdb                                           pool_gfs2


Here is the command I used to create the filesystem:

gfs_mkfs -p lock_gulm -t cluster_cca:pool_gfs2 -j 10 /dev/pool/pool_gfs2


Mount command that hangs:

mount -t gfs /dev/pool/pool_gfs2 /gfs2

Here is the output I see in my messages log file.  I see the last 5 lines
repeated for each time I tried to mount the filesystem.

Mar 17 15:47:05 backup ccsd[2645]: Starting ccsd 6.0.2.27:
Mar 17 15:47:05 backup ccsd[2645]:  Built: Jan 30 2006 15:28:33
Mar 17 15:47:05 backup ccsd[2645]:  Copyright (C) Red Hat, Inc.  2004  All
rights reserved.
Mar 17 15:48:10 backup lock_gulmd[2652]: Starting lock_gulmd 6.0.2.27.
(built Jan 30 2006 15:28:54) Copyright (C) 2004 Red Hat, Inc.  All rights
reserved.
Mar 17 15:48:10 backup lock_gulmd[2652]: You are running in Fail-over mode.
Mar 17 15:48:10 backup lock_gulmd[2652]: I am (backup) with ip (127.0.0.1)
Mar 17 15:48:10 backup lock_gulmd[2652]: Forked core [2653].
Mar 17 15:48:11 backup lock_gulmd[2652]: Forked locktable [2654].
Mar 17 15:48:12 backup lock_gulmd[2652]: Forked ltpx [2655].
Mar 17 15:48:12 backup lock_gulmd_core[2653]: I see no Masters, So I am
Arbitrating until enough Slaves talk to me.
Mar 17 15:48:12 backup lock_gulmd_core[2653]: Could not send quorum update
to slave backup
Mar 17 15:48:12 backup lock_gulmd_core[2653]: New generation of server
state. (1142628492484630)
Mar 17 15:48:12 backup lock_gulmd_LTPX[2655]: New Master at backup:127.0.0.1
Mar 17 15:52:14 backup kernel: Lock_Harness 6.0.2.27 (built Jan 30 2006
15:32:58) installed
Mar 17 15:52:14 backup kernel: GFS 6.0.2.27 (built Jan 30 2006 15:32:20)
installed
Mar 17 15:52:15 backup kernel: Gulm 6.0.2.27 (built Jan 30 2006 15:32:54)
installed
Mar 17 15:54:51 backup kernel: lock_gulm: ERROR cm_login failed. -512
Mar 17 15:54:51 backup kernel: lock_gulm: ERROR Got a -512 trying to start
the threads.
Mar 17 15:54:51 backup lock_gulmd_core[2653]: Error on xdr (GFS Kernel
Interface:127.0.0.1 idx:3 fd:8): (-104:104:Connection reset by peer)
Mar 17 15:54:51 backup kernel: lock_gulm: fsid=cluster_cca:gfs1: Exiting
gulm_mount with errors -512
Mar 17 15:54:51 backup kernel: GFS: can't mount proto = lock_gulm, table =
cluster_cca:gfs1, hostdata =


Result from gulm_tool:

[root at backup gfs]# gulm_tool nodelist backup
 Name: backup
  ip    = 127.0.0.1
  state = Logged in
  mode = Arbitrating
  missed beats = 0
  last beat = 1142632189718986
  delay avg = 10019686
  max delay = 10019735


I'm a newbie to clusters and I have no clue where to look next.  If any
other information is needed let me know.

Thanks,

--
Magnus Andersen
Systems Administrator / Oracle DBA
Walker & Associates, Inc.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060317/93ef9f1f/attachment.htm>

From lhh at redhat.com  Fri Mar 17 19:34:49 2006
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 17 Mar 2006 14:34:49 -0500
Subject: [Linux-cluster] (no subject)
In-Reply-To: <BD37DA8FEF7D8949A5731245FF66CB2BD47869@POSTOFFICE.cranel.local>
References: <BD37DA8FEF7D8949A5731245FF66CB2BD47869@POSTOFFICE.cranel.local>
Message-ID: <1142624089.8266.64.camel@ayanami.boston.redhat.com>

On Thu, 2006-03-16 at 16:51 -0500, Hendershot, Zach wrote:

>     Also, I'm having trouble with service failover. I have a script that
> accepts {start, stop, status} for Apache. I manually fail Apache, and CS
> basically puts the service in the failed state because it can't stop the
> service (the stop command returns a 1 status, because it cant stop an
> already stopped service).

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=151104

> And back to the documentation, why is something basic like script
> creation and API left out? Am I looking in the wrong place or
> overlooking some of this information?

Any LSB-compliant init script should work.

http://refspecs.freestandards.org/LSB_2.0.1/LSB-Core/LSB-Core/iniscrptact.html

That said, we will work on improving the documentation in this regard.

-- Lon


From basv at sara.nl  Sat Mar 18 11:18:19 2006
From: basv at sara.nl (Bas van der Vlies)
Date: Sat, 18 Mar 2006 12:18:19 +0100
Subject: [Linux-cluster] lock_dlm kernel panics
In-Reply-To: <20060317224043.GC29244@redhat.com>
References: <441B2C73.3010202@fnal.gov> <20060317224043.GC29244@redhat.com>
Message-ID: <F1E27D0C-231D-4C64-8C8B-1F3E2033B30E@sara.nl>


On Mar 17, 2006, at 11:40 PM, David Teigland wrote:

> On Fri, Mar 17, 2006 at 03:38:59PM -0600, Paul Tader wrote:
>> Mar 17 11:38:02 nodename kernel: d0 unlock fe20017a no id
>
> GFS is trying to unlock a lock that doesn't exist which causes the  
> panic.
> We know this happens if cman shuts down the dlm while it's in use  
> (cman
> does this if it's lost connection with the cluster.)  There's some new
> output in the RHEL4U3 dlm that should tell us if that's in fact what's
> happening or if there's some other cause that we need to uncover.
>
> So, you should look on all nodes for any cman messages in
> /var/log/messages or the console.  And when you're using the latest
> version look for the new dlm message "WARNING:  
> dlm_emergency_shutdown".

Dave,

  We had similar problems on our 4 node GFS cluster:
    * STABLE from cvs
    *  2.6.16-rc5-sara3

I have attached the kern.log's. 1 cman and 3 dlm kernel panics.

Regards
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: cman_kernel_crash.txt
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060318/383926ef/attachment.txt>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: dlm_kernel_crash.txt3.gz
Type: application/x-gzip
Size: 10331 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060318/383926ef/attachment.bin>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: dlm_lock_crash.txt
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060318/383926ef/attachment-0001.txt>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: dlm_lock_crash.txt2
Type: application/octet-stream
Size: 12701 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060318/383926ef/attachment.obj>
-------------- next part --------------

--
Bas van der Vlies
basv at sara.nl


From paul at tkd.co.id  Thu Mar 16 16:15:56 2006
From: paul at tkd.co.id (Paul)
Date: Thu, 16 Mar 2006 16:15:56 -0000
Subject: [Linux-cluster] cluster suit 4.2
References: <TKDNETwspDKicUbvBDx0000019a@tkdnet.tkd.co.id>
	<1142522321.19535.148.camel@ayanami.boston.redhat.com>
Message-ID: <008501c64914$d331d5f0$4ee17bcb@golie>

manual fence, because we have no redundancy PS, thx

Rgds,
Paul
----- Original Message ----- 
From: "Lon Hohberger" <lhh at redhat.com>
To: "linux clustering" <linux-cluster at redhat.com>
Cc: "'Paul'" <paul at tkd.co.id>
Sent: Thursday, March 16, 2006 10:18 PM
Subject: Re: [Linux-cluster] cluster suit 4.2


> On Thu, 2006-03-16 at 22:18 +0700, Theodorus wrote:
>> Hi all, 
>> 
>> 
>> We need your help.
>> 
>> 
>> We have cluster suit 4.2 installed on RedHat AS 4.2.
>> 
>> 
>> The cluster system has been run well. The resource group can be
>> relocated when the one of the nodes is down.
>> 
>> 
>> But, if we disconnect all network cables of one node on purpose, the
>> cluster system stalled, why  ? 
> 
> What kind of fencing are you using?
> 
> -- Lon
> 
> 
>


From paul at tkd.co.id  Thu Mar 16 16:38:38 2006
From: paul at tkd.co.id (Paul)
Date: Thu, 16 Mar 2006 16:38:38 -0000
Subject: [Linux-cluster] cluster suit 4.2
References: <TKDNETwspDKicUbvBDx0000019a@tkdnet.tkd.co.id>
	<1142522321.19535.148.camel@ayanami.boston.redhat.com>
	<007d01c64914$9404bd70$4ee17bcb@golie>
	<1142526462.19535.156.camel@ayanami.boston.redhat.com>
Message-ID: <009301c64918$0921f160$4ee17bcb@golie>

Oh sorry Mr Lon, I mean we have no power redundancy, thanks

Rgds,
Paul
----- Original Message ----- 
From: "Lon Hohberger" <lhh at redhat.com>
To: "Paul" <paul at tkd.co.id>
Cc: <linux-cluster at redhat.com>
Sent: Thursday, March 16, 2006 11:27 PM
Subject: Re: [Linux-cluster] cluster suit 4.2


> On Thu, 2006-03-16 at 23:13 +0700, Paul wrote:
>> manual fence, because we have redundance PS, thx
>> 
> 
> You need to run fence_ack_manual on the surviving node.  Note that
> running manual fencing in production environments is not supported.
> 
> There is plenty of adequate remote power fencing hardware available
> which will handle multiple power supplies.
> 
> -- Lon
>


From paul at tkd.co.id  Fri Mar 17 17:22:09 2006
From: paul at tkd.co.id (Paul)
Date: Sat, 18 Mar 2006 00:22:09 +0700
Subject: [Linux-cluster] cluster suit 4.2
References: <TKDNETwspDKicUbvBDx0000019a@tkdnet.tkd.co.id>
	<1142522321.19535.148.camel@ayanami.boston.redhat.com>
	<007d01c64914$9404bd70$4ee17bcb@golie>
	<1142526462.19535.156.camel@ayanami.boston.redhat.com>
Message-ID: <004c01c649e7$52cd7f30$4ee17bcb@golie>

Mr Lon,

I still face the same problem. If network cable of two NIC of one node are 
disconnected, the cluster system stall/hang.
For your input, I use manual fence for the time being and using two network 
interfaces for each node as a bonding port.
I have no power switch for fencing but my servers have ILO port ( Proliant 
DL380G4 ).
If manual fence is not recommended for this case, can I use ILO port for 
fencing.
Can you tell me how to set ILO fenced and what things are needed for the 
setting.
Is ILO need license. Can you give me the solution for this problem.
Thanks in advance.

Rgds,
paul


would you like give us the solution
----- Original Message ----- 
From: "Lon Hohberger" <lhh at redhat.com>
To: "Paul" <paul at tkd.co.id>
Cc: <linux-cluster at redhat.com>
Sent: Thursday, March 16, 2006 11:27 PM
Subject: Re: [Linux-cluster] cluster suit 4.2


> On Thu, 2006-03-16 at 23:13 +0700, Paul wrote:
>> manual fence, because we have redundance PS, thx
>>
>
> You need to run fence_ack_manual on the surviving node.  Note that
> running manual fencing in production environments is not supported.
>
> There is plenty of adequate remote power fencing hardware available
> which will handle multiple power supplies.
>
> -- Lon
> 


From Matthew.Patton.ctr at osd.mil  Fri Mar 31 17:57:19 2006
From: Matthew.Patton.ctr at osd.mil (Patton, Matthew F, CTR, OSD-PA&E)
Date: Fri, 31 Mar 2006 12:57:19 -0500
Subject: [Linux-cluster] Cluster aware RAID/LVM
Message-ID: <D8063DF686D10247B0A49D01271285690CE91D52@osdn06.osd.mil>

Classification: UNCLASSIFIED

You will be able to sleep at night if you just buy N (N > 1) cheap raid
controllers, a common disk tower and hook N disk servers up to said disk
tower in a cluster config and then set the cluster up such that one and only
one is alive and exporting a GNBD (or NFS/SAMBA/ISCSI etc) device to your
cluster of compute nodes. Or just buy a cheap turn-key SAN which is really
nothing more than a disk enclosure with 1+ RAID controllers and a FC or AoE
card. RAID should be left to the piece of hardware that is in direct
physical contact with the drives.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20060331/7ad8e0d1/attachment.htm>

From pegasus at nerv.eu.org  Fri Mar 31 22:46:51 2006
From: pegasus at nerv.eu.org (Jure =?UTF-8?Q?Pe=C4=8Dar?=)
Date: Sat, 1 Apr 2006 00:46:51 +0200
Subject: [Linux-cluster] stress-testing GFS ?
In-Reply-To: <fd8dc0d20603161039m4b4a2001lf81c278474b2abdb@mail.gmail.com>
References: <fd8dc0d20603161039m4b4a2001lf81c278474b2abdb@mail.gmail.com>
Message-ID: <20060401004651.60792a82.pegasus@nerv.eu.org>

On Thu, 16 Mar 2006 13:39:17 -0500
"Boris Ostrovsky" <baostr at gmail.com> wrote:

> I'd be very interested to hear comments on the numbers and hopefully some
> tuning suggestions.

Rest assured that you're not the only one dissapointed by gfs performance.

If redhat (or anyone else) put up some article explaining what kind of performance we can expect from clustered file systems, we'd at least know what to aim for.
But without such info, we can only compare against local file systems and such comparisons only show gfs in a very bad light.

-- 

Jure Pe?ar
http://jure.pecar.org/


From mwill at penguincomputing.com  Fri Mar 31 23:20:28 2006
From: mwill at penguincomputing.com (Michael Will)
Date: Fri, 31 Mar 2006 15:20:28 -0800
Subject: [Linux-cluster] stress-testing GFS ?
In-Reply-To: <20060401004651.60792a82.pegasus@nerv.eu.org>
References: <fd8dc0d20603161039m4b4a2001lf81c278474b2abdb@mail.gmail.com>
	<20060401004651.60792a82.pegasus@nerv.eu.org>
Message-ID: <442DB93C.4090807@penguincomputing.com>

Jure Pe?ar wrote:
> On Thu, 16 Mar 2006 13:39:17 -0500 "Boris Ostrovsky" <baostr at gmail.com> wrote:
>   
>> I'd be very interested to hear comments on the numbers and hopefully some
>> tuning suggestions.
>>     
>
> Rest assured that you're not the only one dissapointed by gfs performance.
>
> If redhat (or anyone else) put up some article explaining what kind of performance we can expect from clustered file systems, we'd at least know what to aim for.
> But without such info, we can only compare against local file systems and such comparisons only show gfs in a very bad light.
>   
At the bare minimum you can compare yourself to an unclustered single 
NFS server rather than a local filesystem.

In addition there are plenty commercial cluster filesystems out there 
that you can compare yourself too:
1. polyserve matrix server
2. ibrix
3. ibm gpfs
4. lustre
others?

On the open source side there is
1. ocfs2
2. pvfs2
3. older versions of lustre?
others?

Now the benchmark cases you would pick would be interesting as well. 
i.e. I have seen machines with local
sata raid that where benchmarked with bonnie++ and seemed to stream 
happily at close to 100MB/s
but when you actually where extracting a tar-file from across the 
network, it would slow down to 10MB/s
whereas copying that tarfile across the network into one large file 
would give you close to 100MB/s as well.

Conclusion: lots of unique file-creating makes the local filesystem 
performance tank in comparison to bonnie++ or other large
file benchmarks.

Michael Will
-- 

Michael Will
Penguin Computing Corp.
Sales Engineer
415-954-2822
415-954-2899 fx
mwill at penguincomputing.com