From daniel at osdl.org  Tue Feb  1 19:03:57 2005
From: daniel at osdl.org (Daniel McNeil)
Date: Tue, 01 Feb 2005 11:03:57 -0800
Subject: [Linux-cluster] ccsd does not run (latest cvs)
Message-ID: <1107284637.27476.19.camel@ibm-c.pdx.osdl.net>

I updated to the latest cvs and 2.6.10 and ccsd does not run.

I ran make install to install everything, so it all should
be up to date. I recompile ccsd with DEBUG defined and get this:

[root at cl030 cluster]# ccsd
[ccsd.c:287] Entering parse_cli_args
[ccsd.c:447] Exiting parse_cli_args
[ccsd.c:635] Entering daemonize
[ccsd.c:648] Entering daemon mode.
Failed to connect to cluster manager.
Hint: Magma plugins are not in the right spot.

Shouldn't make install put everything in the right spot?
Where is the right spot?
Is this running for everyone else?

Thanks,

Daniel


From jbrassow at redhat.com  Tue Feb  1 23:24:04 2005
From: jbrassow at redhat.com (Jonathan E Brassow)
Date: Tue, 1 Feb 2005 17:24:04 -0600
Subject: [Linux-cluster] ccsd does not run (latest cvs)
In-Reply-To: <1107284637.27476.19.camel@ibm-c.pdx.osdl.net>
References: <1107284637.27476.19.camel@ibm-c.pdx.osdl.net>
Message-ID: <03893e3e7ff7070ba615d1d93cabb34a@redhat.com>

It's not ccsd, it's magma...  The plugins are not in the spot that 
libmagma is expecting them.

There was a new tool added (magma_tool) which may help shed some light.

# here's what mine says
prompt> magma_tool config plugindir
/usr/lib/magma

prompt> ls `magma_tool config plugindir`
magma_gulm.so  magma_sm.so

You may wish to 'cd cluster; make uninstall; make distclean; 
./configure; make install'.

  brassow

On Feb 1, 2005, at 1:03 PM, Daniel McNeil wrote:

> I updated to the latest cvs and 2.6.10 and ccsd does not run.
>
> I ran make install to install everything, so it all should
> be up to date. I recompile ccsd with DEBUG defined and get this:
>
> [root at cl030 cluster]# ccsd
> [ccsd.c:287] Entering parse_cli_args
> [ccsd.c:447] Exiting parse_cli_args
> [ccsd.c:635] Entering daemonize
> [ccsd.c:648] Entering daemon mode.
> Failed to connect to cluster manager.
> Hint: Magma plugins are not in the right spot.
>
> Shouldn't make install put everything in the right spot?
> Where is the right spot?
> Is this running for everyone else?
>
> Thanks,
>
> Daniel
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster
>


From daniel at osdl.org  Wed Feb  2 00:39:34 2005
From: daniel at osdl.org (Daniel McNeil)
Date: Tue, 01 Feb 2005 16:39:34 -0800
Subject: [Linux-cluster] ccsd does not run (latest cvs)
In-Reply-To: <03893e3e7ff7070ba615d1d93cabb34a@redhat.com>
References: <1107284637.27476.19.camel@ibm-c.pdx.osdl.net>
	<03893e3e7ff7070ba615d1d93cabb34a@redhat.com>
Message-ID: <1107304773.10296.27.camel@ibm-c.pdx.osdl.net>

On Tue, 2005-02-01 at 15:24, Jonathan E Brassow wrote:
> It's not ccsd, it's magma...  The plugins are not in the spot that 
> libmagma is expecting them.
> 
> There was a new tool added (magma_tool) which may help shed some light.
> 
> # here's what mine says
> prompt> magma_tool config plugindir
> /usr/lib/magma
> 
> prompt> ls `magma_tool config plugindir`
> magma_gulm.so  magma_sm.so
> 
> You may wish to 'cd cluster; make uninstall; make distclean; 
> ./configure; make install'.
> 
>   brassow
> 

Every looks ok to me:

# magma_tool config plugindir
/usr/lib/magma

# ls -l `magma_tool config plugindir`
total 140
-rwxr-xr-x  2 root root 113399 Feb  1 15:38 magma_gulm.so
-rwxr-xr-x  2 root root  19049 Feb  1 15:38 magma_sm.so

I'll give the rebuilding from scratch a try.

Daniel


From daniel at osdl.org  Wed Feb  2 01:31:11 2005
From: daniel at osdl.org (Daniel McNeil)
Date: Tue, 01 Feb 2005 17:31:11 -0800
Subject: [Linux-cluster] ccsd does not run (latest cvs)
In-Reply-To: <1107304773.10296.27.camel@ibm-c.pdx.osdl.net>
References: <1107284637.27476.19.camel@ibm-c.pdx.osdl.net>
	<03893e3e7ff7070ba615d1d93cabb34a@redhat.com>
	<1107304773.10296.27.camel@ibm-c.pdx.osdl.net>
Message-ID: <1107307871.10296.29.camel@ibm-c.pdx.osdl.net>

On Tue, 2005-02-01 at 16:39, Daniel McNeil wrote:
> On Tue, 2005-02-01 at 15:24, Jonathan E Brassow wrote:
> > It's not ccsd, it's magma...  The plugins are not in the spot that 
> > libmagma is expecting them.
> > 
> > There was a new tool added (magma_tool) which may help shed some light.
> > 
> > # here's what mine says
> > prompt> magma_tool config plugindir
> > /usr/lib/magma
> > 
> > prompt> ls `magma_tool config plugindir`
> > magma_gulm.so  magma_sm.so
> > 
> > You may wish to 'cd cluster; make uninstall; make distclean; 
> > ./configure; make install'.
> > 
> >   brassow
> > 
> 
> Every looks ok to me:
> 
> # magma_tool config plugindir
> /usr/lib/magma
> 
> # ls -l `magma_tool config plugindir`
> total 140
> -rwxr-xr-x  2 root root 113399 Feb  1 15:38 magma_gulm.so
> -rwxr-xr-x  2 root root  19049 Feb  1 15:38 magma_sm.so
> 
> I'll give the rebuilding from scratch a try.
> 
> Daniel

Rebuilding everything from scratch worked.

Thanks,
Daniel


From fmarchal at inf.ethz.ch  Wed Feb  2 12:57:41 2005
From: fmarchal at inf.ethz.ch (Fabrice Marchal)
Date: Wed, 02 Feb 2005 13:57:41 +0100
Subject: [Linux-cluster] Qs about GFS
Message-ID: <4200CE45.2000807@inf.ethz.ch>

Hi,

I am planning to install GFS. Can it run without GNBD servers?
(as in the examples provided in the admin doc in appendix). Is
there some builtin redundancy (i.e. what if one of the nodes dies?) or
is the redundancy totally delegated to the GNBD servers?

Thanks,
Fabrice

-- 
========================================================================
Fabrice Marchal           http://www.inf.ethz.ch/~marchal 
fabrice.marchal at ieee.org  marchal at inf.ethz.ch        +41-(0)44-632-56-79
ETH Zurich, CoLab Computational Laboratory       FAX:+41-(0)44-632-17-03
========================================================================


From mingz at ele.uri.edu  Tue Feb  1 01:17:19 2005
From: mingz at ele.uri.edu (Ming Zhang)
Date: Mon, 31 Jan 2005 20:17:19 -0500
Subject: [Linux-cluster] [Fwd: any special requirement on scsi device?]
Message-ID: <1107220639.3017.4.camel@localhost.localdomain>

Hi folks

I asked this question today, somebody there guided me to here. Can
anybody help me on this? Thanks.

ps, how to subscribe on this list? Thanks.

Pls cc to me, Thanks.

Ming

-----Forwarded Message-----
> From: Ming Zhang <mingz at ele.uri.edu>
> To: opengfs-devel at lists.sourceforge.net
> Subject: any special requirement on scsi device?
> Date: Mon, 31 Jan 2005 16:41:00 -0500
> 
> Hi, folks
> 
> I wonder if opengfs has any special requirements on scsi deivces. For
> example, reserve/release scsi commands.
> 
> the reason i ask this is we are developing a open source iscsi target
> and would like to support opengfs. since we build all scsi response, we
> need to know what extra scsi commands or feature we need to support.
> 
> thanks.
> 
> 
> ming


From sunjw at onewaveinc.com  Thu Feb  3 03:24:11 2005
From: sunjw at onewaveinc.com (=?GB2312?B?y++/oc6w?=)
Date: Thu, 3 Feb 2005 11:24:11 +0800
Subject: [Linux-cluster] what's gfs_scand doing?
Message-ID: <SERVER6YalkXh3wFDxc00000cb3@mail.onewaveinc.com>

Hi,all

I got the gfs' code from cvs on 2004-12-12, which was for kernel 2.6.9, and the OS was FC3.
I created three logic volumes by CLVM, then I test the read performance of the SAN storage.
On one node, I got 150MB/s by reading 50 files on one LV concurrently; But when I readed 
the same 50 files on two nodes started at the same time, the result was 21 + 21 and totally
42 MB/s. So why the performance declined two much? I saw the result of command "top", the 
process "gfs_scand" got the No.1, so what's gfs_scand doing? And iowait got much cpu's idle
time. Is it related to hardwares, such as the RAID controller, the HBA card, and the FC 
switch? Or related to hardware drivers?

Thanks for any reply!
Best regards! 
Luckey


From mshk_00 at hotmail.com  Thu Feb  3 11:51:04 2005
From: mshk_00 at hotmail.com (maria perez)
Date: Thu, 03 Feb 2005 12:51:04 +0100
Subject: [Linux-cluster] GFS and HEARTBEAT
Message-ID: <BAY20-F35C37CEBC0C7D8A71C9E018D7F0@phx.gbl>

Hi, Lon!!

Thank you very much for your help!!

I have the ideas more clear now.
I hope have good luck with this, I tell you later.

Maria

_________________________________________________________________
Acepta el reto MSN Premium: Correos m?s divertidos con fotos y textos 
incre?bles en MSN Premium. Desc?rgalo y pru?balo 2 meses gratis. 
http://join.msn.com?XAPID=1697&DI=1055&HL=Footer_mailsenviados_correosmasdivertidos


From rmayhew at mweb.com  Thu Feb  3 12:53:08 2005
From: rmayhew at mweb.com (Richard Mayhew)
Date: Thu, 3 Feb 2005 14:53:08 +0200
Subject: [Linux-cluster] 10 Node Installation - Loosing Heartbeat
Message-ID: <91C4F1A7C418014D9F88E938C135545801274D4B@mwjdc2.mweb.com>

Hi All,
 
I have been able to successfully setup a 10 node GFS cluster, 4 Locking
Servers and 6 Clients. Each server is a Dell PowerEdge 1750 with 1GB
RAM, Dual P4 2.8 HT running RedHat Enterprise Server V3 Update 4. The
first ethernet interface is used for normal network traffic and the
second ethernet interface is used for GFS heatbeats only. Both
interfaces run at 1GB-FD on 2 separate switches. I have installed the
GFS-6.0.2-25 from http://bender.it.swin.edu.au/centos-3/RHGFS/i686/ on
the latest RedHat ES Kernel.
 
The storage is made available using a EMC CX600 SAN. (4 x 50GB) running
over a McData Fiber Switch. The pool service uses the storage from the
san over the EMC Powepath software using the emcpower pseudo devices.
Pool is able to assemble the pools with no problem. The ccsd service is
able to retrieve the gfs archive with no problems. The lock gulm server
loads with no problem and communicates with the master lock server with
out any errors or missed beats. GFS mounts with out any errors etc. I am
able to access the storage on each server with no problem on throughput.

 
The problem I am experiencing is as follows.
 
Once the GFS system has been running for a few hours with some usage on
each of the servers, some of the servers start missing beats. I
increased the heartbeat rate to test every 60 seconds and to fail after
10 tries. This just prolonged the servers being fenced. The only thing I
can come up with is that the locking server is buggy and stops
responding to heartbeats. On the master server when it detects that the
server has skipped the required number of beats, it tries to fence it
and fails. I have setup the fencing to use the mcdata module and I have
specified the correct login details. When the server that was fenced has
had its lock server restarted it tries to relog in to the master lock
server. This fails for obvious reasons as the master will refuse to
allow it to reconnect due to the previous fencing failures. Manual
fencing works without any problems but I have only tried this on the cmd
line.
 
Does anyone have an idea as to why the locking servers are hanging up
when it comes to sending heartbeat beats and possibly why the fencing
isnt working?
 
Here are my configs with some of the privileged information changed.
 
fence.css
fence_devices {

	EMC_Switch_01 {
	agent = "fence_mcdata"
	ipaddr = "xxx.xxx.xxx.xxx"
	login = "XXXXXXXX"
	password = "xxxxxx"
	}

I	EMC_Switch_02 {
	agent = "fence_mcdata"
	ipaddr = "xxx.xxx.xxx.xxx"
	login = "XXXXXXXXXX"
	password = "xxxxxx"
	}
}


Cluster.ccs
cluster {
        name = "mail"
        lock_gulm 
        servers =
["store-01.mc.mweb.net","store-02.mc.mweb.net","store-03.mc.mweb.net","s
tore-04.mc.mweb.net"]
        heartbeat_rate = 60
        allowed_misses = 10
        }
}

Nodes.ccs
nodes   {
   store-01.mc.mweb.net    {
       ip_interfaces   {
             eth1 = "xxx.xxx.xxx.xxx"
       }
       fence {
         san {
          EMC_Switch_01   {
              port = 3
          }
         }
       }
    }

   store-02.mc.mweb.net  {
        ip_interfaces {
              eth1 = "xxx.xxx.xxx.xxx"
        }
        fence {
          san {
            EMC_Switch_01 {
                port = 27
            }
          }
        }
    }

   store-03.mc.mweb.net  {
        ip_interfaces {
              eth1 = "xxx.xxx.xxx.xxx"
        }
        fence {
          san {
            EMC_Switch_01 {
                port = 9
            }
          }
        }
    }
   store-04.mc.mweb.net  {
        ip_interfaces {
              eth1 = "xxx.xxx.xxx.xxx"
        }
        fence {
          san {
            EMC_Switch_01 {
                port = 31
            }
          }
        }
    }

        serv-01.mc.mweb.net {
                                ip_interfaces {
                                                        eth1 =
"xxx.xxx.xxx.xxx"
                                }
                                fence {
                                        san {
                                                        EMC_Switch_01 {
                                                                port =
19
                                                        }
                                        }
                                }
                        }
        serv-02.mc.mweb.net {
                                ip_interfaces {
                                                eth1 = "xxx.xxx.xxx.xxx"
                                }
                                fence {
                                        san {
                                                        EMC_Switch_01 {
                                                                port =
27
                                        }
                               }
                        }
                }
        serv-03.mc.mweb.net {
                        ip_interfaces {
                                                eth1 = "xxx.xxx.xxx.xxx"
                        }
                        fence {
                                san {
                                                EMC_Switch_01 {
                                                        port = 31
                                                }
                                        }
                                }
                        }
        serv-04.mc.mweb.net {
                        ip_interfaces {
                                                eth1 = "xxx.xxx.xxx.xxx"
                        }
                        fence {
                                san {
                                                EMC_Switch_02 {
                                                        port = 3
                                                }
                                        }
                                }
                        }
        serv-05.mc.mweb.net {
                        ip_interfaces {
                                                eth1 = "xxx.xxx.xxx.xxx"
                        }
                        fence {
                                san {
                                                EMC_Switch_02 {
                                                        port = 9
                                                }
                       ip_interfaces {
                                                eth1 = "xxx.xxx.xxx.xxx"
                        }
                        fence {
                                san {
                                                EMC_Switch_02 {
                                                        port = 3
                                                }
                                        }
                                }
                        }
        serv-05.mc.mweb.net {
                        ip_interfaces {
                                                eth1 = "xxx.xxx.xxx.xxx"
                        }
                        fence {
                                san {
                                                EMC_Switch_02 {
                                                        port = 9
                                                }
                                        }
                                }
                        }
        serv-06.mc.mweb.net {
                        ip_interfaces {
                                        eth1 = "xxx.xxx.xxx.xxx"
                        }
                        fence {
                                san {
                                                EMC_Switch_02 {
                                                                port =
19
                                                }
                                        }
                                }
                        }
}
--

Regards

Richard Mayhew
Unix Specialist

MWEB Business
Tel:  + 27 11 340 7200
Fax:  + 27 11 340 7288
Website: www.mwebbusiness.co.za


From mtilstra at redhat.com  Thu Feb  3 14:24:00 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Thu, 3 Feb 2005 08:24:00 -0600
Subject: [Linux-cluster] 10 Node Installation - Loosing Heartbeat
In-Reply-To: <91C4F1A7C418014D9F88E938C135545801274D4B@mwjdc2.mweb.com>
References: <91C4F1A7C418014D9F88E938C135545801274D4B@mwjdc2.mweb.com>
Message-ID: <20050203142400.GA9385@redhat.com>

On Thu, Feb 03, 2005 at 02:53:08PM +0200, Richard Mayhew wrote:
> The problem I am experiencing is as follows.
>  
> Once the GFS system has been running for a few hours with some usage on
> each of the servers, some of the servers start missing beats. I
> increased the heartbeat rate to test every 60 seconds and to fail after
> 10 tries. This just prolonged the servers being fenced. The only thing I
> can come up with is that the locking server is buggy and stops
> responding to heartbeats. On the master server when it detects that the
> server has skipped the required number of beats, it tries to fence it
> and fails. I have setup the fencing to use the mcdata module and I have
> specified the correct login details. When the server that was fenced has
> had its lock server restarted it tries to relog in to the master lock
> server. This fails for obvious reasons as the master will refuse to
> allow it to reconnect due to the previous fencing failures. Manual
> fencing works without any problems but I have only tried this on the cmd
> line.
>  
> Does anyone have an idea as to why the locking servers are hanging up
> when it comes to sending heartbeat beats and possibly why the fencing
> isnt working?

First test the mcdata fencing.  Get your configs installed and ccsd
running.  Then run `fence_node store-01.mc.mweb.net`  (or one of the
other nodes).  Make sure that this has infact work.  (look on the switch
and what not.  I don't know anything about the mcdata, so I cannot say
much more here.)

This will let you test and see how fencing is working.  

Don't continue until you can call fence_node for every node and it does
in fact fence the node.

So, on to the missed heartbeats.  First, what is 'some usage'.  Not that
it should matter much, but just as an example, I can get missed
heartbeats by syncing large (~1G or so) amounts of data to the internal
drive.  (but to miss 11 at 60s apiece, it probably not this.)

Also, are there any messages from the Master or the Clients about the
time they start missing heartbeats? (other than the missed heartbeat
messages.)  If so, might give some clues as to what's happening.

Best thing to do when debugging heartbeats is to turn on those messages.
So add cluster.ccs:cluster{ lock_gulm { verbosity = "Default,Heartbeat" } }
to your config, and run things again.  (might also want to turn the
heartbeat rate back down for this.)  There will now be messages in
syslog for every heartbeat sent and received.

Hopefully this will unveil something.
-- 
Michael Conrad Tadpol Tilstra
To understand recursion, you must first understand recursion.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050203/ded2f351/attachment.sig>

From cristiano at develop.com.br  Thu Feb  3 19:10:28 2005
From: cristiano at develop.com.br (Cristiano da Costa)
Date: Thu, 3 Feb 2005 17:10:28 -0200 (BRST)
Subject: [Linux-cluster] gnbd server as shared storage
Message-ID: <2758704.1107457828765.JavaMail.SYSTEM@serraria>

Hello list
Im trying to configure Red Hat Cluster Suite in an enviroment without shared storage to learn how to admin and configure this. Then I using gnbd to simulate a shared storage and I mapped the exported devices in other two RHEL, that are members of the cluster, and bind gnbd devices imported to raw devices to use as quorum shared storage, but when start clumanager the machines reboot, informing the following error in log: 
Feb  3 14:40:55 parati clusvcmgrd[5133]: <crit> Couldn't read configuration from shared state: Success
Feb  3 14:40:55 parati clusvcmgrd[5133]: <alert> Shared State Error: REBOOTING

Then I make tests to simultaneos write data in a gnbd device mapped and I note that the files created by one node only appear to other node when  the device is reimported and remounted.
There are other way to configure RHCS without a shared storage like EMC storages or Sun storages?
That are my configuration files for gnbd
[root at ilhabela mnt]# cat /etc/sistina/ccs-build/angra.cca
#CCA:angra
#nigeb=cluster.ccs mtime=1107192495 size=147
cluster {
  name = "angra"
        lock_gulm {
             servers = ["angra"]
             heartbeat_rate = 0.3
             allowed_misses = 1
         }
}
#dne=cluster.ccs hash=BD5C824C
#nigeb=nodes.ccs mtime=1107192495 size=177
nodes {
  angra {
    ip_interfaces {
      eth0 = "192.168.1.52"
    }
    fence {
      server {
        gnbd0 {
        ipaddr = "192.168.1.52"
        }
      }
    }
  }
}
#dne=nodes.ccs hash=72704281
#nigeb=fence.ccs mtime=1107192495 size=78
fence_devices {
  gnbd0 {
    agent = "fence_gnbd"
    server = "angra"
  }
}
#dne=fence.ccs hash=9E0446A4

[root at ilhabela mnt]# cat /root/angra/cluster.ccs
cluster {
  name = "angra"
        lock_gulm {
             servers = ["angra"]
             heartbeat_rate = 0.3
             allowed_misses = 1
         }
}

[root at ilhabela mnt]# cat /root/angra/fence.ccs
fence_devices {
  gnbd0 {
    agent = "fence_gnbd"
    server = "angra"
  }
}

[root at ilhabela mnt]# cat /root/angra/nodes.ccs
nodes {
  angra {
    ip_interfaces {
      eth0 = "192.168.1.52"
    }
    fence {
      server {
        gnbd0 {
        ipaddr = "192.168.1.52"
        }
      }
    }
  }
  parati.localdomain {
    ip_interfaces {
      eth0 = "192.168.1.56"
    }
    fence {
      server {
        gnbd0 {
        ipaddr = "192.168.1.52"
        }
      }
    }
  }
  ilhabela.localdomain {
    ip_interfaces {
      eth0 = "192.168.1.57"
    }
    fence {
      server {
        gnbd0 {
        ipaddr = "192.168.1.52"
        }
      }
    }
  }
}

This is the output of gnbd_import -l in both nodes of the cluster
[root at ilhabela root]# gnbd_import -l
gnbd_import: Device name : hda10
----------------------
    Minor # : 1
  Proc name : gnbdb
         IP : 192.168.1.52
       Port : 14243
      State : Close Disconnected Clear
   Readonly : No
gnbd_import: Device name : hda9
----------------------
    Minor # : 2
  Proc name : gnbdc
         IP : 192.168.1.52
       Port : 14243
      State : Close Disconnected Clear
   Readonly : No
gnbd_import: Device name : hda11
----------------------
    Minor # : 3
  Proc name : gnbdd
         IP : 192.168.1.52
       Port : 14243
      State : Open Connected Clear
   Readonly : No

This is the content of /etc/sysconfig/rawdevices
[root at ilhabela root]# cat /etc/sysconfig/rawdevices
# raw device bindings
# format:  <rawdev> <major> <minor>
#          <rawdev> <blockdev>
# example: /dev/raw/raw1 /dev/sda1
#          /dev/raw/raw2 8 5
/dev/raw/raw1 /dev/gnbd/hda9
/dev/raw/raw2 /dev/gnbd/hda10

Partition /dev/gnbd/had11 is ext3 and are mounted in both nodes, but the data write for one node only is acessible to other when the device is reimported and remounted.
Grateful


_________________________ 
Cristiano da Costa 
Consultor 
Develop IT Solutions 
www.develop.com.br 
Fone: (51) 3386-6620 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050203/dcfb6636/attachment.htm>

From nigel.jewell at pixexcel.co.uk  Fri Feb  4 10:26:36 2005
From: nigel.jewell at pixexcel.co.uk (Nigel Jewell)
Date: Fri, 04 Feb 2005 10:26:36 +0000
Subject: [Linux-cluster] GNBD & Network Outage
In-Reply-To: <20050125205538.GD13289@phlogiston.msp.redhat.com>
References: <41F622D8.1060102@pixexcel.co.uk>
	<20050125205538.GD13289@phlogiston.msp.redhat.com>
Message-ID: <42034DDC.9010002@pixexcel.co.uk>

Dear Ben,

Thank you for your detailed reply.  It is always refreshing to get a 
decent response on a mailing list ;) .

> Sure. You see the -c in you export line.  Don't put it there.  That puts
> the device in (the very poorly named) uncached mode.  This does two 
> things.
> One: It causes the server to use direct IO to write to the exported 
> device,
> so your read performance will take a hit.  Two: It will time out after
> a period (default to 10 sec).  After gnbd times out, it must be able 
> to fence
> the server before it will let the requests fail.  This is so that you 
> know
> that the server isn't simply stalled and might write out the requests 
> later
> (if gnbd failed out, and the requests were rerouted to the backend 
> storage over
> another gnbd server, if the first server wrote it's requests out 
> later, it
> could cause data corruption).
>  
>

My understanding was that the "-c" put the device in cached mode, as 
described here:

http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-gnbd-commands.html

Or are you saying that by not putting the "-c" put its in uncached mode?

> This means that to run in uncached mode, you need to have a cluster 
> manager and
> fencing devices, which I'm not certain that you have.
>  
>

No we don't as we didn't really see the need, given what we want to do.

> I've got some questions about your setup.  Will this be part of a 
> clustered
> filesystem setup? If it will, I see some problems with your mirror.  When
> other nodes (including the gnbd server node A) write to the exported 
> device,
> these writes will not appear on the local partion of B.  So won't your 
> mirror
> get out of sync?  If only B will write to the exported device, (and 
> that's
> the only way I see this working) you can probably get by with nbd, which
> simply fails out if it loses connection.
>  
>

The intention of the setup was to have two hosts both exporting an 
unmounted device, and the alternative device using it as a RAID-1 
device.  Then to use heartbeat to mount and unmount the partitions as 
required.  For example:

HOST A:

/dev/hda1 (md0, ext3, mounted)
/dev/hda2 (ext3, unmounted, gnbd_exported as A)
/dev/gnbd/B (md0, ext3, mounted)

HOST B:

/dev/hda1 (ext3, unmounted, gnbd_exported as B)
/dev/hda2 (md0, ext3, mounted)
/dev/gnbd/A (md0, ext3, mounted)

I hope that makes sense.

If so, does what we are trying to achieve sound sensible?  Any 
gotchas/advice?

-- 
Nige.

PixExcel Limited
URL: http://www.pixexcel.co.uk


From lhh at redhat.com  Fri Feb  4 16:29:24 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 04 Feb 2005 11:29:24 -0500
Subject: [Linux-cluster] ccsd does not run (latest cvs)
In-Reply-To: <1107307871.10296.29.camel@ibm-c.pdx.osdl.net>
References: <1107284637.27476.19.camel@ibm-c.pdx.osdl.net>
	<03893e3e7ff7070ba615d1d93cabb34a@redhat.com>
	<1107304773.10296.27.camel@ibm-c.pdx.osdl.net>
	<1107307871.10296.29.camel@ibm-c.pdx.osdl.net>
Message-ID: <1107534564.14020.32.camel@ayanami.boston.redhat.com>

On Tue, 2005-02-01 at 17:31 -0800, Daniel McNeil wrote:
> > Daniel
> 
> Rebuilding everything from scratch worked.

Chances are that the plugins were built before the version of magma.  I
bumped the API version when I added magma_tool (just to be safe) because
it required some changes to the build system.

For reference, "magma_tool list" will tell you if the plugins were built
against a different API version of magma.

-- Lon


From lhh at redhat.com  Fri Feb  4 16:33:20 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 04 Feb 2005 11:33:20 -0500
Subject: [Linux-cluster] gnbd server as shared storage
In-Reply-To: <2758704.1107457828765.JavaMail.SYSTEM@serraria>
References: <2758704.1107457828765.JavaMail.SYSTEM@serraria>
Message-ID: <1107534800.14020.35.camel@ayanami.boston.redhat.com>

On Thu, 2005-02-03 at 17:10 -0200, Cristiano da Costa wrote:

> Feb  3 14:40:55 parati clusvcmgrd[5133]: <crit> Couldn't read
> configuration from shared state: Success
> Feb  3 14:40:55 parati clusvcmgrd[5133]: <alert> Shared State Error:
> REBOOTING

Try running 'shutil -i'.  However, be sure you're running the latest
version of clumanager.

-- Lon


From filip.sergeys at verzekeringen.be  Fri Feb  4 16:52:31 2005
From: filip.sergeys at verzekeringen.be (Filip Sergeys)
Date: 04 Feb 2005 17:52:31 +0100
Subject: [Linux-cluster] clvm mirroring target status
Message-ID: <1107535951.14182.75.camel@wavebreaker.eccent.be>

Hi,

We are going to install a linux cluster with 2 gnbd servers (no SPOF)
and gfs + clvm on the cluster nodes (4 nodes). I have two options, if I
read the docs well, for duplicating data on the gnbd servers:
1) using clvm target mirroring on the cluster nodes
2) use drbd underneath to mirror discs. Basically two disks per machine:
1 live disk which is mirrored with drbd to the second disk in the second
machine and the other way around in the second machine 
(so the second disk in the first machine is thus the mirror from the
first (="live") disk in the second machine(sounds complicated, but it is
just hard to write down)). 
Both live disks from each machine will be combined as one logical disk
(If I understood well, this is possible).

Question: what is the status of clvm mirroring? Is it stable? 
Suppose it is stable, so I have a choice: which one of the options would
any of you choose? Reason? (Stability, performance, ...)

I found two hits on google concerning clvm mirroring, but both say it is
not finished yet. However the most recent one is from june 2004. 
I cannot test either because we have no spare machine. I'm going to buy
two machine so I need to know which disk configuration I will be using.

Thanks in advance,

Regards,

Filip Sergeys


http://64.233.183.104/search?q=cache:r1Icx--aI2YJ:www.spinics.net/lists/gfs/msg03439.html+clvm+mirroring+gfs&hl=nl&start=12
https://www.redhat.com/archives/linux-cluster/2004-June/msg00028.html

-- 
*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
* System Engineer, Verzekeringen NV *
* www.verzekeringen.be              *
* Oostkaai 23 B-2170 Merksem        *
* 03/6416673 - 0477/340942          *
*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050204/a1e0abdc/attachment.htm>

From bmarzins at redhat.com  Fri Feb  4 17:34:43 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Fri, 4 Feb 2005 11:34:43 -0600
Subject: [Linux-cluster] GNBD & Network Outage
In-Reply-To: <42034DDC.9010002@pixexcel.co.uk>
References: <41F622D8.1060102@pixexcel.co.uk>
	<20050125205538.GD13289@phlogiston.msp.redhat.com>
	<42034DDC.9010002@pixexcel.co.uk>
Message-ID: <20050204173443.GC3666@phlogiston.msp.redhat.com>

On Fri, Feb 04, 2005 at 10:26:36AM +0000, Nigel Jewell wrote:
> Dear Ben,
> 
> Thank you for your detailed reply.  It is always refreshing to get a 
> decent response on a mailing list ;) .
>

<snip>
 
> My understanding was that the "-c" put the device in cached mode, as 
> described here:

You are correct.
 
> http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-gnbd-commands.html
> 
> Or are you saying that by not putting the "-c" put its in uncached mode?

Yes, that's what I meant to say.  so much for decent responses :P 

> The intention of the setup was to have two hosts both exporting an 
> unmounted device, and the alternative device using it as a RAID-1 
> device.  Then to use heartbeat to mount and unmount the partitions as 
> required.  For example:
> 
> HOST A:
> 
> /dev/hda1 (md0, ext3, mounted)
> /dev/hda2 (ext3, unmounted, gnbd_exported as A)
> /dev/gnbd/B (md0, ext3, mounted)
> 
> HOST B:
> 
> /dev/hda1 (ext3, unmounted, gnbd_exported as B)
> /dev/hda2 (md0, ext3, mounted)
> /dev/gnbd/A (md0, ext3, mounted)
> 
> I hope that makes sense.

O.k. let me see if I get this.

hostA is setting up a mirror on /dev/hda1 and /dev/gnbd/B
hostB is setting up a mirror on /dev/hda2 and /dev/gnbd/A

So, if hostA goes down, you will be able to access its data on /dev/hda1
of hostB. If hostB goes down, you will be able to access its data on /dev/hda2
of hostA.

> If so, does what we are trying to achieve sound sensible?

Yeah, you can do that.

>Any gotchas/advice?

The obvious issue is, for example, if hostB goes down and you are accessing
it's data through /dev/hda2 on hostA, when hostB comes back up, you must
unmount /dev/hda2 on hostA before you export it. Alos, even if hostA never does
any writing to /dev/hda2, the data on it may not be in sync with the data
on /dev/hda2 of hostB, so you will need to resync them when hostB comes back.

For this, there isn't any real reason to use gnbd over nbd... nbd will
failout right away, which is annoying when it's caused by some transient
network issue, but you don't need to have a clustermanager set up to
use it. 

Another option which might work is to use gnbd in cached mode, but when you
decide that the other node really isn't there, run
# gnbd_import -rO <device>
This will flush the requests from the device. The /dev/gnbd/<device> file will
also be removed, which may piss off your mirror. However, if this works, you
get the benefit of retrying the connection until heartbeat decides the other
node is really dead.

Hope this helps.

-Ben
> 
> -- 
> Nige.
> 
> PixExcel Limited
> URL: http://www.pixexcel.co.uk
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From bmarzins at redhat.com  Fri Feb  4 17:41:10 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Fri, 4 Feb 2005 11:41:10 -0600
Subject: [Linux-cluster] GNBD & Network Outage
In-Reply-To: <42034DDC.9010002@pixexcel.co.uk>
References: <41F622D8.1060102@pixexcel.co.uk>
	<20050125205538.GD13289@phlogiston.msp.redhat.com>
	<42034DDC.9010002@pixexcel.co.uk>
Message-ID: <20050204174110.GD3666@phlogiston.msp.redhat.com>

On Fri, Feb 04, 2005 at 10:26:36AM +0000, Nigel Jewell wrote:
> Dear Ben,

Oh yeah, or you could take a look at http://www.drbd.org/

-Ben
 
> Thank you for your detailed reply.  It is always refreshing to get a 
> decent response on a mailing list ;) .
> 
> >Sure. You see the -c in you export line.  Don't put it there.  That puts
> >the device in (the very poorly named) uncached mode.  This does two 
> >things.
> >One: It causes the server to use direct IO to write to the exported 
> >device,
> >so your read performance will take a hit.  Two: It will time out after
> >a period (default to 10 sec).  After gnbd times out, it must be able 
> >to fence
> >the server before it will let the requests fail.  This is so that you 
> >know
> >that the server isn't simply stalled and might write out the requests 
> >later
> >(if gnbd failed out, and the requests were rerouted to the backend 
> >storage over
> >another gnbd server, if the first server wrote it's requests out 
> >later, it
> >could cause data corruption).
> > 
> >
> 
> My understanding was that the "-c" put the device in cached mode, as 
> described here:
> 
> http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-gnbd-commands.html
> 
> Or are you saying that by not putting the "-c" put its in uncached mode?
> 
> >This means that to run in uncached mode, you need to have a cluster 
> >manager and
> >fencing devices, which I'm not certain that you have.
> > 
> >
> 
> No we don't as we didn't really see the need, given what we want to do.
> 
> >I've got some questions about your setup.  Will this be part of a 
> >clustered
> >filesystem setup? If it will, I see some problems with your mirror.  When
> >other nodes (including the gnbd server node A) write to the exported 
> >device,
> >these writes will not appear on the local partion of B.  So won't your 
> >mirror
> >get out of sync?  If only B will write to the exported device, (and 
> >that's
> >the only way I see this working) you can probably get by with nbd, which
> >simply fails out if it loses connection.
> > 
> >
> 
> The intention of the setup was to have two hosts both exporting an 
> unmounted device, and the alternative device using it as a RAID-1 
> device.  Then to use heartbeat to mount and unmount the partitions as 
> required.  For example:
> 
> HOST A:
> 
> /dev/hda1 (md0, ext3, mounted)
> /dev/hda2 (ext3, unmounted, gnbd_exported as A)
> /dev/gnbd/B (md0, ext3, mounted)
> 
> HOST B:
> 
> /dev/hda1 (ext3, unmounted, gnbd_exported as B)
> /dev/hda2 (md0, ext3, mounted)
> /dev/gnbd/A (md0, ext3, mounted)
> 
> I hope that makes sense.
> 
> If so, does what we are trying to achieve sound sensible?  Any 
> gotchas/advice?
> 
> -- 
> Nige.
> 
> PixExcel Limited
> URL: http://www.pixexcel.co.uk
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From bmarzins at redhat.com  Fri Feb  4 17:53:20 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Fri, 4 Feb 2005 11:53:20 -0600
Subject: [Linux-cluster] gnbd server as shared storage
In-Reply-To: <2758704.1107457828765.JavaMail.SYSTEM@serraria>
References: <2758704.1107457828765.JavaMail.SYSTEM@serraria>
Message-ID: <20050204175320.GE3666@phlogiston.msp.redhat.com>

On Thu, Feb 03, 2005 at 05:10:28PM -0200, Cristiano da Costa wrote:
> Hello list

<snip>

> Then I make tests to simultaneos write data in a gnbd device mapped and I note that the files created by one node only appear to other node when  the device is reimported and remounted.

<snip>

> Partition /dev/gnbd/had11 is ext3 and are mounted in both nodes, but the data write for one node only is acessible to other when the device is reimported and remounted.

If you actually had shared storage, and did this test, the results would be
the same.  ext3 is not a clustered filesystem.  It never expects the data
on disk to change out from under it, so it always trusts its cache.
In order to share storage between multiple machines, you need a clustered
filesystem, like GFS.

-Ben

> Grateful
> 
> 
> 
> 
> _________________________ 
> Cristiano da Costa 
> Consultor 
> Develop IT Solutions 
> www.develop.com.br 
> Fone: (51) 3386-6620 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From yazan at ccs.com.jo  Fri Feb  4 18:04:39 2005
From: yazan at ccs.com.jo (Yazan Al-Sheyyab)
Date: Fri, 4 Feb 2005 20:04:39 +0200
Subject: [Linux-cluster] a question about gfs file system
Message-ID: <000901c50ae3$feb22970$69050364@yazanz>

hello everybody,

 how much space that gfs file system can take from a partion ?

 i mean that if i have a 500 MB partition and want to format it as gfs file
system , so how much space will remain for me in this partition after format
it as gfs file system?


Thanks.


From cristiano at develop.com.br  Fri Feb  4 18:21:15 2005
From: cristiano at develop.com.br (Cristiano da Costa)
Date: Fri, 4 Feb 2005 16:21:15 -0200 (BRST)
Subject: [Linux-cluster] gnbd server as shared storage
Message-ID: <1002123.1107541275046.JavaMail.SYSTEM@serraria>

Hi Ben

Tanks in advance, really Im making wrong test, you are very right, but ...

I update the clumager package to last version, and now the nodes aren't rebooting when start clumanager, but the quorum aren't not recognized and in this case Im using 2 raws that bind to 2 gnbd imported devices.

There are the log of the cluster when start

Feb  4 16:22:54 parati clumanager: [5283]: <notice> Starting Red Hat Cluster Manager...
Feb  4 16:22:54 parati cluquorumd[5303]: <debug> changing loglevel from 5 to 7
Feb  4 16:22:54 parati cluquorumd[5303]: <info> IPv4-TB: 192.168.1.52
Feb  4 16:22:54 parati cluquorumd[5303]: <warning> STONITH: No drivers configured for host '192.168.1.56'!
Feb  4 16:22:54 parati cluquorumd[5303]: <warning> STONITH: Data integrity may be compromised!
Feb  4 16:22:54 parati cluquorumd[5303]: <warning> STONITH: No drivers configured for host '192.168.1.57'!
Feb  4 16:22:54 parati cluquorumd[5303]: <warning> STONITH: Data integrity may be compromised!
Feb  4 16:22:54 parati cluquorumd[5303]: <debug> spawn_daemon: starting /usr/sbin/clumembd.
Feb  4 16:22:54 parati cluquorumd[5303]: <debug> IP tie-breaker in use, not starting disk thread.
Feb  4 16:22:54 parati clumembd[5313]: <debug> Changing loglevel from 6 to 7
Feb  4 16:22:54 parati clumembd[5313]: <debug> Transmit thread set to ON
Feb  4 16:22:54 parati clumembd[5313]: <debug> Overriding TKO count to be 20
Feb  4 16:22:54 parati clumembd[5313]: <debug> Broadcast hearbeating set to ON
Feb  4 16:22:54 parati clumembd[5313]: <debug> Multicast heartbeat OFF
Feb  4 16:22:54 parati clumembd[5313]: <debug> I am member #0
Feb  4 16:22:54 parati clumembd[5313]: <debug> Interface IP is 127.0.0.1
Feb  4 16:22:54 parati cluquorumd[5303]: <debug> Cluster I/F: eth0 [192.168.1.56]
Feb  4 16:22:54 parati clumembd[5313]: <debug> broadcast is 127.255.255.255
Feb  4 16:22:54 parati clumembd[5313]: <debug> Interface IP is 192.168.1.56
Feb  4 16:22:54 parati clumembd[5313]: <debug> broadcast is 192.168.1.255
Feb  4 16:22:54 parati clumembd[5313]: <debug> Interface IP is 10.1.1.1
Feb  4 16:22:54 parati clumembd[5313]: <debug> broadcast is 10.255.255.255
Feb  4 16:22:54 parati clumembd[5313]: <debug> Cluster I/F: eth0 [192.168.1.56]
Feb  4 16:22:54 parati clumembd[5313]: <debug> clumembd_start_watchdog: set duration to 14.
Feb  4 16:22:54 parati clumembd[5313]: <debug> Waiting for requests.
Feb  4 16:22:54 parati clumembd[5313]: <debug> Transmit thread: pulsar
Feb  4 16:22:55 parati clumembd[5313]: <debug> Verified connect from member #0 (127.0.0.1)
Feb  4 16:22:55 parati clumembd[5313]: <debug> MB: New connect: fd9
Feb  4 16:22:55 parati clumembd[5313]: <debug> MB: Received EV_REGISTER, fd9
Feb  4 16:22:55 parati cluquorumd[5303]: <debug> spawn_daemon: starting /usr/sbin/clulockd.
Feb  4 16:22:55 parati clulockd[5316]: <debug> /usr/sbin/clulockd starting
Feb  4 16:22:55 parati clulockd[5316]: <debug> Cluster I/F: eth0 [192.168.1.56]
Feb  4 16:22:55 parati cluquorumd[5303]: <debug> Verified connect from member #0 (127.0.0.1)
Feb  4 16:22:55 parati cluquorumd[5303]: <debug> Q: Received EV_REGISTER, fd6
Feb  4 16:22:55 parati cluquorumd[5303]: <debug> Q: Received EV_MEMB_UPDATE, fd5
Feb  4 16:22:55 parati clulockd[5316]: <debug> Quorum Event: NO QUORUM
Feb  4 16:22:55 parati clulockd[5316]: <debug> Lock Keeper = Member #-1
Feb  4 16:23:05 parati clumembd[5313]: <notice> Member 192.168.1.56 UP
Feb  4 16:23:05 parati clumembd[5313]: <debug> MB: Initiating vote on: 0x00000001
Feb  4 16:23:05 parati clumembd[5320]: <debug> VF: Connecting to member #0
Feb  4 16:23:05 parati clumembd[5313]: <debug> Verified connect from member #0 (192.168.1.56)
Feb  4 16:23:05 parati clumembd[5313]: <debug> MB: New connect: fd11
Feb  4 16:23:05 parati clumembd[5320]: <debug> VF: Push 0.5320 #1
Feb  4 16:23:05 parati clumembd[5320]: <debug> VF: Sending to member #0
Feb  4 16:23:05 parati clumembd[5320]: <debug> VF: Checking for consensus...
Feb  4 16:23:05 parati clumembd[5313]: <debug> MB: Received VF_MESSAGE, fd11
Feb  4 16:23:05 parati clumembd[5313]: <debug> VF_JOIN_VIEW from member #0! Key: 0x27456381 #1
Feb  4 16:23:05 parati clumembd[5313]: <debug> VF: Voting YES
Feb  4 16:23:05 parati clumembd[5320]: <debug> VF: Member #0 voted YES
Feb  4 16:23:05 parati clumembd[5320]: <debug> VF: Broadcasting FORMED
Feb  4 16:23:05 parati clumembd[5320]: <debug> VF: Converge Time: 0.000000
Feb  4 16:23:05 parati clumembd[5313]: <debug> MB: Received VF_MESSAGE, fd11
Feb  4 16:23:05 parati clumembd[5313]: <debug> VF: Received VF_VIEW_FORMED, fd11
Feb  4 16:23:05 parati clumembd[5313]: <debug> VF: Commit Key 0x27456381 #1 from member #0
Feb  4 16:23:05 parati clumembd[5313]: <info> Membership View #1:0x00000001
Feb  4 16:23:06 parati clumembd[5313]: <debug> VF: pid 5320 exited, status 0
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> Q: Received EV_MEMB_UPDATE, fd5
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> Need 1 more members for quorum!
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> process_memb_update: Starting VF
Feb  4 16:23:06 parati cluquorumd[5321]: <debug> VF: Connecting to member #0
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> VF: Key 0x12345678 Still running
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> Verified connect from member #0 (192.168.1.56)
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> VF: Key 0x12345678 Still running
Feb  4 16:23:06 parati cluquorumd[5321]: <debug> VF: Push 0.5321 #1
Feb  4 16:23:06 parati cluquorumd[5321]: <debug> VF: Sending to member #0
Feb  4 16:23:06 parati cluquorumd[5321]: <debug> VF: Checking for consensus...
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> Q: Received VF_MESSAGE, fd8
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> VF_JOIN_VIEW from member #0! Key: 0x12345678 #1
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> VF: Voting YES
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> VF: Key 0x12345678 Still running
Feb  4 16:23:06 parati cluquorumd[5321]: <debug> VF: Member #0 voted YES
Feb  4 16:23:06 parati cluquorumd[5321]: <debug> VF: Broadcasting FORMED
Feb  4 16:23:06 parati cluquorumd[5321]: <debug> VF: Converge Time: 0.010000
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> Q: Received VF_MESSAGE, fd8
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> VF: Received VF_VIEW_FORMED, fd8
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> VF: Commit Key 0x12345678 #1 from member #0
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> Lack of Quorum Maintained
Feb  4 16:23:06 parati clulockd[5316]: <debug> Quorum Event: NO QUORUM
Feb  4 16:23:06 parati cluquorumd[5303]: <debug> VF: pid 5321 exited, status 0
Feb  4 16:23:06 parati clulockd[5316]: <debug> I am the new lock keeper


Grateful

Cristiano da Costa


--- Original Message ---
> On Thu, Feb 03, 2005 at 05:10:28PM -0200, 
> Cristiano da Costa wrote:
> > Hello list
> 
> <snip>
> 
> > Then I make tests to simultaneos write data in 
> a gnbd device mapped and I note that the files 
> created by one node only appear to other node 
> when  the device is reimported and remounted.
> 
> <snip>
> 
> > Partition /dev/gnbd/had11 is ext3 and are 
> mounted in both nodes, but the data write for 
> one node only is acessible to other when the 
> device is reimported and remounted.
> 
> If you actually had shared storage, and did this 
> test, the results would be
> the same.  ext3 is not a clustered filesystem.  
> It never expects the data
> on disk to change out from under it, so it 
> always trusts its cache.
> In order to share storage between multiple 
> machines, you need a clustered
> filesystem, like GFS.
> 
> -Ben
> 
> > Grateful
> > 
> > 
> > 
> > 
> > _________________________ 
> > Cristiano da Costa 
> > Consultor 
> > Develop IT Solutions 
> > www.develop.com.br 
> > Fone: (51) 3386-6620 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > 
> http://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> 
> http://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 


From bmarzins at redhat.com  Fri Feb  4 19:03:09 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Fri, 4 Feb 2005 13:03:09 -0600
Subject: [Linux-cluster] [Fwd: any special requirement on scsi device?]
In-Reply-To: <1107220639.3017.4.camel@localhost.localdomain>
References: <1107220639.3017.4.camel@localhost.localdomain>
Message-ID: <20050204190309.GG3666@phlogiston.msp.redhat.com>

On Mon, Jan 31, 2005 at 08:17:19PM -0500, Ming Zhang wrote:
> Hi folks
> 
> I asked this question today, somebody there guided me to here. Can
> anybody help me on this? Thanks.
> 
> ps, how to subscribe on this list? Thanks.

I believe you can get to a subscription page from
http://sources.redhat.com/cluster
 
> Pls cc to me, Thanks.
> 
> Ming
> 
> -----Forwarded Message-----
> > From: Ming Zhang <mingz at ele.uri.edu>
> > To: opengfs-devel at lists.sourceforge.net
> > Subject: any special requirement on scsi device?
> > Date: Mon, 31 Jan 2005 16:41:00 -0500
> > 
> > Hi, folks
> > 
> > I wonder if opengfs has any special requirements on scsi deivces. For
> > example, reserve/release scsi commands.
> > 
> > the reason i ask this is we are developing a open source iscsi target
> > and would like to support opengfs. since we build all scsi response, we
> > need to know what extra scsi commands or feature we need to support.

Persistent reservations would be really useful for fencing.

-Ben

> > 
> > thanks.
> > 
> > 
> > ming
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From mingz at ele.uri.edu  Fri Feb  4 19:15:00 2005
From: mingz at ele.uri.edu (Ming Zhang)
Date: Fri, 04 Feb 2005 14:15:00 -0500
Subject: [Linux-cluster] [Fwd: any special requirement on scsi device?]
In-Reply-To: <20050204190309.GG3666@phlogiston.msp.redhat.com>
References: <1107220639.3017.4.camel@localhost.localdomain>
	<20050204190309.GG3666@phlogiston.msp.redhat.com>
Message-ID: <1107544500.3009.10.camel@localhost.localdomain>

On Fri, 2005-02-04 at 14:03, Benjamin Marzinski wrote:
> On Mon, Jan 31, 2005 at 08:17:19PM -0500, Ming Zhang wrote:
> > Hi folks
> > 
> > I asked this question today, somebody there guided me to here. Can
> > anybody help me on this? Thanks.
> > 
> > ps, how to subscribe on this list? Thanks.
> 
> I believe you can get to a subscription page from
> http://sources.redhat.com/cluster
thx a lot. though it is very slow to connect. :P


>  
> > Pls cc to me, Thanks.
> > 
> > Ming
> > 
> > -----Forwarded Message-----
> > > From: Ming Zhang <mingz at ele.uri.edu>
> > > To: opengfs-devel at lists.sourceforge.net
> > > Subject: any special requirement on scsi device?
> > > Date: Mon, 31 Jan 2005 16:41:00 -0500
> > > 
> > > Hi, folks
> > > 
> > > I wonder if opengfs has any special requirements on scsi deivces. For
> > > example, reserve/release scsi commands.
> > > 
> > > the reason i ask this is we are developing a open source iscsi target
> > > and would like to support opengfs. since we build all scsi response, we
> > > need to know what extra scsi commands or feature we need to support.
> 
> Persistent reservations would be really useful for fencing.
u mean this is "useful". but is this "enough"?

because many end users on our list ask for this feature. so i want to
know what we need to do in order to get it done. thanks.

ps, what is the minimum # of pc i need in order to install a gfs and
test?

ming

> 
> -Ben
> 
> > > 
> > > thanks.
> > > 
> > > 
> > > ming
> > 
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > http://www.redhat.com/mailman/listinfo/linux-cluster


From tony at sybaspace.com  Fri Feb  4 19:20:15 2005
From: tony at sybaspace.com (Tony Fraser)
Date: Fri, 04 Feb 2005 11:20:15 -0800
Subject: [Linux-cluster] GNBD & Network Outage
In-Reply-To: <42034DDC.9010002@pixexcel.co.uk>
References: <41F622D8.1060102@pixexcel.co.uk>
	<20050125205538.GD13289@phlogiston.msp.redhat.com>
	<42034DDC.9010002@pixexcel.co.uk>
Message-ID: <1107544814.1863.33.camel@sybaws1.office.sybaspace.com>

On Fri, 2005-02-04 at 02:26, Nigel Jewell wrote:
> The intention of the setup was to have two hosts both exporting an 
> unmounted device, and the alternative device using it as a RAID-1 
> device.  Then to use heartbeat to mount and unmount the partitions as 
> required.  For example:
> 
> HOST A:
> 
> /dev/hda1 (md0, ext3, mounted)
> /dev/hda2 (ext3, unmounted, gnbd_exported as A)
> /dev/gnbd/B (md0, ext3, mounted)
> 
> HOST B:
> 
> /dev/hda1 (ext3, unmounted, gnbd_exported as B)
> /dev/hda2 (md0, ext3, mounted)
> /dev/gnbd/A (md0, ext3, mounted)
> 
> I hope that makes sense.
> 
> If so, does what we are trying to achieve sound sensible?  Any 
> gotchas/advice?

That sounds to me just like what DRBD (http://www.drbd.org/) was deigned
to do.

You might want to take a look at it.

-- 
Tony Fraser
tony at sybaspace.com
Sybaspace Internet Solutions                        System Administrator
phone: (250) 246-5368                                fax: (250) 246-5398


From bmarzins at redhat.com  Fri Feb  4 20:00:47 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Fri, 4 Feb 2005 14:00:47 -0600
Subject: [Linux-cluster] clvm mirroring target status
In-Reply-To: <1107535951.14182.75.camel@wavebreaker.eccent.be>
References: <1107535951.14182.75.camel@wavebreaker.eccent.be>
Message-ID: <20050204200047.GH3666@phlogiston.msp.redhat.com>

On Fri, Feb 04, 2005 at 05:52:31PM +0100, Filip Sergeys wrote:
> Hi,
> 
> We are going to install a linux cluster with 2 gnbd servers (no SPOF)
> and gfs + clvm on the cluster nodes (4 nodes). I have two options, if I
> read the docs well, for duplicating data on the gnbd servers:
> 1) using clvm target mirroring on the cluster nodes
> 2) use drbd underneath to mirror discs. Basically two disks per machine:
> 1 live disk which is mirrored with drbd to the second disk in the second
> machine and the other way around in the second machine 
> (so the second disk in the first machine is thus the mirror from the
> first (="live") disk in the second machine(sounds complicated, but it is
> just hard to write down)). 
> Both live disks from each machine will be combined as one logical disk
> (If I understood well, this is possible).
> 
> Question: what is the status of clvm mirroring? Is it stable? 
> Suppose it is stable, so I have a choice: which one of the options would
> any of you choose? Reason? (Stability, performance, ...)

I'm still not sure if cluster mirroring is available for testing (I don't
think that it is). It's defintely not considered stable.

I'm also sort of unsure about your drbd solution.
As far as I know, drbd only allows write access on one node at a time. So,
if the first machine uses drbd to write to a local device and one on the second
machine, the second machine cannot write to that device. drbd is only useful
for active passive setups.  If you are using pool multipathing to multipath
between the two gnbd servers, you could set it to failover mode, and modify
the fencing agent that you are using to fence the gnbd_server, to make it tell
drbd to fail over when you fence the server.

I have never tried this, but it seems reasonable. One issue would be how to
bring the failed server back up, since the devices are going to be out of sync.

http://www.drbd.org/start.html says that drbd still only allows write access
to one node at a time.

sorry :(

-Ben

> I found two hits on google concerning clvm mirroring, but both say it is
> not finished yet. However the most recent one is from june 2004. 
> I cannot test either because we have no spare machine. I'm going to buy
> two machine so I need to know which disk configuration I will be using.
> 
> Thanks in advance,
> 
> Regards,
> 
> Filip Sergeys
> 
> 
> 
> http://64.233.183.104/search?q=cache:r1Icx--aI2YJ:www.spinics.net/lists/gfs/msg03439.html+clvm+mirroring+gfs&hl=nl&start=12
> https://www.redhat.com/archives/linux-cluster/2004-June/msg00028.html
> 
> -- 
> *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
> * System Engineer, Verzekeringen NV *
> * www.verzekeringen.be              *
> * Oostkaai 23 B-2170 Merksem        *
> * 03/6416673 - 0477/340942          *
> *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*

> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From lhh at redhat.com  Fri Feb  4 20:02:16 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 04 Feb 2005 15:02:16 -0500
Subject: [Linux-cluster] gnbd server as shared storage
In-Reply-To: <1002123.1107541275046.JavaMail.SYSTEM@serraria>
References: <1002123.1107541275046.JavaMail.SYSTEM@serraria>
Message-ID: <1107547336.14020.155.camel@ayanami.boston.redhat.com>

On Fri, 2005-02-04 at 16:21 -0200, Cristiano da Costa wrote:

> I update the clumager package to last version, and now the nodes aren't rebooting when start clumanager, but the quorum aren't not recognized and in this case Im using 2 raws that bind to 2 gnbd imported devices.

They're not used for quorum; you're using an IP tiebreaker.  

> Feb  4 16:23:06 parati cluquorumd[5303]: <debug> Lack of Quorum Maintained
> Feb  4 16:23:06 parati clulockd[5316]: <debug> Quorum Event: NO QUORUM
> Feb  4 16:23:06 parati cluquorumd[5303]: <debug> VF: pid 5321 exited, status 0
> Feb  4 16:23:06 parati clulockd[5316]: <debug> I am the new lock keeper

Try running 'cluforce' or booting both nodes (see 'man cluforce' and
'man cludb' for more details).

-- Lon


From lhh at redhat.com  Fri Feb  4 20:03:37 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 04 Feb 2005 15:03:37 -0500
Subject: [Linux-cluster] a question about gfs file system
In-Reply-To: <000901c50ae3$feb22970$69050364@yazanz>
References: <000901c50ae3$feb22970$69050364@yazanz>
Message-ID: <1107547417.14020.157.camel@ayanami.boston.redhat.com>

On Fri, 2005-02-04 at 20:04 +0200, Yazan Al-Sheyyab wrote:
> hello everybody,
> 
>  how much space that gfs file system can take from a partion ?
> 
>  i mean that if i have a 500 MB partition and want to format it as gfs file
> system , so how much space will remain for me in this partition after format
> it as gfs file system?

Not sure of the exact metadata overhead, but it varies depending how
many journals you create (i.e. how many nodes will access the file
system).

-- Lon


From bmarzins at redhat.com  Fri Feb  4 20:20:31 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Fri, 4 Feb 2005 14:20:31 -0600
Subject: [Linux-cluster] [Fwd: any special requirement on scsi device?]
In-Reply-To: <1107544500.3009.10.camel@localhost.localdomain>
References: <1107220639.3017.4.camel@localhost.localdomain>
	<20050204190309.GG3666@phlogiston.msp.redhat.com>
	<1107544500.3009.10.camel@localhost.localdomain>
Message-ID: <20050204202031.GI3666@phlogiston.msp.redhat.com>

On Fri, Feb 04, 2005 at 02:15:00PM -0500, Ming Zhang wrote:
> On Fri, 2005-02-04 at 14:03, Benjamin Marzinski wrote:
> > On Mon, Jan 31, 2005 at 08:17:19PM -0500, Ming Zhang wrote:
> > > Hi folks
> > > 
> > > I asked this question today, somebody there guided me to here. Can
> > > anybody help me on this? Thanks.
> > > 
> > > ps, how to subscribe on this list? Thanks.
> > 
> > I believe you can get to a subscription page from
> > http://sources.redhat.com/cluster
> thx a lot. though it is very slow to connect. :P
> 
> 
> >  
> > > Pls cc to me, Thanks.
> > > 
> > > Ming
> > > 
> > > -----Forwarded Message-----
> > > > From: Ming Zhang <mingz at ele.uri.edu>
> > > > To: opengfs-devel at lists.sourceforge.net
> > > > Subject: any special requirement on scsi device?
> > > > Date: Mon, 31 Jan 2005 16:41:00 -0500
> > > > 
> > > > Hi, folks
> > > > 
> > > > I wonder if opengfs has any special requirements on scsi deivces. For
> > > > example, reserve/release scsi commands.
> > > > 
> > > > the reason i ask this is we are developing a open source iscsi target
> > > > and would like to support opengfs. since we build all scsi response, we
> > > > need to know what extra scsi commands or feature we need to support.
> > 
> > Persistent reservations would be really useful for fencing.
> u mean this is "useful". but is this "enough"?

We have a fencing agent based on SCSI 3 persistent reservation. So if you
implement persistent reservation, that should be enough for GFS fencing
purposes.
 
> because many end users on our list ask for this feature. so i want to
> know what we need to do in order to get it done. thanks.
> 
> ps, what is the minimum # of pc i need in order to install a gfs and
> test?

two machines.  I'm not sure that we have made the persistent reservation
fencing agent generally available, but if you would like to test it, we can
defintely do that.

-Ben

> ming
> 
> > 
> > -Ben
> > 
> > > > 
> > > > thanks.
> > > > 
> > > > 
> > > > ming
> > > 
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > http://www.redhat.com/mailman/listinfo/linux-cluster


From mingz at ele.uri.edu  Fri Feb  4 21:11:29 2005
From: mingz at ele.uri.edu (Ming Zhang)
Date: Fri, 04 Feb 2005 16:11:29 -0500
Subject: [Linux-cluster] [Fwd: any special requirement on scsi device?]
In-Reply-To: <20050204202031.GI3666@phlogiston.msp.redhat.com>
References: <1107220639.3017.4.camel@localhost.localdomain>
	<20050204190309.GG3666@phlogiston.msp.redhat.com>
	<1107544500.3009.10.camel@localhost.localdomain>
	<20050204202031.GI3666@phlogiston.msp.redhat.com>
Message-ID: <1107551489.3003.9.camel@localhost.localdomain>

Thanks so much for the information. This is really helpful.

Ming


On Fri, 2005-02-04 at 15:20, Benjamin Marzinski wrote:
> On Fri, Feb 04, 2005 at 02:15:00PM -0500, Ming Zhang wrote:
> > On Fri, 2005-02-04 at 14:03, Benjamin Marzinski wrote:
> > > On Mon, Jan 31, 2005 at 08:17:19PM -0500, Ming Zhang wrote:
> > > > Hi folks
> > > > 
> > > > I asked this question today, somebody there guided me to here. Can
> > > > anybody help me on this? Thanks.
> > > > 
> > > > ps, how to subscribe on this list? Thanks.
> > > 
> > > I believe you can get to a subscription page from
> > > http://sources.redhat.com/cluster
> > thx a lot. though it is very slow to connect. :P
> > 
> > 
> > >  
> > > > Pls cc to me, Thanks.
> > > > 
> > > > Ming
> > > > 
> > > > -----Forwarded Message-----
> > > > > From: Ming Zhang <mingz at ele.uri.edu>
> > > > > To: opengfs-devel at lists.sourceforge.net
> > > > > Subject: any special requirement on scsi device?
> > > > > Date: Mon, 31 Jan 2005 16:41:00 -0500
> > > > > 
> > > > > Hi, folks
> > > > > 
> > > > > I wonder if opengfs has any special requirements on scsi deivces. For
> > > > > example, reserve/release scsi commands.
> > > > > 
> > > > > the reason i ask this is we are developing a open source iscsi target
> > > > > and would like to support opengfs. since we build all scsi response, we
> > > > > need to know what extra scsi commands or feature we need to support.
> > > 
> > > Persistent reservations would be really useful for fencing.
> > u mean this is "useful". but is this "enough"?
> 
> We have a fencing agent based on SCSI 3 persistent reservation. So if you
> implement persistent reservation, that should be enough for GFS fencing
> purposes.
>  
> > because many end users on our list ask for this feature. so i want to
> > know what we need to do in order to get it done. thanks.
> > 
> > ps, what is the minimum # of pc i need in order to install a gfs and
> > test?
> 
> two machines.  I'm not sure that we have made the persistent reservation
> fencing agent generally available, but if you would like to test it, we can
> defintely do that.
> 
> -Ben
> 
> > ming
> > 
> > > 
> > > -Ben
> > > 
> > > > > 
> > > > > thanks.
> > > > > 
> > > > > 
> > > > > ming
> > > > 
> > > > --
> > > > Linux-cluster mailing list
> > > > Linux-cluster at redhat.com
> > > > http://www.redhat.com/mailman/listinfo/linux-cluster


From alewis at redhat.com  Fri Feb  4 21:32:26 2005
From: alewis at redhat.com (AJ Lewis)
Date: Fri, 04 Feb 2005 15:32:26 -0600 (CST)
Subject: [Linux-cluster] Re: any special requirement on scsi device?
In-Reply-To: <1107551942.3003.19.camel@localhost.localdomain>
References: <1107551942.3003.19.camel@localhost.localdomain>
Message-ID: <20050204.153226.59659031.alewis@redhat.com>

From: Ming Zhang <mingz at ele.uri.edu>
Subject: [Iscsitarget-devel] [Fwd: Re: [Linux-cluster] [Fwd: any special requirement on scsi device?]]
Date: Fri, 04 Feb 2005 16:19:02 -0500

> This is what I got from linux-cluster list.
> 
> We need to have the persistent reserve support in order to support GFS.

Ming,

Persistant reservations *can* be used for fencing with GFS, but it is
not *required* for use with GFS.  AFAIK, there are no SCSI specific
commands that GFS requires at this time.

Regards,
-
AJ Lewis                                   Voice:  612-638-0500
Red Hat Inc.                               E-Mail: alewis at redhat.com
720 Washington Ave. SE, Suite 200
Minneapolis, MN 55414
   
Current GPG fingerprint = D9F8 EDCE 4242 855F A03D  9B63 F50C 54A8 578C 8715
Grab the key at: http://people.redhat.com/alewis/gpg.html or one of the
many keyservers out there...

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050204/e8c4ada1/attachment.sig>

From mingz at ele.uri.edu  Fri Feb  4 21:41:42 2005
From: mingz at ele.uri.edu (Ming Zhang)
Date: Fri, 04 Feb 2005 16:41:42 -0500
Subject: [Linux-cluster] Re: any special requirement on scsi device?
In-Reply-To: <20050204.153226.59659031.alewis@redhat.com>
References: <1107551942.3003.19.camel@localhost.localdomain>
	<20050204.153226.59659031.alewis@redhat.com>
Message-ID: <1107553301.3003.30.camel@localhost.localdomain>

hehe. confused.

so this is optional, but not mandatory. but how gfs decide whether to
use it?

seems u are on this list as well. so do u have experience on iet+gfs?

ming


On Fri, 2005-02-04 at 16:32, AJ Lewis wrote:
> From: Ming Zhang <mingz at ele.uri.edu>
> Subject: [Iscsitarget-devel] [Fwd: Re: [Linux-cluster] [Fwd: any special requirement on scsi device?]]
> Date: Fri, 04 Feb 2005 16:19:02 -0500
> 
> > This is what I got from linux-cluster list.
> > 
> > We need to have the persistent reserve support in order to support GFS.
> 
> Ming,
> 
> Persistant reservations *can* be used for fencing with GFS, but it is
> not *required* for use with GFS.  AFAIK, there are no SCSI specific
> commands that GFS requires at this time.
> 
> Regards,
> -
> AJ Lewis                                   Voice:  612-638-0500
> Red Hat Inc.                               E-Mail: alewis at redhat.com
> 720 Washington Ave. SE, Suite 200
> Minneapolis, MN 55414
>    
> Current GPG fingerprint = D9F8 EDCE 4242 855F A03D  9B63 F50C 54A8 578C 8715
> Grab the key at: http://people.redhat.com/alewis/gpg.html or one of the
> many keyservers out there...


From alewis at redhat.com  Fri Feb  4 21:55:43 2005
From: alewis at redhat.com (AJ Lewis)
Date: Fri, 04 Feb 2005 15:55:43 -0600 (CST)
Subject: [Linux-cluster] Re: any special requirement on scsi device?
In-Reply-To: <1107553301.3003.30.camel@localhost.localdomain>
References: <1107551942.3003.19.camel@localhost.localdomain>
	<20050204.153226.59659031.alewis@redhat.com>
	<1107553301.3003.30.camel@localhost.localdomain>
Message-ID: <20050204.155543.21920164.alewis@redhat.com>

From: Ming Zhang <mingz at ele.uri.edu>
Subject: Re: any special requirement on scsi device?
Date: Fri, 04 Feb 2005 16:41:42 -0500

> hehe. confused.
> 
> so this is optional, but not mandatory. but how gfs decide whether to
> use it?

There are cluster configuration file(s) that the cluster
infrastructure on uses to handle this.  GFS itself (the filesystem)
relies on the cluster infrastructure to handle things like fencing.

> seems u are on this list as well. so do u have experience on iet+gfs?

I've not used iet yet, but I have run GFS on iSCSI with hardware iSCSI targets and the linux-iscsi initiator from sourceforge.

Regards,
--
AJ Lewis                                   Voice:  612-638-0500
Red Hat Inc.                               E-Mail: alewis at redhat.com
720 Washington Ave. SE, Suite 200
Minneapolis, MN 55414
   
Current GPG fingerprint = D9F8 EDCE 4242 855F A03D  9B63 F50C 54A8 578C 8715
Grab the key at: http://people.redhat.com/alewis/gpg.html or one of the
many keyservers out there...

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050204/d04f6d06/attachment.sig>

From mingz at ele.uri.edu  Fri Feb  4 22:01:27 2005
From: mingz at ele.uri.edu (Ming Zhang)
Date: Fri, 04 Feb 2005 17:01:27 -0500
Subject: [Linux-cluster] Re: any special requirement on scsi device?
In-Reply-To: <20050204.155543.21920164.alewis@redhat.com>
References: <1107551942.3003.19.camel@localhost.localdomain>
	<20050204.153226.59659031.alewis@redhat.com>
	<1107553301.3003.30.camel@localhost.localdomain>
	<20050204.155543.21920164.alewis@redhat.com>
Message-ID: <1107554486.3003.38.camel@localhost.localdomain>

On Fri, 2005-02-04 at 16:55, AJ Lewis wrote:
> From: Ming Zhang <mingz at ele.uri.edu>
> Subject: Re: any special requirement on scsi device?
> Date: Fri, 04 Feb 2005 16:41:42 -0500
> 
> > hehe. confused.
> > 
> > so this is optional, but not mandatory. but how gfs decide whether to
> > use it?
> 
> There are cluster configuration file(s) that the cluster
> infrastructure on uses to handle this.  GFS itself (the filesystem)
> relies on the cluster infrastructure to handle things like fencing.
ic. thx.

> 
> > seems u are on this list as well. so do u have experience on iet+gfs?
> 
> I've not used iet yet, but I have run GFS on iSCSI with hardware iSCSI targets and the linux-iscsi initiator from sourceforge.
it is fine. i am thinking if i can run GFS on 2 linux running in vmware.

> 
> Regards,
> --
> AJ Lewis                                   Voice:  612-638-0500
> Red Hat Inc.                               E-Mail: alewis at redhat.com
> 720 Washington Ave. SE, Suite 200
> Minneapolis, MN 55414
>    
> Current GPG fingerprint = D9F8 EDCE 4242 855F A03D  9B63 F50C 54A8 578C 8715
> Grab the key at: http://people.redhat.com/alewis/gpg.html or one of the
> many keyservers out there...


From yazan at ccs.com.jo  Sat Feb  5 06:43:06 2005
From: yazan at ccs.com.jo (Yazan Al-Sheyyab)
Date: Sat, 5 Feb 2005 08:43:06 +0200
Subject: [Linux-cluster] a question about gfs file system
References: <000901c50ae3$feb22970$69050364@yazanz>
	<1107547417.14020.157.camel@ayanami.boston.redhat.com>
Message-ID: <005a01c50b4d$f310d0c0$69050364@yazanz>

Hi,

  OK if not sure , please can you tell me the minimum space for a partition
to put if i want to format it as gfs ?

 because i am using ORACLE and i need to make raw devices and i will give
each raw device a partition  and some raw devices have a space of 50 MB and
that in not enough for gfs , so if didi not sure about the space for gfs ,
please can you tell me the minimum space that i can use to be in a safe side
for gfs file system.

 THANKS.
----- Original Message ----- 
From: "Lon Hohberger" <lhh at redhat.com>
To: "linux clistering" <linux-cluster at redhat.com>
Sent: Friday, February 04, 2005 10:03 PM
Subject: Re: [Linux-cluster] a question about gfs file system


> On Fri, 2005-02-04 at 20:04 +0200, Yazan Al-Sheyyab wrote:
> > hello everybody,
> >
> >  how much space that gfs file system can take from a partion ?
> >
> >  i mean that if i have a 500 MB partition and want to format it as gfs
file
> > system , so how much space will remain for me in this partition after
format
> > it as gfs file system?
>
> Not sure of the exact metadata overhead, but it varies depending how
> many journals you create (i.e. how many nodes will access the file
> system).
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From filip.sergeys at verzekeringen.be  Sat Feb  5 09:55:13 2005
From: filip.sergeys at verzekeringen.be (Filip Sergeys)
Date: Sat, 5 Feb 2005 10:55:13 +0100
Subject: [Linux-cluster] clvm mirroring target status
In-Reply-To: <20050204200047.GH3666@phlogiston.msp.redhat.com>
References: <1107535951.14182.75.camel@wavebreaker.eccent.be>
	<20050204200047.GH3666@phlogiston.msp.redhat.com>
Message-ID: <200502051055.13507.filip.sergeys@verzekeringen.be>

I'll try to explain it a bit more structured

Host A
--------
Disk A 
Disk Bm (mirrored disk of disk B in host B, unmounted)

Host B
--------
Disk B
Disk Am (mirrored disk of disk A in host A, unmounted)


Normal working situation:
---------------------------------
Disk A and Disk B are exported with GNBD. If I understood well, I can combine 
them into one logical disk for the clusternodes with clvm (striped maybe, 
don't know, need to read more about it).
Disk Am and Bm are basically only used as mirroring for A and B. THis is  done 
with drbd. So they are not taking part in rw actions in any way.

Host B goes down:
------------------------
Heartbeat says it is down, I cut the power.
This is what I think needs to be done:
-Heartbeat moves the virtual IP address of host B to Host A. This is the IP 
address by which disk B was exported
-Mount disk Bm read/write. 
-Export Bm with GNBD. The clusters should now be able to continue working, I 
think transparently (need to test that to know).

Concequences:
-------------------
Bringing host B back in the game needs a manual intervention. 
-Basically al services on the cluster nodes need to stop writing. 
-Sync the disk from Bm to B
-Give host B back its virtual ip address
-mount B read/write
-umount Bm in host A
-start all services again on the nodes.
=> I know this is not perfect. But we can live with that. This will need to 
happen after office hours. The thing is that we don't have the budget for 
shared storage and certainly not for a redundant shared storage solution 
because most entry level shared storages are SPOFs. 

I need to find out more about that multipathing. I am not sure how to use it 
in this configuration. 
If you have idea's for improvement, they are welcome. 

Regards,

Filip

PS. Thanx for your answer on the clvm mirroring state.

 
On Friday 04 February 2005 21:00, Benjamin Marzinski wrote:
> On Fri, Feb 04, 2005 at 05:52:31PM +0100, Filip Sergeys wrote:
> > Hi,
> >
> > We are going to install a linux cluster with 2 gnbd servers (no SPOF)
> > and gfs + clvm on the cluster nodes (4 nodes). I have two options, if I
> > read the docs well, for duplicating data on the gnbd servers:
> > 1) using clvm target mirroring on the cluster nodes
> > 2) use drbd underneath to mirror discs. Basically two disks per machine:
> > 1 live disk which is mirrored with drbd to the second disk in the second
> > machine and the other way around in the second machine
> > (so the second disk in the first machine is thus the mirror from the
> > first (="live") disk in the second machine(sounds complicated, but it is
> > just hard to write down)).
> > Both live disks from each machine will be combined as one logical disk
> > (If I understood well, this is possible).
> >
> > Question: what is the status of clvm mirroring? Is it stable?
> > Suppose it is stable, so I have a choice: which one of the options would
> > any of you choose? Reason? (Stability, performance, ...)
>
> I'm still not sure if cluster mirroring is available for testing (I don't
> think that it is). It's defintely not considered stable.
>
> I'm also sort of unsure about your drbd solution.
> As far as I know, drbd only allows write access on one node at a time. So,
> if the first machine uses drbd to write to a local device and one on the
> second machine, the second machine cannot write to that device. drbd is
> only useful for active passive setups.  If you are using pool multipathing
> to multipath between the two gnbd servers, you could set it to failover
> mode, and modify the fencing agent that you are using to fence the
> gnbd_server, to make it tell drbd to fail over when you fence the server.
>
> I have never tried this, but it seems reasonable. One issue would be how to
> bring the failed server back up, since the devices are going to be out of
> sync.
>
> http://www.drbd.org/start.html says that drbd still only allows write
> access to one node at a time.
>
> sorry :(
>
> -Ben
>
> > I found two hits on google concerning clvm mirroring, but both say it is
> > not finished yet. However the most recent one is from june 2004.
> > I cannot test either because we have no spare machine. I'm going to buy
> > two machine so I need to know which disk configuration I will be using.
> >
> > Thanks in advance,
> >
> > Regards,
> >
> > Filip Sergeys
> >
> >
> >
> > http://64.233.183.104/search?q=cache:r1Icx--aI2YJ:www.spinics.net/lists/g
> >fs/msg03439.html+clvm+mirroring+gfs&hl=nl&start=12
> > https://www.redhat.com/archives/linux-cluster/2004-June/msg00028.html
> >
> > --
> > *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
> > * System Engineer, Verzekeringen NV *
> > * www.verzekeringen.be              *
> > * Oostkaai 23 B-2170 Merksem        *
> > * 03/6416673 - 0477/340942          *
> > *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > http://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From bastian at waldi.eu.org  Mon Feb  7 10:09:49 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Mon, 7 Feb 2005 11:09:49 +0100
Subject: [Linux-cluster] [PATCH] dlm: Never link with ld -- Don't install
	headers executable
Message-ID: <20050207100949.GA32586@wavehammer.waldi.eu.org>

The following patch fixes two bugs in dlm.
- ld is used for linking which regulary fails.
- Headers are installed executable.

Bastian

-- 
Those who hate and fight must stop themselves -- otherwise it is not stopped.
		-- Spock, "Day of the Dove", stardate unknown
-------------- next part --------------
--- dlm/lib/Makefile	2005-02-06 18:53:03.000000000 +0100
+++ dlm/lib/Makefile	2005-02-06 19:28:17.000000000 +0100
@@ -46,7 +46,7 @@
 	${RANLIB} libdlm_lt.a 
 
 $(LIBNAME).so.${RELEASE_MAJOR}.${RELEASE_MINOR}: libdlm.po libaislock.po
-	$(LD) -shared -o $@ -soname=$(LIBNAME).so.$(RELEASE_MAJOR) $^
+	$(CC) -shared -o $@ -Wl,-soname=$(LIBNAME).so.$(RELEASE_MAJOR) $^
 
 $(LIBNAME)_lt.so.${RELEASE_MAJOR}.${RELEASE_MINOR}: libdlm_lt.po
 	$(CC) -shared -o $@ -Wl,-soname=$(LIBNAME)_lt.so.$(RELEASE_MAJOR) $^
@@ -74,7 +74,7 @@
 
 install: all
 	install -d ${incdir}
-	install libdlm.h ${incdir}
+	install -m644 libdlm.h ${incdir}
 	install -d ${libdir}
 	install $(LIBNAME).a ${libdir}
 	install $(LIBNAME)_lt.a ${libdir}
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050207/ae7ad323/attachment.sig>

From pcaulfie at redhat.com  Mon Feb  7 10:35:29 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 7 Feb 2005 10:35:29 +0000
Subject: [Linux-cluster] [PATCH] dlm: Never link with ld -- Don't install
	headers executable
In-Reply-To: <20050207100949.GA32586@wavehammer.waldi.eu.org>
References: <20050207100949.GA32586@wavehammer.waldi.eu.org>
Message-ID: <20050207103529.GC25122@tykepenguin.com>

On Mon, Feb 07, 2005 at 11:09:49AM +0100, Bastian Blank wrote:
> The following patch fixes two bugs in dlm.
> - ld is used for linking which regulary fails.
> - Headers are installed executable.
> 

Applied.

Thanks.
-- 

patrick


From mtilstra at redhat.com  Mon Feb  7 14:09:30 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Mon, 7 Feb 2005 08:09:30 -0600
Subject: [Linux-cluster] a question about gfs file system
In-Reply-To: <005a01c50b4d$f310d0c0$69050364@yazanz>
References: <000901c50ae3$feb22970$69050364@yazanz>
	<1107547417.14020.157.camel@ayanami.boston.redhat.com>
	<005a01c50b4d$f310d0c0$69050364@yazanz>
Message-ID: <20050207140930.GA3020@redhat.com>

On Sat, Feb 05, 2005 at 08:43:06AM +0200, Yazan Al-Sheyyab wrote:
> Hi,
> 
>   OK if not sure , please can you tell me the minimum space for a partition
> to put if i want to format it as gfs ?

it varies.  by default, each journal is 128M, but you can change that
with mkfs.gfs.  The minimum is 32M.  The rest of the meta data?  I duno,
but it is rather dwarfed by the size of the journals.  I'm not sure
there has been much, if any, testing with devices smaller than a
gigabyte.


-- 
Michael Conrad Tadpol Tilstra
Discretion will protect you, and understanding will guard you.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050207/d531210b/attachment.sig>

From alain at tessiot.info  Mon Feb  7 08:56:20 2005
From: alain at tessiot.info (Alain TESSIOT)
Date: Mon, 7 Feb 2005 09:56:20 +0100
Subject: [Linux-cluster] Cluster Suite an network interfaces
Message-ID: <03ee01c50cf2$f90cc5f0$1b00a8c0@owtessiotalain>

hi all,

Excuse me for my bad english, I am french.
I have two servers each one has four network interfaces.
Server 1 : two bonded for the network : 222.10.3.1, two bonded for the cluster network : 192.168.0.1.
Server 2 : two bonded for the network : 222.10.3.2, two bonded for the cluster network : 192.168.0.2.
I have two rawdevices that works fine and a shared file system.
I try to make a cluster for httpd which ip is 222.10.3.10.
Everything works fine : if I stop the server 1, httpd, the virtual ip and the file system strats to run on the server 2.
But, if httpd runs on the the Server 1and I unplug the 2 network interfaces (222.10.3.1), nothing happens ....
Did I do something wrong ?

Many thanks for your help

Alain
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050207/573b6d82/attachment.htm>

From lhh at redhat.com  Mon Feb  7 15:04:30 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 07 Feb 2005 10:04:30 -0500
Subject: [Linux-cluster] Cluster Suite an network interfaces
In-Reply-To: <03ee01c50cf2$f90cc5f0$1b00a8c0@owtessiotalain>
References: <03ee01c50cf2$f90cc5f0$1b00a8c0@owtessiotalain>
Message-ID: <1107788670.9794.2.camel@ayanami.boston.redhat.com>

On Mon, 2005-02-07 at 09:56 +0100, Alain TESSIOT wrote:

> But, if httpd runs on the the Server 1and I unplug the 2 network
> interfaces (222.10.3.1), nothing happens ....
> Did I do something wrong ?

See:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=144488

-- Lon


From erwan at seanodes.com  Mon Feb  7 16:59:01 2005
From: erwan at seanodes.com (Velu Erwan)
Date: Mon, 07 Feb 2005 17:59:01 +0100
Subject: [Linux-cluster] About tuning GFS
Message-ID: <1107795541.32581.28.camel@R1.seanodes.com>

Hi folks,
I've been trying to understand the way I can tune my GFS 6.0.2
configuration.
I've found I can play using the setflag options which can change the 
exhash, jdata, unused, ea_indirect, directio, inherit_directio &
inherit_jdata

I know about direct_io and its inherit_directio regarding
http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-manage-direct-io.html

jdata are also decribes by
http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-manage-data-journal.html

But I didn't find anything for the other values.
What are theses options ?

I have the same questions regarding the gettune option which print a
lots of value which seems undocumented.

Could someone give me some help for understanding their behaviours ?
Regards,
-- 
Erwan Velu
Consultant - Seanodes SA
http://www.seanodes.com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: Ceci est une partie de message num?riquement sign?e
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050207/5932fd25/attachment.sig>

From danderso at redhat.com  Mon Feb  7 17:36:54 2005
From: danderso at redhat.com (Derek Anderson)
Date: Mon, 7 Feb 2005 11:36:54 -0600
Subject: [Linux-cluster] About tuning GFS
In-Reply-To: <1107795541.32581.28.camel@R1.seanodes.com>
References: <1107795541.32581.28.camel@R1.seanodes.com>
Message-ID: <200502071136.54157.danderso@redhat.com>

On Monday 07 February 2005 10:59, Velu Erwan wrote:
> Hi folks,
> I've been trying to understand the way I can tune my GFS 6.0.2
> configuration.
> I've found I can play using the setflag options which can change the
> exhash, jdata, unused, ea_indirect, directio, inherit_directio &
> inherit_jdata
>
> I know about direct_io and its inherit_directio regarding
> http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-manage-direct-io.ht
>ml
>
> jdata are also decribes by
> http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-manage-data-journal
>.html
>

See also section 9.4:
http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-manage-quota.html

If you are not using quotas you can turn this off which should reduce overhead 
some.

Another thing you can try is mounting with noatime (-o noatime):
http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-manage-atimeconf.html#S2-MANAGE-MOUNTNOATIME

> But I didn't find anything for the other values.
> What are theses options ?
>
> I have the same questions regarding the gettune option which print a
> lots of value which seems undocumented.
>
> Could someone give me some help for understanding their behaviours ?
> Regards,


From fmarchal at inf.ethz.ch  Mon Feb  7 21:41:52 2005
From: fmarchal at inf.ethz.ch (Fabrice Marchal)
Date: Mon, 07 Feb 2005 22:41:52 +0100
Subject: [Linux-cluster] GFS newbie question
In-Reply-To: <200502071136.54157.danderso@redhat.com>
References: <1107795541.32581.28.camel@R1.seanodes.com>
	<200502071136.54157.danderso@redhat.com>
Message-ID: <4207E0A0.5000809@inf.ethz.ch>

Hello,

I would like to know if it is possible to run GFS on a cluster *without*
external storage, i.e. without a NAS and with each GFS node a GNBD
server itself. Is it possible to implement disk redundancy in such a 
configuration,
as pictured on Fig 1.3 of the GFS manual? The simplest configuration
being:
node #1: Disk A, Disk B' (mirror of B on node #2)
node #2: Disk B, Disk A' (mirror of A on node #1)

Thanks,
Fabrice

-- 
========================================================================
Fabrice Marchal           http://www.inf.ethz.ch/~marchal 
fabrice.marchal at ieee.org  marchal at inf.ethz.ch        +41-(0)44-632-56-79
ETH Zurich, CoLab Computational Laboratory       FAX:+41-(0)44-632-17-03
========================================================================


From bmarzins at redhat.com  Mon Feb  7 21:50:17 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Mon, 7 Feb 2005 15:50:17 -0600
Subject: [Linux-cluster] clvm mirroring target status
In-Reply-To: <200502051055.13507.filip.sergeys@verzekeringen.be>
References: <1107535951.14182.75.camel@wavebreaker.eccent.be>
	<20050204200047.GH3666@phlogiston.msp.redhat.com>
	<200502051055.13507.filip.sergeys@verzekeringen.be>
Message-ID: <20050207215017.GK3666@phlogiston.msp.redhat.com>

On Sat, Feb 05, 2005 at 10:55:13AM +0100, Filip Sergeys wrote:
> I'll try to explain it a bit more structured
> 
> Host A
> --------
> Disk A 
> Disk Bm (mirrored disk of disk B in host B, unmounted)
> 
> Host B
> --------
> Disk B
> Disk Am (mirrored disk of disk A in host A, unmounted)
> 
> 
> Normal working situation:
> ---------------------------------
> Disk A and Disk B are exported with GNBD. If I understood well, I can combine 
> them into one logical disk for the clusternodes with clvm (striped maybe, 
> don't know, need to read more about it).
> Disk Am and Bm are basically only used as mirroring for A and B. THis is  done 
> with drbd. So they are not taking part in rw actions in any way.
> 
> Host B goes down:
> ------------------------
> Heartbeat says it is down, I cut the power.
> This is what I think needs to be done:
> -Heartbeat moves the virtual IP address of host B to Host A. This is the IP 
> address by which disk B was exported
> -Mount disk Bm read/write. 
> -Export Bm with GNBD. The clusters should now be able to continue working, I 
> think transparently (need to test that to know).

O.k. The way you plan to use drbd makes sense.  The only issue is this: GFS
doesn't use Heartbeat, the cluster manager does its own heartbeating.  If you
have two different heartbeating mechanisms controlling failover, things won't
fail over all at once.  Ideally, for all the stuff below the filesystem layer,
including gnbd, you wouldn't use Heartbeat at all, but simply rely on the
cluster manager.  To do this, you would have to make drbd switch over when the
cluster manager detected a node failure.  This could be done by hacking a
fencing agent, as I mentioned in my previous e-mail. If you must use heartbeat
for the block device failover, you need to recoginze that this could happen
before, during, or after the gfs failover, which may (probably will) cause
problems occasionally.

Unfortunately, I'm not sure that your multipathing setup will work. I am
assuming that you are using pool for the multipathing.  pool multipathing
has two modes, round-robin, and failover. Obviously round-robin (which is where
pool uses all the paths) won't work, because you only have one path available at
once.  However, failover mode probably won't work either, in the setup you
explained. You would need to force pool to use Disk A from host A and Disk
B from host B.  Getting that to work right is probably possible, but not
easy or reliable.  The easiest way to do it is to make host A to have both
disk A and disk B, and make host B have disk Am and Bm. To do this, GNBD import
the disks from host A, assemble the pool, GNBD import the disks from host B,
and use pool_mp to integrate them into the pool. This should automatically
set you up in failover mode, with disks A and B as the primary disks and disks
Am and Bm as the backups. I realize that this means that hostB is usually
sitting idle.

If you name your devices correctly, or import them in a specific order, you
might be able to get pool to use the correct devices in the setup you
described, but I'm not certain.

What your design actually wants is for pool to not do multipathing at all, but
to simply retry on failed IO.  That way, when the virtual IP switches, gnbd
will just automatically pick up the device at its new location. Unfortunately,
pool and gnbd cannot do this.

-Ben
 
> Concequences:
> -------------------
> Bringing host B back in the game needs a manual intervention. 
> -Basically al services on the cluster nodes need to stop writing. 
> -Sync the disk from Bm to B
> -Give host B back its virtual ip address
> -mount B read/write
> -umount Bm in host A
> -start all services again on the nodes.
> => I know this is not perfect. But we can live with that. This will need to 
> happen after office hours. The thing is that we don't have the budget for 
> shared storage and certainly not for a redundant shared storage solution 
> because most entry level shared storages are SPOFs. 
> 
> I need to find out more about that multipathing. I am not sure how to use it 
> in this configuration. 
> If you have idea's for improvement, they are welcome. 
> 
> Regards,
> 
> Filip
> 
> PS. Thanx for your answer on the clvm mirroring state.
> 
>  
> 
> 
> 
> 
> On Friday 04 February 2005 21:00, Benjamin Marzinski wrote:
> > On Fri, Feb 04, 2005 at 05:52:31PM +0100, Filip Sergeys wrote:
> > > Hi,
> > >
> > > We are going to install a linux cluster with 2 gnbd servers (no SPOF)
> > > and gfs + clvm on the cluster nodes (4 nodes). I have two options, if I
> > > read the docs well, for duplicating data on the gnbd servers:
> > > 1) using clvm target mirroring on the cluster nodes
> > > 2) use drbd underneath to mirror discs. Basically two disks per machine:
> > > 1 live disk which is mirrored with drbd to the second disk in the second
> > > machine and the other way around in the second machine
> > > (so the second disk in the first machine is thus the mirror from the
> > > first (="live") disk in the second machine(sounds complicated, but it is
> > > just hard to write down)).
> > > Both live disks from each machine will be combined as one logical disk
> > > (If I understood well, this is possible).
> > >
> > > Question: what is the status of clvm mirroring? Is it stable?
> > > Suppose it is stable, so I have a choice: which one of the options would
> > > any of you choose? Reason? (Stability, performance, ...)
> >
> > I'm still not sure if cluster mirroring is available for testing (I don't
> > think that it is). It's defintely not considered stable.
> >
> > I'm also sort of unsure about your drbd solution.
> > As far as I know, drbd only allows write access on one node at a time. So,
> > if the first machine uses drbd to write to a local device and one on the
> > second machine, the second machine cannot write to that device. drbd is
> > only useful for active passive setups.  If you are using pool multipathing
> > to multipath between the two gnbd servers, you could set it to failover
> > mode, and modify the fencing agent that you are using to fence the
> > gnbd_server, to make it tell drbd to fail over when you fence the server.
> >
> > I have never tried this, but it seems reasonable. One issue would be how to
> > bring the failed server back up, since the devices are going to be out of
> > sync.
> >
> > http://www.drbd.org/start.html says that drbd still only allows write
> > access to one node at a time.
> >
> > sorry :(
> >
> > -Ben
> >
> > > I found two hits on google concerning clvm mirroring, but both say it is
> > > not finished yet. However the most recent one is from june 2004.
> > > I cannot test either because we have no spare machine. I'm going to buy
> > > two machine so I need to know which disk configuration I will be using.
> > >
> > > Thanks in advance,
> > >
> > > Regards,
> > >
> > > Filip Sergeys
> > >
> > >
> > >
> > > http://64.233.183.104/search?q=cache:r1Icx--aI2YJ:www.spinics.net/lists/g
> > >fs/msg03439.html+clvm+mirroring+gfs&hl=nl&start=12
> > > https://www.redhat.com/archives/linux-cluster/2004-June/msg00028.html
> > >
> > > --
> > > *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
> > > * System Engineer, Verzekeringen NV *
> > > * www.verzekeringen.be              *
> > > * Oostkaai 23 B-2170 Merksem        *
> > > * 03/6416673 - 0477/340942          *
> > > *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
> > >
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > http://www.redhat.com/mailman/listinfo/linux-cluster
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > http://www.redhat.com/mailman/listinfo/linux-cluster
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From qwejohn at hotmail.com  Tue Feb  8 07:58:12 2005
From: qwejohn at hotmail.com (John Que)
Date: Tue, 08 Feb 2005 09:58:12 +0200
Subject: [Linux-cluster] page and buffer cache in Linux File Systems
Message-ID: <BAY14-F33D3E159D53FF95351B49AAF740@phx.gbl>

Hello,

I am a newbie to Linux Clusters and to Clusters File Systems in
particular.

I am reading a book which was recently published, named: "Building Clustered
Linux Systems" By Robert Lucke.(Published by Prentice Hall PTR /HP 
Professional
Series.)
(http://www.phptr.com/title/0131448536)

I have a question regarding buffer or page cache on cluster file systems.

I saw in Chapter 14 (page 429), the follwing text
(in 14.4 Commercially Available Cluster section):

"There is a set of common requirements that you'll find in all parallel
file systems used in Linux Clusters...
For datatbas use,you will find that the file systems needs to provide some 
form
of direct access mode,in which the database software can read and write data
blocks without using the Linux System's buffer or page cache.
The database software performs this caching itself and removes the 
associated
overhead".

The author here talks about a subset of Linux Clusters File Systems,
(parallel file systems);Specifically he mentioned PVFS1 and PVFS2.

What I do not understand is this:
Is this requirment specific to Cluster File Systems?
Because as I understand, this same requirement should exist also
for ordinary (non clustered) file systems like ext3.

Does anybody know of a clustered file system which implements such 
requirement?
Is there a way to configure / setup a clustered file system to avoid
page/buffer cache ? (to enable what the author calls "direct access mode").

Any pointers to info on this are welcomed.

And , I am not an expert in ext3 , but:
in case such a requirement DOES exist for non clustered file systems
(like ext3) - does ext3 implement this requirement ? How?

Regards,
John

_________________________________________________________________
Express yourself instantly with MSN Messenger! Download today it's FREE! 
http://messenger.msn.click-url.com/go/onm00200471ave/direct/01/


From filip.sergeys at verzekeringen.be  Tue Feb  8 08:40:32 2005
From: filip.sergeys at verzekeringen.be (Filip Sergeys)
Date: 08 Feb 2005 09:40:32 +0100
Subject: [Linux-cluster] clvm mirroring target status
In-Reply-To: <20050207215017.GK3666@phlogiston.msp.redhat.com>
References: <1107535951.14182.75.camel@wavebreaker.eccent.be>
	<20050204200047.GH3666@phlogiston.msp.redhat.com>
	<200502051055.13507.filip.sergeys@verzekeringen.be> 
	<20050207215017.GK3666@phlogiston.msp.redhat.com>
Message-ID: <1107852032.4838.53.camel@wavebreaker.eccent.be>

On Mon, 2005-02-07 at 22:50, Benjamin Marzinski wrote:

    On Sat, Feb 05, 2005 at 10:55:13AM +0100, Filip Sergeys wrote:
    > I'll try to explain it a bit more structured
    > 
    > Host A
    > --------
    > Disk A 
    > Disk Bm (mirrored disk of disk B in host B, unmounted)
    > 
    > Host B
    > --------
    > Disk B
    > Disk Am (mirrored disk of disk A in host A, unmounted)
    > 
    > 
    > Normal working situation:
    > ---------------------------------
    > Disk A and Disk B are exported with GNBD. If I understood well, I can combine 
    > them into one logical disk for the clusternodes with clvm (striped maybe, 
    > don't know, need to read more about it).
    > Disk Am and Bm are basically only used as mirroring for A and B. THis is  done 
    > with drbd. So they are not taking part in rw actions in any way.
    > 
    > Host B goes down:
    > ------------------------
    > Heartbeat says it is down, I cut the power.
    > This is what I think needs to be done:
    > -Heartbeat moves the virtual IP address of host B to Host A. This is the IP 
    > address by which disk B was exported
    > -Mount disk Bm read/write. 
    > -Export Bm with GNBD. The clusters should now be able to continue working, I 
    > think transparently (need to test that to know).
    
    O.k. The way you plan to use drbd makes sense.  The only issue is this: GFS
    doesn't use Heartbeat, the cluster manager does its own heartbeating.  If you
    have two different heartbeating mechanisms controlling failover, things won't
    fail over all at once.  Ideally, for all the stuff below the filesystem layer,
    including gnbd, you wouldn't use Heartbeat at all, but simply rely on the
    cluster manager.  To do this, you would have to make drbd switch over when the
    cluster manager detected a node failure.  This could be done by hacking a
    fencing agent, as I mentioned in my previous e-mail. If you must use heartbeat
    for the block device failover, you need to recoginze that this could happen
    before, during, or after the gfs failover, which may (probably will) cause
    problems occasionally.
    
    Unfortunately, I'm not sure that your multipathing setup will work. I am
    assuming that you are using pool for the multipathing.  pool multipathing
    has two modes, round-robin, and failover. Obviously round-robin (which is where
    pool uses all the paths) won't work, because you only have one path available at
    once.  However, failover mode probably won't work either, in the setup you
    explained. You would need to force pool to use Disk A from host A and Disk
    B from host B.  Getting that to work right is probably possible, but not
    easy or reliable.  The easiest way to do it is to make host A to have both
    disk A and disk B, and make host B have disk Am and Bm. To do this, GNBD import
    the disks from host A, assemble the pool, GNBD import the disks from host B,
    and use pool_mp to integrate them into the pool. This should automatically
    set you up in failover mode, with disks A and B as the primary disks and disks
    Am and Bm as the backups. I realize that this means that hostB is usually
    sitting idle.
    

This sounds hopefull

    If you name your devices correctly, or import them in a specific order, you
    might be able to get pool to use the correct devices in the setup you
    described, but I'm not certain.
    

"might be able..."-> Now I lost hope: I borrowed the idea of this setup
from the GFS admin guide, "the economy and performance" setup. 
(http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-ov-perform.html#S2-OV-ECONOMY)
Probably I misinterpreted the figure1-3, especially the disk part.
Can you elaborate a bit more on: "if you name your devices correctly or
import them in a specific order"?
If I put Am and Bm on one machine, export them with gnbd and join them
in the pool in failover mode, can I be sure there will be no writing on
them? Because drbd won't let that happen.

Thanx,

Filip Sergeys

    What your design actually wants is for pool to not do multipathing at all, but
    to simply retry on failed IO.  That way, when the virtual IP switches, gnbd
    will just automatically pick up the device at its new location. Unfortunately,
    pool and gnbd cannot do this.
    
    -Ben
     
    > Concequences:
    > -------------------
    > Bringing host B back in the game needs a manual intervention. 
    > -Basically al services on the cluster nodes need to stop writing. 
    > -Sync the disk from Bm to B
    > -Give host B back its virtual ip address
    > -mount B read/write
    > -umount Bm in host A
    > -start all services again on the nodes.
    > => I know this is not perfect. But we can live with that. This will need to 
    > happen after office hours. The thing is that we don't have the budget for 
    > shared storage and certainly not for a redundant shared storage solution 
    > because most entry level shared storages are SPOFs. 
    > 
    > I need to find out more about that multipathing. I am not sure how to use it 
    > in this configuration. 
    > If you have idea's for improvement, they are welcome. 
    > 
    > Regards,
    > 
    > Filip
    > 
    > PS. Thanx for your answer on the clvm mirroring state.
    > 
    >  
    > 
    > 
    > 
    > 
    > On Friday 04 February 2005 21:00, Benjamin Marzinski wrote:
    > > On Fri, Feb 04, 2005 at 05:52:31PM +0100, Filip Sergeys wrote:
    > > > Hi,
    > > >
    > > > We are going to install a linux cluster with 2 gnbd servers (no SPOF)
    > > > and gfs + clvm on the cluster nodes (4 nodes). I have two options, if I
    > > > read the docs well, for duplicating data on the gnbd servers:
    > > > 1) using clvm target mirroring on the cluster nodes
    > > > 2) use drbd underneath to mirror discs. Basically two disks per machine:
    > > > 1 live disk which is mirrored with drbd to the second disk in the second
    > > > machine and the other way around in the second machine
    > > > (so the second disk in the first machine is thus the mirror from the
    > > > first (="live") disk in the second machine(sounds complicated, but it is
    > > > just hard to write down)).
    > > > Both live disks from each machine will be combined as one logical disk
    > > > (If I understood well, this is possible).
    > > >
    > > > Question: what is the status of clvm mirroring? Is it stable?
    > > > Suppose it is stable, so I have a choice: which one of the options would
    > > > any of you choose? Reason? (Stability, performance, ...)
    > >
    > > I'm still not sure if cluster mirroring is available for testing (I don't
    > > think that it is). It's defintely not considered stable.
    > >
    > > I'm also sort of unsure about your drbd solution.
    > > As far as I know, drbd only allows write access on one node at a time. So,
    > > if the first machine uses drbd to write to a local device and one on the
    > > second machine, the second machine cannot write to that device. drbd is
    > > only useful for active passive setups.  If you are using pool multipathing
    > > to multipath between the two gnbd servers, you could set it to failover
    > > mode, and modify the fencing agent that you are using to fence the
    > > gnbd_server, to make it tell drbd to fail over when you fence the server.
    > >
    > > I have never tried this, but it seems reasonable. One issue would be how to
    > > bring the failed server back up, since the devices are going to be out of
    > > sync.
    > >
    > > http://www.drbd.org/start.html says that drbd still only allows write
    > > access to one node at a time.
    > >
    > > sorry :(
    > >
    > > -Ben
    > >
    > > > I found two hits on google concerning clvm mirroring, but both say it is
    > > > not finished yet. However the most recent one is from june 2004.
    > > > I cannot test either because we have no spare machine. I'm going to buy
    > > > two machine so I need to know which disk configuration I will be using.
    > > >
    > > > Thanks in advance,
    > > >
    > > > Regards,
    > > >
    > > > Filip Sergeys
    > > >
    > > >
    > > >
    > > > http://64.233.183.104/search?q=cache:r1Icx--aI2YJ:www.spinics.net/lists/g
    > > >fs/msg03439.html+clvm+mirroring+gfs&hl=nl&start=12
    > > > https://www.redhat.com/archives/linux-cluster/2004-June/msg00028.html
    > > >
    > > > --
    > > > *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
    > > > * System Engineer, Verzekeringen NV *
    > > > * www.verzekeringen.be              *
    > > > * Oostkaai 23 B-2170 Merksem        *
    > > > * 03/6416673 - 0477/340942          *
    > > > *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
    > > >
    > > > --
    > > > Linux-cluster mailing list
    > > > Linux-cluster at redhat.com
    > > > http://www.redhat.com/mailman/listinfo/linux-cluster
    > >
    > > --
    > > Linux-cluster mailing list
    > > Linux-cluster at redhat.com
    > > http://www.redhat.com/mailman/listinfo/linux-cluster
    > 
    > --
    > Linux-cluster mailing list
    > Linux-cluster at redhat.com
    > http://www.redhat.com/mailman/listinfo/linux-cluster
    
    --
    Linux-cluster mailing list
    Linux-cluster at redhat.com

http://www.redhat.com/mailman/listinfo/linux-cluster
-- 
*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
* System Engineer, Verzekeringen NV *
* www.verzekeringen.be              *
* Oostkaai 23 B-2170 Merksem        *
* 03/6416673 - 0477/340942          *
*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050208/515faf36/attachment.htm>

From bastian at waldi.eu.org  Tue Feb  8 13:54:33 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Tue, 8 Feb 2005 14:54:33 +0100
Subject: [Linux-cluster] static libs with PIC-code
Message-ID: <20050208135433.GC31051@wavehammer.waldi.eu.org>

cman, dlm and magma build static libraries with PIC-code, this makes it
impossible to link that code into binaries.

Bastian

-- 
Killing is stupid; useless!
		-- McCoy, "A Private Little War", stardate 4211.8
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050208/8a1230c6/attachment.sig>

From pcaulfie at redhat.com  Tue Feb  8 16:11:32 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 8 Feb 2005 16:11:32 +0000
Subject: [Linux-cluster] static libs with PIC-code
In-Reply-To: <20050208135433.GC31051@wavehammer.waldi.eu.org>
References: <20050208135433.GC31051@wavehammer.waldi.eu.org>
Message-ID: <20050208161132.GA2354@tykepenguin.com>

On Tue, Feb 08, 2005 at 02:54:33PM +0100, Bastian Blank wrote:
> cman, dlm and magma build static libraries with PIC-code, this makes it
> impossible to link that code into binaries.


  revision 1.7
  date: 2004/10/25 17:52:29;  author: lhh;  state: Exp;  lines: +3 -3
  branches:  1.7.2;
  Make all the different libdlm targets use -fPIC during builds for proper symbol 
  relocation on x86_64

lon ??

-- 

patrick


From lhh at redhat.com  Tue Feb  8 16:17:47 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 08 Feb 2005 11:17:47 -0500
Subject: [Linux-cluster] static libs with PIC-code
In-Reply-To: <20050208135433.GC31051@wavehammer.waldi.eu.org>
References: <20050208135433.GC31051@wavehammer.waldi.eu.org>
Message-ID: <1107879467.9794.45.camel@ayanami.boston.redhat.com>

On Tue, 2005-02-08 at 14:54 +0100, Bastian Blank wrote:
> cman, dlm and magma build static libraries with PIC-code, this makes it
> impossible to link that code into binaries.

Eh?  On what architecture?

-- Lon


From lhh at redhat.com  Tue Feb  8 16:58:23 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 08 Feb 2005 11:58:23 -0500
Subject: [Linux-cluster] static libs with PIC-code
In-Reply-To: <1107879467.9794.45.camel@ayanami.boston.redhat.com>
References: <20050208135433.GC31051@wavehammer.waldi.eu.org>
	<1107879467.9794.45.camel@ayanami.boston.redhat.com>
Message-ID: <1107881903.9794.59.camel@ayanami.boston.redhat.com>

On Tue, 2005-02-08 at 11:17 -0500, Lon Hohberger wrote:
> On Tue, 2005-02-08 at 14:54 +0100, Bastian Blank wrote:
> > cman, dlm and magma build static libraries with PIC-code, this makes it
> > impossible to link that code into binaries.
> 
> Eh?  On what architecture?

Building -fPIC/-fpic generates position independent code, which is
needed on some architectures in order to create dynamically-loadable
plugins/libraries which use static libraries during linking.  Building
without -fPIC precludes this use.

-- Lon


From bmarzins at redhat.com  Tue Feb  8 17:37:02 2005
From: bmarzins at redhat.com (Benjamin Marzinski)
Date: Tue, 8 Feb 2005 11:37:02 -0600
Subject: [Linux-cluster] clvm mirroring target status
In-Reply-To: <1107852032.4838.53.camel@wavebreaker.eccent.be>
References: <1107535951.14182.75.camel@wavebreaker.eccent.be>
	<20050204200047.GH3666@phlogiston.msp.redhat.com>
	<200502051055.13507.filip.sergeys@verzekeringen.be>
	<20050207215017.GK3666@phlogiston.msp.redhat.com>
	<1107852032.4838.53.camel@wavebreaker.eccent.be>
Message-ID: <20050208173702.GL3666@phlogiston.msp.redhat.com>

>     easy or reliable.  The easiest way to do it is to make host A to have both
>     disk A and disk B, and make host B have disk Am and Bm. To do this, GNBD import
>     the disks from host A, assemble the pool, GNBD import the disks from host B,
>     and use pool_mp to integrate them into the pool. This should automatically
>     set you up in failover mode, with disks A and B as the primary disks and disks
>     Am and Bm as the backups. I realize that this means that hostB is usually
>     sitting idle.
>     
> 
> This sounds hopefull
> 
>     If you name your devices correctly, or import them in a specific order, you
>     might be able to get pool to use the correct devices in the setup you
>     described, but I'm not certain.
>     
> 
> "might be able..."-> Now I lost hope: I borrowed the idea of this setup
> from the GFS admin guide, "the economy and performance" setup. 
> (http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-ov-perform.html#S2-OV-ECONOMY)
> Probably I misinterpreted the figure1-3, especially the disk part.
> Can you elaborate a bit more on: "if you name your devices correctly or
> import them in a specific order"?
> If I put Am and Bm on one machine, export them with gnbd and join them
> in the pool in failover mode, can I be sure there will be no writing on
> them? Because drbd won't let that happen.

If you put Am and Bm on one machine, pool should be fine.  That method
should always work. The downside is that one machine is sitting idle.

If you fink around with names and ordering of imports and stuff, you can
probably get the setup you originally described to work. The benefit of
this setup is that both machines are in use until one goes down.  However,
getting this setup to work may be trickier, and without looking at the pool
code, I don't know exactly what you need to do.

Sorry if my email wasn't clear.

And about the admin-guide.... Um.... If you misinterpreted figure 1-3, then
I did too. I wrote GNBD. I know all the testing that QA has done on it, and
I have never heard of this setup being tested.  I expect that somewhere,
there is a marketing person to blame. That's not to say that it won't work. The
tricky thing is to get pool to select the correct devices as primary ones and
drbd to failover before pool does (which happens right after the failed node
is fenced).

Thanks for pointing this out to me.  I was wondering why I was getting so
many questions about drbd under gnbd. And this explains it.

-Ben
 
> Thanx,
> 
> Filip Sergeys
> 
>     What your design actually wants is for pool to not do multipathing at all, but
>     to simply retry on failed IO.  That way, when the virtual IP switches, gnbd
>     will just automatically pick up the device at its new location. Unfortunately,
>     pool and gnbd cannot do this.
>     
>     -Ben
>      
>     > Concequences:
>     > -------------------
>     > Bringing host B back in the game needs a manual intervention. 
>     > -Basically al services on the cluster nodes need to stop writing. 
>     > -Sync the disk from Bm to B
>     > -Give host B back its virtual ip address
>     > -mount B read/write
>     > -umount Bm in host A
>     > -start all services again on the nodes.
>     > => I know this is not perfect. But we can live with that. This will need to 
>     > happen after office hours. The thing is that we don't have the budget for 
>     > shared storage and certainly not for a redundant shared storage solution 
>     > because most entry level shared storages are SPOFs. 
>     > 
>     > I need to find out more about that multipathing. I am not sure how to use it 
>     > in this configuration. 
>     > If you have idea's for improvement, they are welcome. 
>     > 
>     > Regards,
>     > 
>     > Filip
>     > 
>     > PS. Thanx for your answer on the clvm mirroring state.
>     > 
>     >  
>     > 
>     > 
>     > 
>     > 
>     > On Friday 04 February 2005 21:00, Benjamin Marzinski wrote:
>     > > On Fri, Feb 04, 2005 at 05:52:31PM +0100, Filip Sergeys wrote:
>     > > > Hi,
>     > > >
>     > > > We are going to install a linux cluster with 2 gnbd servers (no SPOF)
>     > > > and gfs + clvm on the cluster nodes (4 nodes). I have two options, if I
>     > > > read the docs well, for duplicating data on the gnbd servers:
>     > > > 1) using clvm target mirroring on the cluster nodes
>     > > > 2) use drbd underneath to mirror discs. Basically two disks per machine:
>     > > > 1 live disk which is mirrored with drbd to the second disk in the second
>     > > > machine and the other way around in the second machine
>     > > > (so the second disk in the first machine is thus the mirror from the
>     > > > first (="live") disk in the second machine(sounds complicated, but it is
>     > > > just hard to write down)).
>     > > > Both live disks from each machine will be combined as one logical disk
>     > > > (If I understood well, this is possible).
>     > > >
>     > > > Question: what is the status of clvm mirroring? Is it stable?
>     > > > Suppose it is stable, so I have a choice: which one of the options would
>     > > > any of you choose? Reason? (Stability, performance, ...)
>     > >
>     > > I'm still not sure if cluster mirroring is available for testing (I don't
>     > > think that it is). It's defintely not considered stable.
>     > >
>     > > I'm also sort of unsure about your drbd solution.
>     > > As far as I know, drbd only allows write access on one node at a time. So,
>     > > if the first machine uses drbd to write to a local device and one on the
>     > > second machine, the second machine cannot write to that device. drbd is
>     > > only useful for active passive setups.  If you are using pool multipathing
>     > > to multipath between the two gnbd servers, you could set it to failover
>     > > mode, and modify the fencing agent that you are using to fence the
>     > > gnbd_server, to make it tell drbd to fail over when you fence the server.
>     > >
>     > > I have never tried this, but it seems reasonable. One issue would be how to
>     > > bring the failed server back up, since the devices are going to be out of
>     > > sync.
>     > >
>     > > http://www.drbd.org/start.html says that drbd still only allows write
>     > > access to one node at a time.
>     > >
>     > > sorry :(
>     > >
>     > > -Ben
>     > >
>     > > > I found two hits on google concerning clvm mirroring, but both say it is
>     > > > not finished yet. However the most recent one is from june 2004.
>     > > > I cannot test either because we have no spare machine. I'm going to buy
>     > > > two machine so I need to know which disk configuration I will be using.
>     > > >
>     > > > Thanks in advance,
>     > > >
>     > > > Regards,
>     > > >
>     > > > Filip Sergeys
>     > > >
>     > > >
>     > > >
>     > > > http://64.233.183.104/search?q=cache:r1Icx--aI2YJ:www.spinics.net/lists/g
>     > > >fs/msg03439.html+clvm+mirroring+gfs&hl=nl&start=12
>     > > > https://www.redhat.com/archives/linux-cluster/2004-June/msg00028.html
>     > > >
>     > > > --
>     > > > *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
>     > > > * System Engineer, Verzekeringen NV *
>     > > > * www.verzekeringen.be              *
>     > > > * Oostkaai 23 B-2170 Merksem        *
>     > > > * 03/6416673 - 0477/340942          *
>     > > > *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
>     > > >
>     > > > --
>     > > > Linux-cluster mailing list
>     > > > Linux-cluster at redhat.com
>     > > > http://www.redhat.com/mailman/listinfo/linux-cluster
>     > >
>     > > --
>     > > Linux-cluster mailing list
>     > > Linux-cluster at redhat.com
>     > > http://www.redhat.com/mailman/listinfo/linux-cluster
>     > 
>     > --
>     > Linux-cluster mailing list
>     > Linux-cluster at redhat.com
>     > http://www.redhat.com/mailman/listinfo/linux-cluster
>     
>     --
>     Linux-cluster mailing list
>     Linux-cluster at redhat.com
> 
> http://www.redhat.com/mailman/listinfo/linux-cluster
> -- 
> *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
> * System Engineer, Verzekeringen NV *
> * www.verzekeringen.be              *
> * Oostkaai 23 B-2170 Merksem        *
> * 03/6416673 - 0477/340942          *
> *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*

> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From lhh at redhat.com  Tue Feb  8 18:02:39 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Tue, 08 Feb 2005 13:02:39 -0500
Subject: [Linux-cluster] page and buffer cache in Linux File Systems
In-Reply-To: <BAY14-F33D3E159D53FF95351B49AAF740@phx.gbl>
References: <BAY14-F33D3E159D53FF95351B49AAF740@phx.gbl>
Message-ID: <1107885759.9794.65.camel@ayanami.boston.redhat.com>

On Tue, 2005-02-08 at 09:58 +0200, John Que wrote:

> And , I am not an expert in ext3 , but:
> in case such a requirement DOES exist for non clustered file systems
> (like ext3) - does ext3 implement this requirement ? How?

I believe O_DIRECT is what you're looking for on ext3.

-- Lon


From mtilstra at redhat.com  Tue Feb  8 18:38:31 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Tue, 8 Feb 2005 12:38:31 -0600
Subject: [Linux-cluster] page and buffer cache in Linux File Systems
In-Reply-To: <1107885759.9794.65.camel@ayanami.boston.redhat.com>
References: <BAY14-F33D3E159D53FF95351B49AAF740@phx.gbl>
	<1107885759.9794.65.camel@ayanami.boston.redhat.com>
Message-ID: <20050208183831.GA3893@redhat.com>

On Tue, Feb 08, 2005 at 01:02:39PM -0500, Lon Hohberger wrote:
> On Tue, 2005-02-08 at 09:58 +0200, John Que wrote:
> 
> > And , I am not an expert in ext3 , but:
> > in case such a requirement DOES exist for non clustered file systems
> > (like ext3) - does ext3 implement this requirement ? How?
> 
> I believe O_DIRECT is what you're looking for on ext3.

And gfs.

-- 
Michael Conrad Tadpol Tilstra
I've a memory like a sponge. It's soft, squishy and full of holes.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050208/ecfd8594/attachment.sig>

From vahram at Broadspire.com  Tue Feb  8 21:43:19 2005
From: vahram at Broadspire.com (vahram)
Date: Tue, 08 Feb 2005 13:43:19 -0800
Subject: [Linux-cluster] cluster architecture
Message-ID: <42093277.8090906@broadspire.com>

Hi all,

I'm planning to put together a production web server farm that will 
consist of at least 6 servers.  They will all be running Apache and 
Postfix, and will be sharing a 4+TB storage device.  Horizontal 
scalability is a major issue for us.

I just wanted to get some general recommendations on who to go with for 
our storage needs.  We were considering a Netapp appliance, but the cost 
is extremely high and their solution is probably a bit overkill for our 
needs.  Cost is a major issue for us.

How does the performance of a Netapp appliance running NFS compare to a 
fibre-based storage device (such as an Apple XServe RAID or similar 
unit) running GFS?  Is anyone here running GFS on a production server 
farm?  Thanks!

-vahram


From bastian at waldi.eu.org  Wed Feb  9 11:27:01 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Wed, 9 Feb 2005 12:27:01 +0100
Subject: [Linux-cluster] static libs with PIC-code
In-Reply-To: <1107879467.9794.45.camel@ayanami.boston.redhat.com>
References: <20050208135433.GC31051@wavehammer.waldi.eu.org>
	<1107879467.9794.45.camel@ayanami.boston.redhat.com>
Message-ID: <20050209112701.GB14220@wavehammer.waldi.eu.org>

On Tue, Feb 08, 2005 at 11:17:47AM -0500, Lon Hohberger wrote:
> On Tue, 2005-02-08 at 14:54 +0100, Bastian Blank wrote:
> > cman, dlm and magma build static libraries with PIC-code, this makes it
> > impossible to link that code into binaries.
> Eh?  On what architecture?

ARM for example.

Bastian

-- 
You're too beautiful to ignore.  Too much woman.
		-- Kirk to Yeoman Rand, "The Enemy Within", stardate unknown
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050209/e2341425/attachment.sig>

From bastian at waldi.eu.org  Wed Feb  9 11:28:38 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Wed, 9 Feb 2005 12:28:38 +0100
Subject: [Linux-cluster] static libs with PIC-code
In-Reply-To: <1107881903.9794.59.camel@ayanami.boston.redhat.com>
References: <20050208135433.GC31051@wavehammer.waldi.eu.org>
	<1107879467.9794.45.camel@ayanami.boston.redhat.com>
	<1107881903.9794.59.camel@ayanami.boston.redhat.com>
Message-ID: <20050209112838.GC14220@wavehammer.waldi.eu.org>

On Tue, Feb 08, 2005 at 11:58:23AM -0500, Lon Hohberger wrote:
> Building -fPIC/-fpic generates position independent code, which is
> needed on some architectures in order to create dynamically-loadable
> plugins/libraries which use static libraries during linking.  Building
> without -fPIC precludes this use.

Please take a look at X11 and/or the debian version of sdl, they build
special pic static libs for this case.

Bastian

-- 
The heart is not a logical organ.
		-- Dr. Janet Wallace, "The Deadly Years", stardate 3479.4
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050209/07a96293/attachment.sig>

From filip.sergeys at verzekeringen.be  Wed Feb  9 12:14:25 2005
From: filip.sergeys at verzekeringen.be (Filip Sergeys)
Date: 09 Feb 2005 13:14:25 +0100
Subject: [Linux-cluster] clvm mirroring target status
In-Reply-To: <20050208173702.GL3666@phlogiston.msp.redhat.com>
References: <1107535951.14182.75.camel@wavebreaker.eccent.be>
	<20050204200047.GH3666@phlogiston.msp.redhat.com>
	<200502051055.13507.filip.sergeys@verzekeringen.be>
	<20050207215017.GK3666@phlogiston.msp.redhat.com>
	<1107852032.4838.53.camel@wavebreaker.eccent.be> 
	<20050208173702.GL3666@phlogiston.msp.redhat.com>
Message-ID: <1107951265.3919.92.camel@wavebreaker.eccent.be>

Thank you for your clarification. 
If you are getting so many questions about this setup it's probably
because people find it an appealing configuration. A good sign maybe for
redhat to accomodate the pool a make it work that way. That would be
true customer driven development. Just hinting ;)

Thanx for your help so far,

Regards,

Filip Sergeys


On Tue, 2005-02-08 at 18:37, Benjamin Marzinski wrote:

    >     easy or reliable.  The easiest way to do it is to make host A to have both
    >     disk A and disk B, and make host B have disk Am and Bm. To do this, GNBD import
    >     the disks from host A, assemble the pool, GNBD import the disks from host B,
    >     and use pool_mp to integrate them into the pool. This should automatically
    >     set you up in failover mode, with disks A and B as the primary disks and disks
    >     Am and Bm as the backups. I realize that this means that hostB is usually
    >     sitting idle.
    >     
    > 
    > This sounds hopefull
    > 
    >     If you name your devices correctly, or import them in a specific order, you
    >     might be able to get pool to use the correct devices in the setup you
    >     described, but I'm not certain.
    >     
    > 
    > "might be able..."-> Now I lost hope: I borrowed the idea of this setup
    > from the GFS admin guide, "the economy and performance" setup. 
    > (http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-ov-perform.html#S2-OV-ECONOMY)
    > Probably I misinterpreted the figure1-3, especially the disk part.
    > Can you elaborate a bit more on: "if you name your devices correctly or
    > import them in a specific order"?
    > If I put Am and Bm on one machine, export them with gnbd and join them
    > in the pool in failover mode, can I be sure there will be no writing on
    > them? Because drbd won't let that happen.
    
    If you put Am and Bm on one machine, pool should be fine.  That method
    should always work. The downside is that one machine is sitting idle.
    
    If you fink around with names and ordering of imports and stuff, you can
    probably get the setup you originally described to work. The benefit of
    this setup is that both machines are in use until one goes down.  However,
    getting this setup to work may be trickier, and without looking at the pool
    code, I don't know exactly what you need to do.
    
    Sorry if my email wasn't clear.
    
    And about the admin-guide.... Um.... If you misinterpreted figure 1-3, then
    I did too. I wrote GNBD. I know all the testing that QA has done on it, and
    I have never heard of this setup being tested.  I expect that somewhere,
    there is a marketing person to blame. That's not to say that it won't work. The
    tricky thing is to get pool to select the correct devices as primary ones and
    drbd to failover before pool does (which happens right after the failed node
    is fenced).
    
    Thanks for pointing this out to me.  I was wondering why I was getting so
    many questions about drbd under gnbd. And this explains it.
    
    -Ben
     
    > Thanx,
    > 
    > Filip Sergeys
    > 
    >     What your design actually wants is for pool to not do multipathing at all, but
    >     to simply retry on failed IO.  That way, when the virtual IP switches, gnbd
    >     will just automatically pick up the device at its new location. Unfortunately,
    >     pool and gnbd cannot do this.
    >     
    >     -Ben
    >      
    >     > Concequences:
    >     > -------------------
    >     > Bringing host B back in the game needs a manual intervention. 
    >     > -Basically al services on the cluster nodes need to stop writing. 
    >     > -Sync the disk from Bm to B
    >     > -Give host B back its virtual ip address
    >     > -mount B read/write
    >     > -umount Bm in host A
    >     > -start all services again on the nodes.
    >     > => I know this is not perfect. But we can live with that. This will need to 
    >     > happen after office hours. The thing is that we don't have the budget for 
    >     > shared storage and certainly not for a redundant shared storage solution 
    >     > because most entry level shared storages are SPOFs. 
    >     > 
    >     > I need to find out more about that multipathing. I am not sure how to use it 
    >     > in this configuration. 
    >     > If you have idea's for improvement, they are welcome. 
    >     > 
    >     > Regards,
    >     > 
    >     > Filip
    >     > 
    >     > PS. Thanx for your answer on the clvm mirroring state.
    >     > 
    >     >  
    >     > 
    >     > 
    >     > 
    >     > 
    >     > On Friday 04 February 2005 21:00, Benjamin Marzinski wrote:
    >     > > On Fri, Feb 04, 2005 at 05:52:31PM +0100, Filip Sergeys wrote:
    >     > > > Hi,
    >     > > >
    >     > > > We are going to install a linux cluster with 2 gnbd servers (no SPOF)
    >     > > > and gfs + clvm on the cluster nodes (4 nodes). I have two options, if I
    >     > > > read the docs well, for duplicating data on the gnbd servers:
    >     > > > 1) using clvm target mirroring on the cluster nodes
    >     > > > 2) use drbd underneath to mirror discs. Basically two disks per machine:
    >     > > > 1 live disk which is mirrored with drbd to the second disk in the second
    >     > > > machine and the other way around in the second machine
    >     > > > (so the second disk in the first machine is thus the mirror from the
    >     > > > first (="live") disk in the second machine(sounds complicated, but it is
    >     > > > just hard to write down)).
    >     > > > Both live disks from each machine will be combined as one logical disk
    >     > > > (If I understood well, this is possible).
    >     > > >
    >     > > > Question: what is the status of clvm mirroring? Is it stable?
    >     > > > Suppose it is stable, so I have a choice: which one of the options would
    >     > > > any of you choose? Reason? (Stability, performance, ...)
    >     > >
    >     > > I'm still not sure if cluster mirroring is available for testing (I don't
    >     > > think that it is). It's defintely not considered stable.
    >     > >
    >     > > I'm also sort of unsure about your drbd solution.
    >     > > As far as I know, drbd only allows write access on one node at a time. So,
    >     > > if the first machine uses drbd to write to a local device and one on the
    >     > > second machine, the second machine cannot write to that device. drbd is
    >     > > only useful for active passive setups.  If you are using pool multipathing
    >     > > to multipath between the two gnbd servers, you could set it to failover
    >     > > mode, and modify the fencing agent that you are using to fence the
    >     > > gnbd_server, to make it tell drbd to fail over when you fence the server.
    >     > >
    >     > > I have never tried this, but it seems reasonable. One issue would be how to
    >     > > bring the failed server back up, since the devices are going to be out of
    >     > > sync.
    >     > >
    >     > > http://www.drbd.org/start.html says that drbd still only allows write
    >     > > access to one node at a time.
    >     > >
    >     > > sorry :(
    >     > >
    >     > > -Ben
    >     > >
    >     > > > I found two hits on google concerning clvm mirroring, but both say it is
    >     > > > not finished yet. However the most recent one is from june 2004.
    >     > > > I cannot test either because we have no spare machine. I'm going to buy
    >     > > > two machine so I need to know which disk configuration I will be using.
    >     > > >
    >     > > > Thanks in advance,
    >     > > >
    >     > > > Regards,
    >     > > >
    >     > > > Filip Sergeys
    >     > > >
    >     > > >
    >     > > >
    >     > > > http://64.233.183.104/search?q=cache:r1Icx--aI2YJ:www.spinics.net/lists/g
    >     > > >fs/msg03439.html+clvm+mirroring+gfs&hl=nl&start=12
    >     > > > https://www.redhat.com/archives/linux-cluster/2004-June/msg00028.html
    >     > > >
    >     > > > --
    >     > > > *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
    >     > > > * System Engineer, Verzekeringen NV *
    >     > > > * www.verzekeringen.be              *
    >     > > > * Oostkaai 23 B-2170 Merksem        *
    >     > > > * 03/6416673 - 0477/340942          *
    >     > > > *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
    >     > > >
    >     > > > --
    >     > > > Linux-cluster mailing list
    >     > > > Linux-cluster at redhat.com
    >     > > > http://www.redhat.com/mailman/listinfo/linux-cluster
    >     > >
    >     > > --
    >     > > Linux-cluster mailing list
    >     > > Linux-cluster at redhat.com
    >     > > http://www.redhat.com/mailman/listinfo/linux-cluster
    >     > 
    >     > --
    >     > Linux-cluster mailing list
    >     > Linux-cluster at redhat.com
    >     > http://www.redhat.com/mailman/listinfo/linux-cluster
    >     
    >     --
    >     Linux-cluster mailing list
    >     Linux-cluster at redhat.com
    > 
    > http://www.redhat.com/mailman/listinfo/linux-cluster
    > -- 
    > *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
    > * System Engineer, Verzekeringen NV *
    > * www.verzekeringen.be              *
    > * Oostkaai 23 B-2170 Merksem        *
    > * 03/6416673 - 0477/340942          *
    > *-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
    
    > --
    > Linux-cluster mailing list
    > Linux-cluster at redhat.com
    > http://www.redhat.com/mailman/listinfo/linux-cluster
    
    --
    Linux-cluster mailing list
    Linux-cluster at redhat.com

http://www.redhat.com/mailman/listinfo/linux-cluster
-- 
*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
* System Engineer, Verzekeringen NV *
* www.verzekeringen.be              *
* Oostkaai 23 B-2170 Merksem        *
* 03/6416673 - 0477/340942          *
*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050209/b058ab85/attachment.htm>

From rstevens at vitalstream.com  Wed Feb  9 18:32:09 2005
From: rstevens at vitalstream.com (Rick Stevens)
Date: Wed, 09 Feb 2005 10:32:09 -0800
Subject: [Linux-cluster] cluster architecture
In-Reply-To: <42093277.8090906@broadspire.com>
References: <42093277.8090906@broadspire.com>
Message-ID: <420A5729.80403@vitalstream.com>

vahram wrote:
> Hi all,
> 
> I'm planning to put together a production web server farm that will 
> consist of at least 6 servers.  They will all be running Apache and 
> Postfix, and will be sharing a 4+TB storage device.  Horizontal 
> scalability is a major issue for us.
> 
> I just wanted to get some general recommendations on who to go with for 
> our storage needs.  We were considering a Netapp appliance, but the cost 
> is extremely high and their solution is probably a bit overkill for our 
> needs.  Cost is a major issue for us.
> 
> How does the performance of a Netapp appliance running NFS compare to a 
> fibre-based storage device (such as an Apple XServe RAID or similar 
> unit) running GFS?  Is anyone here running GFS on a production server 
> farm?  Thanks!

We use NetApps a lot.  Their performance is terrific, but it is NFS over
gigabit ethernet with all that entails and isn't as high as it would be
on a SAN or other block-level device (this is true for any NAS).

I will say that NetApps are bulletproof, easy to expand and software
updates are very, very simple.  Licensing is not cheap, but the fact you
can run CIFS and NFS simultaneously is a plus.  Yes, they cost money,
but you get what you pay for.  You could simulate a NetApp by getting a
really beefy server with a FC or SCSI SAN attached to it and making it
an NFS (and possibly Samba) server.  I won't swear to what kind of
performance you'd get, but you could possibly get 80% of wire speed,
depending on your network architecture and other features.

If you're using any NFS or NAS as a common file system, make sure you
have "noac" set for the mounts or you may miss files put on the storage
by other systems.  Unfortunately, this eats into performance, but that's
the nature of the beast.

As far as SANs are concerned, you'll probably need a fiberchannel system
for 6 nodes unless you can find a 6-port SCSI unit (doubtful).  If you
choose FC, you'll need to think about the switch fabric and whether you
will have to deal with multipathing.  If that's true, you have to make
sure your vendor has multipathing modules for your kernel.  You also
need to look at bandwidth and whether the SAN you're looking at can
sustain the I/O bandwidth you want.  You also need to figure out how
you're going to share that storage among the nodes in the cluster.

We are evaluating several fairly large SANs for use with GFS, but our
bandwidth needs are a bit, well, over-the-top.  We need 9Gbps aggregate
throughput.  We're looking at IBM as well as Hitachi FC SAN solutions.
----------------------------------------------------------------------
- Rick Stevens, Senior Systems Engineer     rstevens at vitalstream.com -
- VitalStream, Inc.                       http://www.vitalstream.com -
-                                                                    -
-                 IGNORE that man behind the keyboard!               -
-                                                - The Wizard of OS  -
----------------------------------------------------------------------


From vahram at Broadspire.com  Wed Feb  9 18:59:09 2005
From: vahram at Broadspire.com (vahram)
Date: Wed, 09 Feb 2005 10:59:09 -0800
Subject: [Linux-cluster] cluster architecture
In-Reply-To: <420A5729.80403@vitalstream.com>
References: <42093277.8090906@broadspire.com> <420A5729.80403@vitalstream.com>
Message-ID: <420A5D7D.8060200@broadspire.com>

Raw throughput isn't really an issue for us.  We're more interested in 
seek times.  My biggest concern with GFS is stability and 
performance...any feedback in regards to that would be greatly 
appreciated.  Thanks!

Rick Stevens wrote:
> vahram wrote:
> 
>> Hi all,
>>
>> I'm planning to put together a production web server farm that will 
>> consist of at least 6 servers.  They will all be running Apache and 
>> Postfix, and will be sharing a 4+TB storage device.  Horizontal 
>> scalability is a major issue for us.
>>
>> I just wanted to get some general recommendations on who to go with 
>> for our storage needs.  We were considering a Netapp appliance, but 
>> the cost is extremely high and their solution is probably a bit 
>> overkill for our needs.  Cost is a major issue for us.
>>
>> How does the performance of a Netapp appliance running NFS compare to 
>> a fibre-based storage device (such as an Apple XServe RAID or similar 
>> unit) running GFS?  Is anyone here running GFS on a production server 
>> farm?  Thanks!
> 
> 
> We use NetApps a lot.  Their performance is terrific, but it is NFS over
> gigabit ethernet with all that entails and isn't as high as it would be
> on a SAN or other block-level device (this is true for any NAS).
> 
> I will say that NetApps are bulletproof, easy to expand and software
> updates are very, very simple.  Licensing is not cheap, but the fact you
> can run CIFS and NFS simultaneously is a plus.  Yes, they cost money,
> but you get what you pay for.  You could simulate a NetApp by getting a
> really beefy server with a FC or SCSI SAN attached to it and making it
> an NFS (and possibly Samba) server.  I won't swear to what kind of
> performance you'd get, but you could possibly get 80% of wire speed,
> depending on your network architecture and other features.
> 
> If you're using any NFS or NAS as a common file system, make sure you
> have "noac" set for the mounts or you may miss files put on the storage
> by other systems.  Unfortunately, this eats into performance, but that's
> the nature of the beast.
> 
> As far as SANs are concerned, you'll probably need a fiberchannel system
> for 6 nodes unless you can find a 6-port SCSI unit (doubtful).  If you
> choose FC, you'll need to think about the switch fabric and whether you
> will have to deal with multipathing.  If that's true, you have to make
> sure your vendor has multipathing modules for your kernel.  You also
> need to look at bandwidth and whether the SAN you're looking at can
> sustain the I/O bandwidth you want.  You also need to figure out how
> you're going to share that storage among the nodes in the cluster.
> 
> We are evaluating several fairly large SANs for use with GFS, but our
> bandwidth needs are a bit, well, over-the-top.  We need 9Gbps aggregate
> throughput.  We're looking at IBM as well as Hitachi FC SAN solutions.
> ----------------------------------------------------------------------
> - Rick Stevens, Senior Systems Engineer     rstevens at vitalstream.com -
> - VitalStream, Inc.                       http://www.vitalstream.com -
> -                                                                    -
> -                 IGNORE that man behind the keyboard!               -
> -                                                - The Wizard of OS  -
> ----------------------------------------------------------------------
> 
> -- 
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From rstevens at vitalstream.com  Wed Feb  9 19:21:34 2005
From: rstevens at vitalstream.com (Rick Stevens)
Date: Wed, 09 Feb 2005 11:21:34 -0800
Subject: [Linux-cluster] cluster architecture
In-Reply-To: <420A5D7D.8060200@broadspire.com>
References: <42093277.8090906@broadspire.com> <420A5729.80403@vitalstream.com>
	<420A5D7D.8060200@broadspire.com>
Message-ID: <420A62BE.2050804@vitalstream.com>

vahram wrote:
> Raw throughput isn't really an issue for us.  We're more interested in 
> seek times.  My biggest concern with GFS is stability and 
> performance...any feedback in regards to that would be greatly 
> appreciated.  Thanks!

So far, GFS has worked quite well under our tests.  We have yet to have
it break.  Our current GFS implementation is only on two nodes with gulm
running on a separate lock server.  I intend to update the kernels on
those nodes sometime this week (to the 2.6.11 variety) and change the
locking from gulm to cman (since that seems to be fixed at this point).

The SAN that's attached is an off-brand FC unit via a dual-port switch
and the nodes are using QLogic QLA2300 HBAs.

The application we've been using to test hasn't stressed it that badly
(it's been pooping out long before the servers were stressed), so I
can't say a whole lot.  I can say that rsyncing the entire filesystem
(20GB) didn't cause any problems, either pitching or catching.
----------------------------------------------------------------------
- Rick Stevens, Senior Systems Engineer     rstevens at vitalstream.com -
- VitalStream, Inc.                       http://www.vitalstream.com -
-                                                                    -
-   You possess a mind not merely twisted, but actually sprained.    -
----------------------------------------------------------------------


From ivan.ivanyi at isb-sib.ch  Thu Feb 10 11:02:17 2005
From: ivan.ivanyi at isb-sib.ch (IVANYI Ivan)
Date: Thu, 10 Feb 2005 12:02:17 +0100
Subject: [Linux-cluster] cluster architecture
In-Reply-To: <420A62BE.2050804@vitalstream.com>
References: <42093277.8090906@broadspire.com>
	<420A5729.80403@vitalstream.com>	<420A5D7D.8060200@broadspire.com>
	<420A62BE.2050804@vitalstream.com>
Message-ID: <420B3F39.4000204@isb-sib.ch>

I've got 3 nodes direct attached to SAN. The performance of GFS has disappointed 
me a bit so far. Maybe I've got something wrong but then again documentation is 
lacking... unless I'm looking in the wrong places.

Previously in a slightly different configuration I had only a slight performance 
hit with IBM's GPFS.

Rick Stevens wrote:
> vahram wrote:
> 
>> Raw throughput isn't really an issue for us.  We're more interested in 
>> seek times.  My biggest concern with GFS is stability and 
>> performance...any feedback in regards to that would be greatly 
>> appreciated.  Thanks!
> 
> 
> So far, GFS has worked quite well under our tests.  We have yet to have
> it break.  Our current GFS implementation is only on two nodes with gulm
> running on a separate lock server.  I intend to update the kernels on
> those nodes sometime this week (to the 2.6.11 variety) and change the
> locking from gulm to cman (since that seems to be fixed at this point).

Again not too sure about the different locking mechanisms .. do you mean 
cman/dlm? will this work better for you?


-- 
************************************************************

Ivan Ivanyi

Swiss Institute of Bioinformatics
     1, rue Michel Servet
     CH-1211 Gen?ve 4
     Switzerland

     Tel: (+41 22) 379 58 33
     Fax: (+41 22) 379 58 58
     E-mail: Ivan.Ivanyi at isb-sib.ch


************************************************************
PGP signature
http://www.expasy.org/people/Ivan.Ivanyi.gpg


From rstevens at vitalstream.com  Thu Feb 10 17:22:14 2005
From: rstevens at vitalstream.com (Rick Stevens)
Date: Thu, 10 Feb 2005 09:22:14 -0800
Subject: [Linux-cluster] cluster architecture
In-Reply-To: <420B3F39.4000204@isb-sib.ch>
References: <42093277.8090906@broadspire.com>	<420A5729.80403@vitalstream.com>	<420A5D7D.8060200@broadspire.com>	<420A62BE.2050804@vitalstream.com>
	<420B3F39.4000204@isb-sib.ch>
Message-ID: <420B9846.9@vitalstream.com>

IVANYI Ivan wrote:
> I've got 3 nodes direct attached to SAN. The performance of GFS has 
> disappointed me a bit so far. Maybe I've got something wrong but then 
> again documentation is lacking... unless I'm looking in the wrong places.
> 
> Previously in a slightly different configuration I had only a slight 
> performance hit with IBM's GPFS.
> 
> Rick Stevens wrote:
> 
>> vahram wrote:
>>
>>> Raw throughput isn't really an issue for us.  We're more interested 
>>> in seek times.  My biggest concern with GFS is stability and 
>>> performance...any feedback in regards to that would be greatly 
>>> appreciated.  Thanks!
>>
>>
>>
>> So far, GFS has worked quite well under our tests.  We have yet to have
>> it break.  Our current GFS implementation is only on two nodes with gulm
>> running on a separate lock server.  I intend to update the kernels on
>> those nodes sometime this week (to the 2.6.11 variety) and change the
>> locking from gulm to cman (since that seems to be fixed at this point).
> 
> 
> Again not too sure about the different locking mechanisms .. do you mean 
> cman/dlm? will this work better for you?

Currently we use cman to do the LVM locking/clustering stuff and gulm
to do the GFS locking as cman wasn't reliable handling GFS.  The gods
that write the stuff now tell me that cman can handle GFS properly, so
I'm going to give it a whirl.
> 
> 


-- 
----------------------------------------------------------------------
- Rick Stevens, Senior Systems Engineer     rstevens at vitalstream.com -
- VitalStream, Inc.                       http://www.vitalstream.com -
-                                                                    -
-  You know the old saying--any technology sufficiently advanced is  -
-               indistinguishable from a Perl script                 -
-                                 --Programming Perl, 2nd Edition    -
----------------------------------------------------------------------


From tretkowski at inittab.de  Fri Feb 11 14:21:38 2005
From: tretkowski at inittab.de (Norbert Tretkowski)
Date: Fri, 11 Feb 2005 15:21:38 +0100
Subject: [Linux-cluster] GFS 6.0 patches for kernel 2.4
Message-ID: <20050211142138.GB1383@rollcage.inittab.de>

Hi,

I'm searching GFS 6.0 kernel patches for 2.4 kernels (to be honest,
the 2.4.21 kernel from SuSE Linux Enterprise Server 8, but a patch
against a vanilla kernel would also help). I found patches for RHEL3
kernels as SRPMs but these patches don't apply and/or don't build with
the SLES8 kernel.

Thanks, Norbert


From daniel at osdl.org  Sat Feb 12 00:47:38 2005
From: daniel at osdl.org (Daniel McNeil)
Date: Fri, 11 Feb 2005 16:47:38 -0800
Subject: [Linux-cluster] cluster lost quorum after 11 hours
Message-ID: <1108169257.5927.12.camel@ibm-c.pdx.osdl.net>

I was running my test on a 3 node cluster and it died
after 11 hours.  cl030 lost quorum with the other 2 nodes
kicked out of the cluster.  cl031 also hit a bunch of asserts
like
    lock_dlm:  Assertion failed on line 352 of file  
    /Views/redhat-cluster/cluster/gfs-kernel/src/dlm/lock.c
    lock_dlm:  assertion:  "!error"
    lock_dlm:  time = 291694516
    stripefs: error=-22 num=2,19
I assume is caused by the cluster shutting down.


/var/log/messages showed:

cl030:
Feb 11 02:44:33 cl030 kernel: CMAN: removing node cl032a from the cluster : No response to messages
Feb 11 02:44:33 cl030 kernel: CMAN: removing node cl031a from the cluster : No response to messages
Feb 11 02:44:33 cl030 kernel: CMAN: quorum lost, blocking activity
Feb 11 14:40:33 cl030 sshd(pam_unix)[27323]: session opened for user root by (uid=0)

cl031:
Feb 11 02:44:33 cl031 kernel: CMAN: node cl032a has been removed from the cluster : No response to messages
Feb 11 02:44:33 cl031 kernel: CMAN: node cl031a has been removed from the cluster : No response to messages
Feb 11 02:44:33 cl031 kernel: CMAN: killed by NODEDOWN message
Feb 11 02:44:33 cl031 kernel: CMAN: we are leaving the cluster.
Feb 11 02:44:34 cl031 kernel: lowcomms_get_buffer: accepting is 0
Feb 11 02:44:34 cl031 kernel: dlm: stripefs: remote_stage error -105 2019c
Feb 11 02:44:34 cl031 ccsd[3823]: [cluster_mgr.c:387] Cluster manager shutdown.
 Attemping to reconnect...
Feb 11 02:44:34 cl031 kernel: SM: 00000001 sm_stop: SG still joined
Feb 11 02:44:34 cl031 kernel: SM: 0100041e sm_stop: SG still joined
Feb 11 02:44:34 cl031 kernel: SM: 0200041f sm_stop: SG still joined
Feb 11 02:44:37 cl031 ccsd[3823]: [cluster_mgr.c:346] Unable to connect to cluster infrastructure after 30 seconds.
Feb 11 02:45:07 cl031 ccsd[3823]: [cluster_mgr.c:346] Unable to connect to cluster infrastructure after 60 seconds.

cl032:
Feb 11 02:44:33 cl032 kernel: CMAN: node cl032a has been removed from the cluster : No response to messages
Feb 11 02:44:33 cl032 kernel: CMAN: killed by NODEDOWN message
Feb 11 02:44:33 cl032 kernel: CMAN: we are leaving the cluster.
Feb 11 02:44:34 cl032 kernel: lowcomms_get_buffer: accepting is 0
Feb 11 02:44:34 cl032 kernel: dlm: stripefs: remote_stage error -105 102bd
Feb 11 02:44:34 cl032 kernel: lowcomms_get_buffer: accepting is 0
Feb 11 02:44:34 cl032 ccsd[22909]: [cluster_mgr.c:387] Cluster manager shutdown.  Attemping to reconnect...
Feb 11 02:44:34 cl032 kernel: SM: 00000001 sm_stop: SG still joined
Feb 11 02:44:34 cl032 kernel: SM: 0100041e sm_stop: SG still joined
Feb 11 02:44:34 cl032 kernel: SM: 0200041f sm_stop: SG still joined
Feb 11 02:44:53 cl032 ccsd[22909]: [cluster_mgr.c:346] Unable to connect to cluster infrastructure after 90 seconds.

More info available here:
http://developer.osdl.org/daniel/GFS/test.10feb2005/

I usually get closer to 50 hours before problems. Any ideas?

Daniel


From bastian at waldi.eu.org  Sat Feb 12 15:42:16 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Sat, 12 Feb 2005 16:42:16 +0100
Subject: [Linux-cluster] config update kills cluster
Message-ID: <20050212154216.GA3682@wavehammer.waldi.eu.org>

Hi all

I just tried config update via ccs_tool in a cman cluster. Each of the
nodes got the new config but the kernel rejects joins with
| CMAN: Join request from gfs1 rejected, config version local 1 remote 2

The cluster is running CVS from 2005-02-06.

Bastian

-- 
Vulcans believe peace should not depend on force.
		-- Amanda, "Journey to Babel", stardate 3842.3
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050212/f8ce4126/attachment.sig>

From bastian at waldi.eu.org  Sat Feb 12 16:39:57 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Sat, 12 Feb 2005 17:39:57 +0100
Subject: [Linux-cluster] gfs_grow/_jadd don't accept mointpoint
Message-ID: <20050212163957.GA17954@wavehammer.waldi.eu.org>

Hi all

gfs_grow/_jadd don't accept mountpoints as arguments.

| # gfs_grow /mnt/
| GFS Filesystem /mnt/ not found
| # grep /mnt /proc/mounts 
| /dev/sda2 /mnt gfs rw,noatime,nodiratime 0 0

This is CVS HEAD from 2005-02-06.

Bastian

-- 
Pain is a thing of the mind.  The mind can be controlled.
		-- Spock, "Operation -- Annihilate!" stardate 3287.2
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050212/3474c3d1/attachment.sig>

From kpreslan at redhat.com  Sat Feb 12 19:34:59 2005
From: kpreslan at redhat.com (Ken Preslan)
Date: Sat, 12 Feb 2005 13:34:59 -0600
Subject: [Linux-cluster] gfs_grow/_jadd don't accept mointpoint
In-Reply-To: <20050212163957.GA17954@wavehammer.waldi.eu.org>
References: <20050212163957.GA17954@wavehammer.waldi.eu.org>
Message-ID: <20050212193459.GA14807@potassium.msp.redhat.com>

On Sat, Feb 12, 2005 at 05:39:57PM +0100, Bastian Blank wrote:
> Hi all
> 
> gfs_grow/_jadd don't accept mountpoints as arguments.
> 
> | # gfs_grow /mnt/
> | GFS Filesystem /mnt/ not found
> | # grep /mnt /proc/mounts 
> | /dev/sda2 /mnt gfs rw,noatime,nodiratime 0 0
> 
> This is CVS HEAD from 2005-02-06.

Get rid of the trailing "/".  i.e.  gfs_grow /mnt

-- 
Ken Preslan <kpreslan at redhat.com>


From bastian at waldi.eu.org  Sun Feb 13 19:54:16 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Sun, 13 Feb 2005 20:54:16 +0100
Subject: [Linux-cluster] config update kills cluster
In-Reply-To: <20050212154216.GA3682@wavehammer.waldi.eu.org>
References: <20050212154216.GA3682@wavehammer.waldi.eu.org>
Message-ID: <20050213195416.GD16192@wavehammer.waldi.eu.org>

On Sat, Feb 12, 2005 at 04:42:16PM +0100, Bastian Blank wrote:
> | CMAN: Join request from gfs1 rejected, config version local 1 remote 2

Bah, I should read correctly.

Bastian

-- 
You!  What PLANET is this!
		-- McCoy, "The City on the Edge of Forever", stardate 3134.0
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050213/1d1e69f1/attachment.sig>

From bastian at waldi.eu.org  Sun Feb 13 19:59:55 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Sun, 13 Feb 2005 20:59:55 +0100
Subject: [Linux-cluster] possible to wait on fence domain startup
Message-ID: <20050213195955.GE16192@wavehammer.waldi.eu.org>

Hi folks

Is it possible to wait for the startup of the fence domain?

If I call "fence_tool join" and mount of a gfs volume without a sleep
between, I get permission denied and the kernel log reports the missing
fence domain.

The time which is needed between this two calls seems to be related to
the number of nodes in the fence domain.

Bastian

-- 
Without freedom of choice there is no creativity.
		-- Kirk, "The return of the Archons", stardate 3157.4
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050213/646e78fc/attachment.sig>

From bastian at waldi.eu.org  Sun Feb 13 22:26:59 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Sun, 13 Feb 2005 23:26:59 +0100
Subject: [Linux-cluster] ccs - fix -p parameter of ccsd
Message-ID: <20050213222659.GA28716@wavehammer.waldi.eu.org>

Hi folks

The attached patch fixes the -p parameter of ccsd.

Bastian

-- 
It would be illogical to assume that all conditions remain stable.
		-- Spock, "The Enterprise Incident", stardate 5027.3
-------------- next part --------------
=== daemon/ccsd.c
==================================================================
--- daemon/ccsd.c  (revision 315)
+++ daemon/ccsd.c  (local)
@@ -296,7 +296,7 @@
 
   memset(buff, 0, buff_size);
 
-  while((c = getopt(argc, argv, "46cdf:hlm:nP:t:sV")) != -1){
+  while((c = getopt(argc, argv, "46cdf:hlm:np:P:t:sV")) != -1){
     switch(c){
     case '4':
       if(IPv6 == 1){
@@ -378,6 +378,7 @@
       lockfile_location = optarg;
       buff_index += snprintf(buff+buff_index, buff_size-buff_index,
 			     "  Lock file location:: %s\n", optarg);
+      break;
     case 'P':
       if(optarg[1] != ':'){
 	fprintf(stderr, "Bad argument to '-P' option.\n"
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050213/9687ec33/attachment.sig>

From bastian at waldi.eu.org  Sun Feb 13 22:39:47 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Sun, 13 Feb 2005 23:39:47 +0100
Subject: [Linux-cluster] dlm - update dlm32 layer
Message-ID: <20050213223947.GB28716@wavehammer.waldi.eu.org>

Hi folks

The attached patch adds biarch support to the dlm32 layer of libdlm. It
is currently only enabled on s390 and sparc.

Bastian

-- 
Those who hate and fight must stop themselves -- otherwise it is not stopped.
		-- Spock, "Day of the Dove", stardate unknown
-------------- next part --------------
=== lib/Makefile
==================================================================
--- lib/Makefile  (revision 308)
+++ lib/Makefile  (local)
@@ -22,7 +22,7 @@
 
 include ${top_srcdir}/make/defines.mk
 
-CFLAGS += -g -O -I. -fPIC
+CFLAGS += -g -O -I.
 
 ifneq (${KERNEL_SRC}, )
 # Use the kernel tree if patched, otherwise, look where cluster headers
@@ -37,38 +37,35 @@
 
 all: $(STATICLIB) $(SHAREDLIB)
 
-$(LIBNAME).a: libdlm.o libaislock.o
-	${AR} cr libdlm.a libdlm.o libaislock.o
+lib_SOURCES = libdlm.c libaislock.c dlm32.c
+lib_lt_SOURCES = libdlm_lt.c dlm32.c
+
+$(LIBNAME).a: $(lib_SOURCES:.c=.po)
+	${AR} cr libdlm.a $^
 	${RANLIB} libdlm.a 
 
-$(LIBNAME)_lt.a: libdlm_lt.o 
-	${AR} r libdlm_lt.a libdlm_lt.o 
+$(LIBNAME)_lt.a: $(lib_lt_SOURCES:.c=.po)
+	${AR} r libdlm_lt.a $^
 	${RANLIB} libdlm_lt.a 
 
-$(LIBNAME).so.${RELEASE_MAJOR}.${RELEASE_MINOR}: libdlm.po libaislock.po
-	$(LD) -shared -o $@ -soname=$(LIBNAME).so.$(RELEASE_MAJOR) $^
+$(LIBNAME).so.${RELEASE_MAJOR}.${RELEASE_MINOR}: $(lib_SOURCES:.c=.po)
+	$(CC) -shared -o $@ -Wl,-soname=$(LIBNAME).so.$(RELEASE_MAJOR) $^
 
-$(LIBNAME)_lt.so.${RELEASE_MAJOR}.${RELEASE_MINOR}: libdlm_lt.po
+$(LIBNAME)_lt.so.${RELEASE_MAJOR}.${RELEASE_MINOR}: $(lib_lt_SOURCES:.c=.po)
 	$(CC) -shared -o $@ -Wl,-soname=$(LIBNAME)_lt.so.$(RELEASE_MAJOR) $^
 
-libdlm.po: libdlm.c
-	$(CC) $(CFLAGS) -D_REENTRANT -c -o $@ $<
+%_lt.o: %.c
+	$(CC) $(CFLAGS) -c -o $@ $<
 
-libdlm.o: libdlm.c
-	$(CC) $(CFLAGS) -D_REENTRANT -c -o $@ $<
+%_lt.po: %.c
+	$(CC) $(CFLAGS) -fPIC -c -o $@ $<
 
-libaislock.po: libaislock.c
+%.o: %.c
 	$(CC) $(CFLAGS) -D_REENTRANT -c -o $@ $<
 
-libaislock.o: libaislock.c
-	$(CC) $(CFLAGS) -D_REENTRANT -c -o $@ $<
+%.po: %.c
+	$(CC) $(CFLAGS) -fPIC -D_REENTRANT -c -o $@ $<
 
-libdlm_lt.po: libdlm.c
-	$(CC) $(CFLAGS) -c -o $@ $<
-
-libdlm_lt.o: libdlm.c
-	$(CC) $(CFLAGS) -c -o $@ $<
-
 copytobin: all
 
 
=== lib/dlm32.c
==================================================================
--- lib/dlm32.c  (revision 308)
+++ lib/dlm32.c  (local)
@@ -23,7 +23,23 @@
 /* Convert 32 bit userland reads & writes to something suitable for
    a 64 bit kernel */
 
+#include <stdbool.h>
+#include <stdint.h>
+#include <sys/utsname.h>
+#include <string.h>
+#include <unistd.h>
+#include "dlm.h"
+#include "dlm_device.h"
 
+#if (defined(__s390__) || defined(__sparc__)) && __WORDSIZE == 32
+# define BUILD_BIARCH
+#endif
+
+extern ssize_t dlm_read(int, struct dlm_lock_result *);
+extern ssize_t dlm_read_data(int, struct dlm_lock_result *, size_t);
+extern ssize_t dlm_write(int, struct dlm_write_request *, size_t);
+
+#ifdef BUILD_BIARCH
 /* 64 bit versions of the structs */
 struct dlm_lock_params64 {
 	uint8_t mode;
@@ -32,10 +48,10 @@
 	uint32_t parent;
 	struct dlm_range range;
 	uint8_t namelen;
-        uint64_t castparam;
+	uint64_t castparam;
 	uint64_t castaddr;
 	uint64_t bastparam;
-        uint64_t bastaddr;
+	uint64_t bastaddr;
 	uint64_t lksb;
 	char lvb[DLM_LVB_LEN];
 	char name[1];
@@ -73,10 +89,10 @@
 
 struct dlm_lksb64
 {
-        int      sb_status;
-        uint32_t sb_lkid;
-        char     sb_flags;
-        uint64_t sb_lvbptr;
+	int      sb_status;
+	uint32_t sb_lkid;
+	char     sb_flags;
+	uint64_t sb_lvbptr;
 };
 
 /* struct read from the "device" fd,
@@ -104,16 +120,38 @@
 	int gqi_lockcount;	/* output */
 };
 
+static bool check_biarch_convert(void)
+{
+	static enum { undefined, native, convert } status;
+	if (status == undefined)
+	{
+		struct utsname buf;
+		if (uname(&buf) != 0)
+			status = native;
+		else if (strcmp(buf.machine,
+#ifdef __s390__
+				"s390x"
+#endif
+#ifdef __sparc__
+				"sparc64"
+#endif
+			  ) == 0)
+			status = convert;
+		else
+			status = native;
+	}
+	if (status == convert)
+		return true;
+	return false;
+}
 
-int dlm_write(int fd, void *buf, int len)
+static ssize_t _dlm_write_convert(int fd, struct dlm_write_request *req32, size_t len32)
 {
 	char buf64[sizeof(struct dlm_write_request64) + DLM_RESNAME_MAXLEN];
 	struct dlm_write_request64 *req64;
-	struct dlm_write_request   *req32;
 	int len64;
 	int ret;
 
-	req32 = (struct dlm_write_request *)buf;
 	req64 = (struct dlm_write_request64 *)buf64;
 	len64 = sizeof(struct dlm_write_request64);
 
@@ -167,37 +205,33 @@
 
 	/* Fake the return length */
 	if (ret == len64)
-		ret = len;
+		ret = len32;
 
 	return ret;
 }
 
-
-int dlm_read(int fd, void *buf, int len)
+static ssize_t _dlm_read_convert(bool data, int fd, struct dlm_lock_result *res32, ssize_t len32)
 {
-	int ret;
-	int len64;
-	struct dlm_lock_result *res32;
+	ssize_t ret;
+	size_t len64;
 	struct dlm_lock_result64 *res64;
 	struct dlm_lock_result64 buf64;
 
-	res32 = (struct dlm_lock_result *)buf;
-
 	/* There are two types of read done here, the first just gets the structure, for that
 	   we need our own buffer because the 64 bit one is larger than the 32bit.
 	   When the user wants the extended information it has already been told the full (64bit)
 	   size of the buffer by the kernel so we can use that buffer for reading, that
 	   also avoids the need to copy the extended data blocks too.
 	*/
-	if (len == sizeof(struct dlm_lock_result))
+	if (!data)
 	{
-		len64 = sizeof(struct dlm_lock_result64);
+		len64 = sizeof(buf64);
 		res64 = &buf64;
 	}
 	else
 	{
-		len64 = len;
-		res64 = (struct dlm_lock_result64 *)buf;
+		len64 = len32;
+		res64 = (struct dlm_lock_result64 *)res32;
 	}
 
 	ret = read(fd, res64, len64);
@@ -222,10 +256,47 @@
 			struct dlm_queryinfo64 *qinfo64;
 			struct dlm_queryinfo *qinfo32;
 
-			qinfo64 = (struct dlm_queryinfo64 *)(buf+res32->qinfo_offset);
-			qinfo32 = (struct dlm_queryinfo *)(buf+res32->qinfo_offset);
+			qinfo64 = (struct dlm_queryinfo64 *)(res32+res32->qinfo_offset);
+			qinfo32 = (struct dlm_queryinfo *)(res32+res32->qinfo_offset);
 			qinfo32->gqi_lockcount = qinfo64->gqi_lockcount;
 		}
 	}
 	return ret;
 }
+
+ssize_t dlm_read(int fd, struct dlm_lock_result *res)
+{
+	if (check_biarch_convert())
+		return _dlm_read_convert(false, fd, res, 0);
+	return read(fd, res, sizeof(struct dlm_lock_result));
+}
+
+ssize_t dlm_read_data(int fd, struct dlm_lock_result *res, size_t len)
+{
+	if (check_biarch_convert())
+		return _dlm_read_convert(true, fd, res, len);
+	return read(fd, res, len);
+}
+
+ssize_t dlm_write(int fd, struct dlm_write_request *req, size_t len)
+{
+	if (check_biarch_convert())
+		return _dlm_write_convert(fd, req, len);
+	return write(fd, req, len);
+}
+#else /* BUILD_BIARCH */
+ssize_t dlm_read(int fd, struct dlm_lock_result *res)
+{
+	return read(fd, res, sizeof(struct dlm_lock_result));
+}
+
+ssize_t dlm_read_data(int fd, struct dlm_lock_result *res, size_t len)
+{
+	return read(fd, res, len);
+}
+
+ssize_t dlm_write(int fd, struct dlm_write_request *req, size_t len)
+{
+	return write(fd, req, len);
+}
+#endif /* BUILD_BIARCH */
=== lib/libdlm.c
==================================================================
--- lib/libdlm.c  (revision 308)
+++ lib/libdlm.c  (local)
@@ -46,15 +46,6 @@
 #include "libdlm.h"
 #include "dlm_device.h"
 
-/* Add other grotesqueries here as they arise */
-#if defined(__sparc__) && __WORDSIZE == 32
-#include "dlm32.c"
-#else
-#define dlm_write write
-#define dlm_read read
-#endif
-
-
 #define MISC_PREFIX "/dev/misc/"
 #define PROC_MISC "/proc/misc"
 #define DLM_PREFIX "dlm_"
@@ -76,6 +67,10 @@
 #endif
 };
 
+extern ssize_t dlm_read(int, struct dlm_lock_result *);
+extern ssize_t dlm_read_data(int, struct dlm_lock_result *, size_t);
+extern ssize_t dlm_write(int, struct dlm_write_request *, size_t);
+
 /* The default lockspace.
    I've resisted putting locking around this as the user should be
    "sensible" and only do lockspace operations either in the
@@ -399,14 +394,13 @@
 
 static int do_dlm_dispatch(int fd)
 {
-    char resultbuf[sizeof(struct dlm_lock_result)];
-    struct dlm_lock_result *result = (struct dlm_lock_result *)resultbuf;
-    char *fullresult=NULL;
+    struct dlm_lock_result resultbuf;
+    struct dlm_lock_result *result = &resultbuf, *fullresult = NULL;
     int status;
     void (*astaddr)(void *astarg);
 
     /* Just read the header first */
-    status = dlm_read(fd, result, sizeof(struct dlm_lock_result));
+    status = dlm_read(fd, result);
     if (status <= 0)
 	return -1;
 
@@ -418,7 +412,7 @@
 	if (!fullresult)
 	    return -1;
 
-	newstat = dlm_read(fd, fullresult, result->length);
+	newstat = dlm_read_data(fd, fullresult, result->length);
 
 	/* If it read OK then use the new data. otherwise we can
 	   still deliver the AST, it just might not have all the
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050213/83d13ff2/attachment.sig>

From pcaulfie at redhat.com  Mon Feb 14 08:46:52 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 14 Feb 2005 08:46:52 +0000
Subject: [Linux-cluster] dlm - update dlm32 layer
In-Reply-To: <20050213223947.GB28716@wavehammer.waldi.eu.org>
References: <20050213223947.GB28716@wavehammer.waldi.eu.org>
Message-ID: <20050214084652.GC5724@tykepenguin.com>

On Sun, Feb 13, 2005 at 11:39:47PM +0100, Bastian Blank wrote:
> Hi folks
> 
> The attached patch adds biarch support to the dlm32 layer of libdlm. It
> is currently only enabled on s390 and sparc.

I'm working on this, but the patch you sent last seems to break queries (even on
i386) and I haven't got the bottom of why yet.
-- 

patrick


From adingman at cook-inc.com  Mon Feb 14 14:37:37 2005
From: adingman at cook-inc.com (Andrew C. Dingman)
Date: Mon, 14 Feb 2005 09:37:37 -0500
Subject: [Linux-cluster] fencing agent for Dell 1855 blades
Message-ID: <1108391857.1075.10.camel@adingman.marcomm>

Hi, all

We're working with Dell 1855 blades in a GFS cluster. They can't be
power fenced with an external power switch, and they don't use quite the
same DRAC as other Dell systems. I've therefore written a fencing agent
to use with them. So far, I've only tested it from the command line. If
all goes well, I'll be able to test it with a live GFS cluster this
afternoon.

If anyone is interested, please take a look and let me know what you
think. Ultimately, we'd like to get this included in the main fence
distribution. Thanks in advance for any feedback.

--
Andrew C. Dingman
adingman at cook-inc dot com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: fence_dracmc
Type: application/x-perl
Size: 6201 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050214/cd6568a1/attachment.pl>

From bastian at waldi.eu.org  Sun Feb 13 21:56:30 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Sun, 13 Feb 2005 22:56:30 +0100
Subject: [Linux-cluster] fence - convert manpages to the man macro package
Message-ID: <20050213215630.GA9873@wavehammer.waldi.eu.org>

Hi folks

The current manpages are written in plain nroff which is not parsable by
many scripts.

The attached patch converts the manpages in the fence package to the man
macro package.

Bastian

-- 
Schshschshchsch.
		-- The Gorn, "Arena", stardate 3046.2
-------------- next part --------------
=== man/fence.8
==================================================================
--- man/fence.8  (revision 317)
+++ man/fence.8  (local)
@@ -5,45 +5,66 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence(8)''fence(8)'
+.TH fence 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 I/O Fencing reference guide
 
-.in
-\fBSYNOPSIS\fP
-.in +7
+.SH SYNOPSIS
 Overview of related manual pages
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 The I/O Fencing documentation has been split into a number of sections.  Please
 refer to the table below to determine which man page coincides with the
 command/feature you are looking for.
-.sp
- fence          I/O Fencing overview (this man page)
- fenced         I/O Fencing daemon
 
-I/O Fencing agents
+.TP 10
+fence
+I/O Fencing overview (this man page)
+.TP
+fenced
+I/O Fencing daemon
 
- fence_apc           for APC MasterSwitch and APC 79xx models
- fence_bladecenter   for IBM Bladecenters w/ telnet interface
- fence_brocade       for Brocade fibre channel switches (PortDisable)
- fence_egenera       for Egenera blades
- fence_gnbd          for GNBD-based GFS clusters
- fence_ilo           for HP ILO interfaces (formerly fence_rib)
- fence_manual        for manual intervention
- fence_mcdata        for McData fibre channel switches
- fence_ack_manual    for manual intervention
- fence_sanbox2       for Qlogic SAN Box fibre channel switches
- fence_vixel         for Vixel switches (PortDisable)
- fence_wti           for WTI Network Power Switch
- fence_node          for use by lock_gulmd
+.SS I/O Fencing agents
 
-.sp
-.in
-\fBSEE ALSO\fP
-.in +7
+.TP 20
+fence_apc
+for APC MasterSwitch and APC 79xx models
+.TP
+fence_bladecenter
+for IBM Bladecenters w/ telnet interface
+.TP
+fence_brocade
+for Brocade fibre channel switches (PortDisable)
+.TP
+fence_egenera
+for Egenera blades
+.TP
+fence_gnbd
+for GNBD-based GFS clusters
+.TP
+fence_ilo
+for HP ILO interfaces (formerly fence_rib)
+.TP
+fence_manual
+for manual intervention
+.TP
+fence_mcdata
+for McData fibre channel switches
+.TP
+fence_ack_manual
+for manual intervention
+.TP
+fence_sanbox2
+for Qlogic SAN Box fibre channel switches
+.TP
+fence_vixel
+for Vixel switches (PortDisable)
+.TP
+fence_wti
+for WTI Network Power Switch
+.TP
+fence_node
+for use by lock_gulmd
+
+.SH SEE ALSO
 gnbd(8), gfs(8)
=== man/fence_ack_manual.8
==================================================================
--- man/fence_ack_manual.8  (revision 317)
+++ man/fence_ack_manual.8  (local)
@@ -5,57 +5,39 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_ack_manual(8)''fence_ack_manual(8)'
+.TH fence_ack_manual 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_ack_manual - program run by an operator as a part of manual I/O Fencing
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_ack_manual -n\fP \fInodename\fP
+.SH SYNOPSIS
+.B
+fence_ack_manual
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_ack_manual is run by an operator on the same node that fence_manual(8) 
 was run after the operator has reset a node which required fencing.  A message 
 in the system log indicates to the operator that they must reset a machine and 
 then run fence_ack_manual.  Running fence_ack_manual allows the cluster to 
 continue with recovery of the fenced machine.  The victim may be disconnected 
 from storage rather than resetting it.
-.sp
-.in 
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-O\fP
-.in +7
 Run without prompting for user confirmation.
-.sp
-.in
+.TP
 \fB-n\fP \fInodename\fP
-.in +7
 Name of node that has been reset or disconnected from storage.
-.sp
-.in
+.TP
 \fB-s\fP \fIIPaddress\fP
-.in +7
 IP address of the machine which has been reset or disconnected from storage.  (Deprecated; use -n instead.)
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
-.in
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8)
=== man/fence_apc.8
==================================================================
--- man/fence_apc.8  (revision 317)
+++ man/fence_apc.8  (local)
@@ -5,132 +5,85 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_apc(8)''fence_apc(8)'
+.TH fence_apc 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_apc - I/O Fencing agent for APC MasterSwitch
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_apc -a\fP \fIIPaddress\fR \fB-l\fP \fIlogin\fR \fB-p\fP \fIpassword\fR \fB-n\fP \fIoutlet\fR 
-[\fB-o\fP action] [\fB-T\fP] [\fB-v\fP]
+.SH SYNOPSIS
+.B 
+Bfence_apc
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_apc is an I/O Fencing agent which can be used with the APC MasterSwitch
 network power switch.  It logs into a MasterSwitch via telnet and reboots
 a specified outlet.  Lengthy telnet connections to the MasterSwitch should
 be avoided while a GFS cluster is running because the connection will
 block any necessary fencing actions.
-.sp
+
 fence_apc accepts options on the command line as well as from stdin.  
 Fenced sends parameters through stdin when it execs the agent.  fence_apc 
 can be run by itself with command line options.  This is useful for testing 
 and for turning outlets on or off from scripts.
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-a\fP \fIIPaddress\fR
-.in +7
 IP address or hostname of the switch.
-.sp
-.in
+.TP
 \fB-h\fP 
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-l\fP \fIlogin\fR
-.in +7
 Login name.
-.sp
-.in
+.TP
 \fB-n\fP \fI[<switch>:]outlet\fR
-.in +7
 The outlet number to act upon.  
-.sp
-.in
+.TP
 \fB-o\fP \fIaction\fR
-.in +7
 The action required.  Reboot (default), Off or On.
-.in
-.sp
+.TP
 \fB-p\fP \fIpassword\fR
-.in +7
 Password for login.
-.sp
-.in
+.TP
 \fB-T\fP
-.in +7
 Test only.  Answer NO to the confirmation prompt instead of YES.
-.in
-.sp
+.TP
 \fB-v\fP
-.in +7
 Verbose.  Record telnet session in /tmp/apclog.
-.in
-.sp
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.in
-.sp
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
 
-\fI agent = < param >\fR
-.sp
+.SH STDIN PARAMETERS
+.TP
+\fIagent = < param >\fR
 This option is used by fence_node(8) and is ignored by fence_apc.
-.sp
-
+.TP
 \fIipaddr = < hostname | ip >\fR
-.sp
 IP address or hostname of the switch.
-.sp
-
-\fI login = < param >\fR
-.sp
+.TP
+\fIlogin = < param >\fR
 Login name.
-.sp
-
-\fI option = < param >\fR
-.sp
+.TP
+\fIoption = < param >\fR
 The action required.  Reboot (default), Off or On.
-.sp
-
-\fI passwd = < param >\fR
-.sp
+.TP
+\fIpasswd = < param >\fR
 Password for login.
-.sp
-
-\fI port = < param >\fR
-.sp
+.TP
+\fIport = < param >\fR
 The outlet number to act upon.
-.sp
-
-\fI switch = < param >\fR
-.sp
+.TP
+\fIswitch = < param >\fR
 The switch to operate on.  Defaults to "1" if not specified.
-.sp
-
-\fI test = < param >\fR
-.sp
+.TP
+\fItest = < param >\fR
 Test only.  Answer NO to the confirmation prompt instead of YES.
-.sp
-
-\fI verbose = < param >\fR
-.sp
+.TP
+\fIverbose = < param >\fR
 Verbose.  Record telnet session in /tmp/apclog.
-.sp
 
-
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8)
=== man/fence_baytech.8
==================================================================
--- man/fence_baytech.8  (revision 317)
+++ man/fence_baytech.8  (local)
@@ -5,21 +5,17 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_baytech(8)''fence_baytech(8)'
+.TH fence_baytech 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_baytech - I/O Fencing agent for Baytech RPC switches in combination with a Cyclades Terminal Server
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_baytech -a\fP \fIhost\fR \fB-l\fP \fIlogin\fR \fB-p\fP \fIpassword\fR \fB-n\fP \fIoutletname\fR \fB-o\fP \fIaction\fR
+.SH SYNOPSIS
+.B
+fence_baytech
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 
 This fencing agent is written for the Baytech RPC27-20nc in combination with
 a Cyclades terminal server.  The Cyclades TS exports the RPC's serial port
@@ -27,95 +23,61 @@
 However, this script relys upon the assumption that Telnet is used.  Future
 features to this agent would allow the agent to work with a mulitude of 
 different communication protocols such as Telnet, SSH or Kermit.
-.sp
+
 The other assumption that is made is that Outlet names do not end in space.
 The name "Foo" and "Foo    " are identical when the RPC prints them with
 the status command.
-.sp
+
 fence_baytech accepts options on the command line as well as from stdin.
 fenced sends parameters through stdin when it execs the agent.  fence_baytech
 can be run by itself with command line options which is useful for testing.
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-a\fP \fIhost\fP
-.in +7 
 IP address or hostname to connect to.
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-l\fP \fIlogin\fP
-.in +7
 Username name for the switch.
-.sp
-.in
+.TP
 \fB-n\fP \fIport\fP
-.in +7
 The name of the outlet to act upon.
-.in
-.sp
+.TP
 \fB-o\fP \fIaction\fP
-.in +7
 The action required.  disable (default) or enable.
-.in
-.sp
+.TP
 \fB-p\fP \fIpassword\fP
-.in +7
 Password for login.
-.sp
-.in
+.TP
 \fB-q\fP
-.in +7
 Quiet mode: print only error messages.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
-.in
 
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
+.SH STDIN PARAMETERS
 
-\fI agent = < param >\fR
-.sp
+.TP
+\fIagent = < param >\fR
 This option is used by fence_node(8) and is ignored by fence_baytech.
-.sp
-
-\fI host = < hostname | ip >\fR
-.sp
+.TP
+\fIhost = < hostname | ip >\fR
 IP address or hostname to connect to.
-.sp
-
-\fI login = < param >\fR
-.sp
+.TP
+\fIlogin = < param >\fR
 Login name.
-.sp
-
-\fI action = < param >\fR
-.sp
+.TP
+\fIaction = < param >\fR
 The action required.  On, Off, Status or Reboot (default)
-.sp
-
-\fI passwd = < param >\fR
-.sp
+.TP
+\fIpasswd = < param >\fR
 Password for login.
-.sp
-
-\fI outlet = < param >\fR
-.sp
+.TP
+\fIoutlet = < param >\fR
 The name of the outlet to act upon.
-.sp
 
-
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8)
=== man/fence_bladecenter.8
==================================================================
--- man/fence_bladecenter.8  (revision 317)
+++ man/fence_bladecenter.8  (local)
@@ -5,115 +5,73 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_bladecenter(8)''fence_bladecenter(8)'
+.TH fence_bladecenter 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_brocade - I/O Fencing agent for IBM Bladecenter
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_bladecenter-a\fP \fIIPaddress\fR \fB-l\fP \fIlogin\fR \fB-p\fP \fIpassword\fR \fB-n\fP \fIblade\fR \fB-o\fP \fIaction\fR
+.SH SYNOPSIS
+.B
+fence_bladecenter
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_bladecenter is an I/O Fencing agent which can be used with IBM Bladecenters 
 with recent enough firmware that includes telnet support.  It logs into a Brocade 
 chasis via telnet and uses the command line interface to power on and off blades.
 fence_bladecenter accepts options on the command line or from stdin.
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-a\fP \fIIPaddress\fP
-.in +7 
 IP address of the Bladecenter.
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-l\fP \fIlogin\fP
-.in +7
 Login name for the Bladecenter.
-.sp
-.in
+.TP
 \fB-n\fP \fIblade\fP
-.in +7
 The blade to operate on.
-.in
-.sp
+.TP
 \fB-o\fP \fIaction\fP
-.in +7
 The action required.  Valid actions are on, off, reboot (default) and status.
-.in
-.sp
+.TP
 \fB-p\fP \fIpassword\fP
-.in +7
 Password for login.
-.sp
-.in
+.TP
 \fB-q\fP
-.in +7
 Quiet mode: print only error messages.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
-.in
+.TP
 \fB-v\fP \fIdebuglog\fP
-.in +7
 Log the telnet session to \fIdebuglog\fP for debugging purposes.
-.sp
-.in
 
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
-
-\fI agent = < param >\fR
-.sp
-This option is used by fence_node(8) and is ignored by fence_brocade.
-.sp
-
+.SH STDIN PARAMETERS
+.TP
+\fIagent = < param >\fR
+This option is used by fence_node(8) and is ignored by fence_bladecenter.
+.TP
 \fIipaddr = < hostname | ip >\fR
-.sp
 IP address or hostname of the switch.
-.sp
-
-\fI login = < param >\fR
-.sp
+.TP
+\fIlogin = < param >\fR
 Login name.
-.sp
-
-\fI option = < param >\fR
-.sp
+.TP
+\fIoption = < param >\fR
 The action required.  disable (default) or enable.
-.sp
-
-\fI passwd = < param >\fR
-.sp
+.TP
+\fIpasswd = < param >\fR
 Password for login.
-.sp
-
-\fI blade = < param >\fR
-.sp
+.TP
+\fIblade = < param >\fR
 The blade to operate on.
-.sp
-
-\fI debuglog = < param>\fR
-.sp
+.TP
+\fIdebuglog = < param>\fR
 Optional parameter to send debug transcript of the telnet session to a log file
-.sp
 
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8)
=== man/fence_brocade.8
==================================================================
--- man/fence_brocade.8  (revision 317)
+++ man/fence_brocade.8  (local)
@@ -5,118 +5,79 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_brocade(8)''fence_brocade(8)'
+.TH fence_brocade 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_brocade - I/O Fencing agent for Brocade FC switches
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_brocade -a\fP \fIIPaddress\fR \fB-l\fP \fIlogin\fR \fB-p\fP \fIpassword\fR \fB-n\fP \fIport\fR \fB-o\fP \fIaction\fR
+.SH SYNOPSIS
+.B
+fence_brocade
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_brocade is an I/O Fencing agent which can be used with Brocade FC 
 switches.  It logs into a Brocade switch via telnet and disables a specified 
 port.  Disabling the port which a machine is connected to effectively fences 
 that machine.  Lengthy telnet connections to the switch should be avoided 
 while a GFS cluster is running because the connection will block any necessary 
 fencing actions.
-.sp
+
 fence_brocade accepts options on the command line as well as from stdin.
 fenced sends parameters through stdin when it execs the agent.  fence_brocade 
 can be run by itself with command line options which is useful for testing.
-.sp
+
 After a fence operation has taken place the fenced machine can no longer connect
 to the Brocade FC switch.  When the fenced machine is ready to be brought back 
 into the GFS cluster (after reboot) the port on the Brocade FC switch needs to 
 be enabled. This can be done by running fence_brocade and specifying the 
 enable action.
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-a\fP \fIIPaddress\fP
-.in +7 
 IP address of the switch.
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-l\fP \fIlogin\fP
-.in +7
 Login name for the switch.
-.sp
-.in
+.TP
 \fB-n\fP \fIport\fP
-.in +7
 The port number to disable on the switch.
-.in
-.sp
+.TP
 \fB-o\fP \fIaction\fP
-.in +7
 The action required.  disable (default) or enable.
-.in
-.sp
+.TP
 \fB-p\fP \fIpassword\fP
-.in +7
 Password for login.
-.sp
-.in
+.TP
 \fB-q\fP
-.in +7
 Quiet mode: print only error messages.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
-.in
 
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
-
-\fI agent = < param >\fR
-.sp
+.SH STDIN PARAMETERS
+.TP
+\fIagent = < param >\fR
 This option is used by fence_node(8) and is ignored by fence_brocade.
-.sp
-
+.TP
 \fIipaddr = < hostname | ip >\fR
-.sp
 IP address or hostname of the switch.
-.sp
-
-\fI login = < param >\fR
-.sp
+.TP
+\fIlogin = < param >\fR
 Login name.
-.sp
-
-\fI option = < param >\fR
-.sp
+.TP
+\fIoption = < param >\fR
 The action required.  disable (default) or enable.
-.sp
-
-\fI passwd = < param >\fR
-.sp
+.TP
+\fIpasswd = < param >\fR
 Password for login.
-.sp
-
-\fI port = < param >\fR
-.sp
+.TP
+\fIport = < param >\fR
 The port number to disable on the switch.
-.sp
 
-
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8)
=== man/fence_cpint.8
==================================================================
--- man/fence_cpint.8  (revision 317)
+++ man/fence_cpint.8  (local)
@@ -5,21 +5,17 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_cpint(8)''fence_cpint(8)'
+.TH fence_cpint 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_cpint - I/O Fencing agent for GFS on s390 and zSeries VM clusters
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_cpint -u\fP \fIuserid\fP
+.SH SYNOPSIS
+.B
+fence_cpint
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_cpint is an I/O Fencing agent used on a virtual machine running GFS in a
 s390 or zSeries VM cluster.
 It uses the cpint package to send a CP LOGOFF command to the specified virtual
@@ -31,55 +27,33 @@
 the secondary user of the virtual machine to be fenced.  This means that unless
 all of you gulm server nodes are privilege class C, fence_cpint can only be
 used with SLM.
-.sp
 
 fence_cpint accepts options on the command line as well as from stdin.
 fence_node sends the options through stdin when it execs the agent.
 fence_cpint can be run by itself with command line options which is useful for
 testing.
-.sp
 
-.in
-\fBOPTIONS\fP
-.in
+.SH OPTIONS
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-
-.in
+.TP
 \fB-u\fP \fIuserid\fP
-.in +7
 userid of the virtual machine to fence (required).
-.sp
-
-.in
+.TP
 \fB-q\fP
-.in +7
 quiet mode, no output.
-.sp
-
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
 
-.in
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
+.SH STDIN PARAMETERS
+.TP
 \fIagent = < param >\fR
-.sp
 This option is used by fence_node(8) and is ignored by fence_cpint.
-.sp
-
+.TP
 \fIuserid = < parm >\fP
-.sp
 userid of the virtual machine to fence (required).
-.sp
 
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8)
=== man/fence_egenera.8
==================================================================
--- man/fence_egenera.8  (revision 317)
+++ man/fence_egenera.8  (local)
@@ -5,22 +5,17 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_egenera(8)''fence_egenera(8)'
+.TH fence_egenera 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_egenera - I/O Fencing agent for the Egenera BladeFrame
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_egenera -c\fP \fIcserver\fR \fB-l\fP \fIlpan\fR \fB-p\fP \fIpserver\fR [\fB-o\fP action] 
-[\fB-q\fP] 
+.SH SYNOPSIS
+.B
+fence_egenera
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_egenera is an I/O Fencing agent which can be used with the Egenera
 BladeFrame.  It logs into a control blade (cserver) via ssh and operates
 on a processing blade (pserver) identified by the pserver name and the 
@@ -28,87 +23,55 @@
 that ssh keys have been setup so that the fence_egenera does not require
 a password to authenticate.  Refer to ssh(8) for more information on setting
 up ssh keys.
-.sp
+
 fence_egenera accepts options on the command line as well as from stdin.  
 Fenced sends parameters through stdin when it execs the agent.  fence_egenera 
 can also be run by itself with command line options.  
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-c\fP \fIcserver\fR
-.in +7
 The cserver to ssh to.  cserver can be in the form user at hostname to
 specify a different user to login as.
-.sp
-.in
+.TP
 \fB-h\fP 
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-l\fP \fIlpan\fR
-.in +7
 the lpan to operate on
-.sp
-.in
+.TP
 \fB-o\fP \fIaction\fR
-.in +7
 The action required.  reboot (default), off, on or status.
-.in
-.sp
+.TP
 \fB-p\fP \fIpserver\fR
-.in +7
 the pserver to operate on
-.sp
-.in
+.TP
 \fB-q\fP
-.in +7
 quite mode.  supress output.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.in
-.sp
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
 
-\fI action = < param >\fR
-.sp
+.SH STDIN PARAMETERS
+.TP
+\fIaction = < param >\fR
 The action required.  reboot (default), off, on or status.
-.sp
-
-\fI agent = < param >\fR
-.sp
+.TP
+\fIagent = < param >\fR
 This option is used by fence_node(8) and is ignored by fence_apc.
-.sp
-
-\fI cserver = < param >\fR
-.sp
+.TP
+\fIcserver = < param >\fR
 The cserver to ssh to.  cserver can be in the form user at hostname to
 specify a different user to login as.
-.sp
-
-\fI lpan = < param >\fR
-.sp
+.TP
+\fIlpan = < param >\fR
 The lpan to operate on
-.sp
-
-\fI pserver = < param >\fR
-.sp
+.TP
+\fIpserver = < param >\fR
 The pserver to operate on
-.sp
+.TP
+\fIesh = < param >\fR
+The path to the esh command on the cserver (default is /opt/panmgr/bin/esh)
 
-\fI esh = < param >\fR
-.sp
-The path to the esh command on the cserver (default is /opt/panmgr/bin/esh )
-.sp
-
-
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8), ssh(8)
=== man/fence_ibmblade.8
==================================================================
--- man/fence_ibmblade.8  (revision 317)
+++ man/fence_ibmblade.8  (local)
@@ -5,99 +5,64 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_ibmblade(8)''fence_ibmblade(8)'
+.TH fence_ibmblade 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_ibmblade - I/O Fencing agent for IBM BladeCenter 
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_ibmblade -a\fP \fIIPaddress\fR \fB-c\fP \fIcommunity\fR \fB-n\fP \fIport\fR \fB-o\fP \fIaction\fR
+.SH SYNOPSIS
+.B
+fence_ibmblade
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_ibmblade is an I/O Fencing agent which can be used with IBM BladeCenter 
 chassis. It issues SNMP Set request to BladeCenter chassins, rebooting, powering
 up or down the specified Blade Server. 
-.sp
+
 fence_ibmblade accepts options on the command line as well as from stdin.
 fenced sends parameters through stdin when it execs the agent.  fence_ibmblade 
 can be run by itself with command line options which is useful for testing.
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-a\fP \fIIPaddress\fP
-.in +7 
 IP address of the BladeCenter chassis. 
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-c\fP \fIcommunity\fP
-.in +7
 SNMP community string to use.
-.sp
-.in
+.TP
 \fB-n\fP \fIport\fP
-.in +7
 The Blade port number to disable.
-.in
-.sp
+.TP
 \fB-o\fP \fIaction\fP
-.in +7
 The action required.  Reboot (default), On or off.
-.in
-.sp
+.TP
 \fB-q\fP
-.in +7
 Quiet mode: print only error messages.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
-.in
 
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
-
-\fI agent = < param >\fR
-.sp
+.SH STDIN PARAMETERS
+.TP
+\fIagent = < param >\fR
 This option is used by fence_node(8) and is ignored by fence_ibmblade.
-.sp
-
+.TP
 \fIipaddr = < hostname | ip >\fR
-.sp
 IP address or hostname of the switch.
-.sp
-
-\fI community = < param >\fR
-.sp
+.TP
+\fIcommunity = < param >\fR
 SNMP community.
-.sp
-
-\fI option = < param >\fR
-.sp
+.TP
+\fIoption = < param >\fR
 The action required.  reboot (default), on or off.
-.sp
-
-\fI port = < param >\fR
-.sp
+.TP
+\fIport = < param >\fR
 The Blade port number to disable.
-.sp
 
-
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8)
=== man/fence_ilo.8
==================================================================
--- man/fence_ilo.8  (revision 317)
+++ man/fence_ilo.8  (local)
@@ -5,110 +5,72 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_ilo(8)''fence_ilo(8)'
+.TH fence_ilo 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_ilo - I/O Fencing agent for HP Integrated Lights Out card
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_ilo -a\fP \fIIPaddress[:SSLport]\fR \fB-l\fP \fIlogin\fR \fB-p\fP \fIpassword\fR  
-[\fB-o\fP action] [\fB-vqVh\fP] 
+.SH SYNOPSIS
+.B
+fence_ilo
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_ilo is an I/O Fencing agent used for HP servers with the Integrated Light 
 Out (iLO) PCI card.  The agent opens an SSL connection to the iLO card.  Once the
 SSL connection is established, the agent is able to communicate with the iLO
 card through an XML stream.  
-.sp
+
 fence_ilo depends on the Net::SSLeay or Net::SSL perl module in order to establish 
 the SSL connection to the iLO card.  Net::SSL is available in the perl-Crypt-SSLeay package
 on RHN (http://rhn.redhat.com). Net::SSLeay is available at http://www.cpan.org.
-.sp
+
 NOTE: fence_ilo deprecates fence_rib.  
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-a\fP \fIIPaddress[:port]\fR
-.in +7
 IP address or hostname of the iLO card.  If the SSL server of the card is
 not running on the default SSL port, 443, then [:port] will also need to be
 specified.
-.sp
-.in
+.TP
 \fB-h\fP 
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-l\fP \fIlogin\fR
-.in +7
 Login name.
-.sp
-.in
+.TP
 \fB-o\fP \fIaction\fR
-.in +7
 The action required.  reboot (default), off, on or status.
-.in
-.sp
+.TP
 \fB-p\fP \fIpassword\fR
-.in +7
 Password for login.
-.sp
-.in
+.TP
 \fB-v\fP
-.in +7
 Verbose.  
-.in
-.sp
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.in
-.sp
 
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
-
+.SH STDIN PARAMETERS
+.TP
 \fIaction = < param >\fR
-.sp
 The action required.  reboot (default), off, on or status.
-.sp
-
+.TP
 \fIagent = < param >\fR
-.sp
 This option is used by fence_node(8) and is ignored by fence_ilo.
-.sp
-
+.TP
 \fIhostname = < hostname | ip >\fR
-.sp
 IP address or hostname of the iLO card.
-.sp
-
+.TP
 \fIlogin = < param >\fR
-.sp
 Login name.
-.sp
-
+.TP
 \fIpasswd = < param >\fR
-.sp
 Password for login.
-.sp
-
+.TP
 \fIverbose = < param >\fR
-.sp
 Verbose mode.
-.sp
 
-
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8), fence_rib(8)
=== man/fence_manual.8
==================================================================
--- man/fence_manual.8  (revision 317)
+++ man/fence_manual.8  (local)
@@ -5,76 +5,52 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_manual(8)''fence_manual(8)'
+.TH fence_manual 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_manual - program run by fenced as a part of manual I/O Fencing
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_manual -n\fP \fInodename\fP
+.SH SYNOPSIS
+.B
+fence_manual
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_manual is run by fenced.  It creates a fifo and waits for 
 its counter-part fence_ack_manual(8) to acknowledge that a failed node
 has been reset.  fence_ack_manual(8) should only be run after the operator
 has reset the faulty node.  While waiting for the manual acknowledgement,
 fence_manual also watches for the faulty node to rejoin the cluster;
 if it does, it's taken as an acknowledgement and completes.
-.sp
+
 Note:  fence_manual is provided for use during testing and evaluation
 only.  Sites should not use fence_manual as the primary fencing method
 on a production cluster.
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-q\fP
-.in +7
 quiet mode, no output.
-.sp
-.in
+.TP
 \fB-n\fP \fInodename\fP
-.in +7
 The node name (usually hostname) of the machine that needs to be reset or disconnected from shared storage.
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
-.in
-.in -7
 
-\fBSTDIN PARAMETERS\fP
-.in
-
-\fI agent = < param >\fR
-.sp
+.SH STDIN PARAMETERS
+.TP
+\fIagent = < param >\fR
 This option is used by fence_node(8) and is ignored by fence_manual.
-.sp
-
-\fI nodename = < param >\fR
-.sp
+.TP
+\fInodename = < param >\fR
 The node name (usually hostname) of the machine that needs to be reset or disconnected from storage.
-.sp
-
-\fI ipaddr = < param >\fR
-.sp
+.TP
+\fIipaddr = < param >\fR
 IP address or hostname of the machine that needs to be reset or disconnected from storage.  (Deprecated; use nodename instead.)
 
-.in -7
-
-
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8), fence_ack_manual(8)
=== man/fence_mcdata.8
==================================================================
--- man/fence_mcdata.8  (revision 317)
+++ man/fence_mcdata.8  (local)
@@ -5,118 +5,79 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_mcdata(8)''fence_mcdata(8)'
+.TH fence_mcdata 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_mcdata - I/O Fencing agent for McData FC switches
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_mcdata -a\fP \fIIPaddress\fR \fB-l\fP \fIlogin\fR \fB-p\fP \fIpassword\fR \fB-n\fP \fIport\fR \fB-o\fP \fIaction\fR
+.SH SYNOPSIS
+.B
+fence_mcdata
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_mcdata is an I/O Fencing agent which can be used with McData FC 
 switches.  It logs into a McData switch via telnet and disables a specified 
 port.  Disabling the port which a machine is connected to effectively fences 
 that machine.  Lengthy telnet connections to the switch should be avoided 
 while a GFS cluster is running because the connection will block any necessary 
 fencing actions.
-.sp
+
 fence_mcdata accepts options on the command line as well as from stdin.
 fenced sends parameters through stdin when it execs the agent.  fence_mcdata 
 can be run by itself with command line options which is useful for testing.
-.sp
+
 After a fence operation has taken place the fenced machine can no longer connect
 to the McData FC switch.  When the fenced machine is ready to be brought back 
 into the GFS cluster (after reboot) the port on the McData FC switch needs to 
 be enabled. This can be done by running fence_mcdata and specifying the 
 enable action.
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-a\fP \fIIPaddress\fP
-.in +7 
 IP address of the switch.
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-l\fP \fIlogin\fP
-.in +7
 Username name for the switch.
-.sp
-.in
+.TP
 \fB-n\fP \fIport\fP
-.in +7
 The port number to disable on the switch.
-.in
-.sp
+.TP
 \fB-o\fP \fIaction\fP
-.in +7
 The action required.  disable (default) or enable.
-.in
-.sp
+.TP
 \fB-p\fP \fIpassword\fP
-.in +7
 Password for login.
-.sp
-.in
+.TP
 \fB-q\fP
-.in +7
 Quiet mode: print only error messages.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
-.in
 
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
-
-\fI agent = < param >\fR
-.sp
+.SH STDIN PARAMETERS
+.TP
+\fIagent = < param >\fR
 This option is used by fence_node(8) and is ignored by fence_mcdata.
-.sp
-
+.TP
 \fIipaddr = < hostname | ip >\fR
-.sp
 IP address or hostname of the switch.
-.sp
-
-\fI login = < param >\fR
-.sp
+.TP
+\fIlogin = < param >\fR
 Login name.
-.sp
-
-\fI option = < param >\fR
-.sp
+.TP
+\fIoption = < param >\fR
 The action required.  disable (default) or enable.
-.sp
-
-\fI passwd = < param >\fR
-.sp
+.TP
+\fIpasswd = < param >\fR
 Password for login.
-.sp
-
-\fI port = < param >\fR
-.sp
+.TP
+\fIport = < param >\fR
 The port number to disable on the switch.
-.sp
 
-
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8)
=== man/fence_node.8
==================================================================
--- man/fence_node.8  (revision 317)
+++ man/fence_node.8  (local)
@@ -4,50 +4,35 @@
 .\"  This copyrighted material is made available to anyone wishing to use,
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
-.tl 'fence_node(8)''fence_node(8)'
-'\"     View with 'groff -t -e -mandoc -Tlatin1 fence_node.8 | less'
 
-\fBNAME\fP
-.in +7
+.TH fence_node 8
+
+.SH NAME
 fence_node - A program which performs I/O fencing on a single node.
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_node\fP [\fBoptions\fP] <\fBnode\fP>
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH SYNOPSIS
+.B
+fence_node
+[\fIOPTION\fR]...
+
+.SH DESCRIPTION
 \fBfence_node\fP is a program which accumulates all the necessary information
 for I/O fencing a particular node and then performs the fencing action by
 issuing a call to the proper fencing agent.  \fBfence_node\fP gets the
 necessary information from the Cluster Configuration System (CCS).  CCS must
 be running and properly configured for \fBfence_node\fP to work properly.
-.in
 
-\fBOPTIONS\fP
-.in +7
+.SH OPTIONS
+.TP
 \fB-h\fP
-.in +7
 Help.  Print out the usage syntax.
-.sp
-.in
-
+.TP
 \fB-V\fP
-.in +7
 Print version information.
-.sp
-.in -7
-.in -7
-\fBEXAMPLES\fP
-.in +7
+
+.SH EXAMPLES
 To fence a node called ``bellerophon'':
-.in +7
 prompt> fence_node bellerophon
-.in -7
-.sp
-.in -7
-\fBSEE ALSO\fP
-.in +7
+
+.SH SEE ALSO
 fence(8), ccs(7)
=== man/fence_rackswitch.8
==================================================================
--- man/fence_rackswitch.8  (revision 317)
+++ man/fence_rackswitch.8  (local)
@@ -5,102 +5,65 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_rackswitch(8)''fence_rackswitch(8)'
+.TH fence_rackswitch 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_rackswitch - I/O Fencing agent for RackSaver RackSwitch
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_rackswitch -a\fP \fIIPaddress\fR \fB-p\fP \fIpassword\fR \fB-l\fP \fIusername\fR \fB-n\fP \fIplug\fR 
+.SH SYNOPSIS
+.B
+fence_rackswitch
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_rackswitch is an I/O Fencing agent which can be used with the RackSaver 
 RackSwitch.  It logs into the RackSwitch and boots a specified plug.
 Using the http interface to the RackSwitch should be avoided while a GFS cluster is 
 running because the connection may interfere with the operation of this agent.
-.sp
+
 fence_rackswitch accepts options on the command line as well as from stdin.  
 fenced sends the options through stdin when it execs the agent.  fence_rackswitch 
 can be run by itself with command line options which is useful for testing.
-.sp
-.in 
-\fBOPTIONS\fP
-.in 
+
+.SH OPTIONS
+.TP
 \fB-a\fP \fIIPaddress\fP
-.in +7
 IP address of the switch.
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.in
-.sp
+.TP
 \fB-n\fP \fIplug\fP
-.in +7
 The plug number to power cycle.
-.in
-.sp
+.TP
 \fB-p\fP \fIpassword\fP
-.in +7
 Password for login.
-.sp
-.in
-.sp
+.TP
 \fB-l\fP \fIusername\fP
-.in +7
 Username for login.
-.in
-.sp
+.TP
 \fB-q\fP
-.in +7
 Quiet operation.  Only print out error messages.
-.in
-.sp
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.in
-.sp
 
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
-
-\fI agent = < param >\fR
-.sp
+.SH STDIN PARAMETERS
+.TP
+\fIagent = < param >\fR
 This option is used by fence_node(8) and is ignored by fence_rackswitch.
-.sp
-
+.TP
 \fIipaddr = < ip >\fR
-.sp
 IP address of the switch.
-.sp
-
-\fI username = < param >\fR
-.sp
+.TP
+\fIusername = < param >\fR
 Username for login.
-.sp
-
-\fI password = < param >\fR
-.sp
+.TP
+\fIpassword = < param >\fR
 Password for login.
-.sp
-
-\fI port = < param >\fR
-.sp
+.TP
+\fIport = < param >\fR
 The port (outlet) number to act upon.
-.sp
 
-.in
-.sp
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8)
=== man/fence_rib.8
==================================================================
--- man/fence_rib.8  (revision 317)
+++ man/fence_rib.8  (local)
@@ -5,21 +5,13 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_rib(8)''fence_rib(8)'
+.TH fence_rib 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_rib - I/O Fencing agent for Compaq Remote Insight Lights Out card
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
-.sp
+.SH DESCRIPTION
 fence_rib is deprecated.  fence_ilo should be used instead
-.sp
-.in
 
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence_ilo(8)
=== man/fence_sanbox2.8
==================================================================
--- man/fence_sanbox2.8  (revision 317)
+++ man/fence_sanbox2.8  (local)
@@ -5,118 +5,79 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_sanbox2(8)''fence_sanbox2(8)'
+.TH fence_sanbox2 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_sanbox2 - I/O Fencing agent for QLogic SANBox2 FC switches
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_sanbox2 -a\fP \fIIPaddress\fR \fB-l\fP \fIlogin\fR \fB-p\fP \fIpassword\fR \fB-n\fP \fIport\fR \fB-o\fP \fIaction\fR
+.SH SYNOPSIS
+.B
+fence_sanbox2
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_sanbox2 is an I/O Fencing agent which can be used with QLogic SANBox2 FC 
 switches.  It logs into a SANBox2 switch via telnet and disables a specified 
 port.  Disabling the port which a machine is connected to effectively fences 
 that machine.  Lengthy telnet connections to the switch should be avoided 
 while a GFS cluster is running because the connection will block any necessary 
 fencing actions.
-.sp
+
 fence_sanbox2 accepts options on the command line as well as from stdin.
 fenced sends parameters through stdin when it execs the agent.  fence_sanbox2 
 can be run by itself with command line options which is useful for testing.
-.sp
+
 After a fence operation has taken place the fenced machine can no longer connect
 to the switch.  When the fenced machine is ready to be brought back 
 into the GFS cluster (after reboot) the port on the FC switch needs to 
 be enabled. This can be done by running fence_sanbox2 and specifying the 
 enable action.
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-a\fP \fIIPaddress\fP
-.in +7 
 IP address of the switch.
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-l\fP \fIlogin\fP
-.in +7
 Login name for the switch.
-.sp
-.in
+.TP
 \fB-n\fP \fIport\fP
-.in +7
 The port number to disable on the switch.
-.in
-.sp
+.TP
 \fB-o\fP \fIaction\fP
-.in +7
 The action required.  disable (default) or enable.
-.in
-.sp
+.TP
 \fB-p\fP \fIpassword\fP
-.in +7
 Password for login.
-.sp
-.in
+.TP
 \fB-q\fP
-.in +7
 Quiet mode: print only error messages.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
-.in
 
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
-
-\fI agent = < param >\fR
-.sp
+.SH STDIN PARAMETERS
+.TP
+\fIagent = < param >\fR
 This option is used by fence_node(8) and is ignored by fence_sanbox2.
-.sp
-
+.TP
 \fIipaddr = < hostname | ip >\fR
-.sp
 IP address or hostname of the switch.
-.sp
-
-\fI login = < param >\fR
-.sp
+.TP
+\fIlogin = < param >\fR
 Login name.
-.sp
-
-\fI option = < param >\fR
-.sp
+.TP
+\fIoption = < param >\fR
 The action required.  disable (default) or enable.
-.sp
-
-\fI passwd = < param >\fR
-.sp
+.TP
+\fIpasswd = < param >\fR
 Password for login.
-.sp
-
-\fI port = < param >\fR
-.sp
+.TP
+\fIport = < param >\fR
 The port number to disable on the switch.
-.sp
 
-
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8)
=== man/fence_tool.8
==================================================================
--- man/fence_tool.8  (revision 317)
+++ man/fence_tool.8  (local)
@@ -4,21 +4,19 @@
 .\"  This copyrighted material is made available to anyone wishing to use,
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
-.tl 'fence_tool(8)''fence_tool(8)'
-'\"     View with 'groff -t -e -mandoc -Tlatin1 fence_tool.8 | less'
 
-\fBNAME\fP
-.in +7
+.TH fence_tool 8
+
+.SH NAME
 fence_tool - A program to join and leave the fence domain
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_tool\fP <\fBjoin | leave\fP> [\fBoptions\fP]
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH SYNOPSIS
+.B
+fence_tool
+<\fBjoin | leave\fP> 
+[\fIOPTION\fR]...
+
+.SH DESCRIPTION
 \fBfence_tool\fP is a program used to join or leave the default fence
 domain.  Specifically, it starts the fence daemon (fenced) to join the
 domain and kills fenced to leave the domain.  Fenced can be started
@@ -42,54 +40,29 @@
 
 A node must not leave the fence domain (fenced must not be terminated)
 while CLVM or GFS are in use.
-.in
 
-\fBOPTIONS\fP
-
-.in +7
-
+.SH OPTIONS
+.TP
 \fB-h\fP
-.in +7
 Help.  Print out the usage syntax.
-.sp
-.in
-
+.TP
 \fB-V\fP
-.in +7
 Print version information.
-.sp
-.in
-
+.TP
 \fB-S\fP
-.in +7
 Skip self unfencing before joining.
-.sp
-.in
-
+.TP
 \fB-D\fP
-.in +7
 Enable debugging output and don't fork (also passed to fenced)
-.sp
-.in
-
+.TP
 \fB-j\fP \fIsecs\fP
-.in +7
 Post-join fencing delay (passed to fenced)
-.sp
-.in
-
+.TP
 \fB-f\fP \fIsecs\fP
-.in +7
 Post-fail fencing delay (passed to fenced)
-.sp
-.in
-
+.TP
 \fB-c\fP
-.in +7
 All nodes are in a clean state to start (passed to fenced)
-.sp
-.in -7
-.in -7
-\fBSEE ALSO\fP
-.in +7
+
+.SH SEE ALSO
 fenced(8), fence(8), fence_node(8)
=== man/fence_vixel.8
==================================================================
--- man/fence_vixel.8  (revision 317)
+++ man/fence_vixel.8  (local)
@@ -5,97 +5,67 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_vixel(8)''fence_vixel(8)'
+.TH fence_vixel 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_vixel - I/O Fencing agent for Vixel FC switches
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_vixel [-hV] -a\fP \fIIPaddress\fR \fB-p\fP \fIpassword\fR \fB-n\fP \fIport\fR
+.SH SYNOPSIS
+.B
+fence_vixel
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_vixel is an I/O Fencing agent which can be used with Vixel FC switches.
 It logs into a Vixel switch via telnet and removes the specified port from the
 zone.  Removing the zone access from the port disables the port from being able
 to access the storage.  
-.sp
+
 fence_vixel accepts options on the command line as well as from stdin.
 fenced sends parameters through stdin when it execs the agent.  fence_vixel 
 can be run by itself with command line options which is useful for testing.
-.sp
+
 After a fence operation has taken place the fenced machine can no longer 
 connect to the Vixel FC switch.  When the fenced machine is ready to be brought
 back into the GFS cluster (after reboot) the port on the Vixel FC switch needs 
 to be enabled. In order to do this, log into the Vixel FC switch. Then go to:
-.in
-.in +7
+
 config->zones->config <port> <comma-separated-list-of-ports-in-the-zone>
-.in
-.in +7
+
 Then apply
-.sp
+
 Consult the Vixel manual for details
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-a\fP \fIIPaddress\fP
-.in +7
 IP address of the switch.
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-n\fP \fIport\fP
-.in +7
 The port number to remove zoning from on the switch.
-.in
-.sp
+.TP
 \fB-p\fP \fIpassword\fP
-.in +7
 Password for login.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
-.in
 
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
-
+.SH STDIN PARAMETERS
+.TP
 \fIagent = < param >\fR
-.sp
 This option is used by fence_node(8) and is ignored by fence_vixel.
-.sp
-
+.TP
 \fIipaddr = < hostname | ip >\fR
-.sp
 IP address or hostname of the switch.
-.sp
-
+.TP
 \fIpasswd = < param >\fR
-.sp
 Password for login.
-.sp
-
+.TP
 \fIport = < param >\fR
-.sp
 The port number to remove zoning from on the switch.
-.sp
 
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH BSEE ALSO
 fence(8), fence_node(8)
=== man/fence_wti.8
==================================================================
--- man/fence_wti.8  (revision 317)
+++ man/fence_wti.8  (local)
@@ -5,99 +5,65 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_wti(8)''fence_wti(8)'
+.TH fence_wti 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_wti - I/O Fencing agent for WTI Network Power Switch
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_wti -a\fP \fIIPaddress\fR \fB-p\fP \fIpassword\fR \fB-n\fP \fIplug\fR [\fB-T\fP]
+.SH SYNOPSIS
+.B
+fence_wti
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_wti is an I/O Fencing agent which can be used with the WTI Network 
 Power Switch (NPS).  It logs into an NPS via telnet and boots a specified plug.
 Lengthy telnet connections to the NPS should be avoided while a GFS cluster is 
 running because the connection will block any necessary fencing actions.
-.sp
+
 fence_wti accepts options on the command line as well as from stdin.  
 fenced sends the options through stdin when it execs the agent.  fence_wti 
 can be run by itself with command line options which is useful for testing.
-.sp
-.in 
-\fBOPTIONS\fP
-.in 
+
+.SH OPTIONS
+.TP
 \fB-a\fP \fIIPaddress\fP
-.in +7
 IP address of the switch.
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.in
-.sp
+.TP
 \fB-n\fP \fIplug\fP
-.in +7
 The plug number to power cycle.
-.in
-.sp
+.TP
 \fB-p\fP \fIpassword\fP
-.in +7
 Password for login.
-.sp
-.in
+.TP
 \fB-T\fP
-.in +7
 Test only.  Do not power cycle.  Reports state of the plug.
-.in
-.sp
+.TP
 \fB-q\fP
-.in +7
 Quiet operation.  Only print out error messages.
-.in
-.sp
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.in
-.sp
 
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
-
-\fI agent = < param >\fR
-.sp
+.SH STDIN PARAMETERS
+.TP
+\fIagent = < param >\fR
 This option is used by fence_node(8) and is ignored by fence_wti.
-.sp
-
+.TP
 \fIipaddr = < hostname | ip >\fR
-.sp
 IP address or hostname of the switch.
-.sp
-
-\fI passwd = < param >\fR
-.sp
+.TP
+\fIpasswd = < param >\fR
 Password for login.
-.sp
-
-\fI port = < param >\fR
-.sp
+.TP
+\fIport = < param >\fR
 The outlet number to act upon.
-.sp
-
-\fI test = < param >\fR
-.sp
+.TP
+\fItest = < param >\fR
 Test only.  Answer NO to the confirmation prompt instead of YES.
-.sp
 
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8)
=== man/fence_xcat.8
==================================================================
--- man/fence_xcat.8  (revision 317)
+++ man/fence_xcat.8  (local)
@@ -1,94 +1,63 @@
 .\"  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
 
-.tl 'fence_xcat(8)''fence_xcat(8)'
+.TH fence_xcat 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_xcat - I/O Fencing agent for xcat environments
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_xcat -n\fP \fInodename\fR \fB -o\fP \fIaction\fR \fB -r\fP \fIrpower\fR 
+.SH SYNOPSIS
+.B
+fence_xcat
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_xcat is a wrapper to the rpower(1) command that is distributed
 with the xCAT project available at http://www.xcat.org.  Use of 
 fence_xcat requires that xcat has already been properlly configfured
 for your environment.  Refer to xCAT(1) for more information on 
 configuring xCAT.
-.sp
+
 fence_xcat accepts options on the command line as well as from stdin.
 fenced sends parameters through stdin when it execs the agent.  fence_xcat 
 can be run by itself with command line options which is useful for testing.
-.sp
+
 NOTE: It is recommended that fence_bladecenter(8) is used instead of fence_xcat if
 the bladecenter firmware supports telnet.  This interface is much cleaner and
 easier to setup.
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-n\fP \fInodename\fP
-.in +7
 The nodename as defined in nodelist.tab of the xCAT setup.
-.in
-.sp
+.TP
 \fB-o\fP \fIaction\fP
-.in +7
 The action required.  on, off, reset (default) or stat.
-.in
-.sp
+.TP
 \fB-r\fP \fIrpower\fP
-.in +7
 The path to the rpower binary.
-.sp
-.in
+.TP
 \fB-q\fP
-.in +7
 Quiet mode: print only error messages.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
-.in
 
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
-
-\fI agent = < param >\fR
-.sp
+.SH STDIN PARAMETERS
+.TP
+\fIagent = < param >\fR
 This option is used by fence_node(8) and is ignored by fence_xcat.
-.sp
-
-\fI nodename = < param >\fR
-.sp
+.TP
+\fInodename = < param >\fR
 The nodename as defined in nodelist.tab of the xCAT setup.
-.sp
-
-\fI action = < param >\fR
-.sp
+.TP
+\fIaction = < param >\fR
 The action required.  on, off, reset (default) or stat.
-.sp
-
-\fI rpower = < param >\fR
-.sp
+.TP
+\fIrpower = < param >\fR
 The path to the rpower binary.
-.sp
 
-
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fence_node(8), fence_bladecenter(8), nodelist.tab(8), rpower(1), xCAT(1)
=== man/fence_zvm.8
==================================================================
--- man/fence_zvm.8  (revision 317)
+++ man/fence_zvm.8  (local)
@@ -5,89 +5,59 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fence_zvm(8)''fence_zvm(8)'
+.TH fence_zvm 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_zvm - I/O Fencing agent for GFS on s390 and zSeries VM clusters
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_zvm -a\fP \fIIPaddress\fP \fB-u\fP \fIuserid\fP \fB-p\fP \fIpassword\fP
+.SH SYNOPSIS
+.B
+fence_zvm
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_zvm is an I/O Fencing agent used on a GFS virtual machine in a s390 or zSeries VM cluster.
 It uses the s3270 program to log the specified virtual machine out of VM.
 For fence_zvm to execute correctly, you must have s3270 in your PATH.
 
-.sp
 fence_zvm accepts options on the command line as well as from stdin.
 fence_node sends the options through stdin when it execs the agent.
 fence_zvm can be run by itself with command line options which is useful
 for testing.
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-a\fP \fIIPaddress\fP
-.in +7
 IP address or hostname of the Physical machine (required).
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-u\fP \fIuserid\fP
-.in +7
 userid of the virtual machine to fence (required).
-.sp
-.in
+.TP
 \fB-p\fP \fIpassword\fP
-.in +7
 password of the virtual machine to fence (required).
-.sp
-.in
+.TP
 \fB-q\fP
-.in +7
 quiet mode, no output.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
-.in
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in
 
+.SH STDIN PARAMETERS
+.TP
 \fIagent = < param >\fP
-.sp
 This option is used by fence_node(8) and is ignored by fence_zvm.
-.sp
-
+.TP
 \fIipaddr = < hostname | ip >\fP
-.sp
 IP address or hostname of the Physical machine (required).
-.sp
-
+.TP
 \fIpasswd = < param >\fP
-.sp
 password of the virtual machine to fence (required).
-.sp
-
+.TP
 \fIuserid = < param >\fP
-.sp
 userid of the virtual machine to fence (required).
-.sp
 
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 fence(8), fenced(8), fence_node(8)
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050213/a4f003b5/attachment.sig>

From pcaulfie at redhat.com  Mon Feb 14 15:12:32 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 14 Feb 2005 15:12:32 +0000
Subject: [Linux-cluster] cluster lost quorum after 11 hours
In-Reply-To: <1108169257.5927.12.camel@ibm-c.pdx.osdl.net>
References: <1108169257.5927.12.camel@ibm-c.pdx.osdl.net>
Message-ID: <20050214151232.GC13778@tykepenguin.com>

On Fri, Feb 11, 2005 at 04:47:38PM -0800, Daniel McNeil wrote:
> I was running my test on a 3 node cluster and it died
> after 11 hours.  cl030 lost quorum with the other 2 nodes
> kicked out of the cluster.  cl031 also hit a bunch of asserts
> like
>     lock_dlm:  Assertion failed on line 352 of file  
>     /Views/redhat-cluster/cluster/gfs-kernel/src/dlm/lock.c
>     lock_dlm:  assertion:  "!error"
>     lock_dlm:  time = 291694516
>     stripefs: error=-22 num=2,19
> I assume is caused by the cluster shutting down.
> 
> 
> /var/log/messages showed:
> 
> cl030:
> Feb 11 02:44:33 cl030 kernel: CMAN: removing node cl032a from the cluster : No response to messages
> Feb 11 02:44:33 cl030 kernel: CMAN: removing node cl031a from the cluster : No response to messages
> Feb 11 02:44:33 cl030 kernel: CMAN: quorum lost, blocking activity
> Feb 11 14:40:33 cl030 sshd(pam_unix)[27323]: session opened for user root by (uid=0)


You should only get nodes dying from "No response to messages" during a state 
transition of some sort (eg a node leaving or joining or possibly a GFS
mount/dismount). In which case the DLM has to do recovery. I recently checked in
a couple of changes that will stop the DLM recovery from taking over the
machine when there are several thousand locks to recover, that might help.

During a normal "steady" state, a node should not die from 
"No response to messages" because the only messages that are being sent are
HELLO heartbeat messages and they are not acked.
-- 

patrick


From danderso at redhat.com  Mon Feb 14 15:15:25 2005
From: danderso at redhat.com (Derek Anderson)
Date: Mon, 14 Feb 2005 09:15:25 -0600
Subject: [Linux-cluster] config update kills cluster
In-Reply-To: <20050212154216.GA3682@wavehammer.waldi.eu.org>
References: <20050212154216.GA3682@wavehammer.waldi.eu.org>
Message-ID: <200502140915.25156.danderso@redhat.com>

On Saturday 12 February 2005 09:42, Bastian Blank wrote:
> Hi all
>
> I just tried config update via ccs_tool in a cman cluster. Each of the
> nodes got the new config but the kernel rejects joins with
>
> | CMAN: Join request from gfs1 rejected, config version local 1 remote 2

After updating the config file with ccs_tool you should notify cman of the new 
config version.  So, like 'cman_tool version -r 2', where 2 is your updated 
version.  Please try this and see if nodes can then join.

>
> The cluster is running CVS from 2005-02-06.
>
> Bastian


From yazan at ccs.com.jo  Mon Feb 14 15:36:09 2005
From: yazan at ccs.com.jo (Yazan Al-Sheyyab)
Date: Mon, 14 Feb 2005 17:36:09 +0200
Subject: [Linux-cluster] mail on cluster
Message-ID: <001901c512aa$e7d45600$69050364@yazanz>

hi all,

  i want to begin a project about making a mail server on tow clustered
servers.
 i want to know if i need the GFS during my work (because i dont have a
shared storage here) and if any software from erdhat can i buy for
mailserver or is the send mail service built on the system enough to do so
in use?

 i want any to help me or to give me what i need to have and to do before i
begin this task.

 what i need to begin?

  i know some of you are using this project , i really need to know what i
need exactlly befor begin?

  (GFS,suites,raws........ ????).

Regards.


From ialberdi at histor.fr  Mon Feb 14 16:59:05 2005
From: ialberdi at histor.fr (Ion Alberdi)
Date: Mon, 14 Feb 2005 17:59:05 +0100
Subject: [Linux-cluster] Unique device on each node
Message-ID: <4210D8D9.6060201@histor.fr>

I'm working with rgmanager, and i have a question.
Does a device needed by a service put in failover, must have the same 
name on each node the service could eventually run?
More precisely is this configuration:
a service that needs to mount
*/dev/hda1 in /mnt/fs when he runs on node1
*/dev/hdb1 in /mnt/fs running on node2
impossible to implement with this cluster?

The configuration file cluster.conf is the same for all the nodes and I 
haven't already found a way to say in the cluster.conf:
 in node1 do that,in node2 do that, so I assume it is imposible.

I would like to have a confirmation for that.

Thanks in advance for all the eventual answers.


From bastian at waldi.eu.org  Mon Feb 14 17:06:51 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Mon, 14 Feb 2005 18:06:51 +0100
Subject: [Linux-cluster] dlm - update dlm32 layer
In-Reply-To: <20050214084652.GC5724@tykepenguin.com>
References: <20050213223947.GB28716@wavehammer.waldi.eu.org>
	<20050214084652.GC5724@tykepenguin.com>
Message-ID: <20050214170651.GD20443@wavehammer.waldi.eu.org>

On Mon, Feb 14, 2005 at 08:46:52AM +0000, Patrick Caulfield wrote:
> I'm working on this, but the patch you sent last seems to break queries (even on
> i386) and I haven't got the bottom of why yet.

My first patch was broken, I know. But how can I check if it works?

clvmd does not report errors and seems to work correctly on i386.

Bastian

-- 
You!  What PLANET is this!
		-- McCoy, "The City on the Edge of Forever", stardate 3134.0
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050214/f8c1bd5e/attachment.sig>

From bujan at isqsolutions.com  Mon Feb 14 18:06:54 2005
From: bujan at isqsolutions.com (Manuel Bujan)
Date: Mon, 14 Feb 2005 13:06:54 -0500
Subject: [Linux-cluster] ccsd error in new cvs
References: <20050213223947.GB28716@wavehammer.waldi.eu.org><20050214084652.GC5724@tykepenguin.com>
	<20050214170651.GD20443@wavehammer.waldi.eu.org>
Message-ID: <00ef01c512bf$f71c07b0$5001a8c0@pcbujan>

Helo

We are trying to install a gfs cluster from scratch based on current cvs 
code and we are getting the following when we tried to run ccsd:

Unable to connect to cluster infrastructure after 30 seconds.
Unable to connect to cluster infrastructure after 60 seconds.
Unable to connect to cluster infrastructure after 90 seconds.
Unable to connect to cluster infrastructure after 120 seconds.
Unable to connect to cluster infrastructure after 150 seconds.
................

Any hints ?

We are running kernel 2.6.10, and we succesfully compiled everything inside.

Regards
Bujan


From yazan at ccs.com.jo  Mon Feb 14 18:33:49 2005
From: yazan at ccs.com.jo (Yazan Al-Sheyyab)
Date: Mon, 14 Feb 2005 20:33:49 +0200
Subject: [Linux-cluster] raw device
Message-ID: <011701c512c3$ba1a84f0$69050364@yazanz>

hi all,

  maybe i asked this question before So please execuse me about this.

 can i put more than one rawdevice in the same partition, i mean can i put
in the /etc/sysconfig/rawdevices file more than one raw related with same
partition as :

  /dev/raw/raw5 /dev/cciss/c0d0p3
  /dev/raw/raw6 /dev/cciss/c0d0p3
 .......... etc.

  ????????????????/

   and if so or not    can i use partition generated by LVM for rawdevices
???????

  it maybe a rediculus question for you but i really need to know about.

  Thanks alot if you can help me.

  Regards.


From jbrassow at redhat.com  Mon Feb 14 18:41:46 2005
From: jbrassow at redhat.com (Jonathan E Brassow)
Date: Mon, 14 Feb 2005 12:41:46 -0600
Subject: [Linux-cluster] ccs - fix -p parameter of ccsd
In-Reply-To: <20050213222659.GA28716@wavehammer.waldi.eu.org>
References: <20050213222659.GA28716@wavehammer.waldi.eu.org>
Message-ID: <b6e214ad345b9701c5caec85e949cb24@redhat.com>

Did you need the -p parameter?

Right now, the code is there to handle it, but it is disabled.  It is 
also not mentioned in the man page or in the usage summary.  This 
parameter was something that we thought about including but no 
consensus was ever reached - which is why the code is there, but the 
option is not handled.

  brassow

On Feb 13, 2005, at 4:26 PM, Bastian Blank wrote:

> Hi folks
>
> The attached patch fixes the -p parameter of ccsd.
>
> Bastian
>
> -- 
> It would be illogical to assume that all conditions remain stable.
> 		-- Spock, "The Enterprise Incident", stardate 5027.3
> <diff>--
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From bastian at waldi.eu.org  Mon Feb 14 18:46:20 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Mon, 14 Feb 2005 19:46:20 +0100
Subject: [Linux-cluster] ccs - fix -p parameter of ccsd
In-Reply-To: <b6e214ad345b9701c5caec85e949cb24@redhat.com>
References: <20050213222659.GA28716@wavehammer.waldi.eu.org>
	<b6e214ad345b9701c5caec85e949cb24@redhat.com>
Message-ID: <20050214184620.GA28838@wavehammer.waldi.eu.org>

On Mon, Feb 14, 2005 at 12:41:46PM -0600, Jonathan E Brassow wrote:
> Did you need the -p parameter?

I currently use it in the debian packages to overwrite the pid file
location.

Bastian

-- 
In the strict scientific sense we all feed on death -- even vegetarians.
		-- Spock, "Wolf in the Fold", stardate 3615.4
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050214/275fb45b/attachment.sig>

From jbrassow at redhat.com  Mon Feb 14 18:48:14 2005
From: jbrassow at redhat.com (Jonathan E Brassow)
Date: Mon, 14 Feb 2005 12:48:14 -0600
Subject: [Linux-cluster] ccsd error in new cvs
In-Reply-To: <00ef01c512bf$f71c07b0$5001a8c0@pcbujan>
References: <20050213223947.GB28716@wavehammer.waldi.eu.org><20050214084652.GC5724@tykepenguin.com>
	<20050214170651.GD20443@wavehammer.waldi.eu.org>
	<00ef01c512bf$f71c07b0$5001a8c0@pcbujan>
Message-ID: <3d015bc4b486a840262cc7e97ae4358f@redhat.com>

This is not an error.  It simply means that you haven't started the 
cluster manager yet.  (That is, you haven't run 'cman_tool join -c 
<cluster name>' yet -- or lock_gulmd if you are using gulm.)

  brassow

On Feb 14, 2005, at 12:06 PM, Manuel Bujan wrote:

> Helo
>
> We are trying to install a gfs cluster from scratch based on current 
> cvs code and we are getting the following when we tried to run ccsd:
>
> Unable to connect to cluster infrastructure after 30 seconds.
> Unable to connect to cluster infrastructure after 60 seconds.
> Unable to connect to cluster infrastructure after 90 seconds.
> Unable to connect to cluster infrastructure after 120 seconds.
> Unable to connect to cluster infrastructure after 150 seconds.
> ................
>
> Any hints ?
>
> We are running kernel 2.6.10, and we succesfully compiled everything 
> inside.
>
> Regards
> Bujan
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster
>


From bastian at waldi.eu.org  Mon Feb 14 18:47:47 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Mon, 14 Feb 2005 19:47:47 +0100
Subject: [Linux-cluster] raw device
In-Reply-To: <011701c512c3$ba1a84f0$69050364@yazanz>
References: <011701c512c3$ba1a84f0$69050364@yazanz>
Message-ID: <20050214184747.GB28838@wavehammer.waldi.eu.org>

On Mon, Feb 14, 2005 at 08:33:49PM +0200, Yazan Al-Sheyyab wrote:
>  can i put more than one rawdevice in the same partition

Raw devices are deprecated, use O_DIRECT on the standard device.

Bastian

-- 
If there are self-made purgatories, then we all have to live in them.
		-- Spock, "This Side of Paradise", stardate 3417.7
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050214/08237bc2/attachment.sig>

From bujan at isqsolutions.com  Mon Feb 14 19:14:17 2005
From: bujan at isqsolutions.com (Manuel Bujan)
Date: Mon, 14 Feb 2005 14:14:17 -0500
Subject: [Linux-cluster] ccsd error in new cvs
References: <20050213223947.GB28716@wavehammer.waldi.eu.org><20050214084652.GC5724@tykepenguin.com><20050214170651.GD20443@wavehammer.waldi.eu.org><00ef01c512bf$f71c07b0$5001a8c0@pcbujan>
	<3d015bc4b486a840262cc7e97ae4358f@redhat.com>
Message-ID: <013c01c512c9$6297d010$5001a8c0@pcbujan>

Hello,

 I'm sorry may be our problem is related with cman instead of with ccsd. Any 
way in any case when I tried to run cman I get a segmentation fault

Regards
Bujan

---------ccsd trace---------------------
Starting ccsd DEVEL.1108401572:
 Built: Feb 14 2005 12:20:28
 Copyright (C) Red Hat, Inc.  2004  All rights reserved.
  No Daemon:: SET

cluster.conf (cluster name = ISQCLUSTER, version = 14) found.
Unable to connect to cluster infrastructure after 30 seconds.

-------cman trace------------------------
# cman_tool join -c ISQCLUSTER
multicast address 224.0.0.9
multicast address 224.0.0.1
multicast address 224.0.0.9
multicast address 224.0.0.1
multicast address 224.0.0.9
multicast address 224.0.0.1
..........

Segmentation fault

That's what are we getting

Regards

####################
cluster.conf:
####################
<?xml version="1.0"?>
<cluster name="ISQCLUSTER" config_version="14">
        <cman two_node="1" expected_votes="1">
                <multicast addr="224.0.0.1"/>
                <multicast addr="224.0.0.9"/>
        </cman>
        <clusternodes>
                <clusternode name="atmail-1" nodeid="1" votes="1">
                        <altname name="cluster1"/>
                        <multicast addr="224.0.0.1" interface="eth0"/>
                        <multicast addr="224.0.0.9" interface="eth2"/>
                        <fence>
                                <method name="apcshut">
                                        <device name="apcswitch" port="1"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="atmail-2" nodeid="2" votes="1">
                        <altname name="cluster2"/>
                        <multicast addr="224.0.0.1" interface="eth0"/>
                        <multicast addr="224.0.0.9" interface="eth2"/>
                        <fence>
                                <method name="apcshut">
                                        <device name="apcswitch" port="2"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <fencedevices>
                <fencedevice name="apcswitch" agent="fence_apc" 
ipaddr="192.168.
0.120" login="apc" passwd="r3hd3"/>
        </fencedevices>
</cluster>

#########################################


----- Original Message ----- 
From: "Jonathan E Brassow" <jbrassow at redhat.com>
To: "linux clistering" <linux-cluster at redhat.com>
Sent: Monday, February 14, 2005 1:48 PM
Subject: Re: [Linux-cluster] ccsd error in new cvs


> This is not an error.  It simply means that you haven't started the 
> cluster manager yet.  (That is, you haven't run 'cman_tool join -c 
> <cluster name>' yet -- or lock_gulmd if you are using gulm.)
>
>  brassow
>
> On Feb 14, 2005, at 12:06 PM, Manuel Bujan wrote:
>
>> Helo
>>
>> We are trying to install a gfs cluster from scratch based on current cvs 
>> code and we are getting the following when we tried to run ccsd:
>>
>> Unable to connect to cluster infrastructure after 30 seconds.
>> Unable to connect to cluster infrastructure after 60 seconds.
>> Unable to connect to cluster infrastructure after 90 seconds.
>> Unable to connect to cluster infrastructure after 120 seconds.
>> Unable to connect to cluster infrastructure after 150 seconds.
>> ................
>>
>> Any hints ?
>>
>> We are running kernel 2.6.10, and we succesfully compiled everything 
>> inside.
>>
>> Regards
>> Bujan
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> http://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster
> 


From jbrassow at redhat.com  Mon Feb 14 19:53:31 2005
From: jbrassow at redhat.com (Jonathan E Brassow)
Date: Mon, 14 Feb 2005 13:53:31 -0600
Subject: [Linux-cluster] ccsd error in new cvs
In-Reply-To: <013c01c512c9$6297d010$5001a8c0@pcbujan>
References: <20050213223947.GB28716@wavehammer.waldi.eu.org><20050214084652.GC5724@tykepenguin.com><20050214170651.GD20443@wavehammer.waldi.eu.org><00ef01c512bf$f71c07b0$5001a8c0@pcbujan>
	<3d015bc4b486a840262cc7e97ae4358f@redhat.com>
	<013c01c512c9$6297d010$5001a8c0@pcbujan>
Message-ID: <f9b7aa17c5220ef926a7650f9e1a3e51@redhat.com>

Try the attached patch.

This patch has been applied to the code in cvs

  brassow

-------------- next part --------------
A non-text attachment was scrubbed...
Name: join_ccs.c-patch
Type: application/octet-stream
Size: 637 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050214/2502b738/attachment.obj>
-------------- next part --------------


On Feb 14, 2005, at 1:14 PM, Manuel Bujan wrote:

> Hello,
>
> I'm sorry may be our problem is related with cman instead of with 
> ccsd. Any way in any case when I tried to run cman I get a 
> segmentation fault
>
> Regards
> Bujan
>
> ---------ccsd trace---------------------
> Starting ccsd DEVEL.1108401572:
> Built: Feb 14 2005 12:20:28
> Copyright (C) Red Hat, Inc.  2004  All rights reserved.
>  No Daemon:: SET
>
> cluster.conf (cluster name = ISQCLUSTER, version = 14) found.
> Unable to connect to cluster infrastructure after 30 seconds.
>
> -------cman trace------------------------
> # cman_tool join -c ISQCLUSTER
> multicast address 224.0.0.9
> multicast address 224.0.0.1
> multicast address 224.0.0.9
> multicast address 224.0.0.1
> multicast address 224.0.0.9
> multicast address 224.0.0.1
> ..........
>
> Segmentation fault
>
> That's what are we getting
>
> Regards
>
> ####################
> cluster.conf:
> ####################
> <?xml version="1.0"?>
> <cluster name="ISQCLUSTER" config_version="14">
>        <cman two_node="1" expected_votes="1">
>                <multicast addr="224.0.0.1"/>
>                <multicast addr="224.0.0.9"/>
>        </cman>
>        <clusternodes>
>                <clusternode name="atmail-1" nodeid="1" votes="1">
>                        <altname name="cluster1"/>
>                        <multicast addr="224.0.0.1" interface="eth0"/>
>                        <multicast addr="224.0.0.9" interface="eth2"/>
>                        <fence>
>                                <method name="apcshut">
>                                        <device name="apcswitch" 
> port="1"/>
>                                </method>
>                        </fence>
>                </clusternode>
>                <clusternode name="atmail-2" nodeid="2" votes="1">
>                        <altname name="cluster2"/>
>                        <multicast addr="224.0.0.1" interface="eth0"/>
>                        <multicast addr="224.0.0.9" interface="eth2"/>
>                        <fence>
>                                <method name="apcshut">
>                                        <device name="apcswitch" 
> port="2"/>
>                                </method>
>                        </fence>
>                </clusternode>
>        </clusternodes>
>        <fencedevices>
>                <fencedevice name="apcswitch" agent="fence_apc" 
> ipaddr="192.168.
> 0.120" login="apc" passwd="r3hd3"/>
>        </fencedevices>
> </cluster>
>
> #########################################
>
>
>
>
> ----- Original Message ----- From: "Jonathan E Brassow" 
> <jbrassow at redhat.com>
> To: "linux clistering" <linux-cluster at redhat.com>
> Sent: Monday, February 14, 2005 1:48 PM
> Subject: Re: [Linux-cluster] ccsd error in new cvs
>
>
>> This is not an error.  It simply means that you haven't started the 
>> cluster manager yet.  (That is, you haven't run 'cman_tool join -c 
>> <cluster name>' yet -- or lock_gulmd if you are using gulm.)
>>
>>  brassow
>>
>> On Feb 14, 2005, at 12:06 PM, Manuel Bujan wrote:
>>
>>> Helo
>>>
>>> We are trying to install a gfs cluster from scratch based on current 
>>> cvs code and we are getting the following when we tried to run ccsd:
>>>
>>> Unable to connect to cluster infrastructure after 30 seconds.
>>> Unable to connect to cluster infrastructure after 60 seconds.
>>> Unable to connect to cluster infrastructure after 90 seconds.
>>> Unable to connect to cluster infrastructure after 120 seconds.
>>> Unable to connect to cluster infrastructure after 150 seconds.
>>> ................
>>>
>>> Any hints ?
>>>
>>> We are running kernel 2.6.10, and we succesfully compiled everything 
>>> inside.
>>>
>>> Regards
>>> Bujan
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> http://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> http://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster
>

From bastian at waldi.eu.org  Mon Feb 14 20:35:06 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Mon, 14 Feb 2005 21:35:06 +0100
Subject: [Linux-cluster] fence - fence_ack_manual does not check nodename
Message-ID: <20050214203506.GA7595@wavehammer.waldi.eu.org>

Hi folks

Is it expected that fence_ack_manual wants a nodename but do't checks
them?

Bastian

-- 
I'm a soldier, not a diplomat.  I can only tell the truth.
		-- Kirk, "Errand of Mercy", stardate 3198.9
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050214/7503414e/attachment.sig>

From bujan at isqsolutions.com  Mon Feb 14 20:45:14 2005
From: bujan at isqsolutions.com (Manuel Bujan)
Date: Mon, 14 Feb 2005 15:45:14 -0500
Subject: [Linux-cluster] ccsd error in new cvs
References: <20050213223947.GB28716@wavehammer.waldi.eu.org><20050214084652.GC5724@tykepenguin.com><20050214170651.GD20443@wavehammer.waldi.eu.org><00ef01c512bf$f71c07b0$5001a8c0@pcbujan><3d015bc4b486a840262cc7e97ae4358f@redhat.com><013c01c512c9$6297d010$5001a8c0@pcbujan>
	<f9b7aa17c5220ef926a7650f9e1a3e51@redhat.com>
Message-ID: <017c01c512d6$17e593b0$5001a8c0@pcbujan>


Thanks for the patch,
Now everything is working fine and we are able to start the cluster and 
mount a GFS partition of around 100 GB.

I checked the logs and everything appears to be OK, except one line related 
to the LVM2 that state:

Feb 14 15:31:36 atmail-1 lvm[4546]: locking_type not set correctly in 
lvm.conf, cluster operations will not work.
Feb 14 15:34:15 atmail-1 /sbin/hotplug: no runnable /etc/hotplug/block.agent 
is installed

I'm testing only with one of our nodes before upgrade to the second one 
because we one to be sure everything is working.
What that message means ?

Globally our lvm.conf has the following setup:
> locking_type = 1
> locking_dir = "/var/lock/lvm"

Which value of locking_type has to be used with a two node cman based 
cluster ?

Regards
Bujan


----- Original Message ----- 
From: "Jonathan E Brassow" <jbrassow at redhat.com>
To: "linux clistering" <linux-cluster at redhat.com>
Sent: Monday, February 14, 2005 2:53 PM
Subject: Re: [Linux-cluster] ccsd error in new cvs


> Try the attached patch.
>
> This patch has been applied to the code in cvs
>
>  brassow
>
>


--------------------------------------------------------------------------------


>
>
>
> On Feb 14, 2005, at 1:14 PM, Manuel Bujan wrote:
>
>> Hello,
>>
>> I'm sorry may be our problem is related with cman instead of with
>> ccsd. Any way in any case when I tried to run cman I get a
>> segmentation fault
>>
>> Regards
>> Bujan
>>
>> ---------ccsd trace---------------------
>> Starting ccsd DEVEL.1108401572:
>> Built: Feb 14 2005 12:20:28
>> Copyright (C) Red Hat, Inc.  2004  All rights reserved.
>>  No Daemon:: SET
>>
>> cluster.conf (cluster name = ISQCLUSTER, version = 14) found.
>> Unable to connect to cluster infrastructure after 30 seconds.
>>
>> -------cman trace------------------------
>> # cman_tool join -c ISQCLUSTER
>> multicast address 224.0.0.9
>> multicast address 224.0.0.1
>> multicast address 224.0.0.9
>> multicast address 224.0.0.1
>> multicast address 224.0.0.9
>> multicast address 224.0.0.1
>> ..........
>>
>> Segmentation fault
>>
>> That's what are we getting
>>
>> Regards
>>
>> ####################
>> cluster.conf:
>> ####################
>> <?xml version="1.0"?>
>> <cluster name="ISQCLUSTER" config_version="14">
>>        <cman two_node="1" expected_votes="1">
>>                <multicast addr="224.0.0.1"/>
>>                <multicast addr="224.0.0.9"/>
>>        </cman>
>>        <clusternodes>
>>                <clusternode name="atmail-1" nodeid="1" votes="1">
>>                        <altname name="cluster1"/>
>>                        <multicast addr="224.0.0.1" interface="eth0"/>
>>                        <multicast addr="224.0.0.9" interface="eth2"/>
>>                        <fence>
>>                                <method name="apcshut">
>>                                        <device name="apcswitch"
>> port="1"/>
>>                                </method>
>>                        </fence>
>>                </clusternode>
>>                <clusternode name="atmail-2" nodeid="2" votes="1">
>>                        <altname name="cluster2"/>
>>                        <multicast addr="224.0.0.1" interface="eth0"/>
>>                        <multicast addr="224.0.0.9" interface="eth2"/>
>>                        <fence>
>>                                <method name="apcshut">
>>                                        <device name="apcswitch"
>> port="2"/>
>>                                </method>
>>                        </fence>
>>                </clusternode>
>>        </clusternodes>
>>        <fencedevices>
>>                <fencedevice name="apcswitch" agent="fence_apc"
>> ipaddr="192.168.
>> 0.120" login="apc" passwd="r3hd3"/>
>>        </fencedevices>
>> </cluster>
>>
>> #########################################
>>
>>
>>
>>
>> ----- Original Message ----- From: "Jonathan E Brassow"
>> <jbrassow at redhat.com>
>> To: "linux clistering" <linux-cluster at redhat.com>
>> Sent: Monday, February 14, 2005 1:48 PM
>> Subject: Re: [Linux-cluster] ccsd error in new cvs
>>
>>
>>> This is not an error.  It simply means that you haven't started the
>>> cluster manager yet.  (That is, you haven't run 'cman_tool join -c
>>> <cluster name>' yet -- or lock_gulmd if you are using gulm.)
>>>
>>>  brassow
>>>
>>> On Feb 14, 2005, at 12:06 PM, Manuel Bujan wrote:
>>>
>>>> Helo
>>>>
>>>> We are trying to install a gfs cluster from scratch based on current
>>>> cvs code and we are getting the following when we tried to run ccsd:
>>>>
>>>> Unable to connect to cluster infrastructure after 30 seconds.
>>>> Unable to connect to cluster infrastructure after 60 seconds.
>>>> Unable to connect to cluster infrastructure after 90 seconds.
>>>> Unable to connect to cluster infrastructure after 120 seconds.
>>>> Unable to connect to cluster infrastructure after 150 seconds.
>>>> ................
>>>>
>>>> Any hints ?
>>>>
>>>> We are running kernel 2.6.10, and we succesfully compiled everything
>>>> inside.
>>>>
>>>> Regards
>>>> Bujan
>>>>
>>>> --
>>>> Linux-cluster mailing list
>>>> Linux-cluster at redhat.com
>>>> http://www.redhat.com/mailman/listinfo/linux-cluster
>>>>
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> http://www.redhat.com/mailman/listinfo/linux-cluster
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> http://www.redhat.com/mailman/listinfo/linux-cluster
>>
>


--------------------------------------------------------------------------------


> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster 


From danderso at redhat.com  Mon Feb 14 20:58:29 2005
From: danderso at redhat.com (Derek Anderson)
Date: Mon, 14 Feb 2005 14:58:29 -0600
Subject: [Linux-cluster] ccsd error in new cvs
In-Reply-To: <017c01c512d6$17e593b0$5001a8c0@pcbujan>
References: <20050213223947.GB28716@wavehammer.waldi.eu.org>
	<f9b7aa17c5220ef926a7650f9e1a3e51@redhat.com>
	<017c01c512d6$17e593b0$5001a8c0@pcbujan>
Message-ID: <200502141458.29093.danderso@redhat.com>

On Monday 14 February 2005 14:45, Manuel Bujan wrote:
> Thanks for the patch,
> Now everything is working fine and we are able to start the cluster and
> mount a GFS partition of around 100 GB.
>
> I checked the logs and everything appears to be OK, except one line related
> to the LVM2 that state:
>
> Feb 14 15:31:36 atmail-1 lvm[4546]: locking_type not set correctly in
> lvm.conf, cluster operations will not work.
> Feb 14 15:34:15 atmail-1 /sbin/hotplug: no runnable
> /etc/hotplug/block.agent is installed
>
> I'm testing only with one of our nodes before upgrade to the second one
> because we one to be sure everything is working.
> What that message means ?
>
> Globally our lvm.conf has the following setup:
> > locking_type = 1
> > locking_dir = "/var/lock/lvm"
>
> Which value of locking_type has to be used with a two node cman based
> cluster ?

    locking_type = 2
    locking_library = "/usr/lib/liblvm2clusterlock.so"

The locking_dir can be left the way it is.

You will still get a warning when starting clvmd, but it can be ignored, and 
should be fixed for future builds:
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=147819


From amanthei at redhat.com  Mon Feb 14 21:09:05 2005
From: amanthei at redhat.com (Adam Manthei)
Date: Mon, 14 Feb 2005 15:09:05 -0600
Subject: [Linux-cluster] fence - fence_ack_manual does not check nodename
In-Reply-To: <20050214203506.GA7595@wavehammer.waldi.eu.org>
References: <20050214203506.GA7595@wavehammer.waldi.eu.org>
Message-ID: <20050214210905.GB29436@redhat.com>

On Mon, Feb 14, 2005 at 09:35:06PM +0100, Bastian Blank wrote:
> Hi folks
> 
> Is it expected that fence_ack_manual wants a nodename but do't checks
> them?

yes.

In fact, you can put whatever you want in "ipaddr" and I believe that it
should work provided that a unique key is used for all nodes that are using
fence_manual.  I'm not 100% sure though since it's been a while since I
looked at that code.  The key "ipaddr" is a misleading name. 

-- 
Adam Manthei  <amanthei at redhat.com>


From pbruna at linuxcenterla.com  Mon Feb 14 21:41:51 2005
From: pbruna at linuxcenterla.com (Patricio Bruna V)
Date: Mon, 14 Feb 2005 18:41:51 -0300
Subject: [Linux-cluster] where i have to look?
Message-ID: <1108417311.2733.17.camel@p.linuxcenter.cl>

im seen a lot of ha projects, im a little miss, so im asking a bit of
tips.
what are the tecnologys that rules?

openssi or ultramonkey?
gfs, dlm or lustre?

what book, site, etc, i have to study.

thx
-- 
Patricio Bruna                                  http://www.linuxcenterla.com
Ingeniero de Proyectos                  Mariano S?nchez Fontecilla 310
Red Hat Certified Engineer              Las Condes, Santiago - CHILE
Linux Center Latinoamerica              Fono: 4834041
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050214/3c834394/attachment.sig>

From teigland at redhat.com  Tue Feb 15 04:05:49 2005
From: teigland at redhat.com (David Teigland)
Date: Tue, 15 Feb 2005 12:05:49 +0800
Subject: [Linux-cluster] possible to wait on fence domain startup
In-Reply-To: <20050213195955.GE16192@wavehammer.waldi.eu.org>
References: <20050213195955.GE16192@wavehammer.waldi.eu.org>
Message-ID: <20050215040549.GA5395@redhat.com>

On Sun, Feb 13, 2005 at 08:59:55PM +0100, Bastian Blank wrote:
> Hi folks
> 
> Is it possible to wait for the startup of the fence domain?
> 
> If I call "fence_tool join" and mount of a gfs volume without a sleep
> between, I get permission denied and the kernel log reports the missing
> fence domain.
> 
> The time which is needed between this two calls seems to be related to
> the number of nodes in the fence domain.

Now fixed by doing "fence_tool join -w"

-- 
Dave Teigland  <teigland at redhat.com>


From teigland at redhat.com  Tue Feb 15 04:37:59 2005
From: teigland at redhat.com (David Teigland)
Date: Tue, 15 Feb 2005 12:37:59 +0800
Subject: [Linux-cluster] fence - convert manpages to the man macro package
In-Reply-To: <20050213215630.GA9873@wavehammer.waldi.eu.org>
References: <20050213215630.GA9873@wavehammer.waldi.eu.org>
Message-ID: <20050215043759.GB5395@redhat.com>

On Sun, Feb 13, 2005 at 10:56:30PM +0100, Bastian Blank wrote:
> Hi folks
> 
> The current manpages are written in plain nroff which is not parsable by
> many scripts.
> 
> The attached patch converts the manpages in the fence package to the man

Thanks, added.


From fajar at telkom.co.id  Tue Feb 15 06:52:21 2005
From: fajar at telkom.co.id (Fajar A. Nugraha)
Date: Tue, 15 Feb 2005 13:52:21 +0700
Subject: [Linux-cluster] cluster latest cvs does not fence dead nodes
	automatically
Message-ID: <42119C25.2000207@telkom.co.id>

Hi,

I'm building two-node cluster using today's cvs from 
sources.redhat.com:/cvs/cluster.
Shared storage is located on FC shared disk.
All work as expected up to using gfs.

When I simulated a node crash (I did ifcfg eth0 down on node 2),
node 1 simply says (on syslog):

Feb 15 13:33:35 hosting-cl02-01 CMAN: removing node hosting-cl02-02 from 
the cluster : Missed too many heartbeats

However, NO fencing occured. Not even a "fence failed" message. I use 
fence_ibmblade.
After that, access to gfs device blocked (df -k still works though), and 
/proc/cluster/nodes show

Node  Votes Exp Sts  Name
   1    1    1   M   node-01
   2    1    1   X   node-02

there's an "X" on node 2, but /proc/cluster/service shows

Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 run       -
[1 2]

DLM Lock Space:  "clvmd"                             2   3 run       -
[1 2]

DLM Lock Space:  "data"                              3   4 run       -
[1 2]

DLM Lock Space:  "config"                            5   6 run       -
[1 2]

GFS Mount Group: "data"                              4   5 run       -
[1 2]

GFS Mount Group: "config"                            6   7 run       -
[1 2]

which is the same content with before node 2 is dead.
AFAIK, state should be "recover" or "waiting to recover" instead of run.

If I reboot node 2 (which is the same thing if you exceute
fence_ibmblade manually), and restart cluster services on that node, all 
is back to normal,
and these messages show on syslog :

Feb 15 13:38:40 node-01 CMAN: node node-02 rejoining
Feb 15 13:38:40 node-01 fenced[25486]: node-02 not a cluster member 
after 0 sec post_fail_delay
Feb 15 13:38:42 node-01 GFS: fsid=node:config.0: jid=1: Trying to 
acquire journal lock...
Feb 15 13:38:42 node-01 GFS: fsid=node:data.0: jid=1: Trying to acquire 
journal lock...
Feb 15 13:38:43 node-01 GFS: fsid=node:config.0: jid=1: Looking at 
journal...
Feb 15 13:38:43 node-01 GFS: fsid=node:data.0: jid=1: Looking at journal...
Feb 15 13:38:43 node-01 GFS: fsid=node:config.0: jid=1: Acquiring the 
transaction lock...
Feb 15 13:38:43 node-01 GFS: fsid=node:config.0: jid=1: Replaying journal...
Feb 15 13:38:43 node-01 GFS: fsid=node:config.0: jid=1: Replayed 0 of 0 
blocks
Feb 15 13:38:43 node-01 GFS: fsid=node:config.0: jid=1: replays = 0, 
skips = 0, sames = 0
Feb 15 13:38:43 node-01 GFS: fsid=node:data.0: jid=1: Acquiring the 
transaction lock...
Feb 15 13:38:43 node-01 GFS: fsid=node:data.0: jid=1: Replaying journal...
Feb 15 13:38:43 node-01 GFS: fsid=node:data.0: jid=1: Replayed 0 of 0 blocks
Feb 15 13:38:43 node-01 GFS: fsid=node:data.0: jid=1: replays = 0, skips 
= 0, sames = 0
Feb 15 13:38:43 node-01 GFS: fsid=node:data.0: jid=1: Journal replayed in 1s
Feb 15 13:38:43 node-01 GFS: fsid=node:data.0: jid=1: Done
Feb 15 13:38:43 node-01 GFS: fsid=node:config.0: jid=1: Journal replayed 
in 1s
Feb 15 13:38:43 node-01 GFS: fsid=node:config.0: jid=1: Done

Any idea what's wrong?

Regards,

Fajar


From fajar at telkom.co.id  Tue Feb 15 06:58:12 2005
From: fajar at telkom.co.id (Fajar A. Nugraha)
Date: Tue, 15 Feb 2005 13:58:12 +0700
Subject: [Linux-cluster] cluster latest cvs does not fence dead nodes
	automatically
In-Reply-To: <42119C25.2000207@telkom.co.id>
References: <42119C25.2000207@telkom.co.id>
Message-ID: <42119D84.6030403@telkom.co.id>

Fajar A. Nugraha wrote:

> node 1 simply says (on syslog):
>
> Feb 15 13:33:35 hosting-cl02-01 CMAN: removing node hosting-cl02-02 
> from the cluster : Missed too many heartbeats
>
sorry, wrong message (this was another pair of nodes) :)
should be

Feb 15 13:36:03 node-01 CMAN: removing node node-02 from the cluster : 
Missed too many heartbeats

Other messages are correct.


From teigland at redhat.com  Tue Feb 15 07:13:59 2005
From: teigland at redhat.com (David Teigland)
Date: Tue, 15 Feb 2005 15:13:59 +0800
Subject: [Linux-cluster] cluster latest cvs does not fence dead nodes
	automatically
In-Reply-To: <42119C25.2000207@telkom.co.id>
References: <42119C25.2000207@telkom.co.id>
Message-ID: <20050215071359.GC5395@redhat.com>

On Tue, Feb 15, 2005 at 01:52:21PM +0700, Fajar A. Nugraha wrote:
> Hi,
> 
> I'm building two-node cluster using today's cvs from 
> sources.redhat.com:/cvs/cluster.
> Shared storage is located on FC shared disk.
> All work as expected up to using gfs.
> 
> When I simulated a node crash (I did ifcfg eth0 down on node 2),
> node 1 simply says (on syslog):
> 
> Feb 15 13:33:35 hosting-cl02-01 CMAN: removing node hosting-cl02-02 from 
> the cluster : Missed too many heartbeats
> 
> However, NO fencing occured. Not even a "fence failed" message. I use 
> fence_ibmblade.
> After that, access to gfs device blocked (df -k still works though), and 
> /proc/cluster/nodes show
> 
> Node  Votes Exp Sts  Name
>   1    1    1   M   node-01
>   2    1    1   X   node-02

It looks like the node names are wrong.  There were some recent changes to
how we deal with node names, but I don't see how this setup could ever
have worked, even with previous code.

The node names you put in cluster.conf must match the name on the network
interface you want cman to use.  It looks like the machine's name is
"hosting-cl02-02".  Is that the hostname on the network interface you want
cman to use?  If so, then that's the name you should enter in
cluster.conf, and that's the name that should appear when you run
"cman_tool status" and "cman_tool nodes".


> there's an "X" on node 2, but /proc/cluster/service shows
> 
> Service          Name                              GID LID State     Code
> Fence Domain:    "default"                           1   2 run       -
> [1 2]
> 
> DLM Lock Space:  "clvmd"                             2   3 run       -
> [1 2]
> 
> DLM Lock Space:  "data"                              3   4 run       -
> [1 2]
> 
> DLM Lock Space:  "config"                            5   6 run       -
> [1 2]
> 
> GFS Mount Group: "data"                              4   5 run       -
> [1 2]
> 
> GFS Mount Group: "config"                            6   7 run       -
> [1 2]
> 
> which is the same content with before node 2 is dead.
> AFAIK, state should be "recover" or "waiting to recover" instead of run.
> 
> If I reboot node 2 (which is the same thing if you exceute
> fence_ibmblade manually), and restart cluster services on that node, all 
> is back to normal,
> and these messages show on syslog :
> 
> Feb 15 13:38:40 node-01 CMAN: node node-02 rejoining
> Feb 15 13:38:40 node-01 fenced[25486]: node-02 not a cluster member 
> after 0 sec post_fail_delay

Above the names were "hosting-cl02-01" and "hosting-cl02-02".  Could you
clear that up and if there are still problems send your cluster.conf file?
Thanks

-- 
Dave Teigland  <teigland at redhat.com>


From fajar at telkom.co.id  Tue Feb 15 07:32:59 2005
From: fajar at telkom.co.id (Fajar A. Nugraha)
Date: Tue, 15 Feb 2005 14:32:59 +0700
Subject: [Linux-cluster] cluster latest cvs does not fence dead nodes
	automatically
In-Reply-To: <20050215071359.GC5395@redhat.com>
References: <42119C25.2000207@telkom.co.id> <20050215071359.GC5395@redhat.com>
Message-ID: <4211A5AB.6000701@telkom.co.id>

David Teigland wrote:

>It looks like the node names are wrong.  There were some recent changes to
>how we deal with node names, but I don't see how this setup could ever
>have worked, even with previous code.
>
>The node names you put in cluster.conf must match the name on the network
>interface you want cman to use. 
>
It is. cluster node names is identical to each node's hostname, and those
ip addresses are in /etc/hosts.

> It looks like the machine's name is
>"hosting-cl02-02". 
>
My mistake. That particular line was from the wrong server :)

> Is that the hostname on the network interface you want
>cman to use?  If so, then that's the name you should enter in
>cluster.conf, and that's the name that should appear when you run
>"cman_tool status" and "cman_tool nodes".
>
>  
>
I'll make a new setup, using new node names (in cluster.conf too, just 
in case),
try cman_tool and post the result in a few minutes.

Regards,

Fajar


From fajar at telkom.co.id  Tue Feb 15 08:34:44 2005
From: fajar at telkom.co.id (Fajar A. Nugraha)
Date: Tue, 15 Feb 2005 15:34:44 +0700
Subject: [Linux-cluster] cluster latest cvs does not fence dead nodes
	automatically
In-Reply-To: <20050215071359.GC5395@redhat.com>
References: <42119C25.2000207@telkom.co.id> <20050215071359.GC5395@redhat.com>
Message-ID: <4211B424.4090202@telkom.co.id>

David Teigland wrote:

>Above the names were "hosting-cl02-01" and "hosting-cl02-02".  Could you
>clear that up and if there are still problems send your cluster.conf file?
>Thanks
>
>  
>
Here's how it is now.
Using new hostnames and cluster.conf (blade center's IP address and 
community string removed):
==================================
<?xml version="1.0"?>
<cluster name="cluster" config_version="3">

<cman two_node="1" expected_votes="1">
</cman>

<clusternodes>
<clusternode name="cluster-node2" votes="1">
        <fence>
        <method name="single">
                <device name="ibmblade" port="7"/>
        </method>
        </fence>
</clusternode>
<clusternode name="cluster-node1" votes="1">
        <fence>
        <method name="single">
                <device name="ibmblade" port="6"/>
        </method>
        </fence>
</clusternode>


</clusternodes>

<fencedevices>
        <fencedevice name="ibmblade" agent="fence_ibmblade" 
ipaddr="IP_ADDRESS_HERE" community="COMMUNITY_HERE"/>
</fencedevices>

</cluster>
===========================================

Commands and their output (console or syslog):

# modprobe gfs
# modprobe lock_dlm

Feb 15 15:10:04 cluster-node1 Lock_Harness <CVS> (built Feb 15 2005 
12:00:38) installed
Feb 15 15:10:04 cluster-node1 GFS <CVS> (built Feb 15 2005 12:00:52) 
installed
Feb 15 15:10:08 cluster-node1 CMAN <CVS> (built Feb 15 2005 12:00:31) 
installed
Feb 15 15:10:08 cluster-node1 NET: Registered protocol family 30
Feb 15 15:10:08 cluster-node1 DLM <CVS> (built Feb 15 2005 12:00:34) 
installed
Feb 15 15:10:08 cluster-node1 Lock_DLM (built Feb 15 2005 12:00:39) 
installed

dm-mod is built-in in the kernel (not a module)

# ccsd -V
ccsd DEVEL.1108443619 (built Feb 15 2005 12:01:01)
Copyright (C) Red Hat, Inc.  2004  All rights reserved.

# ccsd -4
Feb 15 15:10:58 cluster-node1 ccsd[8556]: Starting ccsd DEVEL.1108443619:
Feb 15 15:10:58 cluster-node1 ccsd[8556]:  Built: Feb 15 2005 12:01:01
Feb 15 15:10:58 cluster-node1 ccsd[8556]:  Copyright (C) Red Hat, Inc.  
2004  All rights reserved.
Feb 15 15:10:58 cluster-node1 ccsd[8556]:   IP Protocol:: IPv4 only

# cman_tool join
Feb 15 15:12:27 cluster-node1 ccsd[8556]: cluster.conf (cluster name = 
cluster, version = 3) found.
Feb 15 15:12:28 cluster-node1 CMAN: Waiting to join or form a Linux-cluster
Feb 15 15:12:28 cluster-node1 ccsd[8558]: Connected to cluster 
infrastruture via: CMAN/SM Plugin v1.1
Feb 15 15:12:28 cluster-node1 ccsd[8558]: Initial status:: Inquorate
Feb 15 15:13:00 cluster-node1 CMAN: forming a new cluster
Feb 15 15:13:00 cluster-node1 CMAN: quorum regained, resuming activity
Feb 15 15:13:00 cluster-node1 ccsd[8558]: Cluster is quorate.  Allowing 
connections.

# cman_tool status
Protocol version: 5.0.1
Config version: 3
Cluster name: cluster
Cluster ID: 13364
Membership state: Cluster-Member
Nodes: 1
Expected_votes: 1
Total_votes: 1
Quorum: 1
Active subsystems: 0
Node name: cluster-node1
Node addresses: 192.168.192.146

# cman_tool nodes
Node  Votes Exp Sts  Name
   1    1    1   M   cluster-node1

# fence_tool join
Feb 15 15:14:26 cluster-node1 fenced[8847]: cluster-node2 not a cluster 
member after 6 sec post_join_delay
Feb 15 15:14:26 cluster-node1 fenced[8847]: fencing node "cluster-node2"
Feb 15 15:14:32 cluster-node1 fenced[8847]: fence "cluster-node2" success

at this point "cluster-node2" was fenced and automatically rebooted, 
which is good.

Now I join the cluster-node2 to the cluster :
# modprobe gfs
# modprobe lock_dlm
# cman_tool join
# fence_tool join

Feb 15 15:18:30 cluster-node2 ccsd[8376]: Starting ccsd DEVEL.1108443619:
Feb 15 15:18:30 cluster-node2 ccsd[8376]:  Built: Feb 15 2005 12:01:01
Feb 15 15:18:30 cluster-node2 ccsd[8376]:  Copyright (C) Red Hat, Inc.  
2004  All rights reserved.
Feb 15 15:18:30 cluster-node2 ccsd[8376]:   IP Protocol:: IPv4 only
Feb 15 15:18:34 cluster-node2 ccsd[8376]: cluster.conf (cluster name = 
cluster, version = 3) found.
Feb 15 15:18:34 cluster-node2 ccsd[8376]: Remote copy of cluster.conf is 
from quorate node.
Feb 15 15:18:34 cluster-node2 ccsd[8376]:  Local version # : 3
Feb 15 15:18:34 cluster-node2 ccsd[8376]:  Remote version #: 3
Feb 15 15:18:41 cluster-node2 Lock_Harness <CVS> (built Feb 15 2005 
12:00:38) installed
Feb 15 15:18:41 cluster-node2 GFS <CVS> (built Feb 15 2005 12:00:52) 
installed
Feb 15 15:18:44 cluster-node2 CMAN <CVS> (built Feb 15 2005 12:00:31) 
installed
Feb 15 15:18:44 cluster-node2 NET: Registered protocol family 30
Feb 15 15:18:44 cluster-node2 DLM <CVS> (built Feb 15 2005 12:00:34) 
installed
Feb 15 15:18:44 cluster-node2 Lock_DLM (built Feb 15 2005 12:00:39) 
installed
Feb 15 15:18:47 cluster-node2 ccsd[8376]: Remote copy of cluster.conf is 
from quorate node.
Feb 15 15:18:47 cluster-node2 ccsd[8376]:  Local version # : 3
Feb 15 15:18:47 cluster-node2 ccsd[8376]:  Remote version #: 3
Feb 15 15:18:47 cluster-node2 CMAN: Waiting to join or form a Linux-cluster
Feb 15 15:18:48 cluster-node2 ccsd[8378]: Connected to cluster 
infrastruture via: CMAN/SM Plugin v1.1
Feb 15 15:18:48 cluster-node2 ccsd[8378]: Initial status:: Inquorate
Feb 15 15:18:50 cluster-node2 CMAN: sending membership request
Feb 15 15:18:50 cluster-node2 CMAN: got node cluster-node1
Feb 15 15:18:50 cluster-node2 CMAN: quorum regained, resuming activity
Feb 15 15:18:50 cluster-node2 ccsd[8378]: Cluster is quorate.  Allowing 
connections.

on node 1 :
# clvmd
Feb 15 15:24:56 cluster-node1 CMAN: WARNING no listener for port 11 on 
node cluster-node2

on node 2 :
# clvmd
Feb 15 15:25:03 cluster-node2 clvmd: Cluster LVM daemon started - 
connected to CMAN

on node 1 :
# cman_tool nodes
Node  Votes Exp Sts  Name
   1    1    1   M   cluster-node1
   2    1    1   M   cluster-node2

# cman_tool services
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 run       -
[1 2]

DLM Lock Space:  "clvmd"                             3   3 run       -
[1 2]

# cman_tool status
Protocol version: 5.0.1
Config version: 3
Cluster name: cluster
Cluster ID: 13364
Membership state: Cluster-Member
Nodes: 2
Expected_votes: 1
Total_votes: 2
Quorum: 1
Active subsystems: 3
Node name: cluster-node1
Node addresses: 192.168.192.146

Now I shutdown node2's network interface.

On node 2 :
# ifconfig eth0 down

On node 1 :
Feb 15 15:29:50 cluster-node1 CMAN: removing node cluster-node2 from the 
cluster : Missed too many heartbeats

# cman_tool status
Protocol version: 5.0.1
Config version: 3
Cluster name: cluster
Cluster ID: 13364
Membership state: Cluster-Member
Nodes: 2
Expected_votes: 1
Total_votes: 2
Quorum: 1
Active subsystems: 3
Node name: cluster-node1
Node addresses: 192.168.192.146

# cman_tool status
Protocol version: 5.0.1
Config version: 3
Cluster name: cluster
Cluster ID: 13364
Membership state: Cluster-Member
Nodes: 1
Expected_votes: 1
Total_votes: 1
Quorum: 1
Active subsystems: 3
Node name: cluster-node1
Node addresses: 192.168.192.146

# cman_tool nodes
Node  Votes Exp Sts  Name
   1    1    1   M   cluster-node1
   2    1    1   X   cluster-node2

# cman_tool services
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 run       -
[1 2]

DLM Lock Space:  "clvmd"                             3   3 run       -
[1 2]

No note about fencing whatsoever, and node 2 is not automatically rebooted.
Shouldn't node 2 get fenced here?

Regards,

Fajar


From pcaulfie at redhat.com  Tue Feb 15 08:38:35 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 15 Feb 2005 08:38:35 +0000
Subject: [Linux-cluster] dlm - update dlm32 layer
In-Reply-To: <20050214170651.GD20443@wavehammer.waldi.eu.org>
References: <20050213223947.GB28716@wavehammer.waldi.eu.org>
	<20050214084652.GC5724@tykepenguin.com>
	<20050214170651.GD20443@wavehammer.waldi.eu.org>
Message-ID: <20050215083835.GA11831@tykepenguin.com>

On Mon, Feb 14, 2005 at 06:06:51PM +0100, Bastian Blank wrote:
> On Mon, Feb 14, 2005 at 08:46:52AM +0000, Patrick Caulfield wrote:
> > I'm working on this, but the patch you sent last seems to break queries (even on
> > i386) and I haven't got the bottom of why yet.
> 
> My first patch was broken, I know. But how can I check if it works?

The second one was too - I've fixed it up and committed it now.
 
> clvmd does not report errors and seems to work correctly on i386.
> 

clvmd doesn't use the query interface so, yes, it will work fine. 
dlm/tests/usertest/dlmtest -Q exercises (in a limited way) the query facility.

patrick


From teigland at redhat.com  Tue Feb 15 09:42:29 2005
From: teigland at redhat.com (David Teigland)
Date: Tue, 15 Feb 2005 17:42:29 +0800
Subject: [Linux-cluster] cluster latest cvs does not fence dead nodes
	automatically
In-Reply-To: <4211B424.4090202@telkom.co.id>
References: <42119C25.2000207@telkom.co.id> <20050215071359.GC5395@redhat.com>
	<4211B424.4090202@telkom.co.id>
Message-ID: <20050215094229.GG5395@redhat.com>

On Tue, Feb 15, 2005 at 03:34:44PM +0700, Fajar A. Nugraha wrote:

> # cman_tool nodes
> Node  Votes Exp Sts  Name
>   1    1    1   M   cluster-node1
>   2    1    1   X   cluster-node2
> 
> # cman_tool services
> Service          Name                              GID LID State     Code
> Fence Domain:    "default"                           1   2 run       -
> [1 2]
> 
> DLM Lock Space:  "clvmd"                             3   3 run       -
> [1 2]
> 
> No note about fencing whatsoever, and node 2 is not automatically rebooted.
> Shouldn't node 2 get fenced here?

Yes, a checkin last Friday left out a line to wake up cman_serviced.
Fixed now.  Thanks

-- 
Dave Teigland  <teigland at redhat.com>


From crsurf at terra.com.br  Tue Feb 15 16:03:58 2005
From: crsurf at terra.com.br (crsurf)
Date: Tue, 15 Feb 2005 13:03:58 -0300
Subject: [Linux-cluster] kernel 2.4.21-20.EL and GNBD
Message-ID: <IBYNAM$88DB40BA29F1BB138FC1EED400E045BA@terra.com.br>

Hello

I'm trying to configure GNBD to use as shared device (raw) to simulate
a shared storage to use with RHCS, but I'm having problems because the
two nodes of the cluster are rebooting after start clumanager.

I can use the packges in sources.redhat.com/cluster with kernel
2.4.21-20 to run GNBD and other packages requisiteds?

Someone can help me to configure RHCS with GNBD as shared storage?

Grateful

Cristiano


From ggilyeat at jhsph.edu  Tue Feb 15 16:09:30 2005
From: ggilyeat at jhsph.edu (Gerald G. Gilyeat)
Date: Tue, 15 Feb 2005 11:09:30 -0500
Subject: [Linux-cluster] GFS 6.0 Questions
Message-ID: <DF33F4DAC09B3048AA095F1B5C368915A57A55@XCH-VN02.sph.ad.jhsph.edu>

Good morning, y'all:

A bit of background on our current setup before I get into the issues we've been having, and my questions for the list :)
We've been running a 32node computer cluster with two head nodes attached to a 4TB EMC CX300 for ~9 months now. Attached to the storage device are two additional four-way systems. The 2 heads (call them f0 and1) and one of the four-ways (call it e0) are currently configured as GFS servers, and the second four-way is strictly a client to the GFS system (call it c0).  The 4TB are divided into 4 RAID5 arrays: 2x 800GB, 1x730GB, 1x 1.4TB (roughly), each formatted and included in the GFS side of things. f0 and f1 are dual Opteron 248s w/4GB RAM, e0 is a quad Opty 848 w/16GB RAM and c0 is a . An additional system (t0) is planned to replace e0 as the third GFS server, relegating e0 to client-only status, as soon as I can finish building it (it's a dual Opty 248 w/6GB RAM). 

First, the GFS side of things is currently sharing the cluster's internal network for it's communications, mostly because we didn't have a second switch to dedicate to the task. While the cluster is currently lightly used, how sub-optimal is this? I'm currently searching for another switch that a partnering department has/had, but I don't know if they even know where it is at this point.

Second: GFS likes to fence "e0" off on a fairly regular/common basis (once every other week or so, if not more often). This is really rather bad for us, from an operational standpoint - e0  is vital to the operation of our Biostatistics Department (Samba/NFS, user authentication, etc...). There is also some pretty nasty latency on occasion, with logins taking upwards of 30seconds to return to a prompt, providing it doesn't time out to begin with.

In trying to figure out -why- it's constantly being fenced off, and in trying to solve the latency/performance issues, I've noticed a -very- large number of "notices" from GFS like the following:
Feb 15 10:56:10 front-1 lock_gulmd_LT000[4073]: Lock count is at 1124832 which is more than the max 1048576. Sending Drop all req to clients 

Easy enough to gather that we're blowing away the current lock highwater mark. 
Is upping the highwater point a feasable thing to do -and- would it have an affect on performance, and what would that affect be?

This weekend, we also noticed another weirdness (for us, anyways...) - e0 was fenced off on Saturday morning at 0504.09am, almost precisely 24 hours later e0 decided that the problem was the previous GFS master (f0), arbitrated itself to be Master, took over, fenced off F0 and then proceeded to hose the entire thing by the time I heard about things and was able to get on-site to bring it all back up (at 1am Monday morning). What is this apparent 24-hour timer, and is this expected behaviour?

Finally - would increasing the heartbeat timer and the number of acceptable misses an appropriate and acceptable way to help decreases the frequency of e0 being fenced off?

Sorry for the long post and the rambling. I've been fighting with this mess off and on for 6 months now. It's only now coming to a head because people are no longer able to get an "real" work done, computationally, on the system. And with researchers, not being able to get your computations done in time for your presentation next week is a really -=bad=- thing.

Thanks!

--
Jerry Gilyeat, RHCE
Systems Administrator
Molecular Microbiology and Immunology
Johns Hopkins Bloomberg School of Public Health
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050215/05ae5de8/attachment.htm>

From KaiSpeckmann at gmx.de  Tue Feb 15 10:53:22 2005
From: KaiSpeckmann at gmx.de (KaiSpeckmann at gmx.de)
Date: Tue, 15 Feb 2005 11:53:22 +0100 (MET)
Subject: [Linux-cluster] mounting gfs fs on the first node locks all journals
Message-ID: <17729.1108464802@www37.gmx.net>

Hi,

The cluster consists of two nodes:
- a partition on node1 is configured as pool device. It should be used for
cluster storage
- node1 exports its device via gnbd and serves as lock_gulm master
- node2 imports this device and is logged in as lock_gulm client on node1
- ccs config archives are stored locally on both machines.

Until i try to mount the gfs fs on the first node everythings seems to be
find to me. But when i try to mount the pool device (/dev/pool/pool1) on the
first node, it acquires journal lock for all existing journals on the gfs
fs, making it imposible to mount it from another node.
Can someone tell me why this happens ? 

Here are my ccs config files:

fence_devices {
        c1-locksrv {
                agent = "fence_gnbd"
                server = "cluster1"
        }
        manual-reset {
                agent = "fence_manual"
        }
}

nodes {
        cluster1 {
                ip_interfaces {
                        eth0 = "192.168.0.1"
                }
                fence {
                        fenceCluster1 {
                                manual-reset {
                                        ipaddr = "192.168.0.1"
                                }
                        }
                }
        }
        cluster2 {
                ip_interfaces {
                        eth0 = "192.168.0.2"
                }
                fence {
                        fenceCluster2 {
                                c1-locksrv {
                                        ipaddr = "192.168.0.2"
                                }
                        }
                }
        }
}

cluster {
        name = "rac"
        lock_gulm {
                servers = ["cluster1"]
                heartbeat_rate = 3
                allowed_misses = 10
         }
}


-- 
DSL Komplett von GMX +++ Superg?nstig und stressfrei einsteigen!
AKTION "Kein Einrichtungspreis" nutzen: http://www.gmx.net/de/go/dsl


From bastian at waldi.eu.org  Tue Feb 15 16:47:14 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Tue, 15 Feb 2005 17:47:14 +0100
Subject: [Linux-cluster] Re: cluster/fence/fence_tool fence_tool.c
In-Reply-To: <20050215035441.19043.qmail@sourceware.org>
References: <20050215035441.19043.qmail@sourceware.org>
Message-ID: <20050215164656.GB5824@wavehammer.waldi.eu.org>

On Tue, Feb 15, 2005 at 03:54:41AM -0000, teigland at sourceware.org wrote:
> Log message:
> 	Add option to fence_tool to wait for the node to complete its join and
> 	be a member of the fence domain.  Two options:
> 	fence_tool join -w
> 	fence_tool join; fence_tool wait

I think it may be better to implement the -w argument in fenced. It just
need to fork after the kernel reports the successfull join not before.

This will also remove one dependency to the not machine readable output
of procfs.

Bastian

-- 
Intuition, however illogical, is recognized as a command prerogative.
		-- Kirk, "Obsession", stardate 3620.7
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050215/6b02c966/attachment.sig>

From teigland at redhat.com  Tue Feb 15 17:01:46 2005
From: teigland at redhat.com (David Teigland)
Date: Wed, 16 Feb 2005 01:01:46 +0800
Subject: [Linux-cluster] Re: cluster/fence/fence_tool fence_tool.c
In-Reply-To: <20050215164656.GB5824@wavehammer.waldi.eu.org>
References: <20050215035441.19043.qmail@sourceware.org>
	<20050215164656.GB5824@wavehammer.waldi.eu.org>
Message-ID: <20050215170146.GB17487@redhat.com>

On Tue, Feb 15, 2005 at 05:47:14PM +0100, Bastian Blank wrote:
> On Tue, Feb 15, 2005 at 03:54:41AM -0000, teigland at sourceware.org wrote:
> > Log message:
> > 	Add option to fence_tool to wait for the node to complete its join and
> > 	be a member of the fence domain.  Two options:
> > 	fence_tool join -w
> > 	fence_tool join; fence_tool wait
> 
> I think it may be better to implement the -w argument in fenced. It just
> need to fork after the kernel reports the successfull join not before.
> 
> This will also remove one dependency to the not machine readable output
> of procfs.

No, the join (the ioctl performed by fenced) is asynchronous.  The only
way to really tell that it's complete is to monitor the proc file.

(I know this may not be very nice, but it's not going to change in this
generation of the code.  If you don't want to use proc, don't use -w.)

-- 
Dave Teigland  <teigland at redhat.com>


From bastian at waldi.eu.org  Tue Feb 15 16:59:32 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Tue, 15 Feb 2005 17:59:32 +0100
Subject: [Linux-cluster] commit mails
Message-ID: <20050215165932.GC5824@wavehammer.waldi.eu.org>

Hi folks

Is it possible to get commit mails inclusive diffs? As I often want to
know what really changed, I have to check each file by hand.

Bastian

-- 
I'm a soldier, not a diplomat.  I can only tell the truth.
		-- Kirk, "Errand of Mercy", stardate 3198.9
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050215/fe0bd331/attachment.sig>

From mtilstra at redhat.com  Tue Feb 15 17:43:26 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Tue, 15 Feb 2005 11:43:26 -0600
Subject: [Linux-cluster] GFS 6.0 Questions
In-Reply-To: <DF33F4DAC09B3048AA095F1B5C368915A57A55@XCH-VN02.sph.ad.jhsph.edu>
References: <DF33F4DAC09B3048AA095F1B5C368915A57A55@XCH-VN02.sph.ad.jhsph.edu>
Message-ID: <421234BE.90207@redhat.com>

Gerald G. Gilyeat wrote:

[snip]
> First, the GFS side of things is currently sharing the cluster's 
> internal network for it's communications, mostly because we didn't have 
> a second switch to dedicate to the task. While the cluster is currently 
> lightly used, how sub-optimal is this? I'm currently searching for 
> another switch that a partnering department has/had, but I don't know if 
> they even know where it is at this point.

It really depends on how much the actual link is used.  The more data 
that the other apps are pushing over the ethernet, the less of it gulm 
can use.  It is also rather (unfortunately) difficult to tell gulm to 
use a different network device in the current releases.  There is a fix 
pending for this, but its not out yet.

> Second: GFS likes to fence "e0" off on a fairly regular/common basis 
> (once every other week or so, if not more often). This is really rather 
> bad for us, from an operational standpoint - e0  is vital to the 
> operation of our Biostatistics Department (Samba/NFS, user 
> authentication, etc...). There is also some pretty nasty latency on 
> occasion, with logins taking upwards of 30seconds to return to a prompt, 
> providing it doesn't time out to begin with.

If the machine is getting this kind of delay, it is completely possible 
that the delay is also causing heartbeats to be missed.

> In trying to figure out -why- it's constantly being fenced off, and in 
> trying to solve the latency/performance issues, I've noticed a -very- 
> large number of "notices" from GFS like the following:
> Feb 15 10:56:10 front-1 lock_gulmd_LT000[4073]: Lock count is at 1124832 
> which is more than the max 1048576. Sending Drop all req to clients
> 
> Easy enough to gather that we're blowing away the current lock highwater 
> mark.
> Is upping the highwater point a feasable thing to do -and- would it have 
> an affect on performance, and what would that affect be?

cluster.ccs:
cluster {
  lock_gulm {
    ....
    lt_high_locks = <int>
  }
}

The highwater mark is an attempt to keep the amount of memory lock_gulmd 
uses down.  When the highwater is hit, the lock server tells all gfs 
mounts to try and release locks.  It does this every 10 seconds until 
the lock count falls below the highwater mark.  This requires cycles, 
and so not doing it means less cycles used.  The higher the highwater 
mark is, the more memory the gulm lock servers and gfs will use to store 
locks.  The number is just the count of locks (in <=6.0) and not an 
actual representation of ram used.

In short summery, in your case, a higher highwater mark may give some 
performance gained, at the loss of some memory available to other programs.


> This weekend, we also noticed another weirdness (for us, anyways...) - 
> e0 was fenced off on Saturday morning at 0504.09am, almost precisely 24 
> hours later e0 decided that the problem was the previous GFS master 
> (f0), arbitrated itself to be Master, took over, fenced off F0 and then 
> proceeded to hose the entire thing by the time I heard about things and 
> was able to get on-site to bring it all back up (at 1am Monday morning). 
> What is this apparent 24-hour timer, and is this expected behaviour?

No, it sounds like some kind of freak chance.  A very icky thing indeed. 
  Very much sounds like a higher heartbeat_rate is needed.

> Finally - would increasing the heartbeat timer and the number of 
> acceptable misses an appropriate and acceptable way to help decreases 
> the frequency of e0 being fenced off?

Certainly.  The default values for the heartbeat_rate and allowed_misses 
are just suggestions.  Certain setups may require different values, and 
as far as I know the only way to figure this out is to try it.  Sounds 
very much like you could use larger values.

-- 
michael conrad tadpol tilstra
<my wit is my doom>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050215/2a2cd526/attachment.sig>

From ggilyeat at jhsph.edu  Tue Feb 15 17:59:46 2005
From: ggilyeat at jhsph.edu (Gerald G. Gilyeat)
Date: Tue, 15 Feb 2005 12:59:46 -0500
Subject: [Linux-cluster] GFS 6.0 Questions
Message-ID: <DF33F4DAC09B3048AA095F1B5C368915A57A57@XCH-VN02.sph.ad.jhsph.edu>

Thanks a bunch. 
The direction I was leaning on going, then, seems appropriate. I love it when things start coming together.

Is there anyway to get some of these undocumented tunable features, well, documented? I couldn't for the life of me find anything indicating if the lock highwater mark was runtime tunable, for example. 
There is -some- concern about memory usage tanking things, but that will probably end up leading us to simply moving to dedicated locking servers instead of having them on the actual shared production machines (and really, we'd only need two...'f1' is strictly for management type work and backups...)

Finally - so while it's -possible- to have the GFS "stuff" on a separate interface (and yes, it was a royal PITA getting it to work in the first place what with multiple NICs already...), it's not somthing that's at all easy to do, at least until the mentioned fix drops?  bleh.

Thanks!

--
Jerry Gilyeat, RHCE
Systems Administrator
Molecular Microbiology and Immunology
Johns Hopkins Bloomberg School of Public Health


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Michael Conrad Tadpol Tilstra
Sent: Tue 2/15/2005 12:43 PM
To: linux clistering
Subject: Re: [Linux-cluster] GFS 6.0 Questions
 
Gerald G. Gilyeat wrote:

[snip]
> First, the GFS side of things is currently sharing the cluster's 
> internal network for it's communications, mostly because we didn't have 
> a second switch to dedicate to the task. While the cluster is currently 
> lightly used, how sub-optimal is this? I'm currently searching for 
> another switch that a partnering department has/had, but I don't know if 
> they even know where it is at this point.

It really depends on how much the actual link is used.  The more data 
that the other apps are pushing over the ethernet, the less of it gulm 
can use.  It is also rather (unfortunately) difficult to tell gulm to 
use a different network device in the current releases.  There is a fix 
pending for this, but its not out yet.

> Second: GFS likes to fence "e0" off on a fairly regular/common basis 
> (once every other week or so, if not more often). This is really rather 
> bad for us, from an operational standpoint - e0  is vital to the 
> operation of our Biostatistics Department (Samba/NFS, user 
> authentication, etc...). There is also some pretty nasty latency on 
> occasion, with logins taking upwards of 30seconds to return to a prompt, 
> providing it doesn't time out to begin with.

If the machine is getting this kind of delay, it is completely possible 
that the delay is also causing heartbeats to be missed.

> In trying to figure out -why- it's constantly being fenced off, and in 
> trying to solve the latency/performance issues, I've noticed a -very- 
> large number of "notices" from GFS like the following:
> Feb 15 10:56:10 front-1 lock_gulmd_LT000[4073]: Lock count is at 1124832 
> which is more than the max 1048576. Sending Drop all req to clients
> 
> Easy enough to gather that we're blowing away the current lock highwater 
> mark.
> Is upping the highwater point a feasable thing to do -and- would it have 
> an affect on performance, and what would that affect be?

cluster.ccs:
cluster {
  lock_gulm {
    ....
    lt_high_locks = <int>
  }
}

The highwater mark is an attempt to keep the amount of memory lock_gulmd 
uses down.  When the highwater is hit, the lock server tells all gfs 
mounts to try and release locks.  It does this every 10 seconds until 
the lock count falls below the highwater mark.  This requires cycles, 
and so not doing it means less cycles used.  The higher the highwater 
mark is, the more memory the gulm lock servers and gfs will use to store 
locks.  The number is just the count of locks (in <=6.0) and not an 
actual representation of ram used.

In short summery, in your case, a higher highwater mark may give some 
performance gained, at the loss of some memory available to other programs.


> This weekend, we also noticed another weirdness (for us, anyways...) - 
> e0 was fenced off on Saturday morning at 0504.09am, almost precisely 24 
> hours later e0 decided that the problem was the previous GFS master 
> (f0), arbitrated itself to be Master, took over, fenced off F0 and then 
> proceeded to hose the entire thing by the time I heard about things and 
> was able to get on-site to bring it all back up (at 1am Monday morning). 
> What is this apparent 24-hour timer, and is this expected behaviour?

No, it sounds like some kind of freak chance.  A very icky thing indeed. 
  Very much sounds like a higher heartbeat_rate is needed.

> Finally - would increasing the heartbeat timer and the number of 
> acceptable misses an appropriate and acceptable way to help decreases 
> the frequency of e0 being fenced off?

Certainly.  The default values for the heartbeat_rate and allowed_misses 
are just suggestions.  Certain setups may require different values, and 
as far as I know the only way to figure this out is to try it.  Sounds 
very much like you could use larger values.

-- 
michael conrad tadpol tilstra
<my wit is my doom>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 5574 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050215/044d1fd5/attachment.bin>

From mtilstra at redhat.com  Tue Feb 15 18:24:26 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Tue, 15 Feb 2005 12:24:26 -0600
Subject: [Linux-cluster] GFS 6.0 Questions
In-Reply-To: <DF33F4DAC09B3048AA095F1B5C368915A57A57@XCH-VN02.sph.ad.jhsph.edu>
References: <DF33F4DAC09B3048AA095F1B5C368915A57A57@XCH-VN02.sph.ad.jhsph.edu>
Message-ID: <42123E5A.9070801@redhat.com>

Gerald G. Gilyeat wrote:
> Thanks a bunch. 
> The direction I was leaning on going, then, seems appropriate. I love
> it when things start coming together.
> 
> Is there anyway to get some of these undocumented tunable features,
> well, documented? I couldn't for the life of me find anything indicating
> if the lock highwater mark was runtime tunable, for example.

erm, get me not busy enough that I have time to document stuff?
its on my list of things todo, really.  its just at the bottom somewhere.

> There is -some- concern about memory usage tanking things, but that
> will probably end up leading us to simply moving to dedicated locking
> servers instead of having them on the actual shared production machines
> (and really, we'd only need two...'f1' is strictly for management type
> work and backups...)

> Finally - so while it's -possible- to have the GFS "stuff" on a
> separate interface (and yes, it was a royal PITA getting it to work in
> the first place what with multiple NICs already...), it's not somthing
> that's at all easy to do, at least until the mentioned fix drops? bleh.

Work around is described here:
https://bugzilla.redhat.com/beta/show_bug.cgi?id=131142
Not that difficult, just ugly.  The problem is that gulm wants 
hostname==ip==node, and with multiple NICs, that's not the case any more.

-- 
michael conrad tadpol tilstra
<my wit is my doom>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050215/fdb2587f/attachment.sig>

From ggilyeat at jhsph.edu  Tue Feb 15 19:48:49 2005
From: ggilyeat at jhsph.edu (Gerald G. Gilyeat)
Date: Tue, 15 Feb 2005 14:48:49 -0500
Subject: [Linux-cluster] GFS 6.0 Questions
Message-ID: <DF33F4DAC09B3048AA095F1B5C368915A57A5D@XCH-VN02.sph.ad.jhsph.edu>


<snip>

cluster.ccs:
cluster {
  lock_gulm {
    ....
    lt_high_locks = <int>
  }
}

The highwater mark is an attempt to keep the amount of memory lock_gulmd 
uses down.  When the highwater is hit, the lock server tells all gfs 
mounts to try and release locks.  It does this every 10 seconds until 
the lock count falls below the highwater mark.  This requires cycles, 
and so not doing it means less cycles used.  The higher the highwater 
mark is, the more memory the gulm lock servers and gfs will use to store 
locks.  The number is just the count of locks (in <=6.0) and not an 
actual representation of ram used.

In short summery, in your case, a higher highwater mark may give some 
performance gained, at the loss of some memory available to other programs.

</snip>

I just bounced the storage servers using the lt_high_locks directive as above. The cluster.ccs looks like the following:
cluster {
        name = "hopkins"
        lock_gulm {
                servers = ["front-0", "front-1", "enigma"]
        }
        lt_high_locks = 2097152
        heartbeat_rate = 30
        allowed_misses = 4
}

gulm_tool getstats front-1:lt000 returns the following:

[root at front-0 root]# gulm_tool getstats front-1:lt000
I_am = Master
run time = 831
pid = 4073
verbosity = Default
id = 0
partitions = 1
out_queue = 0
drpb_queue = 0
locks = 80640
unlocked = 9267
exclusive = 19
shared = 71354
deferred = 0
lvbs = 9274
expired = 0
lock ops = 1805398
conflicts = 3
incomming_queue = 0
conflict_queue = 0
reply_queue = 0
free_locks = 87162
free_lkrqs = 60
used_lkrqs = 0
free_holders = 125909
used_holders = 81895
highwater = 1048576


Unless I'm mis-reading this, the lt_high_locks directive didn't do anything, unless the bottom number will change once it's breached?

My apologies to the list for my verbosity, btw - I'm just under the gun trying to get this stable and working.

--
Jerry Gilyeat, RHCE
Systems Administrator
Molecular Microbiology and Immunology
Johns Hopkins Bloomberg School of Public Health
-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 3618 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050215/fbd93225/attachment.bin>

From mtilstra at redhat.com  Tue Feb 15 20:00:42 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Tue, 15 Feb 2005 14:00:42 -0600
Subject: [Linux-cluster] GFS 6.0 Questions
In-Reply-To: <DF33F4DAC09B3048AA095F1B5C368915A57A5D@XCH-VN02.sph.ad.jhsph.edu>
References: <DF33F4DAC09B3048AA095F1B5C368915A57A5D@XCH-VN02.sph.ad.jhsph.edu>
Message-ID: <421254EA.6010103@redhat.com>

Gerald G. Gilyeat wrote:

> Unless I'm mis-reading this, the lt_high_locks directive didn't do
> anything,

you read that right.  damn.  a bug.  I'll look into it.

Just to make sure I'm on the right page, which version are you running?


-- 
michael conrad tadpol tilstra
<WIT, DOOM!>

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050215/38e413c4/attachment.sig>

From ggilyeat at jhsph.edu  Tue Feb 15 20:07:28 2005
From: ggilyeat at jhsph.edu (Gerald G. Gilyeat)
Date: Tue, 15 Feb 2005 15:07:28 -0500
Subject: [Linux-cluster] GFS 6.0 Questions
Message-ID: <DF33F4DAC09B3048AA095F1B5C368915A57A61@XCH-VN02.sph.ad.jhsph.edu>

GFS-6.0.0-7.1
At least, those are the RPMs currently on the box. We -are- planning an upgrade, but can move that forward if this bug is known to have been fixed in a newer version of 6.0.

yay. I found a bug.

--
Jerry Gilyeat, RHCE
Systems Administrator
Molecular Microbiology and Immunology
Johns Hopkins Bloomberg School of Public Health


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Michael Conrad Tadpol Tilstra
Sent: Tue 2/15/2005 3:00 PM
To: linux clistering
Subject: Re: [Linux-cluster] GFS 6.0 Questions
 
Gerald G. Gilyeat wrote:

> Unless I'm mis-reading this, the lt_high_locks directive didn't do
> anything,

you read that right.  damn.  a bug.  I'll look into it.

Just to make sure I'm on the right page, which version are you running?


-- 
michael conrad tadpol tilstra
<WIT, DOOM!>

-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 2970 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050215/bd2e841b/attachment.bin>

From mtilstra at redhat.com  Tue Feb 15 20:14:52 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Tue, 15 Feb 2005 14:14:52 -0600
Subject: [Linux-cluster] GFS 6.0 Questions
In-Reply-To: <421254EA.6010103@redhat.com>
References: <DF33F4DAC09B3048AA095F1B5C368915A57A5D@XCH-VN02.sph.ad.jhsph.edu>
	<421254EA.6010103@redhat.com>
Message-ID: <4212583C.4020808@redhat.com>

Michael Conrad Tadpol Tilstra wrote:
> Gerald G. Gilyeat wrote:
> 
>> Unless I'm mis-reading this, the lt_high_locks directive didn't do
>> anything,
> 
> 
> you read that right.  damn.  a bug.  I'll look into it.
> 
> Just to make sure I'm on the right page, which version are you running?

oh, and adding the lt_high_locks value will require gulmd to be 
restarted before it notices.  just so you know.


-- 
michael conrad tadpol tilstra
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050215/03fbcff6/attachment.sig>

From ggilyeat at jhsph.edu  Tue Feb 15 20:15:44 2005
From: ggilyeat at jhsph.edu (Gerald G. Gilyeat)
Date: Tue, 15 Feb 2005 15:15:44 -0500
Subject: [Linux-cluster] GFS 6.0 Questions
Message-ID: <DF33F4DAC09B3048AA095F1B5C368915A57A64@XCH-VN02.sph.ad.jhsph.edu>


One would think that rebooting the machines would do that...
:)

--
Jerry Gilyeat, RHCE
Systems Administrator
Molecular Microbiology and Immunology
Johns Hopkins Bloomberg School of Public Health


-----Original Message-----
From: linux-cluster-bounces at redhat.com on behalf of Michael Conrad Tadpol Tilstra
Sent: Tue 2/15/2005 3:14 PM
To: linux clistering
Subject: Re: [Linux-cluster] GFS 6.0 Questions
 
Michael Conrad Tadpol Tilstra wrote:
> Gerald G. Gilyeat wrote:
> 
>> Unless I'm mis-reading this, the lt_high_locks directive didn't do
>> anything,
> 
> 
> you read that right.  damn.  a bug.  I'll look into it.
> 
> Just to make sure I'm on the right page, which version are you running?

oh, and adding the lt_high_locks value will require gulmd to be 
restarted before it notices.  just so you know.


-- 
michael conrad tadpol tilstra
-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 2938 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050215/e19858bf/attachment.bin>

From mtilstra at redhat.com  Tue Feb 15 20:25:16 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Tue, 15 Feb 2005 14:25:16 -0600
Subject: [Linux-cluster] GFS 6.0 Questions
In-Reply-To: <DF33F4DAC09B3048AA095F1B5C368915A57A64@XCH-VN02.sph.ad.jhsph.edu>
References: <DF33F4DAC09B3048AA095F1B5C368915A57A64@XCH-VN02.sph.ad.jhsph.edu>
Message-ID: <42125AAC.4020008@redhat.com>

Gerald G. Gilyeat wrote:
> One would think that rebooting the machines would do that...
> :)
> 

right, sent that before I saw your next msg.  k.
i know this is a rather dumb question, but I'm gonna ask any ways.
you rebuilt the cca device after changin cluster.ccs, right?

-- 
michael conrad tadpol tilstra
only dumber if not asked.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050215/5c7124cf/attachment.sig>

From jhahm at yahoo.com  Wed Feb 16 00:22:44 2005
From: jhahm at yahoo.com (Jiho Hahm)
Date: Tue, 15 Feb 2005 16:22:44 -0800 (PST)
Subject: [Linux-cluster] Specifying start/stop order of resources in a
	<resourcegroup>
Message-ID: <20050216002244.22618.qmail@web50903.mail.yahoo.com>

Hi,

I'm having some trouble with configuring start/stop order
of resources in a resource group.  When I specify start and
stop level values in resource elements, they are ignored. 
Resources are always started and stopped according to the
type-specific level specified in
cluster/rgmanager/src/resources/resourcegroup.sh (or
/usr/share/cluster/resourcegroup.sh).

What I basically want to do during startup of an RG is
mount a couple of ext3 filesystems in a certain order, run
a custom application, and finally bring up an IP address. 
During shutdown I want to do exactly the opposite: bring
down IP address, stop application, and unmount volumes in
reverse order.

Here's what I have in cluster.conf:

<cluster ...>
  <...>
  <rm>
    <failoverdomains>...</failoverdomains>
    <resources/>
    <resourcegroup name="rg1" domain="fd1">
      <fs name="foo" fstype="ext3"
          device="/dev/sdb1" mountpoint="/foo"
          start="1" stop="4"/>
      <fs name="foobar" fstype="ext3"
          device="/dev/sdb2" mountpoint="/foo/bar"
          start="2" stop="3"/>
      <script name="myapp" file="..."
          start="3" stop="2"/>
      <ip address="..." monitor_link="yes"
          start="4" stop="1"/>
    </resourcegroup>
  </rm>
</cluster>

The intention is to start the resource top-down, and stop
them bottom-up.  Notice the foobar volume mounts as
subdirectory of foo volume.  foo must be mounted first, and
unmounted last.

But the actual start order with the above configuration
turns out to be fs-foo, fs-foobar, ip and script.  The
order was determined by type-specific default start level
in resourcegroups.sh (fs=2, ip=3, script=5), and then top
to bottom.

The stop sequence was apparently the same as start
sequence.  When I ran "clusvcadm -s rg1" to stop the
resource group, the first thing tried was unmounting foo,
which failed because foobar wasn't unmounted first.  ip or
script was not tried before fs, which leads me to guess
stop sequence is determined by default _start_ level rather
than stop level.

Judging by the stop behavior I think there is a bug
somewhere.  But did I specify start/stop levels in my
cluster.conf incorrectly?

Regards,

-Jiho


__________________________________ 
Do you Yahoo!? 
Yahoo! Mail - 250MB free storage. Do more. Manage less. 
http://info.mail.yahoo.com/mail_250


From bastian at waldi.eu.org  Wed Feb 16 14:02:18 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Wed, 16 Feb 2005 15:02:18 +0100
Subject: [Linux-cluster] Re: cluster/fence/fence_tool fence_tool.c
In-Reply-To: <20050215170146.GB17487@redhat.com>
References: <20050215035441.19043.qmail@sourceware.org>
	<20050215164656.GB5824@wavehammer.waldi.eu.org>
	<20050215170146.GB17487@redhat.com>
Message-ID: <20050216140218.GA30095@wavehammer.waldi.eu.org>

On Wed, Feb 16, 2005 at 01:01:46AM +0800, David Teigland wrote:
> No, the join (the ioctl performed by fenced) is asynchronous.  The only
> way to really tell that it's complete is to monitor the proc file.

And what are the events for? fenced gets the join start and finish
event.

Bastian

-- 
Every living thing wants to survive.
		-- Spock, "The Ultimate Computer", stardate 4731.3
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050216/e6fb008a/attachment.sig>

From bastian at waldi.eu.org  Wed Feb 16 14:06:22 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Wed, 16 Feb 2005 15:06:22 +0100
Subject: [Linux-cluster] fence - convert manpages to the man macro package
In-Reply-To: <20050213215630.GA9873@wavehammer.waldi.eu.org>
References: <20050213215630.GA9873@wavehammer.waldi.eu.org>
Message-ID: <20050216140622.GB30095@wavehammer.waldi.eu.org>

I forgot one man page.

The attached patch converts them.

Bastian

-- 
	"What terrible way to die."
	"There are no good ways."
		-- Sulu and Kirk, "That Which Survives", stardate unknown
-------------- next part --------------
=== fenced.8
==================================================================
--- fenced.8  (revision 317)
+++ fenced.8  (revision 318)
@@ -5,20 +5,17 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'fenced(8)''fenced(8)'
+.TH fenced 8
 
-\fBNAME\fP
-.in +7
-fenced- the I/O Fencing daemon
+.SH NAME
+fenced - the I/O Fencing daemon
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfenced [options]\fP
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH SYNOPSIS
+.B
+fenced
+[\fIOPTION\fR]...
+
+.SH DESCRIPTION
 The fencing daemon, \fBfenced\fP, should be run on every node that will
 use CLVM or GFS.  It should be started after the node has joined the CMAN
 cluster (fenced is only used with CMAN; it is not used with GULM/SLM/RLM.)
@@ -38,7 +35,7 @@
 it depends on CCS to provide cluster.conf information.  The fencing daemon
 calls fencing agents according to cluster.conf information.
 
-Node failure
+.SS Node failure
 
 When a domain member fails, the actual fencing must be completed before
 GFS recovery can begin.  This means any delay in carrying out the fencing
@@ -54,7 +51,7 @@
 node is not fenced and all recovery must wait until the failed node
 rejoins the cluster.
 
-Domain startup
+.SS Domain startup
 
 When the domain is first created in the cluster (by the first node to join
 it) and subsequently enabled (by the cluster gaining quorum) any nodes
@@ -89,13 +86,7 @@
 are fenced by power cycling.  If nodes are fenced by disabling their SAN
 access, then unnecessarily fencing a node is usually less disruptive.
 
-
-.sp
-.in
-\fBCONFIGURATION FILE\fP
-.in +7
-.sp
-
+.SH CONFIGURATION FILE
 Fencing daemon behavior can be controlled by setting options in the
 cluster.conf file under the section <fence_daemon> </fence_daemon>.  See
 above for complete descriptions of these values.  The delay values are in
@@ -121,59 +112,30 @@
   <fence_daemon clean_start="0">
   </fence_daemon>
 
-
-.sp
-.in
-\fBOPTIONS\fP
-.in +7
-.sp
-
+.SH OPTIONS
 Command line options override corresonding values in cluster.conf.
-
+.TP
 \fB-j\fP \fIsecs\fP
-.in +7
 Post-join fencing delay
-.in
-.sp
-
+.TP
 \fB-f\fP \fIsecs\fP
-.in +7
 Post-fail fencing delay
-.in
-.sp
-
+.TP
 \fB-c\fP 
-.in +7
 All nodes are in a clean state to start.
-.in
-.sp
-
+.TP
 \fB-D\fP
-.in +7
 Enable debugging code and don't fork into the background.
-.in
-.sp
-
+.TP
 \fB-n\fP \fIname\fP
-.in +7
 Name of the fence domain, "default" if none.
-.in
-.sp
-
+.TP
 \fB-V\fP
-.in +7
 Print the version information and exit.
-.in
-.sp
-
+.TP
 \fB-h\fP 
-.in +7
 Print out a help message describing available options, then exit.
-.in
-.sp
 
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 gfs(8), fence(8)
 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050216/3588d9b6/attachment.sig>

From bastian at waldi.eu.org  Wed Feb 16 14:17:37 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Wed, 16 Feb 2005 15:17:37 +0100
Subject: [Linux-cluster] ccs - convert manpages to the man package
Message-ID: <20050216141737.GC30095@wavehammer.waldi.eu.org>

Hi folks

I've converted the ccs manpages to the man package.

Bastian

-- 
Murder is contrary to the laws of man and God.
		-- M-5 Computer, "The Ultimate Computer", stardate 4731.3
-------------- next part --------------
=== man/ccs.7
==================================================================
--- man/ccs.7  (revision 321)
+++ man/ccs.7  (local)
@@ -4,16 +4,12 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'ccs(7)''ccs(7)'
-'\"     View with 'groff -t -e -mandoc -Tlatin1 ccs.7 | less'
+.TH ccs 7
 
-\fBNAME\fP
-.in +7
+.SH NAME
 ccs - Cluster Configuration System
 
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 A cluster environment that shares resources has information that is
 essential to correct operation which must be available to
 every node in the cluster.  This information may include:
@@ -21,31 +17,24 @@
 more.  \fICCS\fP is the system that makes it possible for the nodes in a
 cluster to retrieve the information they need.
 
-.sp
-.in
-\fBOVERVIEW\fP
-.in +7
+.SH OVERVIEW
 The following is a generic description of the steps one should take to produce
 a working CCS environment.
 
-\fBStep 1)\fP
-.sp
+.SS Step 1
 Choose a cluster name.  It is important to determine
 a name for the cluster before starting.  The cluster name is what
 binds a machine to specific resources that can only be shared by
 machines that are members of the same cluster name.
 
-\fBStep 2)\fP
-.sp
+.SS Step 2
 Create the directory \fI/etc/cluster\fP.
 
-\fBStep 3)\fP
-.sp
+.SS Step 3
 Create the \fI/etc/cluster/cluster.conf\fP file, according to the
 \fBcluster.conf(5)\fP man page, on one node in your cluster.
 
-\fBStep 4)\fP
-.sp
+.SS Step 4
 Start \fBccsd\fP and test the cluster.conf file by using \fBccs_test\fP.
 If you haven't started a cluster manager yet, you should use the 'force'
 option to \fBccs_test\fP - see the \fBccs_test(8)\fP man page for more info.
@@ -53,16 +42,9 @@
 If a failure occurs while parsing the config file, \fBccs_test\fP should
 report "ccs_connect failed: No data available" and /var/log/messages
 should report "Unable to parse /etc/cluster/cluster.conf".
-.sp
-.in
 
-\fBFORMAT OF THE CCS FILE\fP
-.in +7
-.sp
+.SH FORMAT OF THE CCS FILE
 See \fBcluster.conf(5)\fP
-.sp
-.in
 
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 ccsd(8), ccs_tool(8), ccs_test(8), cluster.conf(5)
=== man/ccs_test.8
==================================================================
--- man/ccs_test.8  (revision 321)
+++ man/ccs_test.8  (local)
@@ -4,33 +4,24 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'ccs_test(8)''ccs_test(8)'
-'\"     View with 'groff -t -e -mandoc -Tlatin1 ccs_test.8 | less'
+.TH ccs_test 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 ccs_test - The diagnostic tool for a running Cluster Configuration System.
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBccs_test\fP <\fBcommand\fP>
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH SYNOPSIS
+.B ccs_test
+<\fBcommand\fP>
+
+.SH DESCRIPTION
 \fBccs_test\fP is part of the Cluster Configuration System (CCS).  It
 is a diagnostic tool designed to validate the correct operation of a
 running CCS system.  It will communicate with the CCS daemon - \fBccsd\fP -
 to obtain any information stored in the system.
 
-.sp
-.in
-
-\fBCOMMANDS\fP
-.in +7
+.SH COMMANDS
+.TP
 \fBconnect\fP \fI[force]\fP \fI[block]\fP \fI[cluster name]\fP
-.in +7
 This command creates a connection to ccsd.  It returns a descriptor, which
 is used as an parameter to other commands.
 
@@ -44,10 +35,8 @@
 only configuration files containing the given cluster name are valid
 possibilities.
 
-.sp
-.in
+.TP
 \fBget\fP \fI<desc>\fP \fI<request>\fP
-.in +7
 Get the results of a given request.  The 'desc' is the number returned
 from the \fBconnect\fP command.  The 'request' is a valid Xpath request.
 
@@ -55,89 +44,73 @@
 Subsequent calls with the same 'request' will result in the subsequent
 matches.  Once all the matches have been returned, a subsequent call
 will begin again with the first result.
-.sp
-.in
 
+.TP
 \fBget_list\fP \fI<desc>\fP \fI<request>\fP
-.in +7
 Similar to the \fBget\fP command.  However, issuing subsequent calls
 with the same 'request' will result in all matches being returned (one
 at a time), then null, then starting over with the first result.
-.sp
-.in
 
+.TP
 \fBset\fP \fI<desc>\fP \fI<path>\fP \fI<value>\fP
-.in +7
 Sets a particular 'path' to the given 'value'.  Not yet implemented.
-.sp
-.in
 
+.TP
 \fBget_state\fP \fI<desc>\fP
-.in +7
 Get the state associated with a given connection.
-.sp
-.in
 
+.TP
 \fBset_state\fP \fI<desc>\fP \fI<ncwp>\fP
-.in +7
 Set the current working path (cwp) to 'ncwp' for a given connection.
-.sp
-.in
 
-.in -7
-\fBEXAMPLES\fP
-.in +7
-\fBTo connect to ccsd:\fP
-.sp
+.SH EXAMPLES
+.SS To connect to ccsd:
+
 > ccs_test connect
 
 Connect successful.
  Connection descriptor = 0
-.sp
+
 Or, if the cluster is not yet quorate and the name of the cluster is 'mycluster':
-.sp
+
 > ccs_test connect force block mycluster
 
 Connect successful.
  Connection descriptor = 0
-.sp
 
-\fBTo get the cluster name from ccsd:\fP
-.sp
+.SS To get the cluster name from ccsd:
+
 > ccs_test get 0 //cluster/@name
 
 Get successful.
  Value = <mycluster>
-.sp
 
-\fBTo get the connection state:\fP
-.sp
+.SS To get the connection state:
+
 > ccs_test get_state 0
 
 Get state successful.
  Current working path:
  Previous query      : //cluster/@name
-.sp
 
-\fBTo set the connection state:\fP
-.sp
+
+.SS To set the connection state:
+
 > ccs_test set_state 0 //cluster
 
 Set state successful.
-.sp
 
-\fBAfter setting the connection state, note the change:\fP
-.sp
+
+.SS After setting the connection state, note the change:
+
 > ccs_test get_state 0
 
 Get state successful.
  Current working path: //cluster
  Previous query      : //cluster/@name
-.sp
 
-\fBAfter setting the connection state, you can now query with an absolute
-or relative path:\fP
-.sp
+.SS After setting the connection state, you can now query with an absolute or relative path:
+
 > ccs_test get 0 @name
 
 Get successful.
@@ -147,16 +120,12 @@
 
 Get successful.
  Value = <brassow>
-.sp
 
-\fBTo disconnect:\fP
-.sp
+.SS To disconnect:
+
 > ccs_test disconnect 0
 
 Disconnect successful.
-.sp
 
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 ccs(7), ccsd(8), cluster.conf(5)
=== man/ccs_tool.8
==================================================================
--- man/ccs_tool.8  (revision 321)
+++ man/ccs_tool.8  (local)
@@ -4,48 +4,32 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'ccs_tool(8)''ccs_tool(8)'
-'\"     View with 'groff -t -e -mandoc -Tlatin1 ccs_tool.8 | less'
+.TH ccs_tool 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 ccs_tool - The tool used to make online updates of CCS config files.
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBccs_tool\fP [\fBoptions\fP] <\fBcommand\fP>
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH SYNOPSIS
+.B ccs_tool
+[\fIOPTION\fR].. <\fBcommand\fP>
+
+.SH DESCRIPTION
 \fBccs_tool\fP is part of the Cluster Configuration System (CCS).  It
 is used to make online updates of CCS config files.  Additionally, it
 can be used to upgrade old style (GFS <= 6.0) CCS archives to the new
 xml format.
 
-.sp
-.in
-\fBOPTIONS\fP
-.in +7
-
+.SH OPTIONS
+.TP
 \fB-h\fP
-.in +7
 Help.  Print out the usage.
-.sp
-.in
-
+.TP
 \fB-V\fP
-.in +7
 Print the version information.
-.sp
-.in
 
-.in -7
-\fBCOMMANDS\fP
-.in +7
+.SH COMMANDS
+.TP
 \fBupdate\fP \fI<xml file>\fP
-.in +7
 This command is used to update the config file that ccsd is working with
 while the cluster is operational (i.e. online).  Run this on a single
 machine to update all instances of ccsd across the cluster.
@@ -55,20 +39,12 @@
 complete.  Failure to do so will result in new nodes (or nodes rejoining
 after a failure) not being allowed
 to join the working set due to version number mismatches.
-
-.sp
-.in
+.TP
 \fBupgrade\fP \fI<location>\fP
-.in +7
 This command is used to upgrade an old CCS format archive to the new
 xml format.  \fI<location>\fP is the location of the old archive,
 which can be either a block device archive or a file archive.  The
 converted configuration will be printed to stdout.
 
-.sp
-.in
-
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 ccs(7), ccsd(8), cluster.conf(5)
=== man/ccsd.8
==================================================================
--- man/ccsd.8  (revision 321)
+++ man/ccsd.8  (local)
@@ -5,67 +5,42 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'ccsd(8)''ccsd(8)'
-'\"     View with 'groff -t -e -mandoc -Tlatin1 ccsd.8 | less'
+.TH ccsd 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 ccsd - The daemon used to access CCS cluster configuration files.
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBccsd\fP [\fBoptions\fP]
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH SYNOPSIS
+.B ccsd
+[\fIOPTION\fR]..
+
+.SH DESCRIPTION
 \fBccsd\fP is part of the Cluster Configuration System (CCS).  It is the
 daemon which accesses cluster the configuration file for other cluster
 applications.  It must be run on each node that wishes to join a cluster.
 
-.sp
-.in
-
-\fBOPTIONS\fP
-.in +7
-
+.SH OPTIONS
+.TP
 \fB-4\fP
-.in +7
 Use IPv4 for all communication.  By default, IPv6 is tried, then IPv4.
-.sp
-.in
-
+.TP
 \fB-6\fP
-.in +7
 Use IPv6 for all communication.  By default, IPv6 is tried, then IPv4.
-.sp
-.in
-
+.TP
 \fB-h\fP
-.in +7
 Help.  Print out the usage syntax.
-.sp
-.in
-
+.TP
 \fB-m <multicast address>\fP
-.in +7
 Used to specify the multicast address.  The keyword "default" can be used,
 in which case "ff02::3:1" is used for IPv6 and "224.0.2.5" is used for IPv4.
 
 If you are using IPv4, the default action is to use broadcast.  Specifying
 this option will cause multicast to be used in that instance.
-.sp
-.in
-
+.TP
 \fB-n\fP
-.in +7
 No daemon.  Run in the foreground.
-.sp
-.in
-
+.TP
 \fB-P <port identifier>:<port number>\fP
-.in +7
 You have the option of specifying the port numbers used by ccsd.  The port
 identifier is either: b, c, or f.  "b" is the port which ccsd attempts to
 communicate with ccsd processes on other machines, via broadcast/multicast, to
@@ -76,22 +51,12 @@
 from the CCS library (or programs using it).  This is known as the frontend port.
 
 So, to change the frontend port one might specify \fI-P f:60000\fP.
-.sp
-.in
-
+.TP
 \fB-t <ttl>\fP
-.in +7
 Set the multicast threshold (aka time to live).
-.sp
-.in
-
+.TP
 \fB-V\fP
-.in +7
 Print the version information.
-.sp
-.in
 
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 ccs(7), ccs_tool(8), ccs_test(8), cluster.conf(5)
=== man/cluster.conf.5
==================================================================
--- man/cluster.conf.5  (revision 321)
+++ man/cluster.conf.5  (local)
@@ -2,19 +2,12 @@
 .\"  Copyright 2001-2003 Sistina Software, Inc.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.\" groff -t -e -mandoc -Tlatin1 cluster.conf.5 | less
+.TH cluster.conf 5
 
-.tl 'cluster.conf(5)' 'cluster.conf(5)'
-
-\fBNAME\fP
-.in +7
+.SH NAME
 cluster.conf - The configuration file for cluster products
-.sp
-.in -7
 
-
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 The \fBcluster.conf\fP file is located in the /etc/cluster directory.  It
 is the source of information used by the cluster products - accessed
 indirectly through CCS (see \fBccs(7)\fP).  This file contains all the
@@ -186,10 +179,6 @@
   </fencedevices>
 </cluster>
 
-
-.sp
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 ccs(7), ccs_tool(8), lock_gulmd(5), cman(5)
 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050216/fbf8bf06/attachment.sig>

From ptr at poczta.fm  Wed Feb 16 16:40:50 2005
From: ptr at poczta.fm (ptr at poczta.fm)
Date: 16 Feb 2005 17:40:50 +0100
Subject: [Linux-cluster] 2.6.10 and latest CVS
Message-ID: <20050216164050.99E74A41A5@poczta.interia.pl>


    Hello.

I'm trying to upgrade 2-node cluster raising the kernel
version as well. After patching 2.6.10 sources and building 
cluster modules/userland, I am unable to run it. It seems
like magma has been installed in a wrong dir, but reinstallation
from a scratch did not help:

"Failed to connect to cluster manager.
Hint: Magma plugins are not in the right spot"

And magma_tool says:
gentoo02 # magma_tool config plugindir
/install/GFS/cluster/build/usr/lib/magma

It's my installation dir, why?
How can I force the right directories?
    Thank you in advance for any hints, regards

Piotr

--------------------------------------------------------------------
3 minuty z Niemcami i USA za zlotowke? Tylko z Halo! 0708-108-107
>>> http://link.interia.pl/f184a <<<


From ed.mann at choicepoint.com  Wed Feb 16 22:46:38 2005
From: ed.mann at choicepoint.com (Edward Mann)
Date: Wed, 16 Feb 2005 16:46:38 -0600
Subject: [Linux-cluster] lock_gulm heartbeat
Message-ID: <1108593999.26259.40.camel@storm.cp-direct.com>

Everyone,


I have a gfs cluster that has been running fine for about 3 months now.
I am only using 2 machines and the storage is a firewire drive. Over the
weekend i started to get:
lock_gulmd_core[1100]: Failed to receive a timely heartbeat reply from
Master. (t:1108583425506998 mb:1)

and after 2, which is what i allowed for the missed heart beats, the gfs
slave would die. I moved the missed heartbeat up to 5 and have seen it
miss as many as 4 in a row. The only thing that has changed on the
machine is that i add new clients to process files once they are placed
on the machine. I am using FAM to notify my app that a new file is
present.

Any ideas on what i should look at? How can i diagnose this problem. The
communication between the two machines seems fine. I can ping both
hosts. I am really at a loss at to what to look for.


Thanks for the help.


From daniel at osdl.org  Wed Feb 16 23:39:37 2005
From: daniel at osdl.org (Daniel McNeil)
Date: Wed, 16 Feb 2005 15:39:37 -0800
Subject: [Linux-cluster] mount hang in kcl_join_service
Message-ID: <1108597177.9361.23.camel@ibm-c.pdx.osdl.net>

I have not been able to get my tests to run for more than
1 day for the last several tries.  This time my test hung
during mount in kcl_join_service().  My test does mount and umount 
several times for each test run.  This time it hung on the
22nd test run.  It looks like it was starting a 3node test
where a gfs file system is mounted on all 3 nodes and then
does a umount/mount 1 node at a time.  So this should have
done an umount on cl031 and then hung on a mount on cl031
with cl030 and cl032 having the gfs file system still mounted.

The mount stack trace is:
mount         D C170EF9C     0  1557      1         12111  3932 (NOTLB)
f09b9c30 00000086 f1f4c580 c170ef9c 00002ca3 c2c39ae0 00000008 00000000
       e947e548 7b01b78b 00002ca3 f09b9c10 f1f4c580 00000000 c170f8c0 c170ef60
       00000000 00003ba8 7b01b9ea 00002ca3 c2c39ae0 c2c39c4c 00000000 00002ca3
Call Trace:
 [<c03ce814>] wait_for_completion+0xa4/0xe0
 [<f8ab6164>] kcl_join_service+0x154/0x180 [cman]
 [<f8890fff>] init_mountgroup+0x6f/0xc0 [lock_dlm]
 [<f88934b1>] lm_dlm_mount+0xa1/0xf0 [lock_dlm]
 [<f8812300>] lm_mount+0x140/0x230 [lock_harness]
 [<f9017f4d>] gfs_lm_mount+0x1fd/0x390 [gfs]
 [<f9024276>] fill_super+0x596/0x14c0 [gfs]
 [<f902533f>] gfs_get_sb+0x15f/0x1b0 [gfs]
 [<c0166ae8>] do_kern_mount+0x58/0xe0
 [<c017ce08>] do_new_mount+0x98/0xe0
 [<c017d4b5>] do_mount+0x165/0x1b0
 [<c017d8c7>] sys_mount+0x97/0x100
 [<c010323d>] sysenter_past_esp+0x52/0x75

A bunch of info is available here:
http://developer.osdl.org/daniel/GFS/test.11feb2005/

The bad news is that taking a stack trace to a serial console
causes nodes to be kicked out of the cluster, so some of the
info has the nodes being kicked out.

Any ideas on how to figure this out?

Daniel


From teigland at redhat.com  Thu Feb 17 06:26:05 2005
From: teigland at redhat.com (David Teigland)
Date: Thu, 17 Feb 2005 14:26:05 +0800
Subject: [Linux-cluster] mount hang in kcl_join_service
In-Reply-To: <1108597177.9361.23.camel@ibm-c.pdx.osdl.net>
References: <1108597177.9361.23.camel@ibm-c.pdx.osdl.net>
Message-ID: <20050217062605.GA6389@redhat.com>

On Wed, Feb 16, 2005 at 03:39:37PM -0800, Daniel McNeil wrote:
> I have not been able to get my tests to run for more than
> 1 day for the last several tries.  This time my test hung
> during mount in kcl_join_service().  My test does mount and umount 
> several times for each test run.  This time it hung on the
> 22nd test run.  It looks like it was starting a 3node test
> where a gfs file system is mounted on all 3 nodes and then
> does a umount/mount 1 node at a time.  So this should have
> done an umount on cl031 and then hung on a mount on cl031
> with cl030 and cl032 having the gfs file system still mounted.

> A bunch of info is available here:
> http://developer.osdl.org/daniel/GFS/test.11feb2005/

I've looked through it and can't pinpoint the problem.  Next
time could you also collect /proc/cluster/lock_dlm/debug and
/proc/cluster/dlm_debug ?

I've set up a similar but simplified test on both of my test
clusters (a 2-node and a 7-node).  I can't dedicate these
machines for a full 1-2 day stretch this until the weekend,
though.  My test is a loop around:

- on each node sequentially: unmount/mount gfs
- on each node sequentially: run some load for a couple minutes

-- 
Dave Teigland  <teigland at redhat.com>


From jrajiv at hclinsys.com  Thu Feb 17 07:12:57 2005
From: jrajiv at hclinsys.com (Rajiv)
Date: Thu, 17 Feb 2005 12:42:57 +0530
Subject: [Linux-cluster] GFS and ORACLE-9i
Message-ID: <006a01c514c0$1c5c3b10$0f120897@PMORND>

Dear All,
    Can gfs file system be used for shared storage when configuring Oracle 
RAC. If I can do so will the oracle partition be OCFS - Oracle Cluster File 
System or GFS. Can I install GFS and OCFS on the same storage. Will there be 
any compatibility issues. Can I use OpenGFS for this purpose. What is the 
difference between GFS and Open GFS.

Regards,
Rajiv 


From lars.larsen at edb.com  Thu Feb 17 12:59:44 2005
From: lars.larsen at edb.com (=?iso-8859-1?Q?Larsen_Lars_Asbj=F8rn?=)
Date: Thu, 17 Feb 2005 13:59:44 +0100
Subject: [Linux-cluster] GFS and ORACLE-9i
Message-ID: <E9DC52DB6AF7C342B8C722A818DDF62248BC47@EDBCL01CN011.EDB.local>

You can use GFS as shared storage file system for Oracle RAC. On Red Hats documentation site you will find a setup guide for this under Cluster Suite/GFS documentation.
But it is a bit unclear if Oracle officially have this setup certified yet. In such a setup the Oracle database files will reside on the GFS file system. OCFS will not be used at all. OCFS will need to be installed if you will like to run it. RAC is supported on OCFS. But OCFS has a number of limitations. You can't grow the file system among other things. It is not an general file system and can only be used for Oracle database files, so you can't ha a shared Oracle home on it. 
Read OCFS documentation carefully and decide if it is OK for your use. It is not to difficult to set up and works.

With 10g it should not bee need for any shared file system if you use ASM, if I understand it right.


Regards
Lars Larsen


-----Original Message-----
From: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Rajiv
Sent: 17. februar 2005 08:13
To: linux-cluster at redhat.com
Subject: [Linux-cluster] GFS and ORACLE-9i


Dear All,
    Can gfs file system be used for shared storage when configuring Oracle 
RAC. If I can do so will the oracle partition be OCFS - Oracle Cluster File 
System or GFS. Can I install GFS and OCFS on the same storage. Will there be 
any compatibility issues. Can I use OpenGFS for this purpose. What is the 
difference between GFS and Open GFS.

Regards,
Rajiv 

--
Linux-cluster mailing list
Linux-cluster at redhat.com http://www.redhat.com/mailman/listinfo/linux-cluster


From bastian at waldi.eu.org  Thu Feb 17 14:00:55 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Thu, 17 Feb 2005 15:00:55 +0100
Subject: [Linux-cluster] gfs - convert manpages to the man package
Message-ID: <20050217140055.GA923@wavehammer.waldi.eu.org>

Hi folks

I've converted the gfs manpages to the man package.

Bastian

-- 
Vulcans worship peace above all.
		-- McCoy, "Return to Tomorrow", stardate 4768.3
-------------- next part --------------
=== man/ccs.7
==================================================================
--- man/ccs.7  (revision 321)
+++ man/ccs.7  (local)
@@ -4,16 +4,12 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'ccs(7)''ccs(7)'
-'\"     View with 'groff -t -e -mandoc -Tlatin1 ccs.7 | less'
+.TH ccs 7
 
-\fBNAME\fP
-.in +7
+.SH NAME
 ccs - Cluster Configuration System
 
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 A cluster environment that shares resources has information that is
 essential to correct operation which must be available to
 every node in the cluster.  This information may include:
@@ -21,31 +17,24 @@
 more.  \fICCS\fP is the system that makes it possible for the nodes in a
 cluster to retrieve the information they need.
 
-.sp
-.in
-\fBOVERVIEW\fP
-.in +7
+.SH OVERVIEW
 The following is a generic description of the steps one should take to produce
 a working CCS environment.
 
-\fBStep 1)\fP
-.sp
+.SS Step 1
 Choose a cluster name.  It is important to determine
 a name for the cluster before starting.  The cluster name is what
 binds a machine to specific resources that can only be shared by
 machines that are members of the same cluster name.
 
-\fBStep 2)\fP
-.sp
+.SS Step 2
 Create the directory \fI/etc/cluster\fP.
 
-\fBStep 3)\fP
-.sp
+.SS Step 3
 Create the \fI/etc/cluster/cluster.conf\fP file, according to the
 \fBcluster.conf(5)\fP man page, on one node in your cluster.
 
-\fBStep 4)\fP
-.sp
+.SS Step 4
 Start \fBccsd\fP and test the cluster.conf file by using \fBccs_test\fP.
 If you haven't started a cluster manager yet, you should use the 'force'
 option to \fBccs_test\fP - see the \fBccs_test(8)\fP man page for more info.
@@ -53,16 +42,9 @@
 If a failure occurs while parsing the config file, \fBccs_test\fP should
 report "ccs_connect failed: No data available" and /var/log/messages
 should report "Unable to parse /etc/cluster/cluster.conf".
-.sp
-.in
 
-\fBFORMAT OF THE CCS FILE\fP
-.in +7
-.sp
+.SH FORMAT OF THE CCS FILE
 See \fBcluster.conf(5)\fP
-.sp
-.in
 
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 ccsd(8), ccs_tool(8), ccs_test(8), cluster.conf(5)
=== man/ccs_test.8
==================================================================
--- man/ccs_test.8  (revision 321)
+++ man/ccs_test.8  (local)
@@ -4,33 +4,24 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'ccs_test(8)''ccs_test(8)'
-'\"     View with 'groff -t -e -mandoc -Tlatin1 ccs_test.8 | less'
+.TH ccs_test 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 ccs_test - The diagnostic tool for a running Cluster Configuration System.
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBccs_test\fP <\fBcommand\fP>
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH SYNOPSIS
+.B ccs_test
+<\fBcommand\fP>
+
+.SH DESCRIPTION
 \fBccs_test\fP is part of the Cluster Configuration System (CCS).  It
 is a diagnostic tool designed to validate the correct operation of a
 running CCS system.  It will communicate with the CCS daemon - \fBccsd\fP -
 to obtain any information stored in the system.
 
-.sp
-.in
-
-\fBCOMMANDS\fP
-.in +7
+.SH COMMANDS
+.TP
 \fBconnect\fP \fI[force]\fP \fI[block]\fP \fI[cluster name]\fP
-.in +7
 This command creates a connection to ccsd.  It returns a descriptor, which
 is used as an parameter to other commands.
 
@@ -44,10 +35,8 @@
 only configuration files containing the given cluster name are valid
 possibilities.
 
-.sp
-.in
+.TP
 \fBget\fP \fI<desc>\fP \fI<request>\fP
-.in +7
 Get the results of a given request.  The 'desc' is the number returned
 from the \fBconnect\fP command.  The 'request' is a valid Xpath request.
 
@@ -55,89 +44,73 @@
 Subsequent calls with the same 'request' will result in the subsequent
 matches.  Once all the matches have been returned, a subsequent call
 will begin again with the first result.
-.sp
-.in
 
+.TP
 \fBget_list\fP \fI<desc>\fP \fI<request>\fP
-.in +7
 Similar to the \fBget\fP command.  However, issuing subsequent calls
 with the same 'request' will result in all matches being returned (one
 at a time), then null, then starting over with the first result.
-.sp
-.in
 
+.TP
 \fBset\fP \fI<desc>\fP \fI<path>\fP \fI<value>\fP
-.in +7
 Sets a particular 'path' to the given 'value'.  Not yet implemented.
-.sp
-.in
 
+.TP
 \fBget_state\fP \fI<desc>\fP
-.in +7
 Get the state associated with a given connection.
-.sp
-.in
 
+.TP
 \fBset_state\fP \fI<desc>\fP \fI<ncwp>\fP
-.in +7
 Set the current working path (cwp) to 'ncwp' for a given connection.
-.sp
-.in
 
-.in -7
-\fBEXAMPLES\fP
-.in +7
-\fBTo connect to ccsd:\fP
-.sp
+.SH EXAMPLES
+.SS To connect to ccsd:
+
 > ccs_test connect
 
 Connect successful.
  Connection descriptor = 0
-.sp
+
 Or, if the cluster is not yet quorate and the name of the cluster is 'mycluster':
-.sp
+
 > ccs_test connect force block mycluster
 
 Connect successful.
  Connection descriptor = 0
-.sp
 
-\fBTo get the cluster name from ccsd:\fP
-.sp
+.SS To get the cluster name from ccsd:
+
 > ccs_test get 0 //cluster/@name
 
 Get successful.
  Value = <mycluster>
-.sp
 
-\fBTo get the connection state:\fP
-.sp
+.SS To get the connection state:
+
 > ccs_test get_state 0
 
 Get state successful.
  Current working path:
  Previous query      : //cluster/@name
-.sp
 
-\fBTo set the connection state:\fP
-.sp
+
+.SS To set the connection state:
+
 > ccs_test set_state 0 //cluster
 
 Set state successful.
-.sp
 
-\fBAfter setting the connection state, note the change:\fP
-.sp
+
+.SS After setting the connection state, note the change:
+
 > ccs_test get_state 0
 
 Get state successful.
  Current working path: //cluster
  Previous query      : //cluster/@name
-.sp
 
-\fBAfter setting the connection state, you can now query with an absolute
-or relative path:\fP
-.sp
+.SS After setting the connection state, you can now query with an absolute or relative path:
+
 > ccs_test get 0 @name
 
 Get successful.
@@ -147,16 +120,12 @@
 
 Get successful.
  Value = <brassow>
-.sp
 
-\fBTo disconnect:\fP
-.sp
+.SS To disconnect:
+
 > ccs_test disconnect 0
 
 Disconnect successful.
-.sp
 
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 ccs(7), ccsd(8), cluster.conf(5)
=== man/ccs_tool.8
==================================================================
--- man/ccs_tool.8  (revision 321)
+++ man/ccs_tool.8  (local)
@@ -4,48 +4,32 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'ccs_tool(8)''ccs_tool(8)'
-'\"     View with 'groff -t -e -mandoc -Tlatin1 ccs_tool.8 | less'
+.TH ccs_tool 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 ccs_tool - The tool used to make online updates of CCS config files.
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBccs_tool\fP [\fBoptions\fP] <\fBcommand\fP>
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH SYNOPSIS
+.B ccs_tool
+[\fIOPTION\fR].. <\fBcommand\fP>
+
+.SH DESCRIPTION
 \fBccs_tool\fP is part of the Cluster Configuration System (CCS).  It
 is used to make online updates of CCS config files.  Additionally, it
 can be used to upgrade old style (GFS <= 6.0) CCS archives to the new
 xml format.
 
-.sp
-.in
-\fBOPTIONS\fP
-.in +7
-
+.SH OPTIONS
+.TP
 \fB-h\fP
-.in +7
 Help.  Print out the usage.
-.sp
-.in
-
+.TP
 \fB-V\fP
-.in +7
 Print the version information.
-.sp
-.in
 
-.in -7
-\fBCOMMANDS\fP
-.in +7
+.SH COMMANDS
+.TP
 \fBupdate\fP \fI<xml file>\fP
-.in +7
 This command is used to update the config file that ccsd is working with
 while the cluster is operational (i.e. online).  Run this on a single
 machine to update all instances of ccsd across the cluster.
@@ -55,20 +39,12 @@
 complete.  Failure to do so will result in new nodes (or nodes rejoining
 after a failure) not being allowed
 to join the working set due to version number mismatches.
-
-.sp
-.in
+.TP
 \fBupgrade\fP \fI<location>\fP
-.in +7
 This command is used to upgrade an old CCS format archive to the new
 xml format.  \fI<location>\fP is the location of the old archive,
 which can be either a block device archive or a file archive.  The
 converted configuration will be printed to stdout.
 
-.sp
-.in
-
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 ccs(7), ccsd(8), cluster.conf(5)
=== man/ccsd.8
==================================================================
--- man/ccsd.8  (revision 321)
+++ man/ccsd.8  (local)
@@ -5,67 +5,42 @@
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
 
-.tl 'ccsd(8)''ccsd(8)'
-'\"     View with 'groff -t -e -mandoc -Tlatin1 ccsd.8 | less'
+.TH ccsd 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 ccsd - The daemon used to access CCS cluster configuration files.
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBccsd\fP [\fBoptions\fP]
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH SYNOPSIS
+.B ccsd
+[\fIOPTION\fR]..
+
+.SH DESCRIPTION
 \fBccsd\fP is part of the Cluster Configuration System (CCS).  It is the
 daemon which accesses cluster the configuration file for other cluster
 applications.  It must be run on each node that wishes to join a cluster.
 
-.sp
-.in
-
-\fBOPTIONS\fP
-.in +7
-
+.SH OPTIONS
+.TP
 \fB-4\fP
-.in +7
 Use IPv4 for all communication.  By default, IPv6 is tried, then IPv4.
-.sp
-.in
-
+.TP
 \fB-6\fP
-.in +7
 Use IPv6 for all communication.  By default, IPv6 is tried, then IPv4.
-.sp
-.in
-
+.TP
 \fB-h\fP
-.in +7
 Help.  Print out the usage syntax.
-.sp
-.in
-
+.TP
 \fB-m <multicast address>\fP
-.in +7
 Used to specify the multicast address.  The keyword "default" can be used,
 in which case "ff02::3:1" is used for IPv6 and "224.0.2.5" is used for IPv4.
 
 If you are using IPv4, the default action is to use broadcast.  Specifying
 this option will cause multicast to be used in that instance.
-.sp
-.in
-
+.TP
 \fB-n\fP
-.in +7
 No daemon.  Run in the foreground.
-.sp
-.in
-
+.TP
 \fB-P <port identifier>:<port number>\fP
-.in +7
 You have the option of specifying the port numbers used by ccsd.  The port
 identifier is either: b, c, or f.  "b" is the port which ccsd attempts to
 communicate with ccsd processes on other machines, via broadcast/multicast, to
@@ -76,22 +51,12 @@
 from the CCS library (or programs using it).  This is known as the frontend port.
 
 So, to change the frontend port one might specify \fI-P f:60000\fP.
-.sp
-.in
-
+.TP
 \fB-t <ttl>\fP
-.in +7
 Set the multicast threshold (aka time to live).
-.sp
-.in
-
+.TP
 \fB-V\fP
-.in +7
 Print the version information.
-.sp
-.in
 
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 ccs(7), ccs_tool(8), ccs_test(8), cluster.conf(5)
=== man/cluster.conf.5
==================================================================
--- man/cluster.conf.5  (revision 321)
+++ man/cluster.conf.5  (local)
@@ -2,19 +2,12 @@
 .\"  Copyright 2001-2003 Sistina Software, Inc.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.\" groff -t -e -mandoc -Tlatin1 cluster.conf.5 | less
+.TH cluster.conf 5
 
-.tl 'cluster.conf(5)' 'cluster.conf(5)'
-
-\fBNAME\fP
-.in +7
+.SH NAME
 cluster.conf - The configuration file for cluster products
-.sp
-.in -7
 
-
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 The \fBcluster.conf\fP file is located in the /etc/cluster directory.  It
 is the source of information used by the cluster products - accessed
 indirectly through CCS (see \fBccs(7)\fP).  This file contains all the
@@ -186,10 +179,6 @@
   </fencedevices>
 </cluster>
 
-
-.sp
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 ccs(7), ccs_tool(8), lock_gulmd(5), cman(5)
 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/2023b6ed/attachment.sig>

From bastian at waldi.eu.org  Thu Feb 17 14:08:55 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Thu, 17 Feb 2005 15:08:55 +0100
Subject: [Linux-cluster] gfs - convert manpages to the man package
In-Reply-To: <20050217140055.GA923@wavehammer.waldi.eu.org>
References: <20050217140055.GA923@wavehammer.waldi.eu.org>
Message-ID: <20050217140855.GA15353@wavehammer.waldi.eu.org>

On Thu, Feb 17, 2005 at 03:00:55PM +0100, Bastian Blank wrote:
> I've converted the gfs manpages to the man package.

I attached the wrong diff.

Bastian

-- 
Vulcans do not approve of violence.
		-- Spock, "Journey to Babel", stardate 3842.4
-------------- next part --------------
=== man/gfs.8
==================================================================
--- man/gfs.8  (revision 310)
+++ man/gfs.8  (local)
@@ -1,32 +1,40 @@
 .\"  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.tl 'gfs(8)''gfs(8)'
+.TH gfs 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 GFS reference guide
 
-.in
-\fBSYNOPSIS\fP
-.in +7
+.SH SYNOPSIS
 Overview of manpages and their locations
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+
+.SH DESCRIPTION
 The GFS documentation has been split into a number of sections.  Please
 refer to the table below to determine which man page coincides with the
 command/feature you are looking for.
-.sp
- gfs                 GFS overview (this man page)
- gfs_mount           Mounting a GFS file system
- gfs_fsck            The GFS file system checker
- gfs_grow            Growing a GFS file system
- gfs_jadd            Adding a journal to a GFS file system
- gfs_mkfs            Make a GFS file system
- gfs_quota           Manipulate GFS disk quotas 
- gfs_tool            Tool to manipulate a GFS file system
-.sp
-.in
+.TP 16
+gfs
+GFS overview (this man page)
+.TP
+gfs_mount
+Mounting a GFS file system
+.TP
+gfs_fsck
+The GFS file system checker
+.TP
+gfs_grow
+Growing a GFS file system
+.TP
+gfs_jadd
+Adding a journal to a GFS file system
+.TP
+gfs_mkfs
+Make a GFS file system
+.TP
+gfs_quota
+Manipulate GFS disk quotas 
+.TP
+gfs_tool
+Tool to manipulate a GFS file system
 
=== man/gfs_fsck.8
==================================================================
--- man/gfs_fsck.8  (revision 310)
+++ man/gfs_fsck.8  (local)
@@ -1,26 +1,20 @@
 .\"  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.tl 'gfs_fsck(8)''gfs_fsck(8)'
+.TH gfs_fsck 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 gfs_fsck - Offline GFS file system checker
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBgfs_fsck\fP [\fBoptions\fP] \fIdevice\fR
-.sp
-.in
-\fBWARNING\fP
-.in +7
+.SH SYNOPSIS
+.B gfs_fsck
+[\fIOPTION\fR]... \fIDEVICE\fR
+
+.SH WARNING
 All GFS nodes \fImust\fP have the GFS filesystem unmounted before running
 gfs_fsck.  Failure to unmount all nodes may result in filesystem corruption.
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+
+.SH DESCRIPTION
 gfs_fsck will check that the GFS file system on a device is structurally valid.
 It should not be run on a mounted file system.  If file system corruption is
 detected, it will attempt to repair the file system.  There is a limit to what
@@ -34,44 +28,29 @@
 fix.  The first step to ensuring a healthy file system is the selection of
 reliable hardware (i.e. storage systems that will write complete blocks - even
 in the event of power failure).
-.sp
-.in
 
-\fBOPTIONS\fP
-.in +7
-
+.SH OPTIONS
+.TP
 \fB-h\fP
-.in +7
 Help.
-.sp
-This prints out the proper command line usage syntax.
-.sp
-.in
 
+This prints out the proper command line usage syntax.
+.TP
 \fB-q\fP
-.in +7
 Quiet.
-.sp
-.in
-
+.TP
 \fB-V\fP
-.in +7
 Version.
-.sp
-Print out the current version name.
-.sp
-.in
 
+Print out the current version name.
+.TP
 \fB-v\fP
-.in +7
 Verbose operation.
-.sp
-Print more information while running.
-.in
 
+Print more information while running.
+.TP
 \fB-y\fP
-.in +7
 Yes to all questions.
-.sp
+
 By specifying this option, gfs_fsck will not prompt before making
 changes.
=== man/gfs_grow.8
==================================================================
--- man/gfs_grow.8  (revision 310)
+++ man/gfs_grow.8  (local)
@@ -1,27 +1,16 @@
 .\"  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.tl 'gfs_grow(8)''gfs_grow(8)'
-'\"
-'\"    This file is maintained by:
-'\"      Steven Whitehouse <steve at sistina.com>
-'\"    
-'\"	View with 'groff -t -e -mandoc -Tlatin1 gfs_grow.8 | less'  
-'\"
+.TH gfs_grow 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 gfs_grow - Expand a GFS filesystem
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBgfs_grow\fP\ [options]\ \fIdevice\ |\ mount_point\fR\ [\fIdevice\ |
-\ mount_point\fR...]
-.in
+.SH SYNOPSIS
+.B gfs_grow
+[\fIOPTION\fR]... <\fIDEVICE\fR|\fIMOINTPOINT\fR>...
 
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 gfs_grow is used to expand a GFS filesystem after the device
 upon which the filesystem resides has also been expanded.  By
 running gfs_grow on a GFS filesystem, you are requesting that
@@ -30,61 +19,48 @@
 filesystem extension.  When this operation is complete, the resource
 index for the filesystem is updated so that all nodes in the
 cluster can use the extra storage space which has been added.
-.sp
+
 You may only run gfs_grow on a mounted filesystem; expansion of 
 unmounted filesystems is not supported.  You only need to
 run gfs_grow on one node in the cluster.  All the other nodes will
 see the expansion has occurred and automatically start to use the
 newly available space.
-.sp
+
 You must be superuser to execute \fBgfs_grow\fP.  The gfs_grow
 tool tries to prevent you from corrupting your filesystem by checking as
 many of the likely problems as it can.  When expanding a filesystem,
 only the last step of updating the resource index affects the currently
 mounted filesystem and so failure part way through the expansion process
 should leave your filesystem in its original unexpanded state.
-.sp
+
 You can run gfs_grow with the \fB-Tv\fP flags to get a display
 of the current state of a mounted GFS filesystem.  This can be useful
 to do after the expansion process to see if the changes have been 
 successful.
-.sp
+
 \fBgfs_grow\fP will consume all the remaining space in a device and add
 it to the filesystem.  If you want to add journals too, you need to add
 the journals first using \fBgfs_jadd\fP.
-.in
-.sp
-\fBOPTIONS\fP
-.in 
+
+.SH OPTIONS
+.TP 
 \fB-h\fP
-.in +7
 Prints out a short usage message and exits.
-.in
-.sp
+.TP
 \fB-q\fP
-.in +7
 Quiet. Turns down the verbosity level.
-.sp
-.in
+.TP
 \fB-T\fP
-.in +7
 Test. Do all calculations, but do not write any data to the disk and do
 not expand the filesystem. This is used to discover what the tool would
 have done were it run without this flag. You probably want to turn the
 verbosity level up in order to gain most information from this option.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Version. Print out version information, then exit.
-.sp
-.in
+.TP
 \fB-v\fP
-.in +7
 Verbose. Turn up verbosity of messages.
-.in
-.sp
-.in -7
-\fBSEE ALSO\fP
-.in +7
+
+.SH SEE ALSO
 gfs_mkfs(8) gfs_jadd(8)
=== man/gfs_jadd.8
==================================================================
--- man/gfs_jadd.8  (revision 310)
+++ man/gfs_jadd.8  (local)
@@ -1,27 +1,16 @@
 .\"  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.tl 'gfs_jadd(8)''gfs_jadd(8)'
-'\"
-'\"    This file is maintained by:
-'\"      Steven Whitehouse <steve at sistina.com>
-'\"    
-'\"	View with 'groff -t -e -mandoc -Tlatin1 gfs_jadd.8 | less'  
-'\"
+.TH gfs_jadd 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 gfs_jadd \- Add journals to a GFS filesystem
 
-.in
-\fBSYNOPSIS\fP
-.in +8
-\fBgfs_jadd\fP\ [options]\ [\fB-j\fP\ \fI<num>\fR]\ [\fB-J\fP\ \fI\
-<size>\fR]\ \fBdevice\fP\ |\fB mount_point\fP\ [\fBdevice\fP\ |\fBmount_point\fP...]
+.SH SYNOPSIS
+.B gfs_jadd
+[\fIOPTION\fR]... <\fIDEVICE\fR|\fIMOINTPOINT\fR>...
 
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 \fIgfs_jadd\fR is used to add journals to a GFS filesystem after
 the device upon which the filesystem resides has been grown.
 By running \fIgfs_jadd\fR
@@ -54,54 +43,36 @@
 This can be useful to do after the journal addition process to see if the
 changes have been successful.
 
-.in
-\fBOPTIONS\fR
-.in +7
+.SH OPTIONS
+.TP
 \fB-j num\fP
-.in +7
 The number of new journals to add. This defaults to 1.
-.sp
-.in
+.TP
 \fB-J size\fP
-.in +7
 The size of the new journals in megabytes. The defaults to 128MB (the
 minimum size allowed is 32MB). If you want to add journals of different
 sizes to the filesystem, you'll need to run gfs_jadd once for each
 different size of journal. The size you specify here will be rounded
 down so that it is a multiple of the journal segment size which was
 specified at filesystem creation time.
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Help. Prints out a short usage message and exits.
-.sp
-.in
+.TP
 \fB-q\fP
-.in +7
 Quiet. Turns down the verbosity level.
-.sp
-.in
+.TP
 \fB-T\fP
-.in +7
 Test. Do all calculations, but do not write any data to the disk and do
 not add journals. This is used to discover what the tool would
 have done were it run without this flag. You probably want to turn the
 verbosity level up in order to gain most information from this option.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Version. Print version information, then exit.
-.sp
-.in
+.TP
 \fB-v\fP
-.in +7
 Verbose. Turn up verbosity of messages.
-.sp
-.in
-.in -7
-\fBSEE ALSO\fP
-.in +7
-\fIgfs_mkfs(8) gfs_grow(8)\fR
-.in -7
+
+.SP SEE ALSO
+gfs_mkfs(8) gfs_grow(8)
=== man/gfs_mkfs.8
==================================================================
--- man/gfs_mkfs.8  (revision 310)
+++ man/gfs_mkfs.8  (local)
@@ -1,112 +1,79 @@
 .\"  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.tl 'gfs_mkfs(8)''gfs_mkfs(8)'
+.TH gfs_mkfs 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 gfs_mkfs - Make a GFS filesystem
-.sp
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBgfs_mkfs\fP [options] \fIBlockDevice\fR
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
-gfs_mkfs is used to create a Global File System on the
-block device \fIBlockDevice\fR.
-.sp
-.in
-\fBOPTIONS\fP
-.in +7
+
+.SH SYNOPSIS
+.B gfs_mkfs
+[\fIOPTION\fR]... \fIDEVICE\fR
+
+.SH DESCRIPTION
+gfs_mkfs is used to create a Global File System.
+
+.SH OPTIONS
+.TP
 \fB-b\fP \fIBlockSize\fR 
-.in +7
 Set the filesystem block size to \fIBlockSize\fR (must be a power of
 two).  The minimum block size is 512.  The FS block size cannot exceed
 the machine's memory page size.  On the most architectures (i386,
 x86_64, s390, s390x), the memory page size is 4096 bytes.  On other
 architectures it may be bigger.  The default block size is 4096 bytes.
 In general, GFS filesystems should not deviate from the default value.
-.in
-.sp
+.TP
 \fB-D\fP
-.in +7
 Enable debugging output.
-.in
-.sp
+.TP
 \fB-h\fP
-.in +7
 Print  out  a  help  message  describing  available
 options, then exit.
-.in
-.sp
+.TP
 \fB-J\fP \fIMegaBytes\fR 
-.in +7
 The size of the journals in Megabytes. The default journal size is 
 128 megabytes.  The minimum size is 32 megabytes.
-.in
-.sp
+.TP
 \fB-j\fP \fINumber\fR 
-.in +7
 The number of journals for gfs_mkfs to create.  You need at least one
 journal per machine that will mount the filesystem.
-.in
-.sp
+.TP
 \fB-O\fP
-.in +7
 This option prevents gfs_mkfs from asking for confirmation before writing
 the filesystem.
-.in
-.sp
+.TP
 \fB-p\fP \fILockProtoName\fR 
-.in +7
 LockProtoName is the name of the  locking  protocol to use.  Acceptable
 locking protocols are \fIlock_gulm\fR or if you are using GFS
 as a local filesystem (\fB1 node only\fP), you can specify the
 \fIlock_nolock\fR protocol.
-.in
-.sp
+.TP
 \fB-q\fP
-.in +7
 Be quiet.  Don't print anything.
-.in
-.sp
+.TP
 \fB-r\fP \fIMegaBytes\fR
-.in +7
 gfs_mkfs will try to make Resource Groups about this big.
 The default is 256 MB.
-.in
-.sp
+.TP
 \fB-s\fP \fIBlocks\fR 
-.in +7
 Journal segment size in filesystem blocks.  This value must be at
 least two and not large enough to produce a segment size greater than
 4MB.
-.in
-.sp
+.TP
 \fB-t\fP \fILockTableName\fR 
-.in +7
 The lock table field appropriate to the lock module you're using.
 For lock_gulm, it is \fIclustername:fsname\fR.
 Clustername is the cluster.ccs:cluster/name string for the cluster which
 will use this filesystem (1 to 16 characters).  Fsname is a unique
 file system name used to distinguish this GFS file system from others
 created (1 to 16 characters).  Lock_nolock doesn't use this field.
-.in
-.sp
+.TP
 \fB-V\fP
-.in +7
 Print program version information, then exit.
-.in
-.sp
-.in -7
-\fBEXAMPLE\fP
-.in +7
+
+.SH EXAMPLE
+.TP
 gfs_mkfs -t mycluster:mygfs -p lock_gulm -j 2 /dev/pool/mygfs
-.sp
-.in +7
 This will make a Global File System on the block device
 "/dev/pool/mygfs".  It will belong to "mycluster" and register itself
 as wanting locking for "mygfs".  It will use GULM for locking and make
=== man/gfs_mount.8
==================================================================
--- man/gfs_mount.8  (revision 310)
+++ man/gfs_mount.8  (local)
@@ -4,21 +4,17 @@
 .\"  This copyrighted material is made available to anyone wishing to use,
 .\"  modify, copy, or redistribute it subject to the terms and conditions
 .\"  of the GNU General Public License v.2.
-.\"
-.tl 'gfs_mount(8)''gfs_mount(8)'
 
-\fBNAME\fP
-.in +7
+.TH gfs_mount 8
+
+.SH NAME
 gfs_mount - GFS mount options
-.in
-.sp
-\fBSYNOPSIS\fP
-.in +7
-\fBmount\fP [StandardMountOptions] \fB-t\fP gfs \fIdevice\fR \fImountpoint\fR \fB-o\fP [GFSOption1,GFSOption2,GFSOptionX...]
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+
+.SH SYNOPSIS
+.B mount
+[\fIStandardMountOptions\fR] \fB-t\fP gfs \fIDEVICE\fR \fIMOUNTPOINT\fR \fB-o\fP [GFSOption1,GFSOption2,GFSOptionX...]
+
+.SH DESCRIPTION
 GFS may be used as a local (single computer) filesystem, but its real purpose
 is in clusters, where multiple computers (nodes) share a common storage device.
 
@@ -68,13 +64,10 @@
 
 If you have trouble mounting GFS, check the syslog (e.g. /var/log/messages)
 for specific error messages.
-.sp
-.in
-\fBOPTIONS\fP
-.in +7
 
+.SH OPTIONS
+.TP
 \fBlockproto=\fP\fILockModuleName\fR
-.in +5
 This specifies which inter-node lock protocol is used by the GFS filesystem
 for this mount, overriding the default lock protocol name stored in the
 filesystem's on-disk superblock.
@@ -91,10 +84,8 @@
 The \fBlockproto\fP mount option should be used only under special
 circumstances in which you want to temporarily use a different lock protocol
 without changing the on-disk default.
-.in
-.sp
+.TP
 \fBlocktable=\fP\fILockTableName\fR
-.in +5
 This specifies the identity of the cluster and of the filesystem for this
 mount, overriding the default cluster/filesystem identify stored in the
 filesystem's on-disk superblock.  The cluster/filesystem name is recognized
@@ -114,10 +105,8 @@
 circumstances in which you want to mount the filesystem in a different cluster,
 or mount it as a different filesystem name, without changing the on-disk
 default.
-.in
-.sp
+.TP
 \fBhostdata=\fP\fIHostIDInfo\fR
-.in +5
 This field sends host (the computer on which the filesystem is being mounted)
 identity information to the lock module.
 
@@ -126,20 +115,16 @@
 used as default by lock_gulm.
 
 This field is ignored by \fIlock_dlm\fR and \fIlock_nolock\fR.
-.in
-.sp
+.TP
 \fBlocalcaching\fP
-.in +5
 This flag tells GFS that it is running as a local (not clustered) filesystem,
 so it can turn on some block caching optimizations that can't be used when
 running in cluster mode.
 
 This is turned on automatically by the lock_nolock module,
 but can be overridden by using the \fBignore_local_fs\fP option.
-.in
-.sp
+.TP
 \fBlocalflocks\fP
-.in +5
 This flag tells GFS that it is running as a local (not clustered) filesystem,
 so it can allow the kernel VFS layer to do all flock and fcntl file locking.
 When running in cluster mode, these file locks require inter-node locks,
@@ -148,10 +133,8 @@
 
 This is turned on automatically by the lock_nolock module,
 but can be overridden by using the \fBignore_local_fs\fP option.
-.in
-.sp
+.TP
 \fBoopses_ok\fP
-.in +5
 Normally, GFS automatically turns on the "kernel.panic_on_oops"
 sysctl to cause the machine to panic if an oops (an in-kernel
 segfault or GFS assertion failure) happens.  An oops on one machine of
@@ -166,29 +149,23 @@
 
 This is turned on automatically by the lock_nolock module,
 but can be overridden by using the \fBignore_local_fs\fP option.
-.in
-.sp
+.TP
 \fBignore_local_fs\fP
-.in +5
 By default, using the nolock lock module automatically turns on the
 \fBlocalcaching\fP and \fBlocalflocks\fP optimizations.  \fBignore_local_fs\fP
 forces GFS to treat the filesystem as if it were a multihost (clustered)
 filesystem, with \fBlocalcaching\fP and \fBlocalflocks\fP optimizations
 turned off.
-.in
-.sp
+.TP
 \fBupgrade\fP
-.in +5
 This flag tells GFS to upgrade the filesystem's on-disk format to the version
 supported by the current GFS software installation on this computer.
 If you try to mount an old-version disk image, GFS will notify you via a syslog
 message that you need to upgrade.  Try mounting again, using the
 \fB-o upgrade\fP option.  When upgrading, only one node may mount the GFS
 filesystem.
-.in
-.sp
+.TP
 \fBnum_glockd\fP
-.in +5
 Tunes GFS to alleviate memory pressure when rapidly aquiring many locks (e.g.
 several processes scanning through huge directory trees).  GFS' glockd kernel
 daemon cleans up memory for no-longer-needed glocks.  Multiple instances
@@ -196,40 +173,28 @@
 one daemon, with a maximum of 32.  Since this option was introduced, other
 methods of rapid cleanup have been developed within GFS, so this option may go
 away in the future.
-.in
-.sp
+.TP
 \fBacl\fP
-.in +5
 Enables POSIX Access Control List \fBacl\fP(5) support within GFS.
-.in
-.sp
+.TP
 \fBsuiddir\fP
-.in +5
 Sets owner of any newly created file or directory to be that of parent
 directory, if parent directory has S_ISUID permission attribute bit set.
 Sets S_ISUID in any new directory, if its parent directory's S_ISUID is set.
 Strips all execution bits on a new file, if parent directory owner is different
 from owner of process creating the file.  Set this option only if you know
 why you are setting it.
-.in
-.sp
-.in -7
-\fBLINKS\fP
-.in +7
+
+.SH LINKS
+.TP 30
 http://sources.redhat.com/cluster
-.in +7
 -- home site of GFS
-.in
-.sp
+.TP
 http://www.suse.de/~agruen/acl/linux-acls/
-.in +7
 -- good writeup on ACL support in Linux
-.in
-.in -7
-.sp
-\fBSEE ALSO\fP
-.in +7
 
+.SH SEE ALSO
+
 \fBgfs\fP(8), 
 \fBmount\fP(8) for general mount options,
 \fBchmod\fP(1) and \fBchmod\fP(2) for access permission flags,
@@ -239,6 +204,4 @@
 \fBlock_gulmd\fP(8),
 \fBumount\fP(8),
 \fBinitrd\fP(4).
-.sp
-.in
 
=== man/gfs_quota.8
==================================================================
--- man/gfs_quota.8  (revision 310)
+++ man/gfs_quota.8  (local)
@@ -1,151 +1,104 @@
 .\"  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.tl 'gfs_quota(8)''gfs_quota(8)'
+.TH gfs_quota 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 gfs_quota - Manipulate GFS disk quotas
-.sp
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBgfs_quota\fP <list|sync|get|limit|warn|check|init> [options]
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+
+.SH SYNOPSIS
+.B gfs_quota
+<list|sync|get|limit|warn|check|init> [\fIOPTION\fR]...
+
+.SH DESCRIPTION
 gfs_quota is used to examine and change quota values in a GFS filesystem.
 This command has a number of different actions.
-.sp
-.in
-\fBACTIONS\fP
-.in +7
+
+.SH ACTIONS
+.TP
 \fBlist\fP
-.in +7
 List the contents of the quota file.  Only IDs that have a non-zero hard limit,
 warn limit, or value are printed.
-.in
-.sp
+.TP
 \fBsync\fP
-.in +7
 Sync any local quota changes to the quota file.
-.in
-.sp
+.TP
 \fBget\fP
-.in +7
 Get the current data for the ID specified by the -u or -g argument.
-.in
-.sp
+.TP
 \fBlimit\fP
-.in +7
 Set the current hard limit for the ID specified by the -u or -g argument to 
 the value specified by the -l argument on the specified filesystem.
 The filesystem won't let the user or group use more than this much space.
 A value of zero here means that no limit is enforced.
-.in
-.sp
+.TP
 \fBwarn\fP
-.in +7
 Set the current warn limit for the ID specified by the -u or -g argument to 
 the value specified by the -l argument on the specified filesystem.
 The filesystem will start complaining to the user or group when more
 than this much space is used.  A value of zero here means that the
 user won't ever be warned.
-.in
-.sp
+.TP
 \fBcheck\fP
-.in +7
 Scan a filesystem and make sure that what's out there on the disk matches
 what's in the quota file.  This is only accurate if the filesystem is
 idle when this is running.  If there is a mismatch, it is printed to
 stdout.  Note: GFS quotas are transactional and a quota check is \fBnot\fP
 needed every time there is a system crash.
-.in
-.sp
+.TP
 \fBinit\fP
-.in +7
 Scan a filesystem and initialize the quota file with the values obtained
 from the scan.  The filesystem should be idle when this is run.  You should
 only need to do this if you upgrade a pre-quota GFS filesystem (pre-GFS 5.1).
-.in
-.sp
-.in -7
-\fBOPTIONS\fP
-.in +7
+
+.SH OPTIONS
 \fB-b\fP
-.in +7
 The units for disk space are filesystem blocks.
-.in
-.sp
+.TP
 \fB-d\fP
-.in +7
 Don't include the space allocated to GFS' hidden files in
 what's reported for the root UID and GID values.  This is useful
 if you're trying to get the numbers reported by gfs_quota to match
 up with the numbers reported by du.
-.in
-.sp
+.TP
 \fB-f\fP \fIDirectory\fR 
-.in +7
 Specifies which filesystem to perform the action on.
-.in
-.sp
+.TP
 \fB-g\fP \fIGID\fR 
-.in +7
 Specifies the group ID for get, limit, or warn.  It can be either
 the group name from the group file, or the GID number.
-.in
-.sp
+.TP
 \fB-h\fP
-.in +7
 Print  out  a  help  message  describing  available
 options, then exit.
-.in
-.sp
+.TP
 \fB-k\fP
-.in +7
 The units for disk space are kilobytes.
-.in
-.sp
+.TP
 \fB-l\fP \fISize\fR 
-.in +7
 Specifies the new value for the limit or warn actions.
 The value is assumed to be in the units specified by the
 -m, -k, -s, -b arguments.  The default is megabytes.
-.in
-.sp
+.TP
 \fB-m\fP
-.in +7
 The units for disk space are megabytes.  This is the default.
-.in
-.sp
+.TP
 \fB-n\fP
-.in +7
 Don't try to resolve UIDs and GIDs into user and group names.
-.in
-.sp
+.TP
 \fB-s\fP
-.in +7
 The units for disk space are sectors (512-byte blocks).
-.in
-.sp
+.TP
 \fB-u\fP \fIUID\fR 
-.in +7
 Specifies the user ID for get, limit, or warn.  It can be either
 the username from the password file, or the UID number.
-.in
-.sp
+.TP
 \fB-V\fP
-.in +7
 Print program version information, then exit.
-.in
-.sp
-.in -7
-\fBEXAMPLE\fP
-.in +7
+
+.SH EXAMPLE
 To set the hard limit for user "nobody" to
 1048576 kilobytes on filesystem /gfs0
-.in +7
+
 gfs_quota limit -l 1048576 -k -u nobody -f /gfs0
 
=== man/gfs_tool.8
==================================================================
--- man/gfs_tool.8  (revision 310)
+++ man/gfs_tool.8  (local)
@@ -1,147 +1,103 @@
 .\"  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.tl 'gfs_tool(8)''gfs_tool(8)'
+.TH gfs_tool 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 gfs_tool - interface to gfs ioctl calls
-.in
-.sp
-\fBSYNOPSIS\fP
-.in +7
-\fBgfs_tool\fP command [options]
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+
+.SH SYNOPSIS
+.B gfs_tool
+\fICOMMAND\fR [\fIOPTION\fR]...
+
+.SH DESCRIPTION
 gfs_tool is an interface to a variety of the GFS ioctl calls.
-.sp
-.in
-\fBCOMMANDS\fP
-.in +7
+
+.SH COMMANDS
+.TP
 \fBclearflag\fP \fIFlag\fR \fIFile1\fR \fIFile2\fR \fI...\fR 
-.in +7
 Clear an attribute flag on a file.
-.in
-.sp
+.TP
 \fBcounters\fP \fIMountPoint\fR [-c]
-.in +7
 Print out statistics about a filesystem.  If -c is used, gfs_tool continues
 to run printing out the stats once a second.
-.in
-.sp
+.TP
 \fBdf\fP \fIMountPoint\fR 
-.in +7
 Print out a space usage summary of a given filesystem.  The information
 printed is more detailed than a standard "df".
-.in
-.sp
+.TP
 \fBfreeze\fP \fIMountPoint\fR
-.in +7
 Freeze (quiesce) a GFS cluster.
-.in
-.sp
+.TP
 \fBgetsb\fP \fIMountPoint\fR
-.in +7
 Print out the superblock of a mounted filesystem.
-.in
-.sp
+.TP
 \fBgettune\fP \fIMountPoint\fR
-.in +7
 Print out the current values of the tuning parameters in a running
 filesystem.
-.in
-.sp
+.TP
 \fBjindex\fP \fIMountPoint\fR
-.in +7
 Print out the journal index of a mounted filesystem.
-.in
-.sp
+.TP
 \fBlayout\fP \fIFile\fR \fI[buffersize]\fR
-.in +7
 Print out on-disk layout information about a file or directory.
 Buffersize is the size of the buffer (in bytes) that gfs_tool allocates
 to store the file's metadata during processing.  It defaults to 4194304
 bytes.  If you are printing a very big directory you may need to specify
 a bigger size.
-.in
-.sp
+.TP
 \fBlist\fP
-.in +7
 List the currently mounted GFS filesystems.  Each line represents
 a filesystem.  The columns represent (in order): 1) A number that
 is a cookie that represents the mounted filesystem. 2) The name of the
 device that holds the filesystem (well, the name as the Linux
 kernel knows it). 3) The lock table field that the filesystem was
 mounted with.
-.in
-.sp
+.TP
 \fBlockdump\fP \fIMountPoint\fR \fI[buffersize]\fR
-.in +7
 Print out information about the locks this machine holds for a given
 filesystem. Buffersize is the size of the buffer (in bytes) that gfs_tool
 allocates to store the lock data during processing.  It defaults to 4194304
 bytes.
-.in
-.sp
+.TP
 \fBmargs\fP \fIarguments\fR
-.in +7
 This loads arguments into the module what will override the mount
 options passed with the -o field on the next mount.  See gfs_mount(8).
-.in
-.sp
+.TP
 \fBreclaim\fP \fIFile\fR
-.in +7
 Returns unused on-disk metadata blocks to free blocks.
-.in
-.sp
+.TP
 \fBrindex\fP \fIMountPoint\fR
-.in +7
 Print out the resource group index of a mounted filesystem.
-.in
-.sp
+.TP
 \fBquota\fP \fIMountPoint\fR
-.in +7
 Print out the quota file of a mounted filesystem.  Also see
 the "gfs_quota list" command.
-.in
-.sp
+.TP
 \fBsb\fP \fIdevice\fR \fBproto\fP \fI[newvalue]\fR
-.in +7
 View (and possibly replace) the name of the locking protocol in the
 file system superblock.  The file system shouldn't be mounted by any
 client when you do this.
-.in
-.sp
+.TP
 \fBsb\fP \fIdevice\fR \fBtable\fP \fI[newvalue]\fR
-.in +7
 View (and possibly replace) the name of the locking table in the
 file system superblock.  The file system shouldn't be mounted by any
 client when you do this.
-.in
-.sp
+.TP
 \fBsb\fP \fIdevice\fR \fBondisk\fP \fI[newvalue]\fR
-.in +7
 View (and possibly replace) the ondisk format number in the
 file system superblock.  The file system shouldn't be mounted by any
 client when you do this.  No one should have to use this.
-.in
-.sp
+.TP
 \fBsb\fP \fIdevice\fR \fBmultihost\fP \fI[newvalue]\fR
-.in +7
 View (and possibly replace) the multihost format number in the
 file system superblock.  The file system shouldn't be mounted by any
 client when you do this.  No one should have to use this.
-.in
-.sp
+.TP
 \fBsb\fP \fIdevice\fR \fBall\fP
-.in +7
 Print out the superblock.
-.in
-.sp
+.TP
 \fBsetflag\fP \fIFlag\fR \fIFile1\fR \fIFile2\fR \fI...\fR 
-.in +7
 Set an attribute flag on a file.  There are four currently
 supported flags.  They are jdata, directio, inherit_jdata, and
 inherit_directio.
@@ -162,35 +118,23 @@
 regular files created in that directory automatically inherit the
 \fIdirectio\fR flag.  The \fIinherit_directio\fR is also inherited by
 any new subdirectories created in that directory.
-.in
-.sp
+.TP
 \fBsettune\fP \fIMountPoint\fR \fIparameter\fR \fInewvalue\fR
-.in +7
 Set the value of tuning parameter.  Use \fBgettune\fP for a listing of 
 tunable parameters.
-.in
-.sp
+.TP
 \fBshrink\fP \fIMountPoint\fR
-.in +7
 Causes any unused inodes to be thrown out of memory.
-.in
-.sp
+.TP
 \fBstat\fP \fIFile\fR
-.in +7
 Print out extended stat information about a file.
-.in
-.sp
+.TP
 \fBunfreeze\fP \fIMountPoint\fR
-.in +7
 Unfreeze a GFS cluster.
-.in
-.sp
+.TP
 \fBversion\fP
-.in +7
 Print out the version of GFS that this program goes with.
-.in
-.sp
+.TP
 \fBwithdraw\fP \fIMountPoint\fR
-.in +7
 Cause GFS to abnormally shutdown a given filesystem on this node.
 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/dca6e201/attachment.sig>

From bastian at waldi.eu.org  Thu Feb 17 14:10:11 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Thu, 17 Feb 2005 15:10:11 +0100
Subject: [Linux-cluster] cman - convert manpage to the man package
Message-ID: <20050217141011.GB15353@wavehammer.waldi.eu.org>

Hi folks

I've converted one cman manpage to the man package.

Bastian

-- 
Vulcans worship peace above all.
		-- McCoy, "Return to Tomorrow", stardate 4768.3
-------------- next part --------------
=== man/cman.5
==================================================================
--- man/cman.5  (revision 316)
+++ man/cman.5  (local)
@@ -3,36 +3,25 @@
 
 .\" groff -t -e -mandoc -Tlatin1 cman.5 | less
 
-.tl 'cman(5)' 'cman(5)'
+.TH cman 5
 
-\fBNAME\fP
-.in +7
+.SH NAME
 cman - cluster.conf cman configuration section
-.sp
-.in -7
 
-
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 Cman configuration values are placed in the <cman> </cman> section of
 \fBcluster.conf\fP.  Per-node configuration related to cman is placed
 in the standard <clusternode> </clusternode> sections.  All cman
 configuration settings are optional; usually none are used.
-.in -7
 
-
-\fIUDP port\fR
-.in +7
+.SS UDP port
 By default, cman will use UDP port 6809 for internode communication.  This can
 be changed by setting a port number as follows:
 
   <cman port="6809">
   </cman>
-.in -7
 
-
-\fIExpected votes\fR
-.in +7
+.SS Expected votes
 The expected votes value is used by cman to determine quorum.  The cluster is
 quorate if the sum of votes of existing members is over half of the expected
 votes value.  By default, cman_tool sets the expected votes value to be the sum
@@ -45,11 +34,8 @@
 If the cluster becomes partitioned, improper use of this option can result
 in more than one partition gaining quorum.  In that event, nodes in each
 partition will enable cluster services.
-.in -7
 
-
-\fITwo node clusters\fR
-.in +7
+.SS Two node clusters
 Ordinarily, the loss of quorum after one out of two nodes fails will prevent
 the remaining node from continuing (if both nodes have one vote.)  Special
 configuration options can be set to allow the one remaining node to continue
@@ -59,23 +45,16 @@
 
   <cman two_node="1" expected_votes="1">
   </cman>
-.in -7
 
-
-\fINode votes\fR
-.in +7
+.SS Node votes
 By default, a node is given one vote toward the calculation of quorum.
 This can be changed by giving a node a specific number of votes as
 follows:
 
   <clusternode name="nd1" votes="2">
   </clusternode>
-.in -7
 
-
-\fINode ID\fR
-.in +7
-
+.SS Node ID
 By default, a node is assigned a nodeid by the cluster manager (cman) when
 it joins the cluster.  This can be overriden by specifying a nodeid here.
 Using this option will ensure that a given node always has the same ID.
@@ -84,11 +63,8 @@
 
   <clusternode name="nd1" nodeid="1">
   </clusternode>
-.in -7
 
-
-\fIMultihome network configuration\fR
-.in +7
+.SS Multihome network configuration
 Cman can be configured to use multiple network interfaces.  If one fails it
 should be able to continue running with the one remaining.  A node's name in
 cluster.conf is always associated with the IP address on one network interface;
@@ -104,11 +80,8 @@
   <clusternode name="nd1">
       <altname name="nd1-e1"/>
   </clusternode>
-.in -7
 
-
-\fIMulticast network configuration\fR
-.in +7
+.SS Multicast network configuration
 Cman can be configured to use multicast instead of broadcast (broadcast is used
 by default if no multicast parameters are given.)  To configure multicast when
 one network interface is used add one line under the <cman> section and another
@@ -137,12 +110,7 @@
       <multicast addr="224.0.0.1" interface="eth0"/>
       <multicast addr="224.0.0.9" interface="eth1"/>
   </clusternode>
-.in -7
 
-
-.sp
-.in -7
-\fBSEE ALSO\fP
-.in +7
+.SH SEE ALSO
 cluster.conf(5), ccs(7), cman_tool(8)
 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/61b1d9c1/attachment.sig>

From bastian at waldi.eu.org  Thu Feb 17 15:20:25 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Thu, 17 Feb 2005 16:20:25 +0100
Subject: [Linux-cluster] gnbd - convert manpages to the man package
Message-ID: <20050217152025.GA30311@wavehammer.waldi.eu.org>

Hi folks

I've converted the gnbd manpages to the man package.

Bastian

-- 
Vulcans worship peace above all.
		-- McCoy, "Return to Tomorrow", stardate 4768.3
-------------- next part --------------
=== man/fence_gnbd.8
==================================================================
--- man/fence_gnbd.8  (revision 328)
+++ man/fence_gnbd.8  (local)
@@ -1,116 +1,87 @@
 .\"  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.tl 'fence_gnbd(8)''fence_gnbd(8)'
+.TH fence_gnbd 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 fence_gnbd - I/O Fencing agent for GNBD-based GFS clusters
 
-.in
-\fBSYNOPSIS\fP
-.in +7
-\fBfence_gnbd -s\fP \fIclient_ip\fP \fB-t\fP \fIserver_ip\fP
+.SH SYNOPSIS
+.B fence_gnbd
+[\fIOPTION\fR]...
 
-.in
-.sp
-\fBDESCRIPTION\fP
-.in +7
+.SH DESCRIPTION
 fence_gnbd is an I/O Fencing agent used when GFS is accessed through GNBD.  It
 uses the gnbd_import program and tells a GNBD server to ignore I/O from a 
 specified machine. 
-.sp
+
 fence_gnbd accepts options on the command line as well as from stdin.
 fence_node sends the options through stdin when it execs the agent.
 fence_gnbd can be run by itself with command line options which is useful
 for testing.
-.sp
-.in
-\fBOPTIONS\fP
-.in
+
+.SH OPTIONS
+.TP
 \fB-h\fP
-.in +7
 Print out a help message describing available options, then exit.
-.sp
-.in
+.TP
 \fB-m\fP
-.in +7
 Selects multipath style fencing. With multipath style fencing, if fence_gnbd
 cannot contact the kgnbd_portd process on the gnbd server node, it will fence
 that server node. This is necessary to insure that there is no way for the
 fenced client to access the storage through that server.
 \fBWARNING:\fP Multipath style fencing must be used on a node if it is using
 pool multipathing with GNBD devices.
-.in
+.TP
 \fB-q\fP
-.in +7
 quiet mode, no output.
-.sp
-.in
+.TP
 \fB-s\fP \fInode\fP
-.in +7
 gnbd client machine to fence.
-.sp
-.in
+.TP
 \fB-t\fP \fInode\fP
-.in +7
 server machine to fence the gnbd client from.  If this option is
 not given, the specified gnbd client node will be fenced from all gnbd server
 nodes that have GNBDs imported by the machine running fence_gnbd.
 Using the -t option is strongly recommended.  The -t option may be used
 multiple times to fence a client from multiple servers.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Print out a version message, then exit.
-.sp
-.in
-.in -7
-\fBSTDIN PARAMETERS\fP
-.in +7
 
+.SH STDIN PARAMETERS
+.TP
 \fIagent = < param >\fP
-.sp
 This option is used by fence_node(8) and is ignored by fence_gnbd.
-.sp
-
+.TP
 \fIipaddr = < clustername >\fP
-.sp
 The cluster name of the node to be fenced (required) \fBNOTE:\fP This parameter
 no longer allows the IP address of the node to be used. This parameter is
 deprecated. Please use \fInodename\fP instead.
-.sp
+.TP
 \fInodename = < clustername >\fP
-.sp
 The cluster name of the node to be fenced (required)
-.sp
+.TP
 \fIservers = < hostname [ hostname ... ] >\fP
-.sp
 A whitespace seperated list of the servers to fence the client from, in
 either IP address or hostname form.
-.sp
+.TP
 \fIoption = multipath\fP
-.sp
 Select multipath style fencing. \fBWARNING:\fP When multipath style fencing is
 used, if the kgnbd_portd process of a gnbd server node cannot be contacted, it
 is fenced as well, using its appropriate fencig method.  This means that when
 a client is fenced, any node listed as its server that does not have the
 gnbd_serv module loaded (which starts kgnbd_portd) will also be fenced.
-.sp
+.TP
 \fIretrys = < param >\fP
-.sp
 Number of times to retry connecting to the server after a failed attempt,
 before the server is fenced.  This parameter is only valid when used
 with multipath style fencing (see above).  The default is 3.
-.sp
+.TP
 \fIwait_time = < param >\fP
-.sp
 length of time, in seconds, to wait between retrys. This parameter
 is only valid when used with multipath sytle fencing (see above). The default
 is 2.
-.sp
-.in -7
-\fBSEE ALSO\fP
-.in +7
+
+.SH SEE ALSO
 fence(8), fenced(8), gnbd_import(8)
=== man/gnbd.8
==================================================================
--- man/gnbd.8  (revision 328)
+++ man/gnbd.8  (local)
@@ -1,21 +1,15 @@
 .\"  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.tl 'GNBD(8)''GNBD(8)'
-'\"     View with 'groff -t -e -mandoc -Tlatin1 gnbd.8 | less'
+.TH gnbd 8
 
-\fBNAME\fP
-.in +7
+.SH NAME
 GNBD reference guide
 
-.in
-\fBSYNOPSIS\fP
-.in +7
+.SH SYNOPSIS
 Overview of manpages and their locations
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+
+.SH DESCRIPTION
 The GFS Network Block Device has two parts, the client (which includes the
 \fBgnbd.ko\fP module and  \fBgnbd_import\fP, \fBgnbd_recvd\fP, and
 \fBgnbd_monitor\fP programs) and the server
@@ -27,17 +21,23 @@
 device distributed with the Linux kernel) is GNBD's ability to have multiple
 clients all accessing the same served file or block device at the same
 time.   
-.sp
+
 The GNBD documentation has been split into a number of sections.  Please
 refer to the table below to determine which man page coincides with the
 command/feature you are looking for.
-.sp
- \fBgnbd\fP                 GNBD overview (this man page)
- \fBgnbd_import\fP          Configure a GNBD Client
- \fBgnbd_export\fP          Configure a GNBD Server
- \fBfence_gnbd\fP           Fence method for GNBD
-.sp
-.in
-\fBSEE ALSO\fP
-.in +7
+
+.TP 16
+gnbd
+GNBD overview (this man page)
+.TP
+gnbd_import
+Configure a GNBD Client
+.TP
+gnbd_export
+Configure a GNBD Server
+.TP
+fence_gnbd
+Fence method for GNBD
+
+.SH SEE ALSO
 gfs(8), fence(8), fence_gnbd(8)
=== man/gnbd_export.8
==================================================================
--- man/gnbd_export.8  (revision 328)
+++ man/gnbd_export.8  (local)
@@ -1,37 +1,29 @@
 .\"  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.tl 'gnbd_export(8)''gnbd_export(8)'
+.TH gnbd_export 8
 
-\fBNAME\fP 
-.in +7
+.SH NAME
 gnbd_export - the interface to export GNBDs
 
-.in
-\fBSYNOPSIS\fP 
-.in +7
-\fBgnbd_export\fP [options]
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
+.SH SYNOPSIS
+.B gnbd_export
+[\fIOPTION\fR]...
+
+.SH DESCRIPTION
 gnbd_export exports local block devices or files as GNBDs.
-.sp
-.in
-\fBOPTIONS\fP
-.in +7
+
+.SH OPTIONS
+.TP
 \fB-a\fP
-.in +7
 Validate.
-.sp
+
 This option forces all server processes to send a ping message to the clients
 they are connected to. This forces servers with faulty connections to quit.
-.sp
-.in
+.TP
 \fB-c\fP
-.in +7
 Enable caching.
-.sp
+
 Reads from the exported GNBD will take advantage of the linux page cache.
 This option is used with \fB-e\fP. \fBNOTE:\fP If this option is not specified,
 gnbd will run with a noticeable performance decrease.  Also, if this option
@@ -50,48 +42,38 @@
 the \fB-c\fP option can also be used locally, but you must access the device
 directly. You MUST NOT use gnbd_import to import devices exported from the
 same machine.
-.sp
-.in
+.TP
 \fB-d\fI pathname\fR
-.in +7
 Device.
-.sp
+
 Specify the device to export as a GNBD.  This option is used with \fB-e\fP. 
 \fIpathname\fR may be either a block device or a regular file.  Usually block 
 devices are used, because this increases GNBD performance.
-.sp
-.in
+.TP
 \fB-e\fI gnbdname\fR
-.in +7
 Export.
-.sp
+
 Export a device as a GNBD with the Device name \fIgnbdname\fR. You must also
 specify the pathname of the device with the \fB-d\fP option.  Once a GNBD 
 has been exported, clients can import it with \fBgnbd_import\fP.
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Help.
-.sp
+
 Print the usage information.
-.sp
-.in
+.TP
 \fB-l\fP 
-.in +7
 List.
-.sp
+
 List all exported GNBDs and kgnbd_portd server information.  The listing 
 contains each server's number (which is only for internal use), its Device 
 name, the pathname of the device that is being exported, it's size in
 512 byte sectors, and information on whether or not it is cached, and if not,
 what it's timeout is.
-.sp
-.in
+.TP
 \fB-O\fP
-.in +7
 Override
-.sp
+
 This option allows you to unexport gnbd devices, even if they are still in
 use.  When an agent other than fence_gnbd is used to fence gnbd client nodes,
 occasionally gnbd server threads are not correctly cleaned up.  This causes
@@ -99,40 +81,30 @@
 In this case, using the \fB-O\fP option with either \fB-r\fP or \fB-R\fP will
 allow you to unexport the GNBD devices.  \fBWARNING:\fP Make sure
 that no clients have the GNBD imported before using this option.
-.sp
-.in
+.TP
 \fB-o\fP
-.in +7
 Readonly
-.sp
+
 export the server in readonly mode.
-.sp
-.in
+.TP
 \fB-q\fP 
-.in +7
 Quiet.
-.sp
+
 Only prints out error messages.
-.sp
-.in
+.TP
 \fB-R\fP
-.in +7
 Remove All.
-.sp
+
 Remove all exported GNBDs.
-.sp
-.in
+.TP
 \fB-r\fP [\fIGNBD(s)\fR]
-.in +7
 Remove.
-.sp
+
 Remove named GNBD(s).
-.sp
-.in
+.TP
 \fB-t\fP [\fIseconds\fR]
-.in +7
 Timeout.
-.sp
+
 Set the exported GNBD to timeout mode  This option is used with \fB-p\fP.
 This is the default for uncached GNBDs. For cached GNBDs, the default is wait
 mode (For GFS versions up through 5.2, all GNBDs were in wait mode).  In wait
@@ -144,25 +116,18 @@
 pending and future requests as failures until the imported GNBD is closed. The
 default timeout period is 60 seconds. Timeout mode is necessary for failover to
 work with dm multipathing over gnbd.
-.sp
-.in
+.TP
 \fB-v\fP
-.in +7
 Verbose.
-.sp
+
 Increase the verbosity of the output.  This option is the most useful with
 \fB-l\fP.  If it is used along with \fB-l\fP, an extended list of information
 on each exported device will be printed.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Version information.
-.sp
+
 Print out version information.
-.sp
-.in
-.in -7
-\fBSEE ALSO\fP
-.in +7
+
+.SH SEE ALSO
 gnbd_import(8), gnbd(8)
=== man/gnbd_import.8
==================================================================
--- man/gnbd_import.8  (revision 328)
+++ man/gnbd_import.8  (local)
@@ -1,78 +1,61 @@
 .\"  Copyright (C) Sistina Software, Inc.  1997-2003  All rights reserved.
 .\"  Copyright (C) 2004 Red Hat, Inc.  All rights reserved.
 
-.tl 'gnbd_import(8)''gnbd_import(8)'
+.TH gnbd_import 8
 
-\fBNAME\fP 
-.in +7
+.SH NAME
 gnbd_import - manipulate GNBD block devices on a client
 
-.in
-\fBSYNOPSIS\fP 
-.in +7
-\fBgnbd_import\fP [options]
-.sp
-.in
-\fBDESCRIPTION\fP
-.in +7
-gnbd_import imports, lists, and removes GNBDs from the system. GNBD is simila
- to
+.SH SYNOPSIS
+.B gnbd_import
+[\fIOPTION\fR]...
+
+.SH DESCRIPTION
+gnbd_import imports, lists, and removes GNBDs from the system. GNBD is similar to
 the Network Block Device (nbd) in the Linux kernel, except that it allows
 multiple clients to connect at once and has a built-in fence command.
-.sp
-.in
-\fBOPTIONS\fP
-.in +7
+
+.SH OPTIONS
+.TP
 \fB-a\fP
-.in +7
 Validate.
-.sp
+
 Restart failed gnbd_recvd processes. Usually, if a gnbd becomes
 disconnected, the \fBgnbd_recvd\fP process for that device will automatically
 try to reconnect. If that process is killed, \fBgnbd_import -a\fP will
 restart it.
-.sp
-.in
+.TP
 \fB-c \fIserver\fR
-.in +7
 Check fenced.
-.sp
+
 List all the IP addresses currently IO fenced from the specified \fIserver\fR.
 If the sepecified server does not have any IP addresses fenced, nothing will
 be returned.  If the server machine is not running gnbd_serv, an error will
 be returned.
-.sp
-.in
+.TP
 \fB-e \fIserver\fR
-.in +7
 List exported.
-.sp
+
 List GNBDs exported by the specified \fIserver\fR, along with the port at 
 which they can be accessed. If the specified server is not exporting any GNBDs, 
 nothing will be returned. If the server machine is not running gnbd_serv, an
 error will be returned.
-.sp
-.in
+.TP
 \fB-h\fP
-.in +7
 Help.
-.sp
+
 Print the usage information.
-.sp
-.in
+.TP
 \fB-i \fIserver\fR
-.in +7
 Import.
-.sp
+
 Import all GNBDs which the specified \fIserver\fR has exported. This will not 
 allow a GNBD to be imported if another one with the same name has already been
 imported.
-.sp
-.in
+.TP
 \fB-l\fP
-.in +7
 List.
-.sp
+
 List all imported GNBDs. If no options are specified, this is the default 
 action. There are eight fields for each device: Device name, Minor #, 
 Proc name, Server, Port, State, Readonly and Sectors. The Device name is the
@@ -94,59 +77,47 @@
 to the server, but which haven't been completed yet. Readonly tells whether
 the gnbd server exported this device as readonly. Sectors is the device size
 in 512 bytes sectors.
-.sp
-.in
+.TP
 \fB-O\fP
-.in +7
 Override
-.sp
+
 This makes gnbd_import run in non-interactive mode.  It will no longer prompt
 the user before attempting unsafe actions.  It is recommended that you do
 not use this option.
-.sp
-.in
+.TP
 \fB-p\fP
-.in +7
 Port.
-.sp
+
 Change the port to connect to on the server.  This option is used with 
 \fB-c\fP, \fB-e\fP and \fB-i\fP.  If the port option is not set, gnbd_import
 will try to connect to port 14567 on the server machine to find the gnbd_serv 
 daemon. You should only need to use this if you have changed the gnbd_serv
 port from its default.
-.sp
-.in
+.TP
 \fB-q\fP
-.in +7
 Quiet mode.
-.sp
+
 Only print out errors or questions.
-.sp
-.in
+.TP
 \fB-R\fP
-.in +7
 Remove All.
-.sp
+
 Remove all of the imported GNBDs from the system. Only GNBDs that are in the 
 \fBClosed Disconnected Clear\fP state can be removed (See the \fB-l\fP 
 option), unless \fB-O\fP is used.  Remove All stops after the first failed
 remove.
-.sp
-.in
+.TP
 \fB-r\fP [\fIGNBD\fR | \fILIST\fR]
-.in +7
 Remove.
-.sp
+
 Remove named GNBD(s) from system.  Only GNBDs that are in the \fBClosed 
 Disconnected Clear\fP state can be removed (See the \fB-l\fP option), unless
 the \fB-O\fP option is used.
 Remove stops after the first failed remove. 
-.sp
-.in
+.TP
 \fB-s\fP \fIhost\fR
-.in +7
 Fence.
-.sp
+
 IO fence the specified host. This command is generally invoked by \fBfenced\fP.
 \fBWARNING\fP It is
 not always possible to seamlessly reconnect a client that has had its
@@ -154,45 +125,33 @@
 what you are doing. See the \fB-t\fP option for more information. Once a host
 is fenced from a server, it will not be able to access any GNBDs on that server
 until it is unfenced (see the \fB-u\fP option).
-.sp
-.in
+.TP
 \fB-t\fP \fIserver\fR
-.in +7
 Fence from Server.
-.sp
+
 Specify a server for the IO fence (only used with the \fB-s\fP option).
-.sp
-.in
+.TP
 \fB-u\fP \fIhost\fR
-.in +7
 Unfence.
-.sp
+
 Unfence the specified host. \fBWARNING\fP: Unfencing a client at the incorrect
 time can result in data corruption.  In normal operation, it should never be
 necessary to run this comman. See the \fB-t\fP option.
-.sp
-.in
+.TP
 \fB-V\fP
-.in +7
 Version information.
-.sp
+
 Print out version information.
-.sp
-.in
+.TP
 \fB-v\fP
-.in +7
 Verbose output.
-.sp
+
 Print additional messages during the operation of gnbd_import.
-.sp
-.in -14
-\fBSEE ALSO\fP
-.in +7
+
+.SH SEE ALSO
 gnbd_export(8)
-.sp
-.in
-\fBBUGS\fP
-.in +7
+
+.SH BUGS
 A computer should not import a GNBD device that it exports.  Any 
 significant amount of IO on that device will cause a kernel deadlock. This is 
 a problem common to most NBDs. Instead, the underlying device should be
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/d386b4ec/attachment.sig>

From bujan at isqsolutions.com  Thu Feb 17 15:47:55 2005
From: bujan at isqsolutions.com (Manuel Bujan)
Date: Thu, 17 Feb 2005 10:47:55 -0500
Subject: [Linux-cluster] Performance tunning
Message-ID: <000501c51508$10f23e80$5001a8c0@pcbujan>

Hello,

Is there anyway to improve the performance of a cman based gfs cluster 
installation tunning some of its parameters ?

We are using the dlm style locking mechanism in a two node 100 GB shared gfs 
cluster with a DELL powervault 220s storage.
We already disabled the atime updates and  the quotas_account but the gain 
in performance is not too significant.

Any other hints ?

Regards
Bujan 


From mtilstra at redhat.com  Thu Feb 17 16:20:44 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Thu, 17 Feb 2005 10:20:44 -0600
Subject: [Linux-cluster] lock_gulm heartbeat
In-Reply-To: <1108593999.26259.40.camel@storm.cp-direct.com>
References: <1108593999.26259.40.camel@storm.cp-direct.com>
Message-ID: <4214C45C.6060408@redhat.com>

Edward Mann wrote:
> Everyone,
> 
> 
> I have a gfs cluster that has been running fine for about 3 months now.
> I am only using 2 machines and the storage is a firewire drive. Over the
> weekend i started to get:
> lock_gulmd_core[1100]: Failed to receive a timely heartbeat reply from
> Master. (t:1108583425506998 mb:1)
> 
> and after 2, which is what i allowed for the missed heart beats, the gfs
> slave would die. I moved the missed heartbeat up to 5 and have seen it
> miss as many as 4 in a row. The only thing that has changed on the
> machine is that i add new clients to process files once they are placed
> on the machine. I am using FAM to notify my app that a new file is
> present.
> 
> Any ideas on what i should look at? How can i diagnose this problem. The
> communication between the two machines seems fine. I can ping both
> hosts. I am really at a loss at to what to look for.

It sounds like it might be a load problem.  The node is busy doing other 
work and it doesn't give enough time to the gulm core process to do 
heartbeats.  You should try increacing the heartbeat_rate some.  And 
maybe nicing the lock_gulmd processes some.

If you want to try tacking all of the heartbeat messages, set the 
'heartbeat' verbosity flag.

-- 
Michael Conrad Tadpol Tilstra
AH! Get off my leg! You're Not my type!!!!
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/bc02ec9f/attachment.sig>

From teigland at redhat.com  Thu Feb 17 16:45:32 2005
From: teigland at redhat.com (David Teigland)
Date: Fri, 18 Feb 2005 00:45:32 +0800
Subject: [Linux-cluster] Performance tunning
In-Reply-To: <000501c51508$10f23e80$5001a8c0@pcbujan>
References: <000501c51508$10f23e80$5001a8c0@pcbujan>
Message-ID: <20050217164532.GB6389@redhat.com>

On Thu, Feb 17, 2005 at 10:47:55AM -0500, Manuel Bujan wrote:
> Hello,
> 
> Is there anyway to improve the performance of a cman based gfs cluster 
> installation tunning some of its parameters ?
> 
> We are using the dlm style locking mechanism in a two node 100 GB shared 
> gfs cluster with a DELL powervault 220s storage.
> We already disabled the atime updates and  the quotas_account but the gain 
> in performance is not too significant.
> 
> Any other hints ?

Look at /proc/cluster/lock_dlm/drop_count

It's 50000 by default and increasing it could possibly improve things,
or disable it altogether by setting to zero.
e.g.  echo "0" >> /proc/cluster/lock_dlm/drop_count

You need to change this prior to mounting gfs on each node.  When
non-zero, GFS/lock_dlm tries to keep the number of locks held locally
below this level which limits the caching gfs can do.  When zero, no
limiting is attempted.

-- 
Dave Teigland  <teigland at redhat.com>


From bastian at waldi.eu.org  Thu Feb 17 16:44:50 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Thu, 17 Feb 2005 17:44:50 +0100
Subject: [Linux-cluster] expected vote update is weird
Message-ID: <20050217164450.GA23870@wavehammer.waldi.eu.org>

Hi folks

Why can't I set the expected votes count to a higher count than current?

| # cman_tool status
| Nodes: 3
| Expected_votes: 8
| Total_votes: 3
| Quorum: 5  Activity blocked
| # cman_tool expected -e 3
| # cman_tool status
| Nodes: 2
| Expected_votes: 3
| Total_votes: 2
| Quorum: 2
| # cman_tool expected -e 8
| cman_tool: can't set expected votes

But after joining another node, it is reset to 8.

| # cman_tool status
| Nodes: 3
| Expected_votes: 8
| Total_votes: 3
| Quorum: 5  Activity blocked

This is realy weird, I can't set the value by hand, but the joining node
is able to do that.

Bastian

-- 
Intuition, however illogical, is recognized as a command prerogative.
		-- Kirk, "Obsession", stardate 3620.7
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/985d3e33/attachment.sig>

From bastian at waldi.eu.org  Thu Feb 17 18:35:34 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Thu, 17 Feb 2005 19:35:34 +0100
Subject: [Linux-cluster] [PATCH 0/4] fence
Message-ID: <20050217183534.GB23870@wavehammer.waldi.eu.org>

Patches:
1. Use bools instead of crude defines.
2. Remove GNU-isms.
3. Add option to fence_tool to not wait for quorum.
4. Wait for the join complete event in fenced.

Bastian

-- 
We do not colonize.  We conquer.  We rule.  There is no other way for us.
		-- Rojan, "By Any Other Name", stardate 4657.5
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/99baca48/attachment.sig>

From bastian at waldi.eu.org  Thu Feb 17 18:35:45 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Thu, 17 Feb 2005 19:35:45 +0100
Subject: [Linux-cluster] [PATCH 2/4] fence - Remove GNU-isms
Message-ID: <20050217183545.GD23870@wavehammer.waldi.eu.org>

This patch removes the GNU-isms in the die defines and replaces them
with inline functions.

Bastian

-- 
She won' go Warp 7, Cap'n!  The batteries are dead!
-------------- next part --------------
diff -urN -x CVS cvs-patch01-bool/fenced/fd.h cvs-patch02-gnuism/fenced/fd.h
--- cvs-patch01-bool/fenced/fd.h	2005-02-17 18:43:21.000000000 +0100
+++ cvs-patch02-gnuism/fenced/fd.h	2005-02-17 18:43:29.000000000 +0100
@@ -15,6 +15,7 @@
 #define __FD_DOT_H__
 
 #include <pthread.h>
+#include <stdarg.h>
 #include <stdbool.h>
 #include <stdio.h>
 #include <stdlib.h>
@@ -57,24 +58,30 @@
 #define MAX_NAME_LEN	33
 
 /* use this one before we fork into the background */
-#define die1(fmt, args...) \
-do \
-{ \
-  fprintf(stderr, "%s: ", prog_name); \
-  fprintf(stderr, fmt "\n", ##args); \
-  exit(EXIT_FAILURE); \
-} \
-while (0)
+static inline void die1(const char *fmt, ...)
+{
+	va_list ap;
+	fputs(prog_name, stderr);
+	fputs(": ", stderr);
+	va_start(ap, fmt);
+	vfprintf(stderr, fmt, ap);
+	va_end(ap);
+	fputs("\n", stderr);
+	exit(EXIT_FAILURE);
+}
 
-#define die(fmt, args...) \
-do \
-{ \
-  fprintf(stderr, "%s: ", prog_name); \
-  fprintf(stderr, fmt "\n", ##args); \
-  syslog(LOG_ERR, fmt, ##args); \
-  exit(EXIT_FAILURE); \
-} \
-while (0)
+static inline void die(const char *fmt, ...)
+{
+	va_list ap;
+	fputs(prog_name, stderr);
+	fputs(": ", stderr);
+	va_start(ap, fmt);
+	vfprintf(stderr, fmt, ap);
+	fputs("\n", stderr);
+	vsyslog(LOG_ERR, fmt, ap);
+	va_end(ap);
+	exit(EXIT_FAILURE);
+}
 
 #define FENCE_ASSERT(x, todo) \
 do \
diff -urN -x CVS cvs-patch01-bool/fence_node/fence_node.c cvs-patch02-gnuism/fence_node/fence_node.c
--- cvs-patch01-bool/fence_node/fence_node.c	2005-02-10 06:33:28.000000000 +0100
+++ cvs-patch02-gnuism/fence_node/fence_node.c	2005-02-17 18:25:38.000000000 +0100
@@ -11,6 +11,7 @@
 *******************************************************************************
 ******************************************************************************/
 
+#include <stdarg.h>
 #include <stdio.h>
 #include <stdlib.h>
 #include <unistd.h>
@@ -22,21 +23,24 @@
 
 #define OPTION_STRING           ("hOuV")
 
-#define die(fmt, args...) \
-do \
-{ \
-  fprintf(stderr, "%s: ", prog_name); \
-  fprintf(stderr, fmt "\n", ##args); \
-  exit(EXIT_FAILURE); \
-} \
-while (0)
-
 static char *prog_name;
 static int unfence;
 static int force;
 
 int dispatch_fence_agent(int cd, char *victim, int in);
 
+static inline void die(const char *fmt, ...)
+{
+	va_list ap;
+	fputs(prog_name, stderr);
+	fputs(": ", stderr);
+	va_start(ap, fmt);
+	vfprintf(stderr, fmt, ap);
+	va_end(ap);
+	fputs("\n", stderr);
+	exit(EXIT_FAILURE);
+}
+
 static void print_usage(void)
 {
 	printf("Usage:\n");
diff -urN -x CVS cvs-patch01-bool/fence_tool/fence_tool.c cvs-patch02-gnuism/fence_tool/fence_tool.c
--- cvs-patch01-bool/fence_tool/fence_tool.c	2005-02-17 18:09:30.000000000 +0100
+++ cvs-patch02-gnuism/fence_tool/fence_tool.c	2005-02-17 18:19:19.000000000 +0100
@@ -17,6 +17,7 @@
 #include <stddef.h>
 #include <signal.h>
 #include <string.h>
+#include <stdarg.h>
 #include <stdbool.h>
 #include <stdint.h>
 #include <sys/ioctl.h>
@@ -42,16 +43,6 @@
 #define OP_MONITOR			3
 #define OP_WAIT				4
 
-
-#define die(fmt, args...) \
-do \
-{ \
-  fprintf(stderr, "%s: ", prog_name); \
-  fprintf(stderr, fmt "\n", ##args); \
-  exit(EXIT_FAILURE); \
-} \
-while (0)
-
 char *prog_name;
 bool debug = false;
 int operation;
@@ -62,6 +53,17 @@
 
 int dispatch_fence_agent(int cd, char *victim, int in);
 
+static inline void die(const char *fmt, ...)
+{
+	va_list ap;
+	fputs(prog_name, stderr);
+	fputs(": ", stderr);
+	va_start(ap, fmt);
+	vfprintf(stderr, fmt, ap);
+	va_end(ap);
+	fputs("\n", stderr);
+	exit(EXIT_FAILURE);
+}
 
 static int check_mounted(void)
 {
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/b835bf86/attachment.sig>

From bastian at waldi.eu.org  Thu Feb 17 18:35:40 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Thu, 17 Feb 2005 19:35:40 +0100
Subject: [Linux-cluster] [PATCH 1/4] fence - Use bools instead of crude
	defines
Message-ID: <20050217183540.GC23870@wavehammer.waldi.eu.org>

This patch uses C99 bools instead of crude defines.

Bastian

-- 
Virtue is a relative term.
		-- Spock, "Friday's Child", stardate 3499.1
-------------- next part --------------
diff -urN -x CVS cvs/fenced/fd.h cvs-patch01-bool/fenced/fd.h
--- cvs/fenced/fd.h	2005-02-15 04:58:00.000000000 +0100
+++ cvs-patch01-bool/fenced/fd.h	2005-02-17 18:43:21.000000000 +0100
@@ -14,12 +14,8 @@
 #ifndef __FD_DOT_H__
 #define __FD_DOT_H__
 
-#ifndef TRUE
-#define TRUE 1
-#define FALSE 0
-#endif
-
 #include <pthread.h>
+#include <stdbool.h>
 #include <stdio.h>
 #include <stdlib.h>
 #include <stddef.h>
@@ -44,7 +40,7 @@
 
 
 extern char *             prog_name;
-extern int                fenced_debug;
+extern bool               fenced_debug;
 extern int                debug_sock;
 extern char               debug_buf[256];
 extern struct sockaddr_un debug_addr;
@@ -129,10 +125,10 @@
 	int debug;
 	int post_join_delay;
 	int post_fail_delay;
-	int8_t clean_start;
-	int8_t post_join_delay_opt;
-	int8_t post_fail_delay_opt;
-	int8_t clean_start_opt;
+	int clean_start;
+	bool post_join_delay_opt;
+	bool post_fail_delay_opt;
+	bool clean_start_opt;
 };
 
 #define FDFL_RUN        (0)
@@ -150,7 +146,7 @@
 	int 			last_start;
 	int 			last_finish;
 
-	int			first_recovery;
+	bool			first_recovery;
 	int 			prev_count;
 	struct list_head 	prev;
 	struct list_head 	victims;
diff -urN -x CVS cvs/fenced/main.c cvs-patch01-bool/fenced/main.c
--- cvs/fenced/main.c	2005-02-15 04:56:15.000000000 +0100
+++ cvs-patch01-bool/fenced/main.c	2005-02-17 18:13:07.000000000 +0100
@@ -391,7 +391,7 @@
 
 	/* If an option was set on the command line, don't set it from ccs. */
 
-	if (fd->comline->clean_start_opt == FALSE) {
+	if (fd->comline->clean_start_opt == false) {
 		str = NULL;
 	        memset(path, 0, 256);
 	        sprintf(path, "/cluster/fence_daemon/@clean_start");
@@ -405,7 +405,7 @@
 			free(str);
 	}
 
-	if (fd->comline->post_join_delay_opt == FALSE) {
+	if (fd->comline->post_join_delay_opt == false) {
 		str = NULL;
 	        memset(path, 0, 256);
 	        sprintf(path, "/cluster/fence_daemon/@post_join_delay");
@@ -419,7 +419,7 @@
 			free(str);
 	}
 
-	if (fd->comline->post_fail_delay_opt == FALSE) {
+	if (fd->comline->post_fail_delay_opt == false) {
 		str = NULL;
 	        memset(path, 0, 256);
 	        sprintf(path, "/cluster/fence_daemon/@post_fail_delay");
@@ -508,7 +508,7 @@
 	fd->namelen = namelen;
 
 	fd->comline = comline;
-	fd->first_recovery = FALSE;
+	fd->first_recovery = false;
 	fd->last_stop = 0;
 	fd->last_start = 0;
 	fd->last_finish = 0;
@@ -523,12 +523,12 @@
 
 static void decode_arguments(int argc, char **argv, commandline_t *comline)
 {
-	int cont = TRUE;
+	bool cont = true;
 	int optchar;
 
-	comline->post_join_delay_opt = FALSE;
-	comline->post_fail_delay_opt = FALSE;
-	comline->clean_start_opt = FALSE;
+	comline->post_join_delay_opt = false;
+	comline->post_fail_delay_opt = false;
+	comline->clean_start_opt = false;
 
 	while (cont) {
 		optchar = getopt(argc, argv, OPTION_STRING);
@@ -537,22 +537,22 @@
 
 		case 'c':
 			comline->clean_start = 1;
-			comline->clean_start_opt = TRUE;
+			comline->clean_start_opt = true;
 			break;
 
 		case 'j':
 			comline->post_join_delay = atoi(optarg);
-			comline->post_join_delay_opt = TRUE;
+			comline->post_join_delay_opt = true;
 			break;
 
 		case 'f':
 			comline->post_fail_delay = atoi(optarg);
-			comline->post_fail_delay_opt = TRUE;
+			comline->post_fail_delay_opt = true;
 			break;
 
 		case 'D':
-			comline->debug = TRUE;
-			fenced_debug = TRUE;
+			comline->debug = true;
+			fenced_debug = true;
 			break;
 
 		case 'n':
@@ -584,7 +584,7 @@
 			break;
 
 		case EOF:
-			cont = FALSE;
+			cont = false;
 			break;
 
 		default:
@@ -658,7 +658,7 @@
 }
 
 char *prog_name;
-int fenced_debug;
+bool fenced_debug = false;
 int debug_sock;
 char debug_buf[256];
 struct sockaddr_un debug_addr;
diff -urN -x CVS cvs/fenced/recover.c cvs-patch01-bool/fenced/recover.c
--- cvs/fenced/recover.c	2005-02-10 06:33:29.000000000 +0100
+++ cvs-patch01-bool/fenced/recover.c	2005-02-17 18:15:56.000000000 +0100
@@ -69,14 +69,14 @@
 	return node;
 }
 
-static int name_equal(fd_node_t *node1, struct cl_cluster_node *node2)
+static bool name_equal(fd_node_t *node1, struct cl_cluster_node *node2)
 {
 	char name1[64], name2[64];
 	int i, len1, len2;
 
 	if ((node1->namelen == strlen(node2->name) &&
 	     !strncmp(node1->name, node2->name, node1->namelen)))
-		return TRUE;
+		return true;
 
 	memset(name1, 0, 64);
 	memset(name2, 0, 64);
@@ -98,9 +98,9 @@
 	}
 
 	if (!strncmp(name1, name2, strlen(name1)))
-		return TRUE;
+		return true;
 
-	return FALSE;
+	return false;
 }
 
 static uint32_t next_complete_nodeid(fd_t *fd, uint32_t gt)
@@ -155,7 +155,7 @@
 	return low;
 }
 
-static int can_avert_fence(fd_t *fd, fd_node_t *victim)
+static bool can_avert_fence(fd_t *fd, fd_node_t *victim)
 {
 	struct cl_cluster_node cl_node;
 	int error;
@@ -166,15 +166,15 @@
 
 	error = ioctl(fd->cl_sock, SIOCCLUSTER_GETNODE, &cl_node);
 	if (error < 0)
-		return FALSE;
+		return false;
 
 	log_debug("state of node %s is %d", victim->name, cl_node.state);
 
 	if (cl_node.state == NODESTATE_MEMBER ||
 	    cl_node.state == NODESTATE_JOINING)
-		return TRUE;
+		return true;
 
-        return FALSE;
+        return false;
 }
 
 static void free_node_list(struct list_head *head)
@@ -230,7 +230,7 @@
 	fd->prev_count = ev->node_count;
 }
 
-static int in_cl_nodes(struct cl_cluster_node *cl_nodes, fd_node_t *node,
+static bool in_cl_nodes(struct cl_cluster_node *cl_nodes, fd_node_t *node,
 		       int num_nodes)
 {
 	struct cl_cluster_node *cl_node = cl_nodes;
@@ -238,10 +238,10 @@
 
 	for (i = 0; i < num_nodes; i++) {
 		if (name_equal(node, cl_node))
-			return TRUE;
+			return true;
 		cl_node++;
 	}
-	return FALSE;
+	return false;
 }
 
 static int get_members(fd_t *fd, struct cl_cluster_node **cl_nodes)
@@ -308,7 +308,7 @@
 	free(cl_nodes);
 }
 
-static int id_in_cl_nodes(struct cl_cluster_node *cl_nodes, uint32_t nodeid,
+static bool id_in_cl_nodes(struct cl_cluster_node *cl_nodes, uint32_t nodeid,
 			  int num_nodes)
 {
 	struct cl_cluster_node *cl_node = cl_nodes;
@@ -316,10 +316,10 @@
 
 	for (i = 0; i < num_nodes; i++) {
 		if (nodeid == cl_node->node_id)
-			return TRUE;
+			return true;
 		cl_node++;
 	}
-	return FALSE;
+	return false;
 }
 
 static int list_count(struct list_head *head)
@@ -522,7 +522,7 @@
 	if (!fd->last_finish && fd->last_stop) {
 		log_debug("revert aborted first start");
 		fd->last_stop = 0;
-		fd->first_recovery = FALSE;
+		fd->first_recovery = false;
 		free_prev(fd);
 		free_victims(fd);
 		free_leaving(fd);
@@ -532,7 +532,7 @@
 		  fd->last_stop, fd->last_start, fd->last_finish);
 
 	if (!fd->first_recovery) {
-		fd->first_recovery = TRUE;
+		fd->first_recovery = true;
 		add_first_victims(fd);
 	} else
 		add_victims(fd, ev, cl_nodes);
diff -urN -x CVS cvs/fence_tool/fence_tool.c cvs-patch01-bool/fence_tool/fence_tool.c
--- cvs/fence_tool/fence_tool.c	2005-02-15 04:54:40.000000000 +0100
+++ cvs-patch01-bool/fence_tool/fence_tool.c	2005-02-17 18:09:30.000000000 +0100
@@ -17,6 +17,7 @@
 #include <stddef.h>
 #include <signal.h>
 #include <string.h>
+#include <stdbool.h>
 #include <stdint.h>
 #include <sys/ioctl.h>
 #include <sys/types.h>
@@ -32,11 +33,6 @@
 #include "ccs.h"
 #include "copyright.cf"
 
-#ifndef TRUE
-#define TRUE 1
-#define FALSE 0
-#endif
-
 #define OPTION_STRING			("VhScj:f:Dw")
 #define LOCKFILE_NAME                   "/var/run/fenced.pid"
 #define FENCED_SOCK_PATH                "fenced_socket"
@@ -57,10 +53,10 @@
 while (0)
 
 char *prog_name;
-int debug;
+bool debug = false;
 int operation;
-int skip_unfence;
-int child_wait;
+bool skip_unfence = false;
+bool child_wait = false;
 int cl_sock;
 char our_name[MAX_CLUSTER_MEMBER_NAME_LEN+1];
 
@@ -380,7 +376,7 @@
 
 static void decode_arguments(int argc, char *argv[])
 {
-	int cont = TRUE;
+	bool cont = true;
 	int optchar;
 
 	while (cont) {
@@ -401,15 +397,15 @@
 			break;
 
 		case 'S':
-			skip_unfence = TRUE;
+			skip_unfence = true;
 			break;
 
 		case 'D':
-			debug = TRUE;
+			debug = true;
 			break;
 
 		case 'w':
-			child_wait = TRUE;
+			child_wait = true;
 			break;
 
 		case ':':
@@ -419,7 +415,7 @@
 			break;
 
 		case EOF:
-			cont = FALSE;
+			cont = false;
 			break;
 
 		case 'c':
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/a25b8465/attachment.sig>

From bastian at waldi.eu.org  Thu Feb 17 18:35:54 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Thu, 17 Feb 2005 19:35:54 +0100
Subject: [Linux-cluster] [PATCH 4/4] fence - Wait for the join complete
	event in fenced
Message-ID: <20050217183554.GF23870@wavehammer.waldi.eu.org>

This patch changes the -w option to wait for the join finish event in
fenced. It uses a pipe and wait for its hangup event. It currently lacks
a timeout to make it usable in init scripts without a startup stale.

Bastian

-- 
Wait!  You have not been prepared!
		-- Mr. Atoz, "Tomorrow is Yesterday", stardate 3113.2
-------------- next part --------------
diff -urN -x CVS -x debian cvs-patch03-quorum/fenced/fd.h cvs-patch04-wait/fenced/fd.h
--- cvs-patch03-quorum/fenced/fd.h	2005-02-17 18:43:36.000000000 +0100
+++ cvs-patch04-wait/fenced/fd.h	2005-02-17 19:05:48.000000000 +0100
@@ -136,6 +136,7 @@
 	bool post_join_delay_opt;
 	bool post_fail_delay_opt;
 	bool clean_start_opt;
+	bool wait_opt;
 };
 
 #define FDFL_RUN        (0)
@@ -145,6 +146,7 @@
 struct fd {
 	struct commandline	*comline;
 	int			cl_sock;
+	int			wait_fds[2];
 	uint32_t 		our_nodeid;
 	uint32_t 		local_id;	/* local unique fd ID */
 	uint32_t 		global_id;	/* global unique fd ID */
@@ -152,6 +154,7 @@
 	int 			last_stop;
 	int 			last_start;
 	int 			last_finish;
+	int 			last_start_type;
 
 	bool			first_recovery;
 	int 			prev_count;
diff -urN -x CVS -x debian cvs-patch03-quorum/fenced/main.c cvs-patch04-wait/fenced/main.c
--- cvs-patch03-quorum/fenced/main.c	2005-02-17 18:34:17.000000000 +0100
+++ cvs-patch04-wait/fenced/main.c	2005-02-17 19:05:04.000000000 +0100
@@ -15,6 +15,7 @@
 #include "ccs.h"
 #include "copyright.cf"
 
+#include <sys/poll.h>
 
 /* static pthread_t recv_thread; */
 static int quit;
@@ -239,6 +240,7 @@
 
 	if (ev->type == SERVICE_EVENT_START) {
 		fd->last_start = ev->event_id;
+		fd->last_start_type = ev->start_type;
 
 		/* space for two extra to be sure it's not too small */
 		n = ev->node_count + 2;
@@ -276,6 +278,11 @@
 	else if (ev->type == SERVICE_EVENT_FINISH) {
 		fd->last_finish = ev->event_id;
 		do_recovery_done(fd);
+		/* Report successfull join to parent */
+		if (fd->wait_fds[1] != -1 && fd->last_start_type == SERVICE_START_JOIN) {
+			close(fd->wait_fds[1]);
+			fd->wait_fds[1] = -1;
+		}
 	}
 }
 
@@ -518,6 +525,11 @@
 	INIT_LIST_HEAD(&fd->leaving);
 	INIT_LIST_HEAD(&fd->complete);
 
+	if (!comline->wait_opt)
+		fd->wait_fds[0] = fd->wait_fds[1] = -1;
+	else if (pipe(fd->wait_fds) == -1)
+		die("can't allocate pipe");
+
 	return fd;
 }
 
@@ -559,6 +571,10 @@
 			strncpy(comline->name, optarg, MAX_NAME_LEN);
 			break;
 
+		case 'w':
+			comline->wait_opt = true;
+			break;
+
 		case 'h':
 			print_usage();
 			exit(EXIT_SUCCESS);
@@ -572,7 +588,6 @@
 			break;
 
 		case 'S':
-		case 'w':
 		case 'Q':
 			/* do nothing, this is a fence_tool option that
 			   we ignore when fence_tool starts us */
@@ -639,8 +654,22 @@
 			perror("main: cannot fork");
 			exit(EXIT_FAILURE);
 		}
-		if (pid)
-			exit(EXIT_SUCCESS);
+		else if (pid)
+		{
+			if (comline.wait_opt)
+			{
+				close(fd->wait_fds[1]);
+				struct pollfd fds[] = {
+					{ fd->wait_fds[0], POLLIN, 0 }
+				};
+				int ret = poll(fds, 1, -1);
+				if (ret < 0)
+					return EXIT_FAILURE;
+			}
+			return EXIT_SUCCESS;
+		}
+		if (comline.wait_opt)
+			close(fd->wait_fds[0]);
 		setsid();
 		chdir("/");
 		umask(0);
diff -urN -x CVS -x debian cvs-patch03-quorum/fence_tool/fence_tool.c cvs-patch04-wait/fence_tool/fence_tool.c
--- cvs-patch03-quorum/fence_tool/fence_tool.c	2005-02-17 18:39:30.000000000 +0100
+++ cvs-patch04-wait/fence_tool/fence_tool.c	2005-02-17 18:41:39.000000000 +0100
@@ -47,7 +47,6 @@
 bool debug = false;
 int operation;
 bool skip_unfence = false;
-bool child_wait = false;
 bool wait_for_quorum = true;
 int cl_sock;
 char our_name[MAX_CLUSTER_MEMBER_NAME_LEN+1];
@@ -280,19 +279,6 @@
 	if (debug)
 		printf("%s: start fenced\n", prog_name);
 
-	if (!debug && child_wait) {
-		int status;
-		pid_t pid = fork();
-		/* parent waits for fenced to join */
-		if (pid > 0) {
-			waitpid(pid, &status, 0);
-			if (WIFEXITED(status) && !WEXITSTATUS(status))
-				do_wait();
-			exit(EXIT_SUCCESS);
-		}
-		/* child execs fenced */
-	}
-
 	strcpy(argv[0], "fenced");
 	argv[argc - 1] = NULL;
 
@@ -376,7 +362,6 @@
 	printf("  wait             Wait for node to be member of default fence domain\n");
 	printf("\n");
 	printf("Options:\n");
-	printf("  -w               Wait for join to complete\n");
 	printf("  -V               Print program version information, then exit\n");
 	printf("  -h               Print this help, then exit\n");
 	printf("  -S               Skip self unfencing on join\n");
@@ -388,6 +373,7 @@
 	printf("  -c               All nodes are in a clean state to start\n");
 	printf("  -j <secs>        Post-join fencing delay\n");
 	printf("  -f <secs>        Post-fail fencing delay\n");
+	printf("  -w               Wait for join to complete\n");
 	printf("\n");
 }
 
@@ -421,10 +407,6 @@
 			debug = true;
 			break;
 
-		case 'w':
-			child_wait = true;
-			break;
-
 		case 'Q':
 			wait_for_quorum = false;
 			break;
@@ -440,8 +422,9 @@
 			break;
 
 		case 'c':
-		case 'j':
 		case 'f':
+		case 'j':
+		case 'w':
 			/* Do nothing, just pass these options on to fenced */
 			break;
 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/94f9479d/attachment.sig>

From bastian at waldi.eu.org  Thu Feb 17 18:35:49 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Thu, 17 Feb 2005 19:35:49 +0100
Subject: [Linux-cluster] [PATCH 3/4] fence - Add option to fence_tool to not
	wait for quorum
Message-ID: <20050217183549.GE23870@wavehammer.waldi.eu.org>

The attached patch makes it possible to ask fence_tool to not wait for
quorum and just die in this case. This makes it easier to call
fence_tool join in init scripts without the problem of a blocked
startup.

Bastian

-- 
Bones: "The man's DEAD, Jim!"
-------------- next part --------------
diff -urN -x CVS -x debian cvs-patch02-gnuism/fenced/main.c cvs-patch03-quorum/fenced/main.c
--- cvs-patch02-gnuism/fenced/main.c	2005-02-17 18:13:07.000000000 +0100
+++ cvs-patch03-quorum/fenced/main.c	2005-02-17 18:34:17.000000000 +0100
@@ -23,7 +23,7 @@
 char our_name[MAX_CLUSTER_MEMBER_NAME_LEN+1];
 
 
-#define OPTION_STRING			("cj:f:Dn:hVSw")
+#define OPTION_STRING			("cj:f:Dn:hVSwQ")
 #define LOCKFILE_NAME			"/var/run/fenced.pid"
 
 
@@ -573,6 +573,7 @@
 
 		case 'S':
 		case 'w':
+		case 'Q':
 			/* do nothing, this is a fence_tool option that
 			   we ignore when fence_tool starts us */
 			break;
diff -urN -x CVS -x debian cvs-patch02-gnuism/fence_tool/fence_tool.c cvs-patch03-quorum/fence_tool/fence_tool.c
--- cvs-patch02-gnuism/fence_tool/fence_tool.c	2005-02-17 18:19:19.000000000 +0100
+++ cvs-patch03-quorum/fence_tool/fence_tool.c	2005-02-17 18:39:30.000000000 +0100
@@ -34,7 +34,7 @@
 #include "ccs.h"
 #include "copyright.cf"
 
-#define OPTION_STRING			("VhScj:f:Dw")
+#define OPTION_STRING			("VhScj:f:DwQ")
 #define LOCKFILE_NAME                   "/var/run/fenced.pid"
 #define FENCED_SOCK_PATH                "fenced_socket"
 
@@ -48,6 +48,7 @@
 int operation;
 bool skip_unfence = false;
 bool child_wait = false;
+bool wait_for_quorum = true;
 int cl_sock;
 char our_name[MAX_CLUSTER_MEMBER_NAME_LEN+1];
 
@@ -188,7 +189,7 @@
  * join/leave process.)
  */
 
-static int wait_quorum(void)
+static bool check_quorum(void)
 {
 	int rv, i = 0;
 
@@ -202,15 +203,15 @@
 
 		rv = ioctl(cl_sock, SIOCCLUSTER_ISQUORATE, NULL);
 		if (rv)
-			break;
+			return true;
+		else if (!wait_for_quorum)
+			return false;
 
 		sleep(1);
 
 		if (++i > 9 && !(i % 10))
 			printf("%s: waiting for cluster quorum\n", prog_name);
 	}
-
-	return 0;
 }
 
 /*
@@ -246,18 +247,22 @@
 		sleep(1);
 		rewind(file);
 	}
-                        
+
  out:
 	fclose(file);
-	return 0;
+
+	return EXIT_SUCCESS;
 }
 
-static void do_join(int argc, char *argv[])
+static int do_join(int argc, char *argv[])
 {
 	int cd;
 
 	setup_sock();
-	wait_quorum();
+
+	if (!check_quorum())
+		return EXIT_FAILURE;
+
 	get_our_name();
 	close(cl_sock);
 	cd = check_ccs();
@@ -293,9 +298,11 @@
 
 	execvp("fenced", argv);
 	die("starting fenced failed");
+
+	return EXIT_FAILURE;
 }
 
-static void do_leave(void)
+static int do_leave(void)
 {
 	FILE *f;
 	char buf[33] = "";
@@ -314,13 +321,18 @@
 
 	check_mounted();
 	setup_sock();
-	wait_quorum();
+
+	if (!check_quorum())
+		return EXIT_FAILURE;
+
 	close(cl_sock);
 
 	kill(pid, SIGTERM);
+
+	return EXIT_SUCCESS;
 }
 
-static void do_monitor(void)
+static int do_monitor(void)
 {
 	int sfd, error, rv;
 	struct sockaddr_un addr;
@@ -348,6 +360,8 @@
 
 		printf("%s", buf);
 	}
+
+	return EXIT_SUCCESS;
 }
 
 static void print_usage(void)
@@ -367,6 +381,7 @@
 	printf("  -h               Print this help, then exit\n");
 	printf("  -S               Skip self unfencing on join\n");
 	printf("  -D               Enable debugging, don't fork (also passed to fenced)\n");
+        printf("  -Q               Don't wait if cluster is not quorate\n");
 	printf("\n");
 	printf("Fenced options:\n");
 	printf("  these are passed on to fenced when it's started\n");
@@ -410,6 +425,10 @@
 			child_wait = true;
 			break;
 
+		case 'Q':
+			wait_for_quorum = false;
+			break;
+
 		case ':':
 		case '?':
 			fprintf(stderr, "Please use '-h' for usage.\n");
@@ -458,18 +477,14 @@
 
 	switch (operation) {
 	case OP_JOIN:
-		do_join(argc, argv);
-		break;
+		return do_join(argc, argv);
 	case OP_LEAVE:
-		do_leave();
-		break;
+		return do_leave();
 	case OP_MONITOR:
-		do_monitor();
-		break;
+		return do_monitor();
 	case OP_WAIT:
-		do_wait();
-		break;
+		return do_wait();
 	}
 
-	exit(EXIT_SUCCESS);
+	return EXIT_FAILURE;
 }
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/6e5d9866/attachment.sig>

From daniel at osdl.org  Thu Feb 17 19:11:04 2005
From: daniel at osdl.org (Daniel McNeil)
Date: Thu, 17 Feb 2005 11:11:04 -0800
Subject: [Linux-cluster] mount hang in kcl_join_service
In-Reply-To: <20050217062605.GA6389@redhat.com>
References: <1108597177.9361.23.camel@ibm-c.pdx.osdl.net>
	<20050217062605.GA6389@redhat.com>
Message-ID: <1108667464.9361.35.camel@ibm-c.pdx.osdl.net>

On Wed, 2005-02-16 at 22:26, David Teigland wrote:
> On Wed, Feb 16, 2005 at 03:39:37PM -0800, Daniel McNeil wrote:
> > I have not been able to get my tests to run for more than
> > 1 day for the last several tries.  This time my test hung
> > during mount in kcl_join_service().  My test does mount and umount 
> > several times for each test run.  This time it hung on the
> > 22nd test run.  It looks like it was starting a 3node test
> > where a gfs file system is mounted on all 3 nodes and then
> > does a umount/mount 1 node at a time.  So this should have
> > done an umount on cl031 and then hung on a mount on cl031
> > with cl030 and cl032 having the gfs file system still mounted.
> 
> > A bunch of info is available here:
> > http://developer.osdl.org/daniel/GFS/test.11feb2005/
> 
> I've looked through it and can't pinpoint the problem.  Next
> time could you also collect /proc/cluster/lock_dlm/debug and
> /proc/cluster/dlm_debug ?
> 
> I've set up a similar but simplified test on both of my test
> clusters (a 2-node and a 7-node).  I can't dedicate these
> machines for a full 1-2 day stretch this until the weekend,
> though.  My test is a loop around:
> 
> - on each node sequentially: unmount/mount gfs
> - on each node sequentially: run some load for a couple minutes

I started in running again yesterday afternoon.  I'll collect
all the info when I hit a problem.  I still have not
made it past 52 hours running this test.

Thanks for taking a look,

Daniel


From bujan at isqsolutions.com  Thu Feb 17 19:43:59 2005
From: bujan at isqsolutions.com (Manuel Bujan)
Date: Thu, 17 Feb 2005 14:43:59 -0500
Subject: [Linux-cluster] Node hang
Message-ID: <00be01c51529$06c4a440$5001a8c0@pcbujan>

Hello guys,

After 3 days of a heavy read/write test load one of our nodes crash with the 
following error:

Feb 17 12:06:28 atmail-1 kernel: GFS: fsid=ISQCLUSTER:gfs001.0: fatal: 
invalid metadata block
Feb 17 12:06:28 atmail-1 kernel: GFS: fsid=ISQCLUSTER:gfs001.0:   bh = 
13156295 (magic)
Feb 17 12:06:28 atmail-1 kernel: GFS: fsid=ISQCLUSTER:gfs001.0:   function = 
gfs_get_data_buffer
Feb 17 12:06:28 atmail-1 kernel: GFS: fsid=ISQCLUSTER:gfs001.0:   file = 
/usr/src/cluster/gfs-kernel/src/gfs/dio.c, line = 1328
Feb 17 12:06:28 atmail-1 kernel: GFS: fsid=ISQCLUSTER:gfs001.0:   time = 
1108659988
Feb 17 12:06:28 atmail-1 kernel: GFS: fsid=ISQCLUSTER:gfs001.0: about to 
withdraw from the cluster
Feb 17 12:06:28 atmail-1 kernel: GFS: fsid=ISQCLUSTER:gfs001.0: waiting for 
outstanding I/O
Feb 17 12:06:28 atmail-1 kernel: GFS: fsid=ISQCLUSTER:gfs001.0: telling LM 
to withdraw
Feb 17 12:06:35 atmail-1 kernel: lock_dlm: withdraw abandoned memory

We are mounting our GFS partition using the noatime option, and quotas has 
been disabled in order to improve performance. The aplications currently 
running are "postfix, apache, and Courier/Imap".

We are using the CVS version available on Feb 14 around 5:00 PM.

Any light with this matter ?
Is there any way to know which file exactly was trying to read or write the 
server when it crash based on the log ?

Regards
Bujan 


From bastian at waldi.eu.org  Thu Feb 17 20:55:30 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Thu, 17 Feb 2005 21:55:30 +0100
Subject: [Linux-cluster] [PATCH 4.1/4] fence - Wait for the join complete
	event in fenced
Message-ID: <20050217205530.GA20320@wavehammer.waldi.eu.org>

This patch changes -w to get a timeout value and fixes the return value
in case of timeout.

Bastian

-- 
Wait!  You have not been prepared!
		-- Mr. Atoz, "Tomorrow is Yesterday", stardate 3113.2
-------------- next part --------------
diff -urN -x CVS -x debian cvs-patch04-wait/fenced/fd.h cvs-patch041-wait-timeout/fenced/fd.h
--- cvs-patch04-wait/fenced/fd.h	2005-02-17 19:05:48.000000000 +0100
+++ cvs-patch041-wait-timeout/fenced/fd.h	2005-02-17 21:24:37.000000000 +0100
@@ -133,6 +133,7 @@
 	int post_join_delay;
 	int post_fail_delay;
 	int clean_start;
+	int wait;
 	bool post_join_delay_opt;
 	bool post_fail_delay_opt;
 	bool clean_start_opt;
diff -urN -x CVS -x debian cvs-patch04-wait/fenced/main.c cvs-patch041-wait-timeout/fenced/main.c
--- cvs-patch04-wait/fenced/main.c	2005-02-17 19:05:04.000000000 +0100
+++ cvs-patch041-wait-timeout/fenced/main.c	2005-02-17 21:52:08.000000000 +0100
@@ -24,7 +24,7 @@
 char our_name[MAX_CLUSTER_MEMBER_NAME_LEN+1];
 
 
-#define OPTION_STRING			("cj:f:Dn:hVSwQ")
+#define OPTION_STRING			("cj:f:Dn:hVSw:Q")
 #define LOCKFILE_NAME			"/var/run/fenced.pid"
 
 
@@ -537,6 +537,7 @@
 {
 	bool cont = true;
 	int optchar;
+        char *temp;
 
 	comline->post_join_delay_opt = false;
 	comline->post_fail_delay_opt = false;
@@ -558,7 +559,7 @@
 			break;
 
 		case 'f':
-			comline->post_fail_delay = atoi(optarg);
+			comline->post_fail_delay = strtol(optarg, &temp, 0);
 			comline->post_fail_delay_opt = true;
 			break;
 
@@ -572,6 +573,7 @@
 			break;
 
 		case 'w':
+			comline->wait = atoi(optarg);
 			comline->wait_opt = true;
 			break;
 
@@ -662,8 +664,8 @@
 				struct pollfd fds[] = {
 					{ fd->wait_fds[0], POLLIN, 0 }
 				};
-				int ret = poll(fds, 1, -1);
-				if (ret < 0)
+				int ret = poll(fds, 1, comline.wait * 1000);
+				if (ret <= 0)
 					return EXIT_FAILURE;
 			}
 			return EXIT_SUCCESS;
diff -urN -x CVS -x debian cvs-patch04-wait/fence_tool/fence_tool.c cvs-patch041-wait-timeout/fence_tool/fence_tool.c
--- cvs-patch04-wait/fence_tool/fence_tool.c	2005-02-17 18:41:39.000000000 +0100
+++ cvs-patch041-wait-timeout/fence_tool/fence_tool.c	2005-02-17 21:23:06.000000000 +0100
@@ -34,7 +34,7 @@
 #include "ccs.h"
 #include "copyright.cf"
 
-#define OPTION_STRING			("VhScj:f:DwQ")
+#define OPTION_STRING			("VhScj:f:Dw:Q")
 #define LOCKFILE_NAME                   "/var/run/fenced.pid"
 #define FENCED_SOCK_PATH                "fenced_socket"
 
@@ -373,7 +373,7 @@
 	printf("  -c               All nodes are in a clean state to start\n");
 	printf("  -j <secs>        Post-join fencing delay\n");
 	printf("  -f <secs>        Post-fail fencing delay\n");
-	printf("  -w               Wait for join to complete\n");
+	printf("  -w <secs>        Wait for join to complete\n");
 	printf("\n");
 }
 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/87fd5b04/attachment.sig>

From pbruna at linuxcenterla.com  Fri Feb 18 00:23:40 2005
From: pbruna at linuxcenterla.com (Patricio Bruna V)
Date: Thu, 17 Feb 2005 21:23:40 -0300
Subject: [Linux-cluster] test
Message-ID: <1108686221.2691.1.camel@p.linuxcenter.cl>


-- 
Patricio Bruna                                  http://www.linuxcenterla.com
Ingeniero de Proyectos                  Mariano S?nchez Fontecilla 310
Red Hat Certified Engineer              Las Condes, Santiago - CHILE
Linux Center Latinoamerica              Fono: 4834041
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/c71a9843/attachment.sig>

From pbruna at linuxcenterla.com  Fri Feb 18 00:27:20 2005
From: pbruna at linuxcenterla.com (Patricio Bruna V)
Date: Thu, 17 Feb 2005 21:27:20 -0300
Subject: [Linux-cluster] howto?
Message-ID: <1108686440.2691.3.camel@p.linuxcenter.cl>

is there any howto implement a cluster with the redhat tools?
i meant cman lvm2-cluster gfs, etc.

thx
-- 
Patricio Bruna                                  http://www.linuxcenterla.com
Ingeniero de Proyectos                  Mariano S?nchez Fontecilla 310
Red Hat Certified Engineer              Las Condes, Santiago - CHILE
Linux Center Latinoamerica              Fono: 4834041
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/eef0edde/attachment.sig>

From teigland at redhat.com  Fri Feb 18 04:01:35 2005
From: teigland at redhat.com (David Teigland)
Date: Fri, 18 Feb 2005 12:01:35 +0800
Subject: [Linux-cluster] [PATCH 0/4] fence
In-Reply-To: <20050217183534.GB23870@wavehammer.waldi.eu.org>
References: <20050217183534.GB23870@wavehammer.waldi.eu.org>
Message-ID: <20050218040135.GB6493@redhat.com>

On Thu, Feb 17, 2005 at 07:35:34PM +0100, Bastian Blank wrote:
> Patches:
> 1. Use bools instead of crude defines.

Interesting, but I think I still prefer using int's and defines.  I like
the option of doing other things (-1/0/1) even if we're only doing T/F
now.  Also, there's no /usr/include/stdbool.h on my machine.

> 2. Remove GNU-isms.

You'll have to pardon my ignorance about GNU-isms; could you explain the
shortcoming of the current method, or point me to a discussion?  The
existing code looks nicer to me, but I'm happy to learn something new.

For reference, existing:

#define die(fmt, args...) \
do \
{ \
	fprintf(stderr, "%s: ", prog_name); \
	fprintf(stderr, fmt "\n", ##args); \
	syslog(LOG_ERR, fmt, ##args); \
	exit(EXIT_FAILURE); \
} \
while (0)

patch does:

static inline void die(const char *fmt, ...)
{
	va_list ap;
	fputs(prog_name, stderr);
	fputs(": ", stderr);
	va_start(ap, fmt);
	vfprintf(stderr, fmt, ap);
	fputs("\n", stderr);
	vsyslog(LOG_ERR, fmt, ap);
	va_end(ap);
	exit(EXIT_FAILURE);
}


> 3. Add option to fence_tool to not wait for quorum.

ok

> 4. Wait for the join complete event in fenced.

I'd really like to keep all the "usability-oriented" stuff in fence_tool
and out of fenced.  It would probably be better if the poll-end of the new
pipe was in fence_tool.

Or, I just thought of another method.  fence_tool's -w handling could
could read fenced's unix socket and wait until it sees "finish:".  See
fence_tool.c:do_monitor().  do_monitor("finish:") would return when it
sees a line matching "finish:".

We could also use this method to allow "fence_tool leave -w".

-- 
Dave Teigland  <teigland at redhat.com>


From teigland at redhat.com  Fri Feb 18 04:09:31 2005
From: teigland at redhat.com (David Teigland)
Date: Fri, 18 Feb 2005 12:09:31 +0800
Subject: [Linux-cluster] howto?
In-Reply-To: <1108686440.2691.3.camel@p.linuxcenter.cl>
References: <1108686440.2691.3.camel@p.linuxcenter.cl>
Message-ID: <20050218040931.GC6493@redhat.com>

On Thu, Feb 17, 2005 at 09:27:20PM -0300, Patricio Bruna V wrote:
> is there any howto implement a cluster with the redhat tools?
> i meant cman lvm2-cluster gfs, etc.

http://sources.redhat.com/cluster/doc/usage.txt
http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/doc/?cvsroot=cluster

-- 
Dave Teigland  <teigland at redhat.com>


From teigland at redhat.com  Fri Feb 18 06:18:02 2005
From: teigland at redhat.com (David Teigland)
Date: Fri, 18 Feb 2005 14:18:02 +0800
Subject: [Linux-cluster] Node hang
In-Reply-To: <00be01c51529$06c4a440$5001a8c0@pcbujan>
References: <00be01c51529$06c4a440$5001a8c0@pcbujan>
Message-ID: <20050218061802.GD6493@redhat.com>

On Thu, Feb 17, 2005 at 02:43:59PM -0500, Manuel Bujan wrote:

> We are mounting our GFS partition using the noatime option, and quotas has 
> been disabled in order to improve performance. The aplications currently 
> running are "postfix, apache, and Courier/Imap".

Do these applications make heavy use of fcntl() locks?  If so, that can
cause a big performance hit and you might see if they can use flock()
instead which is faster on gfs.

-- 
Dave Teigland  <teigland at redhat.com>


From fajar at telkom.co.id  Fri Feb 18 06:26:42 2005
From: fajar at telkom.co.id (Fajar A. Nugraha)
Date: Fri, 18 Feb 2005 13:26:42 +0700
Subject: [Linux-cluster] cluster latest cvs does not work?
Message-ID: <42158AA2.7030901@telkom.co.id>

Hi,

Building from latest cvs get me this while trying to run ccsd :

[root at lincluster1 root]# ccsd -n -4
Starting ccsd DEVEL.1108701944:
 Built: Feb 18 2005 11:46:07
 Copyright (C) Red Hat, Inc.  2004  All rights reserved.
  No Daemon:: SET
  IP Protocol:: IPv4 only

[root at lincluster1 root]# ps -ef | grep ccs
root      3020  2809  0 13:18 pts/0    00:00:00 grep ccs
[root at lincluster1 root]# ccsd -4
Failed to connect to cluster manager.
Hint: Magma plugins are not in the right spot.

Earlier cvs version (DEVEL.1108707607) works fine on the same server.
Any idea what's wrong?

PS : what happened to verbose/debug flag in ccsd?
There used to be an option to run ccsd with -v in much earlier versions
(at least DEVEL.1103006512 has it)

Regards,

Fajar


From bastian at waldi.eu.org  Fri Feb 18 09:07:27 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Fri, 18 Feb 2005 10:07:27 +0100
Subject: [Linux-cluster] [PATCH 0/4] fence
In-Reply-To: <20050218040135.GB6493@redhat.com>
References: <20050217183534.GB23870@wavehammer.waldi.eu.org>
	<20050218040135.GB6493@redhat.com>
Message-ID: <20050218090727.GA2250@wavehammer.waldi.eu.org>

On Fri, Feb 18, 2005 at 12:01:35PM +0800, David Teigland wrote:
> On Thu, Feb 17, 2005 at 07:35:34PM +0100, Bastian Blank wrote:
> > Patches:
> > 1. Use bools instead of crude defines.
> Interesting, but I think I still prefer using int's and defines.  I like
> the option of doing other things (-1/0/1) even if we're only doing T/F
> now.  Also, there's no /usr/include/stdbool.h on my machine.

Try to find it in the include dir of your compiler, stdbool.h is not
implementable by the libc, which puts its includes into /susr/include.

If you want to use ints, please wrap them into an emun.

> > 2. Remove GNU-isms.
> You'll have to pardon my ignorance about GNU-isms; could you explain the
> shortcoming of the current method, or point me to a discussion?  The
> existing code looks nicer to me, but I'm happy to learn something new.
> 
> For reference, existing:
> #define die(fmt, args...) \
                   ^^^^^^^
> do \
> { \
> 	fprintf(stderr, "%s: ", prog_name); \
> 	fprintf(stderr, fmt "\n", ##args); \
> 	syslog(LOG_ERR, fmt, ##args); \
                             ^^^^^^
> 	exit(EXIT_FAILURE); \
> } \
> while (0)

The variable argument list is a GNU-ism.

> > 4. Wait for the join complete event in fenced.
> Or, I just thought of another method.  fence_tool's -w handling could
> could read fenced's unix socket and wait until it sees "finish:".  See
> fence_tool.c:do_monitor().  do_monitor("finish:") would return when it
> sees a line matching "finish:".
> We could also use this method to allow "fence_tool leave -w".

Hmm, lets think about it.

Bastian

-- 
If there are self-made purgatories, then we all have to live in them.
		-- Spock, "This Side of Paradise", stardate 3417.7
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050218/8b03f1bb/attachment.sig>

From bastian at waldi.eu.org  Fri Feb 18 10:23:09 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Fri, 18 Feb 2005 11:23:09 +0100
Subject: [Linux-cluster] [PATCH 0/4] fence
In-Reply-To: <20050218090727.GA2250@wavehammer.waldi.eu.org>
References: <20050217183534.GB23870@wavehammer.waldi.eu.org>
	<20050218040135.GB6493@redhat.com>
	<20050218090727.GA2250@wavehammer.waldi.eu.org>
Message-ID: <20050218102309.GC2250@wavehammer.waldi.eu.org>

On Fri, Feb 18, 2005 at 10:07:27AM +0100, Bastian Blank wrote:
> > Or, I just thought of another method.  fence_tool's -w handling could
> > could read fenced's unix socket and wait until it sees "finish:".  See
> > fence_tool.c:do_monitor().  do_monitor("finish:") would return when it
> > sees a line matching "finish:".
> > We could also use this method to allow "fence_tool leave -w".
> Hmm, lets think about it.

Okay first part of this: Change print_ev to report the complete event
type in one message.

It look like:

| fenced: 1108721677 event: join:start
| fenced: 1108721677   event_id    = 5
| fenced: 1108721677   last_stop   = 0
| fenced: 1108721677   last_start  = 5
| fenced: 1108721677   last_finish = 0
| fenced: 1108721677   node_count  = 1

and

| fenced: 1108721677 event: join:finish
| fenced: 1108721677   event_id    = 5
| fenced: 1108721677   last_stop   = 0
| fenced: 1108721677   last_start  = 5
| fenced: 1108721677   last_finish = 5
| fenced: 1108721677   node_count  = 0

This resolves a possible race while checking the type of the event via
the socket.

Bastian

-- 
Warp 7 -- It's a law we can live with.
-------------- next part --------------
diff -urN -x CVS -x debian cvs-patch03-quorum/fenced/fd.h cvs-patch05-event/fenced/fd.h
--- cvs-patch03-quorum/fenced/fd.h	2005-02-17 18:43:36.000000000 +0100
+++ cvs-patch05-event/fenced/fd.h	2005-02-18 11:07:26.000000000 +0100
@@ -152,6 +152,7 @@
 	int 			last_stop;
 	int 			last_start;
 	int 			last_finish;
+	int 			last_start_type;
 
 	bool			first_recovery;
 	int 			prev_count;
diff -urN -x CVS -x debian cvs-patch03-quorum/fenced/main.c cvs-patch05-event/fenced/main.c
--- cvs-patch03-quorum/fenced/main.c	2005-02-17 18:34:17.000000000 +0100
+++ cvs-patch05-event/fenced/main.c	2005-02-18 11:13:48.000000000 +0100
@@ -180,42 +180,45 @@
 }
 #endif
 
-static void print_ev(struct cl_service_event *ev)
+static void print_ev(fd_t *fd, struct cl_service_event *ev)
 {
+	char *type = "unknown", *start_type = "unknown";
+
 	switch (ev->type) {
 	case SERVICE_EVENT_STOP:
-		log_debug("stop:");
+		type = "stop";
 		break;
 	case SERVICE_EVENT_START:
-		log_debug("start:");
+		type = "start";
 		break;
 	case SERVICE_EVENT_FINISH:
-		log_debug("finish:");
+		type = "finish";
 		break;
 	case SERVICE_EVENT_LEAVEDONE:
-		log_debug("leavedone:");
+		type = "leavedone";
 		break;
 	}
+
+	if (ev->event_id == fd->last_start)
+		switch (fd->last_start_type) {
+		case SERVICE_START_FAILED:
+			start_type = "failed";
+			break;
+		case SERVICE_START_JOIN:
+			start_type = "join";
+			break;
+		case SERVICE_START_LEAVE:
+			start_type = "leave";
+			break;
+	}
+
+	log_debug("event: %s:%s", start_type, type);
+
 	log_debug("  event_id    = %u", ev->event_id);
 	log_debug("  last_stop   = %u", ev->last_stop);
 	log_debug("  last_start  = %u", ev->last_start);
 	log_debug("  last_finish = %u", ev->last_finish);
 	log_debug("  node_count  = %u", ev->node_count);
-
-	if (ev->type != SERVICE_EVENT_START)
-		return;
-
-	switch (ev->start_type) {
-	case SERVICE_START_FAILED:
-		log_debug("  start_type  = %s", "failed");
-		break;
-	case SERVICE_START_JOIN:
-		log_debug("  start_type  = %s", "join");
-		break;
-	case SERVICE_START_LEAVE:
-		log_debug("  start_type  = %s", "leave");
-		break;
-	}
 }
 
 static void print_members(int count, struct cl_cluster_node *nodes)
@@ -235,11 +238,25 @@
 	struct cl_cluster_node *nodes;
 	int error = 0, n;
 
-	print_ev(ev);
-
-	if (ev->type == SERVICE_EVENT_START) {
+	switch (ev->type)
+	{
+	case SERVICE_EVENT_START:
 		fd->last_start = ev->event_id;
+		fd->last_start_type = ev->start_type;
+		break;
+	case SERVICE_EVENT_STOP:
+		fd->last_stop = fd->last_start;
+		break;
+	case SERVICE_EVENT_FINISH:
+		fd->last_finish = ev->event_id;
+		break;
+	}
 
+	print_ev(fd, ev);
+
+	switch (ev->type)
+	{
+	case SERVICE_EVENT_START:
 		/* space for two extra to be sure it's not too small */
 		n = ev->node_count + 2;
 
@@ -265,17 +282,15 @@
 			log_debug("process_event: start done error");
 
 		free(nodes);
-	}
+		break;
 
-	else if (ev->type == SERVICE_EVENT_LEAVEDONE)
+	case SERVICE_EVENT_LEAVEDONE:
 		leave_finished = 1;
+		break;
 
-	else if (ev->type == SERVICE_EVENT_STOP)
-		fd->last_stop = fd->last_start;
-
-	else if (ev->type == SERVICE_EVENT_FINISH) {
-		fd->last_finish = ev->event_id;
+	case SERVICE_EVENT_FINISH:
 		do_recovery_done(fd);
+		break;
 	}
 }
 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050218/d84587dd/attachment.sig>

From bastian at waldi.eu.org  Fri Feb 18 12:23:31 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Fri, 18 Feb 2005 13:23:31 +0100
Subject: [Linux-cluster] [PATCH 0/4] fence
In-Reply-To: <20050218090727.GA2250@wavehammer.waldi.eu.org>
References: <20050217183534.GB23870@wavehammer.waldi.eu.org>
	<20050218040135.GB6493@redhat.com>
	<20050218090727.GA2250@wavehammer.waldi.eu.org>
Message-ID: <20050218122331.GD2250@wavehammer.waldi.eu.org>

On Fri, Feb 18, 2005 at 10:07:27AM +0100, Bastian Blank wrote:
> > Or, I just thought of another method.  fence_tool's -w handling could
> > could read fenced's unix socket and wait until it sees "finish:".  See
> > fence_tool.c:do_monitor().  do_monitor("finish:") would return when it
> > sees a line matching "finish:".
> > We could also use this method to allow "fence_tool leave -w".
> Hmm, lets think about it.

This is the implementation. It looks a bit fancy as it uses several
callbacks to not duplicate code.

Bastian

-- 
Our missions are peaceful -- not for conquest.  When we do battle, it
is only because we have no choice.
		-- Kirk, "The Squire of Gothos", stardate 2124.5
-------------- next part --------------
diff -urN -x CVS -x debian cvs-patch05-event/fenced/main.c cvs-patch06-wait/fenced/main.c
--- cvs-patch05-event/fenced/main.c	2005-02-18 11:13:48.000000000 +0100
+++ cvs-patch06-wait/fenced/main.c	2005-02-18 12:28:59.000000000 +0100
@@ -23,7 +23,7 @@
 char our_name[MAX_CLUSTER_MEMBER_NAME_LEN+1];
 
 
-#define OPTION_STRING			("cj:f:Dn:hVSwQ")
+#define OPTION_STRING			("cj:f:Dn:hVSw:Q")
 #define LOCKFILE_NAME			"/var/run/fenced.pid"
 
 
diff -urN -x CVS -x debian cvs-patch05-event/fence_tool/fence_tool.c cvs-patch06-wait/fence_tool/fence_tool.c
--- cvs-patch05-event/fence_tool/fence_tool.c	2005-02-17 18:39:30.000000000 +0100
+++ cvs-patch06-wait/fence_tool/fence_tool.c	2005-02-18 13:20:36.000000000 +0100
@@ -11,7 +11,8 @@
 *******************************************************************************
 ******************************************************************************/
 
-#include <unistd.h>
+#include <ctype.h>
+#include <errno.h>
 #include <stdio.h>
 #include <stdlib.h>
 #include <stddef.h>
@@ -21,12 +22,15 @@
 #include <stdbool.h>
 #include <stdint.h>
 #include <sys/ioctl.h>
-#include <sys/types.h>
-#include <sys/wait.h>
+#include <sys/poll.h>
 #include <sys/socket.h>
+#include <sys/time.h>
+#include <sys/types.h>
 #include <sys/un.h>
+#include <sys/wait.h>
+#include <time.h>
+#include <unistd.h>
 #include <fcntl.h>
-#include <errno.h>
 #include <mntent.h>
 #include <libgen.h>
 
@@ -34,7 +38,7 @@
 #include "ccs.h"
 #include "copyright.cf"
 
-#define OPTION_STRING			("VhScj:f:DwQ")
+#define OPTION_STRING			("VhScj:f:Dw:Q")
 #define LOCKFILE_NAME                   "/var/run/fenced.pid"
 #define FENCED_SOCK_PATH                "fenced_socket"
 
@@ -47,7 +51,7 @@
 bool debug = false;
 int operation;
 bool skip_unfence = false;
-bool child_wait = false;
+int event_wait_timeout = 0;
 bool wait_for_quorum = true;
 int cl_sock;
 char our_name[MAX_CLUSTER_MEMBER_NAME_LEN+1];
@@ -215,6 +219,136 @@
 }
 
 /*
+ * Callback types for fenced_socket.
+ * fenced_socket_receive_callback:
+ *   Called for each received event.
+ *   Return:
+ *     true: Break of the loop.
+ *     false: Go further.
+ * fenced_socket_setup_callback:
+ *   Called after the socket setup.
+ *   Return:
+ *     true: Go further.
+ *     false: Signal an error to the caller.
+ */
+typedef bool fenced_socket_receive_callback(const char *buf, void *user_data);
+typedef bool fenced_socket_setup_callback(int fd, void *user_data);
+
+enum fenced_socket_ret {
+	FENCED_SOCKET_ERROR,
+	FENCED_SOCKET_FINISH,
+	FENCED_SOCKET_TIMEOUT,
+	FENCED_SOCKET_SHUTDOWN,
+};
+
+static enum fenced_socket_ret fenced_socket(int timeout, fenced_socket_receive_callback receive_callback, void *receive_user_data, fenced_socket_setup_callback setup_callback, void *setup_user_data)
+{
+	int sfd, error, rv;
+	struct sockaddr_un addr;
+	socklen_t addrlen;
+	char buf[256];
+	struct timeval act, end;
+	struct pollfd fds[] = {
+		{ -1, POLLIN, 0 },
+	};
+	enum fenced_socket_ret ret = FENCED_SOCKET_ERROR;
+
+	sfd = socket(AF_LOCAL, SOCK_DGRAM, 0);
+	if (sfd < 0)
+		die("cannot create local socket");
+
+	fds[0].fd = sfd;
+
+	fcntl(sfd, F_SETFD, FD_CLOEXEC);
+	fcntl(sfd, F_SETFL, O_NONBLOCK);
+
+	memset(&addr, 0, sizeof(addr));
+	addr.sun_family = AF_LOCAL;
+	strcpy(&addr.sun_path[1], FENCED_SOCK_PATH);
+	addrlen = sizeof(sa_family_t) + strlen(addr.sun_path+1) + 1;
+
+	error = bind(sfd, (struct sockaddr *) &addr, addrlen);
+	if (error < 0)
+		die("cannot bind to local socket");
+
+	if (setup_callback)
+		if (!setup_callback(sfd, setup_user_data))
+			goto out;
+
+	if (timeout > 0) {
+		gettimeofday(&end, NULL);
+		end.tv_sec += timeout;
+	}
+
+	while (1) {
+		int t = -1;
+		if (timeout > 0) {
+			gettimeofday(&act, NULL);
+			t = (end.tv_sec - act.tv_sec) * 1000;
+			if (t < 0) {
+				ret = FENCED_SOCKET_TIMEOUT;
+				break;
+			}
+		}
+		rv = poll(fds, 1, t);
+		if (rv < 0)
+			die("poll failed");
+		else if (rv == 0) {
+			ret = FENCED_SOCKET_TIMEOUT;
+			break;
+		}
+		rv = recvfrom(sfd, buf, sizeof(buf) - 1, 0, (struct sockaddr *)&addr, &addrlen);
+		if (rv < 0)
+			die("recv failed");
+		else if (rv == 0) {
+			ret = FENCED_SOCKET_SHUTDOWN;
+			break;
+		}
+		buf[rv] = 0;
+
+		if (receive_callback(buf, receive_user_data)) {
+			ret = FENCED_SOCKET_FINISH;
+			break;
+		}
+	}
+
+out:
+	close(sfd);
+	return ret;
+}
+
+/*
+ * Checks if the received event matches the given one.
+ */
+static fenced_socket_receive_callback event_wait_callback;
+static bool event_wait_callback(const char *buf, void *user_data)
+{
+	const char *event = user_data;
+	while (*buf && isdigit(*buf)) buf++;
+	if (*buf++ != ' ')
+		return false;
+	if (strncmp(buf, "event:", strlen("event:")) == 0)
+		if (strncmp(buf + strlen("event:") + 1, event, strlen (event)) == 0)
+			return true;
+	return false;
+}
+
+/*
+ * Wrapper for fenced_socket, produces correct return values for main.
+ */
+static int event_wait(char *event, fenced_socket_setup_callback setup_callback, void *setup_user_data)
+{
+	enum fenced_socket_ret ret = fenced_socket(event_wait_timeout, event_wait_callback, event, setup_callback, setup_user_data);
+	switch (ret)
+	{
+	case FENCED_SOCKET_FINISH:
+		return EXIT_SUCCESS;
+	default:
+		return EXIT_FAILURE;
+	}
+}
+
+/*
  * This is a really lousy way of waiting, which is why I took so long to add
  * it.  I guess it's better than nothing for a lot of people.  The state may
  * not be "run" if we've joined but other nodes are joining/leaving.
@@ -254,8 +388,45 @@
 	return EXIT_SUCCESS;
 }
 
+struct do_join_callback_data
+{
+	int argc;
+	char **argv;
+};
+
+static void do_join_real(struct do_join_callback_data *data)
+{
+	strcpy(data->argv[0], "fenced");
+	data->argv[data->argc - 1] = NULL;
+
+	execvp("fenced", data->argv);
+	die("starting fenced failed");
+}
+
+static fenced_socket_setup_callback do_join_callback;
+static bool do_join_callback(int fd, void *user_data)
+{
+	struct do_join_callback_data *data = user_data;
+
+	pid_t pid = fork();
+	/* parent waits for fenced to join */
+	if (pid > 0) {
+		int status;
+		waitpid(pid, &status, 0);
+		if (WIFEXITED(status) && !WEXITSTATUS(status))
+			return true;
+		return false;
+	}
+
+	do_join_real(data);
+	return false;
+}
+
 static int do_join(int argc, char *argv[])
 {
+	struct do_join_callback_data data = {
+		argc, argv
+	};
 	int cd;
 
 	setup_sock();
@@ -280,33 +451,36 @@
 	if (debug)
 		printf("%s: start fenced\n", prog_name);
 
-	if (!debug && child_wait) {
-		int status;
-		pid_t pid = fork();
-		/* parent waits for fenced to join */
-		if (pid > 0) {
-			waitpid(pid, &status, 0);
-			if (WIFEXITED(status) && !WEXITSTATUS(status))
-				do_wait();
-			exit(EXIT_SUCCESS);
-		}
-		/* child execs fenced */
-	}
+	if (!debug && event_wait_timeout)
+		return event_wait("join:finish", do_join_callback, &data);
 
-	strcpy(argv[0], "fenced");
-	argv[argc - 1] = NULL;
-
-	execvp("fenced", argv);
-	die("starting fenced failed");
+	do_join_real(&data);
 
 	return EXIT_FAILURE;
 }
 
+struct do_leave_callback_data
+{
+	pid_t pid;
+};
+
+static bool do_leave_real(struct do_leave_callback_data *data)
+{
+	return kill(data->pid, SIGTERM) == 0;
+}
+
+static fenced_socket_setup_callback do_leave_callback;
+static bool do_leave_callback(int fd, void *user_data)
+{
+	struct do_leave_callback_data *data = user_data;
+	return do_leave_real(data);
+}
+
 static int do_leave(void)
 {
 	FILE *f;
 	char buf[33] = "";
-	int pid = 0;
+	struct do_leave_callback_data data = { 0 };
 
 	lockfile();
 
@@ -316,7 +490,7 @@
 	if (!f)
 		die("fenced not running - no file %s", LOCKFILE_NAME);
 	fgets(buf, 33, f);
-	sscanf(buf, "%d", &pid);
+	sscanf(buf, "%d", &data.pid);
 	fclose(f);
 
 	check_mounted();
@@ -327,41 +501,28 @@
 
 	close(cl_sock);
 
-	kill(pid, SIGTERM);
+	if (event_wait_timeout)
+		return event_wait("unknown:leavedone", do_leave_callback, &data);
+	return do_leave_real(&data);
+}
 
-	return EXIT_SUCCESS;
+static fenced_socket_receive_callback do_monitor_callback;
+static bool do_monitor_callback(const char *buf, void *user_data)
+{
+	fputs(buf, stdout);
+	return false;
 }
 
 static int do_monitor(void)
 {
-	int sfd, error, rv;
-	struct sockaddr_un addr;
-	socklen_t addrlen;
-	char buf[256];
-
-	sfd = socket(AF_LOCAL, SOCK_DGRAM, 0);
-	if (sfd < 0)
-		die("cannot create local socket");
-
-	memset(&addr, 0, sizeof(addr));
-	addr.sun_family = AF_LOCAL;
-	strcpy(&addr.sun_path[1], FENCED_SOCK_PATH);
-	addrlen = sizeof(sa_family_t) + strlen(addr.sun_path+1) + 1;
-
-	error = bind(sfd, (struct sockaddr *) &addr, addrlen);
-	if (error < 0)
-		die("cannot bind to local socket");
-
-	for (;;) {
-		memset(buf, 0, 256);
-
-		rv = recvfrom(sfd, buf, 256, 0, (struct sockaddr *)&addr,
-			      &addrlen);
-
-		printf("%s", buf);
+	enum fenced_socket_ret ret = fenced_socket(-1, do_monitor_callback, 0, 0, 0);
+	switch (ret)
+	{
+	case FENCED_SOCKET_SHUTDOWN:
+		return EXIT_SUCCESS;
+	default:
+		return EXIT_FAILURE;
 	}
-
-	return EXIT_SUCCESS;
 }
 
 static void print_usage(void)
@@ -376,7 +537,7 @@
 	printf("  wait             Wait for node to be member of default fence domain\n");
 	printf("\n");
 	printf("Options:\n");
-	printf("  -w               Wait for join to complete\n");
+	printf("  -w <secs>        Wait for join or leave to complete\n");
 	printf("  -V               Print program version information, then exit\n");
 	printf("  -h               Print this help, then exit\n");
 	printf("  -S               Skip self unfencing on join\n");
@@ -422,7 +583,7 @@
 			break;
 
 		case 'w':
-			child_wait = true;
+			event_wait_timeout = atoi(optarg);
 			break;
 
 		case 'Q':
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050218/caed4227/attachment.sig>

From ptr at poczta.fm  Fri Feb 18 15:53:14 2005
From: ptr at poczta.fm (ptr at poczta.fm)
Date: 18 Feb 2005 16:53:14 +0100
Subject: [Linux-cluster] lock_dlm: fence domain not found (?)
Message-ID: <20050218155314.B54A73B21E4@poczta.interia.pl>


    Hello.

I succesfully compiled recent (17.02) CVS code with vanilla 2.6.10 kernel
on Gentoo distro. As it's upgrade from old version, I shut down GFS on both nodes
and I'm trying to start only one node for a test. Everything seems to be fine, 
until I mount the GFS file system:

share2 root # mount -t gfs /dev/sdb1 /eva/
mount: permission denied

Dmesg says after that:

GFS: Trying to join cluster "lock_dlm", "cluster1:eva"
lock_dlm: fence domain not found; check fenced
GFS: can't mount proto = lock_dlm, table = cluster1:eva, hostdata =

What might be wrong? Here's additional info:

share2 root # cat /proc/cluster/status 
Protocol version: 5.0.1
Config version: 2
Cluster name: cluster1
Cluster ID: 26777
Membership state: Cluster-Member
Nodes: 1
Expected_votes: 1
Total_votes: 1
Quorum: 1   
Active subsystems: 3
Node name: share2
Node addresses: 10.1.20.29

share2 root # cat /proc/cluster/nodes  
Node  Votes Exp Sts  Name
   1    1    1   M   share2

share2 root # cat /proc/cluster/services 
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 join      S-6,20,1
[1]
DLM Lock Space:  "clvmd"                             2   3 run       -
[1]

I have no clue why canot mount the fs. Do I have to start both nodes
simultaneously? Than you in advance for any hints!
    Piotr


My cluster.conf:

<?xml version="1.0"?>
<cluster name="cluster1" config_version="2">
 
<cman two_node="1" expected_votes="1">
</cman>
 
<clusternodes>
        <clusternode name="share1" votes="1">
                <fence>
                        <method name="single">
                                <device name="human" ipaddr="10.1.20.30"/>
                        </method>
                </fence>
        </clusternode>
         <clusternode name="share2" votes="1">
                <fence>
                        <method name="single">
                                <device name="human" ipaddr="10.1.20.29"/>
                        </method>
                </fence>
        </clusternode>
</clusternodes>
 
<fencedevices>
        <fencedevice name="human" agent="fence_manual"/>
</fencedevices>
 
</cluster>


----------------------------------------------------------------------
INTERIA.PL >>> http://link.interia.pl/f1853


From teigland at redhat.com  Fri Feb 18 16:03:46 2005
From: teigland at redhat.com (David Teigland)
Date: Sat, 19 Feb 2005 00:03:46 +0800
Subject: [Linux-cluster] lock_dlm: fence domain not found (?)
In-Reply-To: <20050218155314.B54A73B21E4@poczta.interia.pl>
References: <20050218155314.B54A73B21E4@poczta.interia.pl>
Message-ID: <20050218160346.GG6493@redhat.com>

On Fri, Feb 18, 2005 at 04:53:14PM +0100, ptr at poczta.fm wrote:

> GFS: Trying to join cluster "lock_dlm", "cluster1:eva"
> lock_dlm: fence domain not found; check fenced

> share2 root # cat /proc/cluster/nodes  
> Node  Votes Exp Sts  Name
>    1    1    1   M   share2
> 
> share2 root # cat /proc/cluster/services 
> Service          Name                              GID LID State     Code
> Fence Domain:    "default"                           1   2 join      S-6,20,1
> [1]

share2 is fencing your other node.  look in /var/log/messages for
fence_manual's instruction on what to do next

-- 
Dave Teigland  <teigland at redhat.com>


From ptr at poczta.fm  Fri Feb 18 16:40:30 2005
From: ptr at poczta.fm (ptr at poczta.fm)
Date: 18 Feb 2005 17:40:30 +0100
Subject: [Linux-cluster] lock_dlm: fence domain not found (?)
Message-ID: <20050218164030.04AC0B9F6B@poczta.interia.pl>


   Thank you Dave, although /var/log/messages says only the same as I mentioned above 
(after cluster startup and on mount command):

Feb 18 16:47:00 share2 kernel: GFS: Trying to join cluster "lock_dlm", "cluster1:eva"
Feb 18 16:47:00 share2 kernel: lock_dlm: fence domain not found; check fenced
Feb 18 16:47:00 share2 kernel: GFS: can't mount proto = lock_dlm, table = cluster1:eva, hostdata =

No more helpful info. I'll try to cycle both nodes together and see if it helps fencing. 
BTW: I used "fence_tool join" with "-t 120" option before. Looks like it's been deprecated in recent
GFS versions?
    Regards,

Piotr

----------------------------------------------------------------------
INTERIA.PL >>> http://link.interia.pl/f1853


From ialberdi at histor.fr  Fri Feb 18 16:54:07 2005
From: ialberdi at histor.fr (Ion Alberdi)
Date: Fri, 18 Feb 2005 17:54:07 +0100
Subject: [Linux-cluster] LVM over a gnbd exported device
Message-ID: <42161DAF.1010901@histor.fr>

Hello.
I'm currently seting up a three node cluster.
I want to have a shared logical partition between the three nodes i.e 
that the device
/dev/vg1/lv1 appears on the three nodes.

There is what I tryed
after initialing the cluster:

(ccsd 
 cman_tool join                   
 fence_tool join                  
 clvmd                            
 vgchange -aly 
 worked well)                

export /dev/hdb from the node debian to buba and gump.
debian has /dev/hdb
buba has /dev/gnbd/dd
gump has /dev/gnbd/dd

 From gump, I tried to create a logical volume from the shared device, 
everything goes well until lvcreate:

[root at gump gnbd]#pvcreate /dev/gnbd/dd
  Physical volume "/dev/gnbd/dd" successfully created
[root at gump gnbd]# vgcreate vg1 /dev/gnbd/dd
  Volume group "vg1" successfully created
[root at gump gnbd]# lvcreate -l 1500 -n lv1 vg1
  Error locking on node gump: Internal lvm error, check syslog
  Error locking on node buba: Internal lvm error, check syslog

The syslog from buba end gump were the same:

Feb 18 17:27:10 gump lvm[6634]: Volume group for uuid not found: 
6fz5zUSDvMbKnau8HdhjawXiduq1wIYoRRj2kR4Mf8rM9YE1bisYu671jD26sijR

For the debian node everything went well, because it had  /dev/vg1/lv1 
and because lvscan gave:

17:27 root at debian ~# lvscan
  ACTIVE            '/dev/vg1/lv1' [5,86 GB] inherit


On the two other nodes I had
[root at buba dev]# lvscan
  inactive          '/dev/vg1/lv1' [5,86 GB] inherit.

It seems that all worked well, except  the activation of the logical 
volume for buba and gump because they didn't find a volume groupe for 
the uuid of the shared physical device.......

Has someone any idea about fixing it?

Thanks in advance.

There is my cluster.conf:

<?xml version="1.0"?>
<cluster name="cluster1" config_version="1">
 
  <clusternodes>
    <clusternode name="buba" votes="1">
      <fence>
        <method name="single">
          <device name="human" ipaddr="200.0.0.10"/>
        </method>
      </fence>
    </clusternode>
   
    <clusternode name="gump" votes="1">
      <fence>
        <method name="single">
          <device name="human" ipaddr="200.0.0.97"/>
        </method>
      </fence>
    </clusternode>
 
    <clusternode name="debian" votes="1">
      <fence>
        <method name="single">
          <device name="human" ipaddr="200.0.0.102"/>
        </method>
      </fence>
    </clusternode>
  </clusternodes>

  <fence_devices>
    <device name="human" agent="fence_manual"/>
  </fence_devices>

</cluster>


From lhh at redhat.com  Fri Feb 18 17:47:34 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 18 Feb 2005 12:47:34 -0500
Subject: [Linux-cluster] cluster latest cvs does not work?
In-Reply-To: <42158AA2.7030901@telkom.co.id>
References: <42158AA2.7030901@telkom.co.id>
Message-ID: <1108748854.10064.39.camel@ayanami.boston.redhat.com>

On Fri, 2005-02-18 at 13:26 +0700, Fajar A. Nugraha wrote:

> [root at lincluster1 root]# ccsd -4
> Failed to connect to cluster manager.
> Hint: Magma plugins are not in the right spot.

Reinstall magma and magma-plugins and it should work fine.

-- Lon


From lhh at redhat.com  Fri Feb 18 17:50:40 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 18 Feb 2005 12:50:40 -0500
Subject: [Linux-cluster] raw device
In-Reply-To: <011701c512c3$ba1a84f0$69050364@yazanz>
References: <011701c512c3$ba1a84f0$69050364@yazanz>
Message-ID: <1108749040.10064.41.camel@ayanami.boston.redhat.com>

On Mon, 2005-02-14 at 20:33 +0200, Yazan Al-Sheyyab wrote:
> hi all,
> 
>   maybe i asked this question before So please execuse me about this.
> 
>  can i put more than one rawdevice in the same partition, i mean can i put
> in the /etc/sysconfig/rawdevices file more than one raw related with same
> partition as :
> 
>   /dev/raw/raw5 /dev/cciss/c0d0p3
>   /dev/raw/raw6 /dev/cciss/c0d0p3
>  .......... etc.

Yes, but that's a bad idea.

Furthermore, O_DIRECT has replaced raw devices.

-- Lon


From pbruna at linuxcenterla.com  Thu Feb 17 12:20:21 2005
From: pbruna at linuxcenterla.com (Patricio Bruna V)
Date: Thu, 17 Feb 2005 09:20:21 -0300
Subject: [Linux-cluster] info
Message-ID: <1108642821.2791.0.camel@p.linuxcenter.cl>

where i can read a howto for configure a cluster with the redhat tools,
cman,ccc,gfs, etc.?
-- 
Patricio Bruna                                  http://www.linuxcenterla.com
Ingeniero de Proyectos                  Mariano S?nchez Fontecilla 310
Red Hat Certified Engineer              Las Condes, Santiago - CHILE
Linux Center Latinoamerica              Fono: 4834041
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050217/8b20d4f8/attachment.sig>

From bkorb at veritas.com  Thu Feb 17 18:48:04 2005
From: bkorb at veritas.com (Bruce Korb)
Date: Thu, 17 Feb 2005 10:48:04 -0800
Subject: [Linux-cluster] [PATCH 1/4] fence - Use bools instead of
	crudedefines
References: <20050217183540.GC23870@wavehammer.waldi.eu.org>
Message-ID: <4214E6E4.F5C4C370@veritas.com>

Bastian Blank wrote:
> 
> This patch uses C99 bools instead of crude defines.

Please use crude defines.  stdbool.h is still too bleeding edge
for many systems.


From bkorb at veritas.com  Thu Feb 17 18:55:03 2005
From: bkorb at veritas.com (Bruce Korb)
Date: Thu, 17 Feb 2005 10:55:03 -0800
Subject: [Linux-cluster] [PATCH 2/4] fence - Remove GNU-isms
References: <20050217183545.GD23870@wavehammer.waldi.eu.org>
Message-ID: <4214E887.72E0F83F@veritas.com>

Bastian Blank wrote:
> 
> This patch removes the GNU-isms in the die defines and replaces them
> with inline functions.

The problem with this is that you have to go above -O2 optimization
to get inline functions.  The macros are always inline.  Also,
If you're going to optimize printf's, then replace:

  fputs("\n", stdout);

with:

  fputc('\n', stdout);

:)


From lhh at redhat.com  Fri Feb 18 18:28:47 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 18 Feb 2005 13:28:47 -0500
Subject: [Linux-cluster] [PATCH 0/4] fence
In-Reply-To: <20050218040135.GB6493@redhat.com>
References: <20050217183534.GB23870@wavehammer.waldi.eu.org>
	<20050218040135.GB6493@redhat.com>
Message-ID: <1108751327.5665.12.camel@ayanami.boston.redhat.com>

On Fri, 2005-02-18 at 12:01 +0800, David Teigland wrote:

> For reference, existing:
> 
> #define die(fmt, args...) \
> do \
> { \
> 	fprintf(stderr, "%s: ", prog_name); \
> 	fprintf(stderr, fmt "\n", ##args); \
> 	syslog(LOG_ERR, fmt, ##args); \
> 	exit(EXIT_FAILURE); \
> } \
> while (0)
> 
> patch does:
> 
> static inline void die(const char *fmt, ...)
> {
> 	va_list ap;
> 	fputs(prog_name, stderr);
> 	fputs(": ", stderr);
> 	va_start(ap, fmt);
> 	vfprintf(stderr, fmt, ap);
> 	fputs("\n", stderr);
> 	vsyslog(LOG_ERR, fmt, ap);
> 	va_end(ap);
> 	exit(EXIT_FAILURE);
> }

Note: Specifying inline doesn't always guarantee that inlining will
occur.  In fact, you can't inline a function with variable length
arguments with GCC...

http://gcc.gnu.org/onlinedocs/gcc/Inline.html

-- Lon


From lhh at redhat.com  Fri Feb 18 19:11:21 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 18 Feb 2005 14:11:21 -0500
Subject: [Linux-cluster] Performance tunning
In-Reply-To: <20050217164532.GB6389@redhat.com>
References: <000501c51508$10f23e80$5001a8c0@pcbujan>
	<20050217164532.GB6389@redhat.com>
Message-ID: <1108753881.6077.34.camel@ayanami.boston.redhat.com>

On Fri, 2005-02-18 at 00:45 +0800, David Teigland wrote:
> On Thu, Feb 17, 2005 at 10:47:55AM -0500, Manuel Bujan wrote:
> > Hello,
> > 
> > Is there anyway to improve the performance of a cman based gfs cluster 
> > installation tunning some of its parameters ?
> > 
> > We are using the dlm style locking mechanism in a two node 100 GB shared 
> > gfs cluster with a DELL powervault 220s storage.
> > We already disabled the atime updates and  the quotas_account but the gain 
> > in performance is not too significant.
> > 
> > Any other hints ?
> 
> Look at /proc/cluster/lock_dlm/drop_count
> 
> It's 50000 by default and increasing it could possibly improve things,
> or disable it altogether by setting to zero.
> e.g.  echo "0" >> /proc/cluster/lock_dlm/drop_count
> 
> You need to change this prior to mounting gfs on each node.  When
> non-zero, GFS/lock_dlm tries to keep the number of locks held locally
> below this level which limits the caching gfs can do.  When zero, no
> limiting is attempted.

Additionally, there are performance implications when using JBODs +
host-RAID controllers.  Basically, each controller's caching has to be
disabled (aka "put in cluster mode") in order for most host-RAID
controllers to operate properly in a clustered environment.  Naturally,
disabling the RAID controllers' cache can hurt I/O performance.

It might be interesting to test a one node configuration with and
without cache disabled and see if there is a significant change in
performance with your tests (still using DLM+GFS, of course).

Out of curiosity, what tests/benchmarks are you using?

-- Lon


From bastian at waldi.eu.org  Fri Feb 18 20:15:41 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Fri, 18 Feb 2005 21:15:41 +0100
Subject: [Linux-cluster] [PATCH 1/4] fence - Use bools instead of
	crudedefines
In-Reply-To: <4214E6E4.F5C4C370@veritas.com>
References: <20050217183540.GC23870@wavehammer.waldi.eu.org>
	<4214E6E4.F5C4C370@veritas.com>
Message-ID: <20050218201541.GA5136@wavehammer.waldi.eu.org>

On Thu, Feb 17, 2005 at 10:48:04AM -0800, Bruce Korb wrote:
> Bastian Blank wrote:
> > This patch uses C99 bools instead of crude defines.
> Please use crude defines.  stdbool.h is still too bleeding edge
> for many systems.

Than you have to remove the usage of stdint.h as well, both are defined
at the same place.

Bastian

-- 
Another Armenia, Belgium ... the weak innocents who always seem to be
located on a natural invasion route.
		-- Kirk, "Errand of Mercy", stardate 3198.4
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050218/2f49bd68/attachment.sig>

From lhh at redhat.com  Fri Feb 18 21:23:06 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 18 Feb 2005 16:23:06 -0500
Subject: [Linux-cluster] [PATCH 0/4] fence
In-Reply-To: <1108751327.5665.12.camel@ayanami.boston.redhat.com>
References: <20050217183534.GB23870@wavehammer.waldi.eu.org>
	<20050218040135.GB6493@redhat.com>
	<1108751327.5665.12.camel@ayanami.boston.redhat.com>
Message-ID: <1108761786.6077.116.camel@ayanami.boston.redhat.com>

On Fri, 2005-02-18 at 13:28 -0500, Lon Hohberger wrote:
> On Fri, 2005-02-18 at 12:01 +0800, David Teigland wrote:
> 
> > For reference, existing:
> > 
> > #define die(fmt, args...) \
> > do \
> > { \
> > 	fprintf(stderr, "%s: ", prog_name); \
> > 	fprintf(stderr, fmt "\n", ##args); \
> > 	syslog(LOG_ERR, fmt, ##args); \
> > 	exit(EXIT_FAILURE); \
> > } \
> > while (0)

Side note:

#define die(fmt, ...) \
do \
{ \
	fprintf(stderr, "%s: ", prog_name); \
	fprintf(stderr, fmt "\n", __VA_ARGS__); \
	syslog(LOG_ERR, fmt, __VA_ARGS__); \
	exit(EXIT_FAILURE); \
}\
while (0)

...is functionally equivalent, is ISO-C99 compliant (as to opposed to a
"GNUism"), and does not rely on the compiler actually successfully
inlining anything.

-- Lon


From pbruna at linuxcenterla.com  Fri Feb 18 21:41:05 2005
From: pbruna at linuxcenterla.com (Patricio Bruna V)
Date: Fri, 18 Feb 2005 18:41:05 -0300
Subject: [Linux-cluster] stnoith
Message-ID: <1108762866.2857.6.camel@p.linuxcenter.cl>

its stnoith implemented in the redhat cluster project
-- 
Patricio Bruna                                  http://www.linuxcenterla.com
Ingeniero de Proyectos                  Mariano S?nchez Fontecilla 310
Red Hat Certified Engineer              Las Condes, Santiago - CHILE
Linux Center Latinoamerica              Fono: 4834041
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050218/c7f63110/attachment.sig>

From lhh at redhat.com  Fri Feb 18 21:39:20 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 18 Feb 2005 16:39:20 -0500
Subject: [Linux-cluster] stnoith
In-Reply-To: <1108762866.2857.6.camel@p.linuxcenter.cl>
References: <1108762866.2857.6.camel@p.linuxcenter.cl>
Message-ID: <1108762760.6077.119.camel@ayanami.boston.redhat.com>

 On Fri, 2005-02-18 at 18:41 -0300, Patricio Bruna V wrote:
> its stnoith implemented in the redhat cluster project

Yes, but it's called "Fencing".  Same concept: Issuing I/O barriers to
prevent data corruption when a node dies.

-- Lon


From pbruna at linuxcenterla.com  Sat Feb 19 03:27:47 2005
From: pbruna at linuxcenterla.com (Patricio Bruna V)
Date: Sat, 19 Feb 2005 00:27:47 -0300
Subject: [Linux-cluster] gnbd
Message-ID: <1108783667.2956.2.camel@p.linuxcenter.cl>

correct me if im wrong, with gnbd i can export a block device
("/dev/blockdevice") to other machine, node, and the other node can
mount it?

i hope im right
-- 
Patricio Bruna                                  http://www.linuxcenterla.com
Ingeniero de Proyectos                  Mariano S?nchez Fontecilla 310
Red Hat Certified Engineer              Las Condes, Santiago - CHILE
Linux Center Latinoamerica              Fono: 4834041
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050219/01924692/attachment.sig>

From jhahm at yahoo.com  Sat Feb 19 04:46:35 2005
From: jhahm at yahoo.com (Jiho Hahm)
Date: Fri, 18 Feb 2005 20:46:35 -0800 (PST)
Subject: Fwd: [Linux-cluster] Specifying start/stop order of resources in a
	<resourcegroup>
Message-ID: <20050219044635.94909.qmail@web50901.mail.yahoo.com>

Hi, does anyone know the answer to my question below? 
Haven't heard back since I posted it...  Thanks!

-Jiho

--- Jiho Hahm <jhahm at yahoo.com> wrote:

> Date: Tue, 15 Feb 2005 16:22:44 -0800 (PST)
> From: Jiho Hahm <jhahm at yahoo.com>
> To: linux-cluster at redhat.com
> Subject: [Linux-cluster] Specifying start/stop order of
> resources in a
> 	<resourcegroup>
> 
> Hi,
> 
> I'm having some trouble with configuring start/stop order
> of resources in a resource group.  When I specify start
> and
> stop level values in resource elements, they are ignored.
> 
> Resources are always started and stopped according to the
> type-specific level specified in
> cluster/rgmanager/src/resources/resourcegroup.sh (or
> /usr/share/cluster/resourcegroup.sh).
> 
> What I basically want to do during startup of an RG is
> mount a couple of ext3 filesystems in a certain order,
> run
> a custom application, and finally bring up an IP address.
> 
> During shutdown I want to do exactly the opposite: bring
> down IP address, stop application, and unmount volumes in
> reverse order.
> 
> Here's what I have in cluster.conf:
> 
> <cluster ...>
>   <...>
>   <rm>
>     <failoverdomains>...</failoverdomains>
>     <resources/>
>     <resourcegroup name="rg1" domain="fd1">
>       <fs name="foo" fstype="ext3"
>           device="/dev/sdb1" mountpoint="/foo"
>           start="1" stop="4"/>
>       <fs name="foobar" fstype="ext3"
>           device="/dev/sdb2" mountpoint="/foo/bar"
>           start="2" stop="3"/>
>       <script name="myapp" file="..."
>           start="3" stop="2"/>
>       <ip address="..." monitor_link="yes"
>           start="4" stop="1"/>
>     </resourcegroup>
>   </rm>
> </cluster>
> 
> The intention is to start the resource top-down, and stop
> them bottom-up.  Notice the foobar volume mounts as
> subdirectory of foo volume.  foo must be mounted first,
> and
> unmounted last.
> 
> But the actual start order with the above configuration
> turns out to be fs-foo, fs-foobar, ip and script.  The
> order was determined by type-specific default start level
> in resourcegroups.sh (fs=2, ip=3, script=5), and then top
> to bottom.
> 
> The stop sequence was apparently the same as start
> sequence.  When I ran "clusvcadm -s rg1" to stop the
> resource group, the first thing tried was unmounting foo,
> which failed because foobar wasn't unmounted first.  ip
> or
> script was not tried before fs, which leads me to guess
> stop sequence is determined by default _start_ level
> rather
> than stop level.
> 
> Judging by the stop behavior I think there is a bug
> somewhere.  But did I specify start/stop levels in my
> cluster.conf incorrectly?
> 
> Regards,
> 
> -Jiho
> 
> 
> 		
> __________________________________ 
> Do you Yahoo!? 
> Yahoo! Mail - 250MB free storage. Do more. Manage less. 
> http://info.mail.yahoo.com/mail_250
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster
> 


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From yazan at ccs.com.jo  Sat Feb 19 12:06:23 2005
From: yazan at ccs.com.jo (Yazan Al-Sheyyab)
Date: Sat, 19 Feb 2005 14:06:23 +0200
Subject: [Linux-cluster] raw device
References: <011701c512c3$ba1a84f0$69050364@yazanz>
	<1108749040.10064.41.camel@ayanami.boston.redhat.com>
Message-ID: <000401c5167b$6e615c30$69050364@yazanz>

Thanks Alot

  So i will use the lvm manager and made about 15 partition to use for each
RawDevice.

   is that OK ??? or i will make a mistake ?

 Regards

----- Original Message ----- 
From: "Lon Hohberger" <lhh at redhat.com>
To: "linux clistering" <linux-cluster at redhat.com>
Sent: Friday, February 18, 2005 7:50 PM
Subject: Re: [Linux-cluster] raw device


> On Mon, 2005-02-14 at 20:33 +0200, Yazan Al-Sheyyab wrote:
> > hi all,
> >
> >   maybe i asked this question before So please execuse me about this.
> >
> >  can i put more than one rawdevice in the same partition, i mean can i
put
> > in the /etc/sysconfig/rawdevices file more than one raw related with
same
> > partition as :
> >
> >   /dev/raw/raw5 /dev/cciss/c0d0p3
> >   /dev/raw/raw6 /dev/cciss/c0d0p3
> >  .......... etc.
>
> Yes, but that's a bad idea.
>
> Furthermore, O_DIRECT has replaced raw devices.
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From jhahm at yahoo.com  Sat Feb 19 17:51:15 2005
From: jhahm at yahoo.com (Jiho Hahm)
Date: Sat, 19 Feb 2005 09:51:15 -0800 (PST)
Subject: [Linux-cluster] Configuring CMAN timer/timeout values
Message-ID: <20050219175115.55220.qmail@web50908.mail.yahoo.com>

Hi,

Is it possible to configure the various timer/timeout
values in config_info struct (defined in
cman-kernel/src/config.h)?

The Symmetric Cluster Architecture document (sca.pdf) by
David Teigland suggests these values are configurable, but
they appear to be hard coded to the #define's in
cman-kernel/src/config.c file.

Specifically, I'm trying to modify hello_timer and
deadnode_timeout values.

-Jiho

__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From laza at yu.net  Sat Feb 19 18:30:18 2005
From: laza at yu.net (Lazar Obradovic)
Date: Sat, 19 Feb 2005 19:30:18 +0100
Subject: [Linux-cluster] lock_dlm: fence domain not found (?)
In-Reply-To: <20050218164030.04AC0B9F6B@poczta.interia.pl>
References: <20050218164030.04AC0B9F6B@poczta.interia.pl>
Message-ID: <1108837819.16581.102.camel@laza.eunet.yu>

> No more helpful info. I'll try to cycle both nodes together and see if it helps fencing. 
> BTW: I used "fence_tool join" with "-t 120" option before. Looks like it's been deprecated in recent
> GFS versions?

If you are sure that both nodes are clean (and you are, since you're
booting them both from ground zero), then you might want to start
fence_tool with "-c" option. 

-- 
Lazar Obradovic <laza at yu.net>
YUnet International, NOC


From laza at yu.net  Sat Feb 19 20:55:42 2005
From: laza at yu.net (Lazar Obradovic)
Date: Sat, 19 Feb 2005 21:55:42 +0100
Subject: [Linux-cluster] Configuring CMAN timer/timeout values
In-Reply-To: <20050219175115.55220.qmail@web50908.mail.yahoo.com>
References: <20050219175115.55220.qmail@web50908.mail.yahoo.com>
Message-ID: <1108846542.16581.107.camel@laza.eunet.yu>

> Is it possible to configure the various timer/timeout
> values in config_info struct (defined in
> cman-kernel/src/config.h)?

check /proc/cluster/config/cman/ directory... 

eg. 

echo 1 > /proc/cluster/config/cman/hello_timer

-- 
Lazar Obradovic <laza at yu.net>
YUnet International, NOC


From teigland at redhat.com  Sun Feb 20 05:35:53 2005
From: teigland at redhat.com (David Teigland)
Date: Sun, 20 Feb 2005 13:35:53 +0800
Subject: [Linux-cluster] lock_dlm: fence domain not found (?)
In-Reply-To: <1108837819.16581.102.camel@laza.eunet.yu>
References: <20050218164030.04AC0B9F6B@poczta.interia.pl>
	<1108837819.16581.102.camel@laza.eunet.yu>
Message-ID: <20050220053553.GA8814@redhat.com>

On Sat, Feb 19, 2005 at 07:30:18PM +0100, Lazar Obradovic wrote:

> > No more helpful info. I'll try to cycle both nodes together and see if
> > it helps fencing.  BTW: I used "fence_tool join" with "-t 120" option
> > before. Looks like it's been deprecated in recent GFS versions?
> 
> If you are sure that both nodes are clean (and you are, since you're
> booting them both from ground zero), then you might want to start
> fence_tool with "-c" option. 

fence_manual now detects when a victim node has rejoined the cluster and
takes that as an ack.  So, you shouldn't even notice spurious fences
during startup if you're using a recent fence_manual.

-- 
Dave Teigland  <teigland at redhat.com>


From bastian at waldi.eu.org  Sun Feb 20 13:18:48 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Sun, 20 Feb 2005 14:18:48 +0100
Subject: [Linux-cluster] [PATCH] cman-kernel - ioctl32 support
Message-ID: <20050220131848.GA3293@wavehammer.waldi.eu.org>

The attached patch adds ioctl32 support to cman.

Bastian

-- 
... The prejudices people feel about each other disappear when they get
to know each other.
		-- Kirk, "Elaan of Troyius", stardate 4372.5
-------------- next part --------------
=== src/Makefile
==================================================================
--- src/Makefile  (revision 334)
+++ src/Makefile  (local)
@@ -25,7 +25,7 @@
 PWD := $(shell pwd)
 
 obj-m := cman.o
-cman-objs := cnxman.o config.o membership.o proc.o \
+cman-objs := cnxman.o cnxman-ioctl32.o config.o membership.o proc.o \
 	sm_barrier.o sm_control.o sm_daemon.o sm_joinleave.o\
 	sm_membership.o sm_message.o sm_misc.o sm_recover.o sm_services.o \
 	sm_user.o
=== src/cnxman-private.h
==================================================================
--- src/cnxman-private.h  (revision 334)
+++ src/cnxman-private.h  (local)
@@ -435,4 +435,7 @@
 
 #endif				/* __KERNEL */
 
+extern void cnxman_ioctl32_init(void);
+extern void cnxman_ioctl32_exit(void);
+
 #endif
=== src/cnxman.c
==================================================================
--- src/cnxman.c  (revision 334)
+++ src/cnxman.c  (local)
@@ -4200,6 +4200,9 @@
 
 	atomic_set(&cnxman_running, 0);
 
+#ifdef CONFIG_COMPAT
+	cnxman_ioctl32_init();
+#endif
 	sm_init();
 
 	return 0;
@@ -4211,6 +4214,9 @@
 	cleanup_proc_entries();
 #endif
 
+#ifdef CONFIG_COMPAT
+	cnxman_ioctl32_exit();
+#endif
 	sock_unregister(AF_CLUSTER);
 	kmem_cache_destroy(cluster_sk_cachep);
 }
=== src/cnxman-ioctl32.c
==================================================================
--- src/cnxman-ioctl32.c  (revision 334)
+++ src/cnxman-ioctl32.c  (local)
@@ -0,0 +1,152 @@
+/******************************************************************************
+*******************************************************************************
+**
+**  Copyright (C) 2005 Bastian Blank <waldi at debian.org>
+**
+**  This copyrighted material is made available to anyone wishing to use,
+**  modify, copy, or redistribute it subject to the terms and conditions
+**  of the GNU General Public License v.2.
+**
+*******************************************************************************
+******************************************************************************/
+
+#include <linux/module.h>
+#include <linux/compat.h>
+#include <linux/ioctl32.h>
+#include <linux/syscalls.h>
+#include <net/sock.h>
+
+#include <cluster/cnxman.h>
+#include <cluster/service.h>
+
+#include "cnxman-private.h"
+
+#ifdef CONFIG_COMPAT
+
+static int do_ioctl32_pointer(unsigned int fd, unsigned int cmd, unsigned long arg, struct file *f)
+{
+	return sys_ioctl(fd, cmd, (unsigned long)compat_ptr(arg));
+}
+
+static int do_ioctl32_ulong(unsigned int fd, unsigned int cmd, unsigned long arg, struct file *f)
+{
+	return sys_ioctl(fd, cmd, arg);
+}
+
+struct ioctl32_cl_cluster_nodelist {
+	uint32_t max_members;
+	uint32_t nodes;
+};
+
+#define IOCTL32_SIOCCLUSTER_GETMEMBERS		_IOR('x', 0x03, struct ioctl32_cl_cluster_nodelist)
+#define IOCTL32_SIOCCLUSTER_GETALLMEMBERS	_IOR('x', 0x07, struct ioctl32_cl_cluster_nodelist)
+#define IOCTL32_SIOCCLUSTER_SERVICE_GETMEMBERS	_IOR('x', 0x60, struct ioctl32_cl_cluster_nodelist)
+
+static int do_ioctl32_cl_cluster_nodelist(unsigned int fd, unsigned int cmd, unsigned long arg, struct file *f)
+{
+	struct ioctl32_cl_cluster_nodelist i32;
+	struct cl_cluster_nodelist i64, __user *p64 = NULL;
+	unsigned int ncmd, r;
+
+	if (arg)
+	{
+		if (copy_from_user(&i32, compat_ptr(arg), sizeof(struct ioctl32_cl_cluster_nodelist)))
+			return -EFAULT;
+		r = copy_from_user(&i32, compat_ptr(arg), sizeof(struct ioctl32_cl_cluster_nodelist));
+
+		i64.max_members = i32.max_members;
+		i64.nodes = compat_ptr(i32.nodes);
+
+		p64 = compat_alloc_user_space(sizeof(struct cl_cluster_nodelist));
+		if (copy_to_user(p64, &i64, sizeof(struct cl_cluster_nodelist)))
+			return -EFAULT;
+	}
+
+	switch(cmd)
+	{
+		case IOCTL32_SIOCCLUSTER_GETMEMBERS:
+			ncmd = SIOCCLUSTER_GETMEMBERS;
+			break;
+		case IOCTL32_SIOCCLUSTER_GETALLMEMBERS:
+			ncmd = SIOCCLUSTER_GETALLMEMBERS;
+			break;
+		case IOCTL32_SIOCCLUSTER_SERVICE_GETMEMBERS:
+			ncmd = SIOCCLUSTER_SERVICE_GETMEMBERS;
+			break;
+		default:
+			return -EINVAL;
+	}
+
+	return sys_ioctl(fd, ncmd, (unsigned long)p64);
+}
+
+void __init cnxman_ioctl32_init(void)
+{
+	register_ioctl32_conversion(SIOCCLUSTER_NOTIFY, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_REMOVENOTIFY, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_GET_VERSION, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_SET_VERSION, do_ioctl32_pointer);
+	register_ioctl32_conversion(IOCTL32_SIOCCLUSTER_GETMEMBERS, do_ioctl32_cl_cluster_nodelist);
+	register_ioctl32_conversion(IOCTL32_SIOCCLUSTER_GETALLMEMBERS, do_ioctl32_cl_cluster_nodelist);
+	register_ioctl32_conversion(SIOCCLUSTER_GETNODE, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_GETCLUSTER, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_ISQUORATE, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_ISACTIVE, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SETEXPECTED_VOTES, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SET_VOTES, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_ISLISTENING, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_KILLNODE, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_GET_JOINCOUNT, do_ioctl32_ulong);
+//	register_ioctl32_conversion(SIOCCLUSTER_BARRIER
+	register_ioctl32_conversion(SIOCCLUSTER_PASS_SOCKET, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_SET_NODENAME, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_SET_NODEID, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_JOIN_CLUSTER, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_LEAVE_CLUSTER, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_REGISTER, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_UNREGISTER, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_JOIN, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_LEAVE, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_SETSIGNAL, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_STARTDONE, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_GETEVENT, do_ioctl32_pointer);
+	register_ioctl32_conversion(IOCTL32_SIOCCLUSTER_SERVICE_GETMEMBERS, do_ioctl32_cl_cluster_nodelist);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_GLOBALID, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_SETLEVEL, do_ioctl32_ulong);
+}
+
+void __exit cnxman_ioctl32_exit(void)
+{
+	unregister_ioctl32_conversion(SIOCCLUSTER_NOTIFY);
+	unregister_ioctl32_conversion(SIOCCLUSTER_REMOVENOTIFY);
+	unregister_ioctl32_conversion(SIOCCLUSTER_GET_VERSION);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SET_VERSION);
+	unregister_ioctl32_conversion(IOCTL32_SIOCCLUSTER_GETMEMBERS);
+	unregister_ioctl32_conversion(IOCTL32_SIOCCLUSTER_GETALLMEMBERS);
+	unregister_ioctl32_conversion(SIOCCLUSTER_GETNODE);
+	unregister_ioctl32_conversion(SIOCCLUSTER_GETCLUSTER);
+	unregister_ioctl32_conversion(SIOCCLUSTER_ISQUORATE);
+	unregister_ioctl32_conversion(SIOCCLUSTER_ISACTIVE);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SETEXPECTED_VOTES);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SET_VOTES);
+	unregister_ioctl32_conversion(SIOCCLUSTER_ISLISTENING);
+	unregister_ioctl32_conversion(SIOCCLUSTER_KILLNODE);
+	unregister_ioctl32_conversion(SIOCCLUSTER_GET_JOINCOUNT);
+	unregister_ioctl32_conversion(SIOCCLUSTER_BARRIER);
+	unregister_ioctl32_conversion(SIOCCLUSTER_PASS_SOCKET);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SET_NODENAME);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SET_NODEID);
+	unregister_ioctl32_conversion(SIOCCLUSTER_JOIN_CLUSTER);
+	unregister_ioctl32_conversion(SIOCCLUSTER_LEAVE_CLUSTER);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_REGISTER);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_UNREGISTER);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_JOIN);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_LEAVE);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_SETSIGNAL);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_STARTDONE);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_GETEVENT);
+	unregister_ioctl32_conversion(IOCTL32_SIOCCLUSTER_SERVICE_GETMEMBERS);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_GLOBALID);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_SETLEVEL);
+}
+#endif /* CONFIG_COMPAT */
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050220/dc688256/attachment.sig>

From bastian at waldi.eu.org  Sun Feb 20 14:54:10 2005
From: bastian at waldi.eu.org (Bastian Blank)
Date: Sun, 20 Feb 2005 15:54:10 +0100
Subject: [Linux-cluster] [PATCH] cman-kernel - ioctl32 support
In-Reply-To: <20050220131848.GA3293@wavehammer.waldi.eu.org>
References: <20050220131848.GA3293@wavehammer.waldi.eu.org>
Message-ID: <20050220145410.GA13293@wavehammer.waldi.eu.org>

On Sun, Feb 20, 2005 at 02:18:48PM +0100, Bastian Blank wrote:
> The attached patch adds ioctl32 support to cman.

The updated patch fixes SIOCCLUSTER_SET_NODENAME, which needs
conversation.

Bastian

-- 
Well, Jim, I'm not much of an actor either.
-------------- next part --------------
=== cnxman.c
==================================================================
--- cnxman.c  (revision 346)
+++ cnxman.c  (revision 351)
@@ -4200,6 +4200,9 @@
 
 	atomic_set(&cnxman_running, 0);
 
+#ifdef CONFIG_COMPAT
+	cnxman_ioctl32_init();
+#endif
 	sm_init();
 
 	return 0;
@@ -4211,6 +4214,9 @@
 	cleanup_proc_entries();
 #endif
 
+#ifdef CONFIG_COMPAT
+	cnxman_ioctl32_exit();
+#endif
 	sock_unregister(AF_CLUSTER);
 	kmem_cache_destroy(cluster_sk_cachep);
 }
=== Makefile
==================================================================
--- Makefile  (revision 346)
+++ Makefile  (revision 351)
@@ -29,6 +29,10 @@
 	sm_barrier.o sm_control.o sm_daemon.o sm_joinleave.o\
 	sm_membership.o sm_message.o sm_misc.o sm_recover.o sm_services.o \
 	sm_user.o
+
+ifeq ($(CONFIG_COMPAT),y)
+cman-objs += cnxman-ioctl32.o
+endif
  
 EXTRA_CFLAGS += -I$(obj)
 
=== cnxman-ioctl32.c
==================================================================
--- cnxman-ioctl32.c  (revision 346)
+++ cnxman-ioctl32.c  (revision 351)
@@ -0,0 +1,168 @@
+/******************************************************************************
+*******************************************************************************
+**
+**  Copyright (C) 2005 Bastian Blank <waldi at debian.org>
+**
+**  This copyrighted material is made available to anyone wishing to use,
+**  modify, copy, or redistribute it subject to the terms and conditions
+**  of the GNU General Public License v.2.
+**
+*******************************************************************************
+******************************************************************************/
+
+#include <linux/module.h>
+#include <linux/compat.h>
+#include <linux/ioctl32.h>
+#include <linux/syscalls.h>
+#include <net/sock.h>
+
+#include <cluster/cnxman.h>
+#include <cluster/service.h>
+
+#include "cnxman-private.h"
+
+static int do_ioctl32_pointer(unsigned int fd, unsigned int cmd, unsigned long arg, struct file *f)
+{
+	return sys_ioctl(fd, cmd, (unsigned long)compat_ptr(arg));
+}
+
+static int do_ioctl32_ulong(unsigned int fd, unsigned int cmd, unsigned long arg, struct file *f)
+{
+	return sys_ioctl(fd, cmd, arg);
+}
+
+struct ioctl32_cl_cluster_nodelist {
+	uint32_t max_members;
+	uint32_t nodes;
+};
+
+#define IOCTL32_SIOCCLUSTER_GETMEMBERS		_IOR('x', 0x03, struct ioctl32_cl_cluster_nodelist)
+#define IOCTL32_SIOCCLUSTER_GETALLMEMBERS	_IOR('x', 0x07, struct ioctl32_cl_cluster_nodelist)
+#define IOCTL32_SIOCCLUSTER_SERVICE_GETMEMBERS	_IOR('x', 0x60, struct ioctl32_cl_cluster_nodelist)
+
+static int do_ioctl32_cl_cluster_nodelist(unsigned int fd, unsigned int cmd, unsigned long arg, struct file *f)
+{
+	struct ioctl32_cl_cluster_nodelist i32;
+	struct cl_cluster_nodelist i64, __user *p64 = NULL;
+	unsigned int ncmd, r;
+
+	if (arg)
+	{
+		if (copy_from_user(&i32, compat_ptr(arg), sizeof(struct ioctl32_cl_cluster_nodelist)))
+			return -EFAULT;
+		r = copy_from_user(&i32, compat_ptr(arg), sizeof(struct ioctl32_cl_cluster_nodelist));
+
+		i64.max_members = i32.max_members;
+		i64.nodes = compat_ptr(i32.nodes);
+
+		p64 = compat_alloc_user_space(sizeof(struct cl_cluster_nodelist));
+		if (copy_to_user(p64, &i64, sizeof(struct cl_cluster_nodelist)))
+			return -EFAULT;
+	}
+
+	switch(cmd)
+	{
+		case IOCTL32_SIOCCLUSTER_GETMEMBERS:
+			ncmd = SIOCCLUSTER_GETMEMBERS;
+			break;
+		case IOCTL32_SIOCCLUSTER_GETALLMEMBERS:
+			ncmd = SIOCCLUSTER_GETALLMEMBERS;
+			break;
+		case IOCTL32_SIOCCLUSTER_SERVICE_GETMEMBERS:
+			ncmd = SIOCCLUSTER_SERVICE_GETMEMBERS;
+			break;
+		default:
+			return -EINVAL;
+	}
+
+	return sys_ioctl(fd, ncmd, (unsigned long)p64);
+}
+
+#define IOCTL32_SIOCCLUSTER_SET_NODENAME	_IOW('x', 0xb1, uint32_t)
+
+static int do_ioctl32_char_p(unsigned int fd, unsigned int cmd, unsigned long arg, struct file *f)
+{
+	unsigned int ncmd;
+
+	switch(cmd)
+	{
+		case IOCTL32_SIOCCLUSTER_SET_NODENAME:
+			ncmd = SIOCCLUSTER_SET_NODENAME;
+			break;
+		default:
+			return -EINVAL;
+	}
+
+	return sys_ioctl(fd, ncmd, (unsigned long)compat_ptr(arg));
+}
+
+void __init cnxman_ioctl32_init(void)
+{
+	register_ioctl32_conversion(SIOCCLUSTER_NOTIFY, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_REMOVENOTIFY, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_GET_VERSION, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_SET_VERSION, do_ioctl32_pointer);
+	register_ioctl32_conversion(IOCTL32_SIOCCLUSTER_GETMEMBERS, do_ioctl32_cl_cluster_nodelist);
+	register_ioctl32_conversion(IOCTL32_SIOCCLUSTER_GETALLMEMBERS, do_ioctl32_cl_cluster_nodelist);
+	register_ioctl32_conversion(SIOCCLUSTER_GETNODE, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_GETCLUSTER, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_ISQUORATE, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_ISACTIVE, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SETEXPECTED_VOTES, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SET_VOTES, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_ISLISTENING, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_KILLNODE, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_GET_JOINCOUNT, do_ioctl32_ulong);
+//	register_ioctl32_conversion(SIOCCLUSTER_BARRIER
+	register_ioctl32_conversion(SIOCCLUSTER_PASS_SOCKET, do_ioctl32_pointer);
+	register_ioctl32_conversion(IOCTL32_SIOCCLUSTER_SET_NODENAME, do_ioctl32_char_p);
+	register_ioctl32_conversion(SIOCCLUSTER_SET_NODEID, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_JOIN_CLUSTER, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_LEAVE_CLUSTER, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_REGISTER, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_UNREGISTER, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_JOIN, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_LEAVE, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_SETSIGNAL, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_STARTDONE, do_ioctl32_ulong);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_GETEVENT, do_ioctl32_pointer);
+	register_ioctl32_conversion(IOCTL32_SIOCCLUSTER_SERVICE_GETMEMBERS, do_ioctl32_cl_cluster_nodelist);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_GLOBALID, do_ioctl32_pointer);
+	register_ioctl32_conversion(SIOCCLUSTER_SERVICE_SETLEVEL, do_ioctl32_ulong);
+}
+
+void __exit cnxman_ioctl32_exit(void)
+{
+	unregister_ioctl32_conversion(SIOCCLUSTER_NOTIFY);
+	unregister_ioctl32_conversion(SIOCCLUSTER_REMOVENOTIFY);
+	unregister_ioctl32_conversion(SIOCCLUSTER_GET_VERSION);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SET_VERSION);
+	unregister_ioctl32_conversion(IOCTL32_SIOCCLUSTER_GETMEMBERS);
+	unregister_ioctl32_conversion(IOCTL32_SIOCCLUSTER_GETALLMEMBERS);
+	unregister_ioctl32_conversion(SIOCCLUSTER_GETNODE);
+	unregister_ioctl32_conversion(SIOCCLUSTER_GETCLUSTER);
+	unregister_ioctl32_conversion(SIOCCLUSTER_ISQUORATE);
+	unregister_ioctl32_conversion(SIOCCLUSTER_ISACTIVE);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SETEXPECTED_VOTES);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SET_VOTES);
+	unregister_ioctl32_conversion(SIOCCLUSTER_ISLISTENING);
+	unregister_ioctl32_conversion(SIOCCLUSTER_KILLNODE);
+	unregister_ioctl32_conversion(SIOCCLUSTER_GET_JOINCOUNT);
+//	unregister_ioctl32_conversion(SIOCCLUSTER_BARRIER);
+	unregister_ioctl32_conversion(SIOCCLUSTER_PASS_SOCKET);
+	unregister_ioctl32_conversion(IOCTL32_SIOCCLUSTER_SET_NODENAME);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SET_NODEID);
+	unregister_ioctl32_conversion(SIOCCLUSTER_JOIN_CLUSTER);
+	unregister_ioctl32_conversion(SIOCCLUSTER_LEAVE_CLUSTER);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_REGISTER);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_UNREGISTER);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_JOIN);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_LEAVE);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_SETSIGNAL);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_STARTDONE);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_GETEVENT);
+	unregister_ioctl32_conversion(IOCTL32_SIOCCLUSTER_SERVICE_GETMEMBERS);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_GLOBALID);
+	unregister_ioctl32_conversion(SIOCCLUSTER_SERVICE_SETLEVEL);
+}
+
=== cnxman-private.h
==================================================================
--- cnxman-private.h  (revision 346)
+++ cnxman-private.h  (revision 351)
@@ -433,6 +433,11 @@
 #define C_MEMB(fmt, args...)
 #endif
 
+#ifdef CONFIG_COMPAT
+extern void cnxman_ioctl32_init(void);
+extern void cnxman_ioctl32_exit(void);
+#endif
+
 #endif				/* __KERNEL */
 
 #endif
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050220/1d6b15f4/attachment.sig>

From pcaulfie at redhat.com  Sun Feb 20 19:30:17 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Sun, 20 Feb 2005 19:30:17 +0000
Subject: [Linux-cluster] Configuring CMAN timer/timeout values
In-Reply-To: <1108846542.16581.107.camel@laza.eunet.yu>
References: <20050219175115.55220.qmail@web50908.mail.yahoo.com>
	<1108846542.16581.107.camel@laza.eunet.yu>
Message-ID: <20050220193017.GB20210@tykepenguin.com>

On Sat, Feb 19, 2005 at 09:55:42PM +0100, Lazar Obradovic wrote:
> > Is it possible to configure the various timer/timeout
> > values in config_info struct (defined in
> > cman-kernel/src/config.h)?
> 
> check /proc/cluster/config/cman/ directory... 
> 
> eg. 
> 
> echo 1 > /proc/cluster/config/cman/hello_timer
> 

It's important to note that most of these should only be changed before the node
joins a cluster.

-- 

patrick


From fajar at telkom.co.id  Mon Feb 21 00:57:36 2005
From: fajar at telkom.co.id (Fajar A. Nugraha)
Date: Mon, 21 Feb 2005 07:57:36 +0700
Subject: [Linux-cluster] cluster latest cvs does not work?
In-Reply-To: <1108748854.10064.39.camel@ayanami.boston.redhat.com>
References: <42158AA2.7030901@telkom.co.id>
	<1108748854.10064.39.camel@ayanami.boston.redhat.com>
Message-ID: <42193200.4070403@telkom.co.id>

Lon Hohberger wrote:

>On Fri, 2005-02-18 at 13:26 +0700, Fajar A. Nugraha wrote:
>
>  
>
>>[root at lincluster1 root]# ccsd -4
>>Failed to connect to cluster manager.
>>Hint: Magma plugins are not in the right spot.
>>    
>>
>
>Reinstall magma and magma-plugins and it should work fine.
>
>  
>
Isn't magma and magma plugins installed when you run "make install" on 
cluster source directory?


From pcaulfie at redhat.com  Mon Feb 21 08:21:23 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 21 Feb 2005 08:21:23 +0000
Subject: [Linux-cluster] LVM over a gnbd exported device
In-Reply-To: <42161DAF.1010901@histor.fr>
References: <42161DAF.1010901@histor.fr>
Message-ID: <20050221082123.GA829@tykepenguin.com>

On Fri, Feb 18, 2005 at 05:54:07PM +0100, Ion Alberdi wrote:
> Hello.
> I'm currently seting up a three node cluster.
> I want to have a shared logical partition between the three nodes i.e 
> that the device
> /dev/vg1/lv1 appears on the three nodes.
> 
> There is what I tryed
> after initialing the cluster:
> 
> (ccsd 
> cman_tool join                   
> fence_tool join                  
> clvmd                            
> vgchange -aly 
> worked well)                
> 
> export /dev/hdb from the node debian to buba and gump.
> debian has /dev/hdb
> buba has /dev/gnbd/dd
> gump has /dev/gnbd/dd
> 

If you change the available block devices to a node them clvmd must (currently)
be restarted on all nodes. This includes adding new partitions or gnbd imports.
-- 

patrick


From pcaulfie at redhat.com  Mon Feb 21 08:23:15 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 21 Feb 2005 08:23:15 +0000
Subject: [Linux-cluster] [PATCH] cman-kernel - ioctl32 support
In-Reply-To: <20050220145410.GA13293@wavehammer.waldi.eu.org>
References: <20050220131848.GA3293@wavehammer.waldi.eu.org>
	<20050220145410.GA13293@wavehammer.waldi.eu.org>
Message-ID: <20050221082315.GB829@tykepenguin.com>

On Sun, Feb 20, 2005 at 03:54:10PM +0100, Bastian Blank wrote:
> On Sun, Feb 20, 2005 at 02:18:48PM +0100, Bastian Blank wrote:
> > The attached patch adds ioctl32 support to cman.
> 
> The updated patch fixes SIOCCLUSTER_SET_NODENAME, which needs
> conversation.
> 

Great work, thanks.
-- 

patrick


From ialberdi at histor.fr  Mon Feb 21 09:02:47 2005
From: ialberdi at histor.fr (Ion Alberdi)
Date: Mon, 21 Feb 2005 10:02:47 +0100
Subject: [Linux-cluster] LVM over a gnbd exported device
In-Reply-To: <20050221082123.GA829@tykepenguin.com>
References: <42161DAF.1010901@histor.fr> <20050221082123.GA829@tykepenguin.com>
Message-ID: <4219A3B7.7070100@histor.fr>

Patrick Caulfield wrote:

>On Fri, Feb 18, 2005 at 05:54:07PM +0100, Ion Alberdi wrote:
>  
>
>>Hello.
>>I'm currently seting up a three node cluster.
>>I want to have a shared logical partition between the three nodes i.e 
>>that the device
>>/dev/vg1/lv1 appears on the three nodes.
>>
>>There is what I tryed
>>after initialing the cluster:
>>
>>(ccsd 
>>cman_tool join                   
>>fence_tool join                  
>>clvmd                            
>>vgchange -aly 
>>worked well)                
>>
>>export /dev/hdb from the node debian to buba and gump.
>>debian has /dev/hdb
>>buba has /dev/gnbd/dd
>>gump has /dev/gnbd/dd
>>
>>    
>>
>
>If you change the available block devices to a node them clvmd must (currently)
>be restarted on all nodes. This includes adding new partitions or gnbd imports.
>  
>
I restarted  clvmd after exporting/importing the devices on all nodes, 
and it worked.
Thanks!


From mswanson at sonitrol.net  Mon Feb 21 14:21:13 2005
From: mswanson at sonitrol.net (Marc Swanson)
Date: Mon, 21 Feb 2005 09:21:13 -0500
Subject: [Linux-cluster] gfs problem during shutdown of master lock server
Message-ID: <1108995673.3355.167.camel@wsmis3.sonitrol.net>

Hi,

We've been using redhat GFS 6.0.0-1.2 for a few months now with
generally very good reliability.  We have the system configured with 5
lock servers (2 of which are the servers that physically mount the
shared storage which is connected via shared scsi on an hp dl380
packaged cluster).  We've taken down slave lock servers before without
incident but recently we went to reboot the master lock server and the
filesystem became inaccessible from the other server.  The logs indicate
what _seems_ to be a successful change of the master lock server over to
one of the other nodes but then there are messages indicating that a
request for a lock on the filesystem results in a 'Busy' message.

We then took all the servers down and booted up one of the filesystem
mounting nodes and all was well, no need for fsck.

The 5 nodes are as follows:
cluster1 (lock + fs node, was master, shutdown initiated)
cluster2 (lock +fs node, becomes master on cluster1 shutdown)
lvs1 (lock node)
lvs2 (lock node)
intra4 (lock node)

the locking is done with pool on two volumes, pool_home and
pool_shared.  ccsd uses the shared storage for the cluster config info
on cluster1 and cluster2, and local files for the rest of the lock
servers.  If I need to provide more information I'd be happy to post
full config details.


Here are the messages in the logs on cluster2 around the time of the
shutdown.  Any help on how to prevent this lockup from happening (or
pointers on what I'm doing wrong) would be appreciated!

Thanks!

-------------------------
Feb 18 17:02:24 cluster2 kernel: lock_gulm: Checking for journals for
node "cluster1.sonitrol.net"
Feb 18 17:02:24 cluster2 lock_gulmd_core[1345]: Master Node has logged
out.
Feb 18 17:02:24 cluster2 kernel: lock_gulm: Checking for journals for
node "cluster1.sonitrol.net"
Feb 18 17:02:24 cluster2 lock_gulmd_core[1345]: ERROR [core_io.c:1029]
Got error from reply: (cluster1.sonitrol.net:172.16.6.131) 1:Unknown
GULM Err
Feb 18 17:02:24 cluster2 lock_gulmd_core[1345]: ERROR [core_io.c:1034]
Errors on xdr: (cluster1.sonitrol.net:172.16.6.131) -104:104:Connection
reset by peer
Feb 18 17:02:33 cluster2 lock_gulmd_core[1345]: I see no Masters, So I
am Arbitrating until enough Slaves talk to me.
Feb 18 17:02:33 cluster2 lock_gulmd_LTPX[1351]: New Master at
cluster2.sonitrol.net:172.16.6.132
Feb 18 17:02:48 cluster2 lock_gulmd_core[1345]: lvs1.sonitrol.net missed
a heartbeat (time:1108764168893978 mb:1)
Feb 18 17:02:48 cluster2 lock_gulmd_core[1345]: lvs2.sonitrol.net missed
a heartbeat (time:1108764168893978 mb:1)
Feb 18 17:02:48 cluster2 lock_gulmd_core[1345]: intra4 missed a
heartbeat (time:1108764168893978 mb:1)
Feb 18 17:02:49 cluster2 lock_gulmd_core[1345]: Still in Arbitrating:
Have 2, need 3 for quorum.
Feb 18 17:02:49 cluster2 lock_gulmd_core[1345]: New Client: idx:5 fd:10
from (172.16.6.150:intra4)
Feb 18 17:02:49 cluster2 lock_gulmd_core[1345]: Member update message
Logged in about intra4 to lvs1.sonitrol.net is lost because node is in
OM
Feb 18 17:02:49 cluster2 lock_gulmd_core[1345]: Member update message
Logged in about intra4 to lvs2.sonitrol.net is lost because node is in
OM
Feb 18 17:02:49 cluster2 lock_gulmd_core[1345]: Now have Slave quorum,
going full Master.
Feb 18 17:02:49 cluster2 lock_gulmd_core[1345]: New Client: idx:6 fd:11
from (172.16.6.231:lvs2.sonitrol.net)
Feb 18 17:02:49 cluster2 lock_gulmd_core[1345]: Member update message
Logged in about lvs2.sonitrol.net to lvs1.sonitrol.net is lost because
node is in OM
Feb 18 17:02:49 cluster2 lock_gulmd_LTPX[1351]: Logged into LT000 at
cluster2.sonitrol.net:172.16.6.132
Feb 18 17:02:49 cluster2 lock_gulmd_LTPX[1351]: Finished resending to
LT000
Feb 18 17:02:50 cluster2 lock_gulmd_LT000[1348]: New Client: idx 2 fd 7
from (172.16.6.132:cluster2.sonitrol.net)
Feb 18 17:02:50 cluster2 lock_gulmd_LT000[1348]: New Client: idx 3 fd 8
from (172.16.6.231:lvs2.sonitrol.net)
Feb 18 17:02:50 cluster2 lock_gulmd_core[1345]: New Client: idx:7 fd:12
from (172.16.6.230:lvs1.sonitrol.net)
Feb 18 17:02:50 cluster2 lock_gulmd_core[1345]: Timeout (15000000) on
fd:5 (cluster1.sonitrol.net:172.16.6.131)
Feb 18 17:02:52 cluster2 lock_gulmd_LT000[1348]: Attached slave
lvs2.sonitrol.net:172.16.6.231 idx:4 fd:9 (soff:3 connected:0x8)
Feb 18 17:02:52 cluster2 lock_gulmd_LT000[1348]: New Client: idx 5 fd 10
from (172.16.6.230:lvs1.sonitrol.net)
Feb 18 17:02:54 cluster2 lock_gulmd_LT000[1348]: Attached slave
lvs1.sonitrol.net:172.16.6.230 idx:6 fd:11 (soff:2 connected:0xc)
Feb 18 17:02:54 cluster2 lock_gulmd_LT000[1348]: New Client: idx 7 fd 12
from (172.16.6.150:intra4)
Feb 18 17:03:04 cluster2 lock_gulmd_LT000[1348]: Attached slave
intra4:172.16.6.150 idx:8 fd:13 (soff:1 connected:0xe)
Feb 18 17:03:04 cluster2 kernel: GFS: fsid=alpha:shared.1: jid=0: Trying
to acquire journal lock...
Feb 18 17:03:04 cluster2 kernel: GFS: fsid=alpha:shared.1: jid=0: Busy
Feb 18 17:03:04 cluster2 kernel: GFS: fsid=alpha:home.1: jid=0: Trying
to acquire journal lock...
Feb 18 17:03:04 cluster2 kernel: GFS: fsid=alpha:home.1: jid=0: Busy
Feb 18 17:03:04 cluster2 kernel: GFS: fsid=alpha:home.1: jid=0: Trying
to acquire journal lock...
Feb 18 17:03:04 cluster2 kernel: GFS: fsid=alpha:home.1: jid=0: Busy
Feb 18 17:03:04 cluster2 kernel: GFS: fsid=alpha:shared.1: jid=0: Trying
to acquire journal lock...
Feb 18 17:03:04 cluster2 kernel: GFS: fsid=alpha:shared.1: jid=0: Busy
Feb 18 17:05:03 cluster2 lock_gulmd_core[1345]: New Client: idx:1 fd:5
from (172.16.6.131:cluster1.sonitrol.net)
Feb 18 17:05:05 cluster2 lock_gulmd_LT000[1348]: Attached slave
cluster1.sonitrol.net:172.16.6.131 idx:9 fd:14 (soff:0 connected:0xf)
Feb 18 17:05:05 cluster2 lock_gulmd_LT000[1348]: New Client: idx 10 fd
15 from (172.16.6.131:cluster1.sonitrol.net)
Feb 18 17:05:06 cluster2 ypserv[1395]: refused connect from
172.16.6.131:753 to procedure ypproc_all (LTSP,auto.master;-4)
Feb 18 17:05:55 cluster2 login(pam_unix)[1828]: session opened for user
root by LOGIN(uid=0)
Feb 18 17:05:55 cluster2  -- root[1828]: ROOT LOGIN ON tty2
Feb 18 17:06:02 cluster2 lock_gulmd_core[1345]: "cluster1.sonitrol.net"
is logged out. fd:5
Feb 18 17:06:02 cluster2 kernel: lock_gulm: Checking for journals for
node "cluster1.sonitrol.net"
Feb 18 17:06:02 cluster2 lock_gulmd_LT000[1348]: EOF on xdr
(cluster1.sonitrol.net:172.16.6.131 idx:10 fd:15)
Feb 18 17:06:02 cluster2 kernel: GFS: fsid=alpha:shared.1: jid=0: Trying
to acquire journal lock...
Feb 18 17:06:02 cluster2 kernel: GFS: fsid=alpha:shared.1: jid=0: Busy
Feb 18 17:06:02 cluster2 kernel: GFS: fsid=alpha:home.1: jid=0: Trying
to acquire journal lock...
Feb 18 17:06:02 cluster2 kernel: GFS: fsid=alpha:home.1: jid=0: Busy


-----------------------------


-- 
----------------------------------
Marc Swanson, Software Engineer
Sonitrol Communications Corp.
Hartford, CT

Email: mswanson at sonitrol.net
Phone: (860) 616-7036
Pager: (860) 948-6713
 Cell: (603) 512-1267
  Fax: (860) 616-7589
----------------------------------


From pbruna at linuxcenterla.com  Mon Feb 21 16:10:13 2005
From: pbruna at linuxcenterla.com (Patricio Bruna V)
Date: Mon, 21 Feb 2005 13:10:13 -0300
Subject: [Linux-cluster] ccs problem
Message-ID: <1109002213.2858.10.camel@cluster2>

when i do fence_tool join i get: fence_tool: waiting for ccs connection
-111
so i try : 
ccs_test connect
ccs_connect failed: Connection refused

but whit ccs_test connect force
Force is set.
Connect successful.
 Connection descriptor = 0

so i think i have a problem with ccs, any ideas?
-- 
Patricio Bruna                                  http://www.linuxcenterla.com
Ingeniero de Proyectos                  Mariano S?nchez Fontecilla 310
Red Hat Certified Engineer              Las Condes, Santiago - CHILE
Linux Center Latinoamerica              Fono: 4834041
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/c65596f9/attachment.sig>

From teigland at redhat.com  Mon Feb 21 16:17:47 2005
From: teigland at redhat.com (David Teigland)
Date: Tue, 22 Feb 2005 00:17:47 +0800
Subject: [Linux-cluster] ccs problem
In-Reply-To: <1109002213.2858.10.camel@cluster2>
References: <1109002213.2858.10.camel@cluster2>
Message-ID: <20050221161747.GA11143@redhat.com>

On Mon, Feb 21, 2005 at 01:10:13PM -0300, Patricio Bruna V wrote:
> when i do fence_tool join i get: fence_tool: waiting for ccs connection
> -111

Did you run cman_tool join ?   If so what does cman_tool status say?

-- 
Dave Teigland  <teigland at redhat.com>


From lhh at redhat.com  Mon Feb 21 16:34:38 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Mon, 21 Feb 2005 11:34:38 -0500
Subject: [Linux-cluster] ccs problem
In-Reply-To: <1109002213.2858.10.camel@cluster2>
References: <1109002213.2858.10.camel@cluster2>
Message-ID: <1109003678.20325.21.camel@ayanami.boston.redhat.com>

On Mon, 2005-02-21 at 13:10 -0300, Patricio Bruna V wrote:
> when i do fence_tool join i get: fence_tool: waiting for ccs connection
> -111
> so i try : 
> ccs_test connect
> ccs_connect failed: Connection refused
>
> but whit ccs_test connect force
> Force is set.
> Connect successful.
>  Connection descriptor = 0

The node has to be quorate in order for CCS to accept non-forced
connections.

-- Lon


From rajkum2002 at rediffmail.com  Mon Feb 21 17:12:36 2005
From: rajkum2002 at rediffmail.com (Raj  Kumar)
Date: 21 Feb 2005 17:12:36 -0000
Subject: [Linux-cluster] lock_gulm ERROR
Message-ID: <20050221171236.19549.qmail@webmail46.rediffmail.com>

Hello,

We have been running GFS for sometime now and it was very unreliable until we found a problem today. The GFS mount points are not accessible (cd /mnt/gfs just hangup) at all. This error message was available in the logs:


Feb 19 16:06:27 hpdl3801 kernel: lock_gulm: ERROR gulm_LT_recver err -110

How do I debug this problem and restore the filesystem to a clean state. Any tips on avoiding this type of errors would be very helpful.

Thanks,
Raj
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/7d5de003/attachment.htm>

From mtilstra at redhat.com  Mon Feb 21 17:18:21 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Mon, 21 Feb 2005 11:18:21 -0600
Subject: [Linux-cluster] lock_gulm ERROR
In-Reply-To: <20050221171236.19549.qmail@webmail46.rediffmail.com>
References: <20050221171236.19549.qmail@webmail46.rediffmail.com>
Message-ID: <421A17DD.4060209@redhat.com>

Raj Kumar wrote:
> We have been running GFS for sometime now and it was very unreliable 
> until we found a problem today. The GFS mount points are not accessible 
> (cd /mnt/gfs just hangup) at all. This error message was available in 
> the logs:
> 
> 
> Feb 19 16:06:27 hpdl3801 kernel: lock_gulm: ERROR gulm_LT_recver err -110
> 
> How do I debug this problem and restore the filesystem to a clean state. 
> Any tips on avoiding this type of errors would be very helpful.

what did gulm print to the logs before that message?

-- 
Michael Conrad Tadpol Tilstra
Sometimes you can't hear me, because I'm in ()
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/9e62616f/attachment.sig>

From rajkum2002 at rediffmail.com  Mon Feb 21 17:50:24 2005
From: rajkum2002 at rediffmail.com (Raj  Kumar)
Date: 21 Feb 2005 17:50:24 -0000
Subject: [Linux-cluster] lock_gulm ERROR
Message-ID: <20050221175024.10589.qmail@webmail18.rediffmail.com>

There were no messages!! 

/var/log> grep -i gulm me*
messages:Feb 21 10:56:29 server2 lock_gulmd: In main_main.c:254 (v6.0.0) death by: Extra command line parameters. Try again. 
messages.1:Feb 19 16:06:27 server2 kernel: lock_gulm: ERROR gulm_LT_recver err -110

I issued lock_gulmd status this morning to check the status of lock servers... The first error message must be from this command since status is not a valid parameter. I was actually trying to 

server2 is the client lock server and server1 is the master lock server. There are no error messages on the master lock server either.

Looks like the error messages were not lost: 

/var/log> ls -l mes*
-rw-------    1 root     root       219201 Feb 21 11:42 messages
-rw-------    1 root     root      1174142 Feb 20 04:02 messages.1
-rw-------    1 root     root      1496714 Feb 13 04:02 messages.2
-rw-------    1 root     root       873816 Feb  6 04:02 messages.3
-rw-------    1 root     root        31158 Jan 30 04:02 messages.4


Thank you,
Raj


On Mon, 21 Feb 2005 Michael Conrad Tadpol Tilstra wrote :
>Raj Kumar wrote:
>>We have been running GFS for sometime now and it was very unreliable until we found a problem today. The GFS mount points are not accessible (cd /mnt/gfs just hangup) at all. This error message was available in the logs:
>>
>>
>>Feb 19 16:06:27 hpdl3801 kernel: lock_gulm: ERROR gulm_LT_recver err -110
>>
>>How do I debug this problem and restore the filesystem to a clean state. Any tips on avoiding this type of errors would be very helpful.
>
>what did gulm print to the logs before that message?
>
>-- Michael Conrad Tadpol Tilstra
>Sometimes you can't hear me, because I'm in ()
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>http://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/1210bff6/attachment.htm>

From mtilstra at redhat.com  Mon Feb 21 18:23:28 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Mon, 21 Feb 2005 12:23:28 -0600
Subject: [Linux-cluster] lock_gulm ERROR
In-Reply-To: <20050221175024.10589.qmail@webmail18.rediffmail.com>
References: <20050221175024.10589.qmail@webmail18.rediffmail.com>
Message-ID: <421A2720.8020607@redhat.com>

Raj Kumar wrote:
> There were no messages!!
> 
> /var/log> grep -i gulm me*
> messages:Feb 21 10:56:29 server2 lock_gulmd: In main_main.c:254 (v6.0.0) 
> death by: Extra command line parameters. Try again.
> messages.1:Feb 19 16:06:27 server2 kernel: lock_gulm: ERROR 
> gulm_LT_recver err -110
> 
> I issued lock_gulmd status this morning to check the status of lock 
> servers... The first error message must be from this command since 
> status is not a valid parameter. I was actually trying to

Try `gulm_tool getstats <server>`

-- 
Michael Conrad Tadpol Tilstra

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/a9a3623a/attachment.sig>

From pbruna at linuxcenterla.com  Mon Feb 21 19:01:48 2005
From: pbruna at linuxcenterla.com (Patricio Bruna V)
Date: Mon, 21 Feb 2005 16:01:48 -0300
Subject: [Linux-cluster] redhat cluster, or cluster project questions?
Message-ID: <1109012508.2891.4.camel@cluster2>

what are the diferences between linux-ha and the redhat-cluster project?
or they complement each other?

how openssi play here?
-- 
Patricio Bruna                                  http://www.linuxcenterla.com
Ingeniero de Proyectos                  Mariano S?nchez Fontecilla 310
Red Hat Certified Engineer              Las Condes, Santiago - CHILE
Linux Center Latinoamerica              Fono: 4834041
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/11aa6953/attachment.sig>

From lmb at suse.de  Mon Feb 21 19:16:02 2005
From: lmb at suse.de (Lars Marowsky-Bree)
Date: Mon, 21 Feb 2005 20:16:02 +0100
Subject: [Linux-cluster] redhat cluster, or cluster project questions?
In-Reply-To: <1109012508.2891.4.camel@cluster2>
References: <1109012508.2891.4.camel@cluster2>
Message-ID: <20050221191602.GC16294@marowsky-bree.de>

On 2005-02-21T16:01:48, Patricio Bruna V <pbruna at linuxcenterla.com> wrote:

> what are the diferences between linux-ha and the redhat-cluster project?
> or they complement each other?

RedHat's cluster project focuses mainly on the clustered filesystem and
the clustered LVM, plus their DLM.

Linux HA is mostly dealing with complex and powerful cluster resource
management (dependency based switch- and fail-over, etc).

(Both statements are a gross over-simplification.)

In theory they ought to complement eachother, in practice they are based
on rather different infrastructure and also have some conceptual
differences.


Sincerely,
    Lars Marowsky-Br?e <lmb at suse.de>

-- 
High Availability & Clustering
SUSE Labs, Research and Development
SUSE LINUX Products GmbH - A Novell Business


From bruce.walker at hp.com  Mon Feb 21 19:30:18 2005
From: bruce.walker at hp.com (Walker, Bruce J)
Date: Mon, 21 Feb 2005 11:30:18 -0800
Subject: [Linux-cluster] redhat cluster, or cluster project questions?
Message-ID: <3689AF909D816446BA505D21F1461AE402AF3E73@cacexc04.americas.cpqcorp.net>

Lars,
  You forgot OpenSSI (which Particio asked about).  OpenSSI's goal is to provide a very easy to manage cluster, for all forms of clustering (parallel programming, load balancing, web serving, storage, database and high availability), in part by integrating other open source project technology.  OpenSSI did support GFS and later OGFS.  It will again support GFS on 2.6 in a couple of months.   The HA provided in OpenSSI is much simpler than that in ha-linux.

Bruce


> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Lars 
> Marowsky-Bree
> Sent: Monday, February 21, 2005 11:16 AM
> To: linux clistering
> Subject: Re: [Linux-cluster] redhat cluster, or cluster 
> project questions?
> 
> 
> On 2005-02-21T16:01:48, Patricio Bruna V 
> <pbruna at linuxcenterla.com> wrote:
> 
> > what are the diferences between linux-ha and the 
> redhat-cluster project?
> > or they complement each other?
> 
> RedHat's cluster project focuses mainly on the clustered 
> filesystem and
> the clustered LVM, plus their DLM.
> 
> Linux HA is mostly dealing with complex and powerful cluster resource
> management (dependency based switch- and fail-over, etc).
> 
> (Both statements are a gross over-simplification.)
> 
> In theory they ought to complement eachother, in practice 
> they are based
> on rather different infrastructure and also have some conceptual
> differences.
> 
> 
> Sincerely,
>     Lars Marowsky-Br?e <lmb at suse.de>
> 
> -- 
> High Availability & Clustering
> SUSE Labs, Research and Development
> SUSE LINUX Products GmbH - A Novell Business
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster
> 


From rajkum2002 at rediffmail.com  Mon Feb 21 19:54:14 2005
From: rajkum2002 at rediffmail.com (Raj  Kumar)
Date: 21 Feb 2005 19:54:14 -0000
Subject: [Linux-cluster] lock_gulm ERROR
Message-ID: <20050221195414.21469.qmail@webmail49.rediffmail.com>

Here is the output of gulm_tool getstats <server>
 ?
I_am = Master
quorum_has = 1
quorum_needs = 1
rank = 0
quorate = true
GenerationID = 1106237330010385
run time = 2778015
pid = 14115
verbosity = Default
failover = disabled
locked = 1

I_am = Client
Master = server2
rank = -1
quorate = true
GenerationID = 1106237330010385
run time = 2770611
pid = 20313
verbosity = Default
failover = disabled
locked = 1


What does "locked = 1" in the output mean? Is the problem apparent in the above output?

Thank you,
Raj

On Mon, 21 Feb 2005 Michael Conrad Tadpol Tilstra wrote :
>Raj Kumar wrote:
>>There were no messages!!
>>
>>/var/log> grep -i gulm me*
>>messages:Feb 21 10:56:29 server2 lock_gulmd: In main_main.c:254 (v6.0.0) death by: Extra command line parameters. Try again.
>>messages.1:Feb 19 16:06:27 server2 kernel: lock_gulm: ERROR gulm_LT_recver err -110
>>
>>I issued lock_gulmd status this morning to check the status of lock servers... The first error message must be from this command since status is not a valid parameter. I was actually trying to
>
>Try `gulm_tool getstats <server>`
>
>-- Michael Conrad Tadpol Tilstra
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>http://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/8990921c/attachment.htm>

From mtilstra at redhat.com  Mon Feb 21 20:04:19 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Mon, 21 Feb 2005 14:04:19 -0600
Subject: [Linux-cluster] lock_gulm ERROR
In-Reply-To: <20050221195414.21469.qmail@webmail49.rediffmail.com>
References: <20050221195414.21469.qmail@webmail49.rediffmail.com>
Message-ID: <421A3EC3.9080205@redhat.com>

Raj Kumar wrote:
> What does "locked = 1" in the output mean? Is the problem apparent in 
> the above output?

The locked line is a count of services using gulm that cannot handle 
gulm stopping while they are connected. (Like GFS).  If you try to 
`gulm_tool shutdown <localhost>` it will give you an error because the 
gulm_core is locked.


-- 
Michael Conrad Tadpol Tilstra
I haven't lost my mind -- it's backed up on tape somewhere.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/8436d40e/attachment.sig>

From mtilstra at redhat.com  Mon Feb 21 20:21:08 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Mon, 21 Feb 2005 14:21:08 -0600
Subject: [Linux-cluster] gfs problem during shutdown of master lock server
In-Reply-To: <1108995673.3355.167.camel@wsmis3.sonitrol.net>
References: <1108995673.3355.167.camel@wsmis3.sonitrol.net>
Message-ID: <421A42B4.7030909@redhat.com>

Marc Swanson wrote:
> Here are the messages in the logs on cluster2 around the time of the
> shutdown.  Any help on how to prevent this lockup from happening (or
> pointers on what I'm doing wrong) would be appreciated!
> 

actually, this looks similar to a bug that I'm fixing right now. 
(Rebooting the Master lock server when also having gfs mounts on the 
same node is causing sockets to get mixed up. 
https://bugzilla.redhat.com/beta/show_bug.cgi?id=148029 )

Just looking at the code, it seems to be present in 6.0 as well. 
bummer.  I'll get that fixed.

thanks,
-- 
Michael Conrad Tadpol Tilstra
Like a cool breeze through a freshly napalmed forest.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/1884b07b/attachment.sig>

From rajkum2002 at rediffmail.com  Mon Feb 21 20:32:12 2005
From: rajkum2002 at rediffmail.com (Raj  Kumar)
Date: 21 Feb 2005 20:32:12 -0000
Subject: [Linux-cluster] lock_gulm ERROR
Message-ID: <20050221203212.14020.qmail@webmail18.rediffmail.com>

Thanks for the information.
What could be the problem in my case? I didn't restart the servers yet with the concern that it might screw up the filesystem. I want to figure out what is the cause of the actual problem and correct it!!!

Thanks,
Raj


On Tue, 22 Feb 2005 Michael Conrad Tadpol Tilstra wrote :
>Raj Kumar wrote:
>>What does "locked = 1" in the output mean? Is the problem apparent in the above output?
>
>The locked line is a count of services using gulm that cannot handle gulm stopping while they are connected. (Like GFS).  If you try to `gulm_tool shutdown <localhost>` it will give you an error because the gulm_core is locked.
>
>
>-- Michael Conrad Tadpol Tilstra
>I haven't lost my mind -- it's backed up on tape somewhere.
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>http://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/9bf8fcac/attachment.htm>

From mtilstra at redhat.com  Mon Feb 21 20:38:24 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Mon, 21 Feb 2005 14:38:24 -0600
Subject: [Linux-cluster] lock_gulm ERROR
In-Reply-To: <20050221203212.14020.qmail@webmail18.rediffmail.com>
References: <20050221203212.14020.qmail@webmail18.rediffmail.com>
Message-ID: <421A46C0.1070605@redhat.com>

Raj Kumar wrote:
> Thanks for the information.
> What could be the problem in my case? I didn't restart the servers yet 
> with the concern that it might screw up the filesystem. I want to figure 
> out what is the cause of the actual problem and correct it!!!
> 

I don't know.  I've never seen a timeout error in the gulm_LT_recver 
before.  The only clues would have come from other gulm messages in the 
logs, and since there aren't any, I cannot say much.
What version of the code are you using?


Restarting the servers at this point should not make things any worse.


-- 
Michael Conrad Tadpol Tilstra

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/6d618142/attachment.sig>

From rajkum2002 at rediffmail.com  Mon Feb 21 21:08:11 2005
From: rajkum2002 at rediffmail.com (Raj  Kumar)
Date: 21 Feb 2005 21:08:11 -0000
Subject: [Linux-cluster] lock_gulm ERROR
Message-ID: <20050221210811.21685.qmail@webmail18.rediffmail.com>


GFS-modules-smp-6.0.2-24
GFS-6.0.2-24

The system was pretty stable until now and we considered moving the storage to production this week. But this problem scared us!

Thanks,
Raj

On Tue, 22 Feb 2005 Michael Conrad Tadpol Tilstra wrote :
>Raj Kumar wrote:
>>Thanks for the information.
>>What could be the problem in my case? I didn't restart the servers yet with the concern that it might screw up the filesystem. I want to figure out what is the cause of the actual problem and correct it!!!
>>
>
>I don't know.  I've never seen a timeout error in the gulm_LT_recver before.  The only clues would have come from other gulm messages in the logs, and since there aren't any, I cannot say much.
>What version of the code are you using?
>
>
>Restarting the servers at this point should not make things any worse.
>
>
>-- Michael Conrad Tadpol Tilstra
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>http://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/00b8137c/attachment.htm>

From lmb at suse.de  Mon Feb 21 21:20:38 2005
From: lmb at suse.de (Lars Marowsky-Bree)
Date: Mon, 21 Feb 2005 22:20:38 +0100
Subject: [Linux-cluster] redhat cluster, or cluster project questions?
In-Reply-To: <3689AF909D816446BA505D21F1461AE402AF3E73@cacexc04.americas.cpqcorp.net>
References: <3689AF909D816446BA505D21F1461AE402AF3E73@cacexc04.americas.cpqcorp.net>
Message-ID: <20050221212038.GF16294@marowsky-bree.de>

On 2005-02-21T11:30:18, "Walker, Bruce J" <bruce.walker at hp.com> wrote:

> Lars, You forgot OpenSSI (which Particio asked about). 

I didn't forget it, I knew you'd chime in as soon as OpenSSI is
mentioned anywhere on a Linux list ;-)


Sincerely,
    Lars Marowsky-Br?e <lmb at suse.de>

-- 
High Availability & Clustering
SUSE Labs, Research and Development
SUSE LINUX Products GmbH - A Novell Business


From pbruna at linuxcenterla.com  Mon Feb 21 21:31:00 2005
From: pbruna at linuxcenterla.com (Patricio Bruna V)
Date: Mon, 21 Feb 2005 18:31:00 -0300
Subject: [Linux-cluster] redhat cluster, or cluster project questions?
In-Reply-To: <3689AF909D816446BA505D21F1461AE402AF3E73@cacexc04.americas.cpqcorp.net>
References: <3689AF909D816446BA505D21F1461AE402AF3E73@cacexc04.americas.cpqcorp.net>
Message-ID: <1109021460.2891.15.camel@cluster2>

El lun, 21-02-2005 a las 11:30 -0800, Walker, Bruce J escribi?:
>Lars,
>  You forgot OpenSSI (which Particio asked about).  OpenSSI's goal is to provide a very easy to manage cluster, for all forms of clustering (parallel programming, load balancing, web serving, storage, database and high availability), in part by integrating other open source project technology.  OpenSSI did support GFS and later OGFS.  It will again support GFS on 2.6 in a couple of months.   The HA provided in OpenSSI is much simpler than that in ha-linux.
>
>Bruce
>
what about gnbd?
-- 
Patricio Bruna                                  http://www.linuxcenterla.com
Ingeniero de Proyectos                  Mariano S?nchez Fontecilla 310
Red Hat Certified Engineer              Las Condes, Santiago - CHILE
Linux Center Latinoamerica              Fono: 4834041
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050221/e9618602/attachment.sig>

From bruce.walker at hp.com  Mon Feb 21 22:13:21 2005
From: bruce.walker at hp.com (Walker, Bruce J)
Date: Mon, 21 Feb 2005 14:13:21 -0800
Subject: [Linux-cluster] redhat cluster, or cluster project questions?
Message-ID: <3689AF909D816446BA505D21F1461AE402AF3E76@cacexc04.americas.cpqcorp.net>

What about it.  When we have 2.6 GFS integrated with OpenSSI, we will support it.  If you mean what about gndb instead of DRBD, i'm not aware that gndb provides replication;  I thought it was just a way to access a remote block device.  I believe RH is working on enhancements to CLVM to provide cross-node replication, however.

Bruce


> -----Original Message-----
> From: linux-cluster-bounces at redhat.com 
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of 
> Patricio Bruna V
> Sent: Monday, February 21, 2005 1:31 PM
> To: linux clistering
> Subject: RE: [Linux-cluster] redhat cluster, or cluster 
> project questions?
> 
> 
> El lun, 21-02-2005 a las 11:30 -0800, Walker, Bruce J escribi?:
> >Lars,
> >  You forgot OpenSSI (which Particio asked about).  
> OpenSSI's goal is to provide a very easy to manage cluster, 
> for all forms of clustering (parallel programming, load 
> balancing, web serving, storage, database and high 
> availability), in part by integrating other open source 
> project technology.  OpenSSI did support GFS and later OGFS.  
> It will again support GFS on 2.6 in a couple of months.   The 
> HA provided in OpenSSI is much simpler than that in ha-linux.
> >
> >Bruce
> >
> what about gnbd?
> -- 
> Patricio Bruna                                  
> http://www.linuxcenterla.com
> Ingeniero de Proyectos           
>        Mariano S?nchez Fontecilla 310
> Red Hat Certified Engineer              Las Condes, Santiago - CHILE
> Linux Center Latinoamerica              Fono: 4834041
> 


From greg.freemyer at gmail.com  Mon Feb 21 22:24:37 2005
From: greg.freemyer at gmail.com (Greg Freemyer)
Date: Mon, 21 Feb 2005 17:24:37 -0500
Subject: [Linux-cluster] redhat cluster, or cluster project questions?
In-Reply-To: <3689AF909D816446BA505D21F1461AE402AF3E76@cacexc04.americas.cpqcorp.net>
References: <3689AF909D816446BA505D21F1461AE402AF3E76@cacexc04.americas.cpqcorp.net>
Message-ID: <87f94c370502211424496ee1ea@mail.gmail.com>

On Mon, 21 Feb 2005 14:13:21 -0800, Walker, Bruce J <bruce.walker at hp.com> wrote:
> What about it.  When we have 2.6 GFS integrated with OpenSSI, we will support it.  If you mean what about gndb instead of DRBD, i'm not aware that gndb provides replication;  I thought it was just a way to access a remote block device.  I believe RH is working on enhancements to CLVM to provide cross-node replication, however.
> 
> Bruce

I believe with the 2.4 kernel, Redhat has recommended layering MD
(mirror disk) above a local disk and a remote GNBD disk for some time.
 I have not tried it, but the goal is similar to drbd.  I don't know
if this is still recommended in the 2.6 kernel series.

Greg
-- 
Greg Freemyer


From daniel at osdl.org  Tue Feb 22 01:34:23 2005
From: daniel at osdl.org (Daniel McNeil)
Date: Mon, 21 Feb 2005 17:34:23 -0800
Subject: [Linux-cluster] node kicked out of cluster
Message-ID: <1109036063.24124.7.camel@ibm-c.pdx.osdl.net>

My latest test ran 49 hours before a node got kicked out.


cl030:
Feb 18 18:07:40 cl030 kernel: CMAN: node cl030a has been removed from the cluster : No response to messages
Feb 18 18:07:40 cl030 kernel: CMAN: killed by NODEDOWN message
Feb 18 18:07:40 cl030 kernel: CMAN: we are leaving the cluster.
Feb 18 18:07:41 cl030 kernel: dlm: stripefs: recoverd_kick after exit
Feb 18 18:07:41 cl030 kernel:
Feb 18 18:07:41 cl030 kernel: SM: send_nodeid_message error -107 to 2
Feb 18 18:07:42 cl030 kernel: SM: 00000001 sm_stop: SG still joined
Feb 18 18:07:42 cl030 kernel: SM: 01000430 sm_stop: SG still joined
Feb 18 18:07:42 cl030 kernel: SM: 02000431 sm_stop: SG still joined
Feb 18 18:07:42 cl030 ccsd[3766]: [cluster_mgr.c:387] Cluster manager shutdown.

cl031:
Feb 18 18:07:40 cl031 kernel: CMAN: removing node cl030a from the cluster : No response to messages
Feb 18 18:07:41 cl031 fenced[4127]: cl030a not a cluster member after 0 sec post_fail_delay
Feb 18 18:07:41 cl031 fenced[4127]: fencing node "cl030a"
Feb 18 18:07:41 cl031 fence_manual: Node cl030a needs to be reset before recovery can procede.  Waiting for cl030a to rejoin the cluster or for manual acknowledgement that it has been reset (i.e. fence_ack_manual -n cl030a)

cl032:
Feb 18 18:07:40 cl032 kernel: CMAN: node cl030a has been removed from the cluster : No response to messages
Feb 18 18:07:41 cl032 fenced[4262]: fencing deferred to cl031a
Feb 19 04:02:06 cl032 su(pam_unix)[29639]: session opened for user cyrus by (uid=0)

Does this mean heartbeats got lost so cl030 was kicked out?
Full info here:
http://developer.osdl.org/daniel/GFS/test.16feb2005/

Thanks,

Daniel


From pcaulfie at redhat.com  Tue Feb 22 08:35:50 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Tue, 22 Feb 2005 08:35:50 +0000
Subject: [Linux-cluster] node kicked out of cluster
In-Reply-To: <1109036063.24124.7.camel@ibm-c.pdx.osdl.net>
References: <1109036063.24124.7.camel@ibm-c.pdx.osdl.net>
Message-ID: <20050222083550.GB31659@tykepenguin.com>

On Mon, Feb 21, 2005 at 05:34:23PM -0800, Daniel McNeil wrote:
> My latest test ran 49 hours before a node got kicked out.
> 
> 
> cl030:
> Feb 18 18:07:40 cl030 kernel: CMAN: node cl030a has been removed from the cluster : No response to messages
> Feb 18 18:07:40 cl030 kernel: CMAN: killed by NODEDOWN message
> Feb 18 18:07:40 cl030 kernel: CMAN: we are leaving the cluster.
> Feb 18 18:07:41 cl030 kernel: dlm: stripefs: recoverd_kick after exit
> Feb 18 18:07:41 cl030 kernel:
> Feb 18 18:07:41 cl030 kernel: SM: send_nodeid_message error -107 to 2
> Feb 18 18:07:42 cl030 kernel: SM: 00000001 sm_stop: SG still joined
> Feb 18 18:07:42 cl030 kernel: SM: 01000430 sm_stop: SG still joined
> Feb 18 18:07:42 cl030 kernel: SM: 02000431 sm_stop: SG still joined
> Feb 18 18:07:42 cl030 ccsd[3766]: [cluster_mgr.c:387] Cluster manager shutdown.
> 
> cl031:
> Feb 18 18:07:40 cl031 kernel: CMAN: removing node cl030a from the cluster : No response to messages
> Feb 18 18:07:41 cl031 fenced[4127]: cl030a not a cluster member after 0 sec post_fail_delay
> Feb 18 18:07:41 cl031 fenced[4127]: fencing node "cl030a"
> Feb 18 18:07:41 cl031 fence_manual: Node cl030a needs to be reset before recovery can procede.  Waiting for cl030a to rejoin the cluster or for manual acknowledgement that it has been reset (i.e. fence_ack_manual -n cl030a)
> 
> cl032:
> Feb 18 18:07:40 cl032 kernel: CMAN: node cl030a has been removed from the cluster : No response to messages
> Feb 18 18:07:41 cl032 fenced[4262]: fencing deferred to cl031a
> Feb 19 04:02:06 cl032 su(pam_unix)[29639]: session opened for user cyrus by (uid=0)
> 
> Does this mean heartbeats got lost so cl030 was kicked out?

No. "No response to messages" can only happen during a state transition or
services join/leave. Current thinking is that the DLM can hog the CPU when
recovering huge numbers of locks, so we a re looking into placing some strategic
"schedule()" calls in the recovery process.
-- 

patrick


From nathan at valuecommerce.co.jp  Tue Feb 22 09:14:04 2005
From: nathan at valuecommerce.co.jp (Nathan Ollerenshaw)
Date: Tue, 22 Feb 2005 18:14:04 +0900
Subject: [Linux-cluster] GFS -> NFS, Locking and quotas
Message-ID: <2a5b11a870bfe22794ec127f5e34cf22@valuecommerce.co.jp>

All,

I'm currently investigating the feasibility of building a couple of 
Linux based NFS servers that use GFS on a back end EMC CX700 array. I 
had a few requirements for the system.

1. It should support quotas.

It seems that GFS supports quotas just fine, using some commands on the 
machines mounting the GFS filesystems. While this is fine, ideally I'd 
like to be able to query and set the quota information using 
rpc.rquotad and the associated commands 'edquota' and 'quota'.

Is this possible?

2. It should handle locking from the NFS clients.

I wouldn't expect any problems here, but maybe someone here who has 
built something like this has some personal experience they could 
share. :)

3. It would be nice if we could load balance the NFS with sticky load 
balancing of some kind (probably a hardware loadbalancer like a Foundry 
ServerIron XL).

Not sure if this is possible at all, its a nice to have really. But it 
doesn't look impossible, what with NFSv3 and v4 ... ? Any gotchas that 
anyone can think of? NFS is supposed to be stateless, right, so if a 
machine starts talking to a different NFS server, it shouldn't make a 
difference ...

Anyone tried anything like this, or has some advice? Would love some 
feedback.

I will be building out some kind of test system in the following weeks 
with any luck so hopefully I'll be able to answer my own questions, but 
I kinda wanted to know if I've been smoking too much ...

Thanks,

Nathan.

-- 
Nathan Ollerenshaw / Systems Engineer
Systems Engineering
ValueCommerce Co., Ltd.

Tokyo Bldg 4F 3-32-7 Hongo Bunkyo-ku Tokyo 113-0033 Japan
Tel. +81.3.3817.8995?? Fax. +81.3.3812.4051
mailto:nathan at valuecommerce.co.jp


From Uli.Schroeder at ing.boehringer-ingelheim.com  Tue Feb 22 10:00:57 2005
From: Uli.Schroeder at ing.boehringer-ingelheim.com (Uli.Schroeder at ing.boehringer-ingelheim.com)
Date: Tue, 22 Feb 2005 11:00:57 +0100
Subject: [Linux-cluster] system-config-cluster not working
Message-ID: <1127C4CCEF18BA4DA6CFE4E8BE88CCB001CAFB3B@ingexch2.eu.boehringer.com>

Hello everyone,

we are curently evaluating RHEL 4 and wanted to test the new cluster
capabilities. We installed the alpha RPMs as of 2005-02-28. Unfortunately
the package system-config-cluster doesn't seem to work with the official
RHEL4 system-config-lvm.

In the beginning it failed to start due to Python files (i.e.
CommandHandler) that belong to system-config-lvm. I included the necessary
path to the Python search path. At least the cluster manager started after
that. But now I get the error message "AttributeError: CommandHandler
instance has no attribute 'isClusterMember'". Is there a new package for LVM
configuration or for cluster configuration that resolves that problem?

I couldn't find a roadmap for a cluster suite compatible with RHEL4. Anyone
knows of the planned date for the official release?

Best regards,
Uli
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050222/43e6413f/attachment.htm>

From ialberdi at histor.fr  Tue Feb 22 12:03:36 2005
From: ialberdi at histor.fr (Ion Alberdi)
Date: Tue, 22 Feb 2005 13:03:36 +0100
Subject: [Linux-cluster] Configuring rgmanager
Message-ID: <421B1F98.1040708@histor.fr>

Hello.

I would like to know first if someone has sucessfully used rgmanager in 
this ML.
If so, how did you configure your cluster.conf, by hand or did you use a 
tool?
I searched over the archives and I only saw Jiho Hahm posting (twice) 
(16.02.2005) a question about rgmanager, and nobody answered.

Thanks in advance


From mswanson at sonitrol.net  Tue Feb 22 13:26:22 2005
From: mswanson at sonitrol.net (Marc Swanson)
Date: Tue, 22 Feb 2005 08:26:22 -0500
Subject: [Linux-cluster] gfs problem during shutdown of master lock server
In-Reply-To: <20050221221344.99B5B738A4@hormel.redhat.com>
References: <20050221221344.99B5B738A4@hormel.redhat.com>
Message-ID: <1109078781.3346.224.camel@wsmis3.sonitrol.net>


> > Here are the messages in the logs on cluster2 around the time of the
> > shutdown.  Any help on how to prevent this lockup from happening (or
> > pointers on what I'm doing wrong) would be appreciated!
> > 
> 
> actually, this looks similar to a bug that I'm fixing right now. 
> (Rebooting the Master lock server when also having gfs mounts on the 
> same node is causing sockets to get mixed up. 
> https://bugzilla.redhat.com/beta/show_bug.cgi?id=148029 )
> 
> Just looking at the code, it seems to be present in 6.0 as well. 
> bummer.  I'll get that fixed.


Thanks Michael!

So in the meantime, I could probably avoid this problem by electing one
of the non-mounters as the master lock server?


-- 
----------------------------------
Marc Swanson, Software Engineer
Sonitrol Communications Corp.
Hartford, CT

Email: mswanson at sonitrol.net
Phone: (860) 616-7036
Pager: (860) 948-6713
 Cell: (603) 512-1267
  Fax: (860) 616-7589
----------------------------------


From mtilstra at redhat.com  Tue Feb 22 14:11:41 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Tue, 22 Feb 2005 08:11:41 -0600
Subject: [Linux-cluster] gfs problem during shutdown of master lock server
In-Reply-To: <1109078781.3346.224.camel@wsmis3.sonitrol.net>
References: <20050221221344.99B5B738A4@hormel.redhat.com>
	<1109078781.3346.224.camel@wsmis3.sonitrol.net>
Message-ID: <421B3D9D.9090605@redhat.com>

Marc Swanson wrote:
>>>Here are the messages in the logs on cluster2 around the time of the
>>>shutdown.  Any help on how to prevent this lockup from happening (or
>>>pointers on what I'm doing wrong) would be appreciated!
>>>
>>
>>actually, this looks similar to a bug that I'm fixing right now. 
>>(Rebooting the Master lock server when also having gfs mounts on the 
>>same node is causing sockets to get mixed up. 
>>https://bugzilla.redhat.com/beta/show_bug.cgi?id=148029 )
>>
>>Just looking at the code, it seems to be present in 6.0 as well. 
>>bummer.  I'll get that fixed.
> 
> 
> 
> Thanks Michael!
> 
> So in the meantime, I could probably avoid this problem by electing one
> of the non-mounters as the master lock server?

Yeah, that should work.

-- 
Michael Conrad Tadpol Tilstra
Like a cool breeze through a freshly napalmed forest.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050222/936a9cad/attachment.sig>

From alain at tessiot.info  Mon Feb 21 21:50:52 2005
From: alain at tessiot.info (Alain TESSIOT)
Date: Mon, 21 Feb 2005 22:50:52 +0100
Subject: [Linux-cluster] Cluster Suite an network interfaces
References: <03ee01c50cf2$f90cc5f0$1b00a8c0@owtessiotalain>
	<1107788670.9794.2.camel@ayanami.boston.redhat.com>
Message-ID: <002301c5185f$6b406830$1b00a8c0@owtessiotalain>

Many thanks,

But how can I do to solve my trouble ? as the package
clumanager-1.2.24-0.1.i386.rpm doesn't solve my problem ...

----- Original Message ----- 
From: "Lon Hohberger" <lhh at redhat.com>
To: "linux clustering" <linux-cluster at redhat.com>
Sent: Monday, February 07, 2005 4:04 PM
Subject: Re: [Linux-cluster] Cluster Suite an network interfaces


> On Mon, 2005-02-07 at 09:56 +0100, Alain TESSIOT wrote:
>
> > But, if httpd runs on the the Server 1and I unplug the 2 network
> > interfaces (222.10.3.1), nothing happens ....
> > Did I do something wrong ?
>
> See:
>
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=144488
>
> -- Lon
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster
>


From sdake at mvista.com  Tue Feb 22 20:14:29 2005
From: sdake at mvista.com (Steven Dake)
Date: Tue, 22 Feb 2005 13:14:29 -0700
Subject: [Linux-cluster] redhat cluster, or cluster project questions?
In-Reply-To: <1109012508.2891.4.camel@cluster2>
References: <1109012508.2891.4.camel@cluster2>
Message-ID: <1109103268.21827.40.camel@persist.az.mvista.com>

Patricio

Also have a look at http://developer.osdl.org/dev/openais.  This is an
implementation of the SA Forum APIs for high availability and also
virtual synchrony group messaging exported through an EVS API.

I'd also suggest having a look at lists.osdl.org.  The clustersig
mailing list has some discussion of various clustering technologies
taking place.

Those archives list more projects which you might find of interest.

Thanks
-steve

On Mon, 2005-02-21 at 12:01, Patricio Bruna V wrote:
> what are the diferences between linux-ha and the redhat-cluster project?
> or they complement each other?
> 
> how openssi play here?


From merlin at studiobz.it  Wed Feb 23 00:46:29 2005
From: merlin at studiobz.it (Christian Zoffoli)
Date: Wed, 23 Feb 2005 01:46:29 +0100
Subject: [Linux-cluster] Maxtor Onetouch II works?
Message-ID: <421BD265.5090505@studiobz.it>

Hi to all.

Does anyone tried the Maxtor Onetouch II with GFS and a couple of servers?

In the past I've seen that the "old" Onetouch works on this setup, now 
need to know if also the series II works fine.

Regards,
Christian


From robert.schmoelzer at heicare.com  Wed Feb 23 14:09:15 2005
From: robert.schmoelzer at heicare.com (Robert Schmoelzer)
Date: Wed, 23 Feb 2005 15:09:15 +0100
Subject: [Linux-cluster] upgrading GFS 5.2 --> GFS 6
Message-ID: <009301c519b1$423ee0d0$0101000a@rschmoelz>

Hi all,

We are running GFS 5.2 on 3 nodes.
Each of our 4 filesystems is ~900GB.
We are now planning to upgrade to GSF 6.*

My questions are:
Can we just mount the filesystems after the
Upgrade or is there some "migration" needed?

What is the advantage of GULM over DLM, 
which one should we consider?


Regards
Robert
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 2826 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050223/d596e3a0/attachment.bin>

From lhh at redhat.com  Wed Feb 23 18:42:06 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Wed, 23 Feb 2005 13:42:06 -0500
Subject: [Linux-cluster] Configuring rgmanager
In-Reply-To: <421B1F98.1040708@histor.fr>
References: <421B1F98.1040708@histor.fr>
Message-ID: <1109184126.12299.11.camel@ayanami.boston.redhat.com>

On Tue, 2005-02-22 at 13:03 +0100, Ion Alberdi wrote:
> Hello.
> 
> I would like to know first if someone has sucessfully used rgmanager in 
> this ML.
> If so, how did you configure your cluster.conf, by hand or did you use a 
> tool?
> I searched over the archives and I only saw Jiho Hahm posting (twice) 
> (16.02.2005) a question about rgmanager, and nobody answered.

I have configured it successfully, by hand.

Right now, there's a bug I'm working on which prevents it from working
properly in some configurations (namely: CMAN >2 nodes).  I should have
a permanent fix tomorrow or the day after.

There are some example resource configurations in
rgmanager/src/daemons/tests.

-- Lon


From danderso at redhat.com  Wed Feb 23 20:00:45 2005
From: danderso at redhat.com (Derek Anderson)
Date: Wed, 23 Feb 2005 14:00:45 -0600
Subject: [Linux-cluster] upgrading GFS 5.2 --> GFS 6
In-Reply-To: <009301c519b1$423ee0d0$0101000a@rschmoelz>
References: <009301c519b1$423ee0d0$0101000a@rschmoelz>
Message-ID: <421CE0ED.8090200@redhat.com>

Robert Schmoelzer wrote:

>Hi all,
>
>We are running GFS 5.2 on 3 nodes.
>Each of our 4 filesystems is ~900GB.
>We are now planning to upgrade to GSF 6.*
>  
>
GFS 5.2.1, right?  I assume you want to go to GFS 6.1 since you mention 
DLM below.

>My questions are:
>Can we just mount the filesystems after the
>Upgrade or is there some "migration" needed?
>  
>
Some migration involved.  Don't forget to backup first.

Basically, you need to use 'ccs_tool upgrade' to change your 5.2.1 ccs 
archive into the cluster.conf XML for 6.1.  Then start your 6.1 
cluster.  If you choose to use the DLM you need to use 'gfs_tool sb 
/dev/example/example proto lock_dlm' to change the lock protocol.  Then 
the first time you mount you need to specify the extra mount option 
'upgrade', like 'mount -t gfs -o upgrade /dev/example/example 
/mnt/gfs1'.  The upgrade mount option only needs to be specified once 
per filesystem.  Before mounting you can also choose (or not) to upgrade 
the pool volumes to LVM volumes; use the 'vgconvert -M2 volname' to do this.

None of this is documented yet, but that's the basic jist of it.  I've 
done a few upgrades of both 5.2.1 and 6.0 to 6.1 and it worksforme.

Good Luck,
Derek

>What is the advantage of GULM over DLM, 
>which one should we consider?
>
>
>Regards
>Robert
>  
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>http://www.redhat.com/mailman/listinfo/linux-cluster
>


From daniel at osdl.org  Wed Feb 23 23:12:22 2005
From: daniel at osdl.org (Daniel McNeil)
Date: Wed, 23 Feb 2005 15:12:22 -0800
Subject: [Linux-cluster] node kicked out of cluster
In-Reply-To: <20050222083550.GB31659@tykepenguin.com>
References: <1109036063.24124.7.camel@ibm-c.pdx.osdl.net>
	<20050222083550.GB31659@tykepenguin.com>
Message-ID: <1109200342.24124.37.camel@ibm-c.pdx.osdl.net>

On Tue, 2005-02-22 at 00:35, Patrick Caulfield wrote:
> On Mon, Feb 21, 2005 at 05:34:23PM -0800, Daniel McNeil wrote:
> > My latest test ran 49 hours before a node got kicked out.
> > 
> > 
> > cl030:
> > Feb 18 18:07:40 cl030 kernel: CMAN: node cl030a has been removed from the cluster : No response to messages
> > Feb 18 18:07:40 cl030 kernel: CMAN: killed by NODEDOWN message
> > Feb 18 18:07:40 cl030 kernel: CMAN: we are leaving the cluster.
> > Feb 18 18:07:41 cl030 kernel: dlm: stripefs: recoverd_kick after exit
> > Feb 18 18:07:41 cl030 kernel:
> > Feb 18 18:07:41 cl030 kernel: SM: send_nodeid_message error -107 to 2
> > Feb 18 18:07:42 cl030 kernel: SM: 00000001 sm_stop: SG still joined
> > Feb 18 18:07:42 cl030 kernel: SM: 01000430 sm_stop: SG still joined
> > Feb 18 18:07:42 cl030 kernel: SM: 02000431 sm_stop: SG still joined
> > Feb 18 18:07:42 cl030 ccsd[3766]: [cluster_mgr.c:387] Cluster manager shutdown.
> > 
> > cl031:
> > Feb 18 18:07:40 cl031 kernel: CMAN: removing node cl030a from the cluster : No response to messages
> > Feb 18 18:07:41 cl031 fenced[4127]: cl030a not a cluster member after 0 sec post_fail_delay
> > Feb 18 18:07:41 cl031 fenced[4127]: fencing node "cl030a"
> > Feb 18 18:07:41 cl031 fence_manual: Node cl030a needs to be reset before recovery can procede.  Waiting for cl030a to rejoin the cluster or for manual acknowledgement that it has been reset (i.e. fence_ack_manual -n cl030a)
> > 
> > cl032:
> > Feb 18 18:07:40 cl032 kernel: CMAN: node cl030a has been removed from the cluster : No response to messages
> > Feb 18 18:07:41 cl032 fenced[4262]: fencing deferred to cl031a
> > Feb 19 04:02:06 cl032 su(pam_unix)[29639]: session opened for user cyrus by (uid=0)
> > 
> > Does this mean heartbeats got lost so cl030 was kicked out?
> 
> No. "No response to messages" can only happen during a state transition or
> services join/leave. Current thinking is that the DLM can hog the CPU when
> recovering huge numbers of locks, so we a re looking into placing some strategic
> "schedule()" calls in the recovery process.

My test is doing a bunch of mount/umount's, so that is causing
the service join/leave for the DLM lock space and file system
mount group.  

Are you saying leaving a DLM lock space causes a DLM recovery and
 that is what is leading to the 'No response to messages' ?

How does DLM hogging a cpu lead to 'no response'?
BTW, this is running on 2 proc machines, so hogging one cpu
still leaves on available.  

Who is not responding to what message?

Thanks,

Daniel


From pcaulfie at redhat.com  Thu Feb 24 08:35:32 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Thu, 24 Feb 2005 08:35:32 +0000
Subject: [Linux-cluster] node kicked out of cluster
In-Reply-To: <1109200342.24124.37.camel@ibm-c.pdx.osdl.net>
References: <1109036063.24124.7.camel@ibm-c.pdx.osdl.net>
	<20050222083550.GB31659@tykepenguin.com>
	<1109200342.24124.37.camel@ibm-c.pdx.osdl.net>
Message-ID: <20050224083532.GB4315@tykepenguin.com>

On Wed, Feb 23, 2005 at 03:12:22PM -0800, Daniel McNeil wrote:
> On Tue, 2005-02-22 at 00:35, Patrick Caulfield wrote:
> > On Mon, Feb 21, 2005 at 05:34:23PM -0800, Daniel McNeil wrote:
> > > My latest test ran 49 hours before a node got kicked out.
> 
> My test is doing a bunch of mount/umount's, so that is causing
> the service join/leave for the DLM lock space and file system
> mount group.  
> 
> Are you saying leaving a DLM lock space causes a DLM recovery and
>  that is what is leading to the 'No response to messages' ?

Indirectly, we think.
 
> How does DLM hogging a cpu lead to 'no response'?
> BTW, this is running on 2 proc machines, so hogging one cpu
> still leaves on available.  

Hmm that makes it more, er, interesting.
 
> Who is not responding to what message?

It's cman that has sent a message to another node, to process the join. It
hasn't got back an ACK for that message after max_retries (default 3) attempts.

-- 

patrick


From ialberdi at histor.fr  Thu Feb 24 13:00:01 2005
From: ialberdi at histor.fr (Ion Alberdi)
Date: Thu, 24 Feb 2005 14:00:01 +0100
Subject: [Linux-cluster] Configuring rgmanager
In-Reply-To: <1109184126.12299.11.camel@ayanami.boston.redhat.com>
References: <421B1F98.1040708@histor.fr>
	<1109184126.12299.11.camel@ayanami.boston.redhat.com>
Message-ID: <421DCFD1.5010606@histor.fr>

Lon Hohberger wrote:

>On Tue, 2005-02-22 at 13:03 +0100, Ion Alberdi wrote:
>  
>
>>Hello.
>>
>>I would like to know first if someone has sucessfully used rgmanager in 
>>this ML.
>>If so, how did you configure your cluster.conf, by hand or did you use a 
>>tool?
>>I searched over the archives and I only saw Jiho Hahm posting (twice) 
>>(16.02.2005) a question about rgmanager, and nobody answered.
>>    
>>
>
>I have configured it successfully, by hand.
>
>Right now, there's a bug I'm working on which prevents it from working
>properly in some configurations (namely: CMAN >2 nodes).  I should have
>a permanent fix tomorrow or the day after.
>
>  
>
Ok, because what I'm trying doesn't work:
I have three nodes debian, gump,buba.
debian exports devices to gump and buba over gnbd, and finally the three 
nodes have the same /dev/vg1/lv1 thanks to clvmd.
I created a script which mounts this gfs logical volume on /mnt/gfs and 
appends the name of the node in nodes_name.txt. (and stops writting and 
unmounting to stop), and I put this script as a resource.
I launch rgmanager and with clusvcadm I  start, stop realocate the 
resource, this works very well!
But when I unplug the ethernet cable of the node running the resource,  
the resource  isn't relocated  to
 another node, I suppose that's what you mean by it doesnt work with 
CMAN>2nodes.

If you need another tester to test your fixes here am I :)
 

From Vincent.Aniello at PipelineTrading.com  Thu Feb 24 14:39:08 2005
From: Vincent.Aniello at PipelineTrading.com (Vincent Aniello)
Date: Thu, 24 Feb 2005 09:39:08 -0500
Subject: [Linux-cluster] GFS 6.1 and RedHat Enterprise 4.0
Message-ID: <834F55E6F1BE3B488AD3AFC927A09700317897@EMAILSRV1.exad.net>

Is GFS 6.1 included in the distribution for RedHat Enterprise 4.0?
 
--Vincent
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050224/ddb7f70f/attachment.htm>

From fajar at telkom.co.id  Thu Feb 24 15:00:55 2005
From: fajar at telkom.co.id (Fajar A. Nugraha)
Date: Thu, 24 Feb 2005 22:00:55 +0700
Subject: [Linux-cluster] GFS 6.1 and RedHat Enterprise 4.0
In-Reply-To: <834F55E6F1BE3B488AD3AFC927A09700317897@EMAILSRV1.exad.net>
References: <834F55E6F1BE3B488AD3AFC927A09700317897@EMAILSRV1.exad.net>
Message-ID: <421DEC27.4070209@telkom.co.id>

Vincent Aniello wrote:

> Is GFS 6.1 included in the distribution for RedHat Enterprise 4.0?
>
To be more specific, I would also like to know what is the estimated 
release dates (e.g the date they become fully supported redhat product) 
of the new GFS, clustering, and RHE4.

Regards,

Fajar


From rajkum2002 at rediffmail.com  Thu Feb 24 16:45:27 2005
From: rajkum2002 at rediffmail.com (Raj  Kumar)
Date: 24 Feb 2005 16:45:27 -0000
Subject: [Linux-cluster] client doesnt start when lock master is not ready
Message-ID: <20050224164527.6384.qmail@webmail29.rediffmail.com>

Hi All,

We have a two node system using GFS. One of them is the lock server and other is just client. We restarted our servers recently and brought the lock client before bringing up the lock master. lock_gulmd is set to restart at levels 3, 4 and 5. The lock client system just hungup with the message "Starting lock_gulmd..." in the boot process. It's clear that this situation happened since lock master server wasn't available then. When the lock master server started the lock client system started successfully.

I noticed before client system started even when lock master was not available and the status of the lock_gulmd on client was set to "pending". But now the system doesnt start until the master server is also started. Has this changed recently? It is possible that other administrators in the group may have to restart the system at times. If they start the client before master (or worse they dont start master at all) then the system will not complete its boot process and other services remain unavailable. I like the system to complete its boot process and have the lock_gulmd stay in pending state until master comes back. Is there any trick to achieve this behavior?

Thanks much!
Raj
 ?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050224/cedc5eca/attachment.htm>

From amanthei at redhat.com  Thu Feb 24 17:38:19 2005
From: amanthei at redhat.com (Adam Manthei)
Date: Thu, 24 Feb 2005 11:38:19 -0600
Subject: [Linux-cluster] client doesnt start when lock master is not ready
In-Reply-To: <20050224164527.6384.qmail@webmail29.rediffmail.com>
References: <20050224164527.6384.qmail@webmail29.rediffmail.com>
Message-ID: <20050224173819.GB26952@redhat.com>

On Thu, Feb 24, 2005 at 04:45:27PM -0000, Raj  Kumar wrote:
> Hi All,
> 
> We have a two node system using GFS. One of them is the lock server and 
> other is just client. We restarted our servers recently and brought the 
> lock client before bringing up the lock master. lock_gulmd is set to 
> restart at levels 3, 4 and 5. The lock client system just hungup with the 
> message "Starting lock_gulmd..." in the boot process. It's clear that this 
> situation happened since lock master server wasn't available then. When 
> the lock master server started the lock client system started successfully.

This is the desired behavior.  Adjust the following value in
/etc/sysconfig/gfs if you don't like it's behavior.

# GULM_QUORUM_TIMEOUT -- amount of time to wait for there to be a master
#     before giving up.  If GULM_QUORUM_TIMEOUT is positive, then we will
#     wait GULM_QUORUM_TIMEOUT seconds before giving up and failing when
#     a master server is not found.  If GULM_QUORUM_TIMEOUT is zero, then
#     wait indefinately for a master server.  If GULM_QUORUM_TIMEOUT is
#     negative, just start lock_gulmd and not worry about whether it is
#     quorate.
GULM_QUORUM_TIMEOUT=300

> I noticed before client system started even when lock master was not 
> available and the status of the lock_gulmd on client was set to "pending". 
> But now the system doesnt start until the master server is also started. 

Did you have the system mounting GFS automatically?  Apparently not since it
would have "hung" there too.  The client node should have eventually timed
out after 5 minutes without a master server to log into.

> Has this changed recently? 

Define recently... sort of need the version information you are using :)

My guess is that since you are complaining about this behavior, you just
upgraded from GFS-6.0.0-15 to GFS-6.0.2-24.  From the rpm change log:

* Mon Nov 15 2004 Chris Feist <cfeist at redhat.com> 6.0.2-0
- init.d/lock_gulmd will not start if quorum is not established after
  a specified time (rbz135732).
- init.d/lock_gulmd will not stop if GFS is mounted (rbz135730).
- pool init.d scripts no longer hang on startup until console input
  is provided (rbz137382).


> It is possible that other administrators in the 
> group may have to restart the system at times. If they start the client 
> before master (or worse they dont start master at all) then the system will
> not complete its boot process and other services remain unavailable. 

Your nodes won't be able to mount GFS if there cluster the gulm servers
aren't quorate, so what's the problem?

> I like 
> the system to complete its boot process and have the lock_gulmd stay in 
> pending state until master comes back. Is there any trick to achieve this 
> behavior?

GULM_QUORUM_TIMEOUT=-1 

One other suggestion.  I usually start sshd immediately after networking on
my machines so that I can get into them as soon as possible.  This often
helps when dealing with complaints of this nature.
-- 
Adam Manthei  <amanthei at redhat.com>


From lhh at redhat.com  Thu Feb 24 19:29:12 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Thu, 24 Feb 2005 14:29:12 -0500
Subject: [Linux-cluster] Configuring rgmanager
In-Reply-To: <421DCFD1.5010606@histor.fr>
References: <421B1F98.1040708@histor.fr>
	<1109184126.12299.11.camel@ayanami.boston.redhat.com>
	<421DCFD1.5010606@histor.fr>
Message-ID: <1109273352.12299.50.camel@ayanami.boston.redhat.com>

On Thu, 2005-02-24 at 14:00 +0100, Ion Alberdi wrote:
> Ok, because what I'm trying doesn't work:
> I have three nodes debian, gump,buba.
> debian exports devices to gump and buba over gnbd, and finally the three 
> nodes have the same /dev/vg1/lv1 thanks to clvmd.
> I created a script which mounts this gfs logical volume on /mnt/gfs and 
> appends the name of the node in nodes_name.txt. (and stops writting and 
> unmounting to stop), and I put this script as a resource.
> I launch rgmanager and with clusvcadm I  start, stop realocate the 
> resource, this works very well!
> But when I unplug the ethernet cable of the node running the resource,  
> the resource  isn't relocated  to
>  another node, I suppose that's what you mean by it doesnt work with 
> CMAN>2nodes.
> 
> If you need another tester to test your fixes here am I :)

Failover will not occur until after CMAN (or gulm) says the node is dead
and has been fenced.  When using the kernel Service Manager (provided by
CMAN), recovery is in the following order:

(1) Fencing
(2) Locking
(3) GFS
(4) User services (e.g. rgmanager)

How long did you wait? :)

-- Lon


From jhahm at yahoo.com  Thu Feb 24 20:01:45 2005
From: jhahm at yahoo.com (Jiho Hahm)
Date: Thu, 24 Feb 2005 12:01:45 -0800 (PST)
Subject: [Linux-cluster] Triggering failover at Resource Manager level
Message-ID: <20050224200145.71324.qmail@web50905.mail.yahoo.com>

Hi,

Is it possible for a custom <script> resource to trigger
failover when there is no heartbeat problem at CMAN level? 
When status command returns non-zero, RM keeps calling
recover command but won't initiate failover when recover
fails repeatedly.

I'm getting around this by having the recover command
reboot the box (and thus triggering CMAN-level fence and
failover), but I'm wondering if there is a more proper way
of doing this, basically a way to ask clurgmgrd to talk to
CMAN/fence layer and initiate failover.

-Jiho

__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From lhh at redhat.com  Fri Feb 25 14:55:20 2005
From: lhh at redhat.com (Lon Hohberger)
Date: Fri, 25 Feb 2005 09:55:20 -0500
Subject: [Linux-cluster] Triggering failover at Resource Manager level
In-Reply-To: <20050224200145.71324.qmail@web50905.mail.yahoo.com>
References: <20050224200145.71324.qmail@web50905.mail.yahoo.com>
Message-ID: <1109343320.12299.86.camel@ayanami.boston.redhat.com>

On Thu, 2005-02-24 at 12:01 -0800, Jiho Hahm wrote:
> Hi,
> 
> Is it possible for a custom <script> resource to trigger
> failover when there is no heartbeat problem at CMAN level? 
> When status command returns non-zero, RM keeps calling
> recover command but won't initiate failover when recover
> fails repeatedly.

Not yet...

For now, you want the script to flag itself so that a 'start' operation
fails.

Could you bugzilla this?  It should be a really easy fix.

I guess we don't have cluster bugzilla on sources.redhat.com - so use
Red Hat bugzilla, product = "Red Hat Cluster Suite", package =
rgmanager.

-- Lon


From pcaulfie at redhat.com  Fri Feb 25 15:10:05 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Fri, 25 Feb 2005 15:10:05 +0000
Subject: [Linux-cluster] node kicked out of cluster
In-Reply-To: <20050224083532.GB4315@tykepenguin.com>
References: <1109036063.24124.7.camel@ibm-c.pdx.osdl.net>
	<20050222083550.GB31659@tykepenguin.com>
	<1109200342.24124.37.camel@ibm-c.pdx.osdl.net>
	<20050224083532.GB4315@tykepenguin.com>
Message-ID: <20050225151005.GF31610@tykepenguin.com>

You might like to try this patch. It seems to reduce the number of cman retries
considerably in my testing.

-- 

patrick


Index: cman-kernel/src/cnxman.c
===================================================================
RCS file: /cvs/cluster/cluster/cman-kernel/src/cnxman.c,v
retrieving revision 1.51
diff -u -p -r1.51 cnxman.c
--- cman-kernel/src/cnxman.c    25 Feb 2005 10:16:36 -0000      1.51
+++ cman-kernel/src/cnxman.c    25 Feb 2005 14:48:44 -0000
@@ -340,10 +340,6 @@ static int cluster_kthread(void *unused)
                if (quit_threads)
                        break;
 
-               if (test_and_clear_bit(ACK_TIMEOUT, &mainloop_flags)) {
-                       check_for_unacked_nodes();
-               }
-
                /* Now receive any messages waiting for us */
                spin_lock_irq(&active_socket_lock);
                list_for_each_safe(socklist, temp, &active_socket_list) {
@@ -368,6 +364,10 @@ static int cluster_kthread(void *unused)
                }
                spin_unlock_irq(&active_socket_lock);
 
+               if (test_and_clear_bit(ACK_TIMEOUT, &mainloop_flags)) {
+                       check_for_unacked_nodes();
+               }
+
                /* Resend any unacked messages */
                if (test_and_clear_bit(RESEND_NEEDED, &mainloop_flags)
                    && acks_expected) {


From ialberdi at histor.fr  Fri Feb 25 15:22:41 2005
From: ialberdi at histor.fr (Ion Alberdi)
Date: Fri, 25 Feb 2005 16:22:41 +0100
Subject: [Linux-cluster] Configuring rgmanager
In-Reply-To: <1109273352.12299.50.camel@ayanami.boston.redhat.com>
References: <421B1F98.1040708@histor.fr>	<1109184126.12299.11.camel@ayanami.boston.redhat.com>	<421DCFD1.5010606@histor.fr>
	<1109273352.12299.50.camel@ayanami.boston.redhat.com>
Message-ID: <421F42C1.8010506@histor.fr>

Lon Hohberger wrote:

>On Thu, 2005-02-24 at 14:00 +0100, Ion Alberdi wrote:
>  
>
>>Ok, because what I'm trying doesn't work:
>>I have three nodes debian, gump,buba.
>>debian exports devices to gump and buba over gnbd, and finally the three 
>>nodes have the same /dev/vg1/lv1 thanks to clvmd.
>>I created a script which mounts this gfs logical volume on /mnt/gfs and 
>>appends the name of the node in nodes_name.txt. (and stops writting and 
>>unmounting to stop), and I put this script as a resource.
>>I launch rgmanager and with clusvcadm I  start, stop realocate the 
>>resource, this works very well!
>>But when I unplug the ethernet cable of the node running the resource,  
>>the resource  isn't relocated  to
>> another node, I suppose that's what you mean by it doesnt work with 
>>CMAN>2nodes.
>>
>>If you need another tester to test your fixes here am I :)
>>    
>>
>
>Failover will not occur until after CMAN (or gulm) says the node is dead
>and has been fenced.  When using the kernel Service Manager (provided by
>CMAN), recovery is in the following order:
>
>(1) Fencing
>(2) Locking
>(3) GFS
>(4) User services (e.g. rgmanager)
>
>How long did you wait? :)
>
>  
>
Quite long, I fence_ack_manual the node and i wait but nothing happens,
I tried rgmanager with two nodes, and a basic script (no gfs, no logical 
volume, no gnbd) and it works well, I'm going to try with another script 
(which will use, gnbd,gfs, and logical volumes) on two nodes


>-- Lon
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>http://www.redhat.com/mailman/listinfo/linux-cluster
>
>  
>


From jhahm at yahoo.com  Fri Feb 25 21:09:25 2005
From: jhahm at yahoo.com (Jiho Hahm)
Date: Fri, 25 Feb 2005 13:09:25 -0800 (PST)
Subject: [Linux-cluster] Triggering failover at Resource Manager level
In-Reply-To: <1109343320.12299.86.camel@ayanami.boston.redhat.com>
Message-ID: <20050225210925.37778.qmail@web50905.mail.yahoo.com>

Lon, I logged it as bug 149735.  I didn't know if it should
be marked as bug or enhancement, so I just took the
defaults (bug).

-Jiho

--- Lon Hohberger <lhh at redhat.com> wrote:

> On Thu, 2005-02-24 at 12:01 -0800, Jiho Hahm wrote:
> > Hi,
> > 
> > Is it possible for a custom <script> resource to
> trigger
> > failover when there is no heartbeat problem at CMAN
> level? 
> > When status command returns non-zero, RM keeps calling
> > recover command but won't initiate failover when
> recover
> > fails repeatedly.
> 
> Not yet...
> 
> For now, you want the script to flag itself so that a
> 'start' operation
> fails.
> 
> Could you bugzilla this?  It should be a really easy fix.
> 
> I guess we don't have cluster bugzilla on
> sources.redhat.com - so use
> Red Hat bugzilla, product = "Red Hat Cluster Suite",
> package =
> rgmanager.
> 
> -- Lon
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster
> 


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From jhahm at yahoo.com  Sat Feb 26 00:44:04 2005
From: jhahm at yahoo.com (Jiho Hahm)
Date: Fri, 25 Feb 2005 16:44:04 -0800 (PST)
Subject: [Linux-cluster] <script> and <fs> as child resource of <fs> resource
Message-ID: <20050226004405.24238.qmail@web50903.mail.yahoo.com>

This is related to my earlier post where I asked about
start/stop ordering.

I have a setup where my application code is installed on a
SAN volume.  Furthermore, the data directory appears as a
subdirectory of install root but is actually on another SAN
volume.  Taking Apache as an example, let's say it's
installed under /opt/apache and web pages are under
/opt/apache/htdocs.

The directories correspond to devices like this:

/dev/sda1 - /opt/apache
/dev/sdb1 - /opt/apache/htdocs

To start Apache I need to first mount /dev/sda1, then mount
/dev/sdb1, and finally invoke /opt/apache/bin/apachectl
start.  To stop I would do things in reverse. 
Particularly, I can't unmount /opt/apache before unmounting
/opt/apache/htdocs.  (rgmanager doesn't deal with failed
umounts well.)

In cluster.conf, this dependency can be expressed like
this:

    <resources>
        <fs name="/opt/apache" device="/dev/sda1" ... />
        <fs name="/opt/apache/htdocs" device="dev/sdb1" ...
/>
        <script name="apachectl"
                file="/opt/apache/bin/apachectl"/>
    </resources>
    <resourcegroup name="rg1">
        <fs ref="/opt/apache">
            <fs ref="/opt/apache/htdocs">
                <script ref="apachectl"/>
            </fs>
        </fs>
    </resourcegroup>

This is currently not possible because the only child
resource type allowed inside <fs> is <nfsexport>.  But the
rule can be easily changed by adding:

    <child type="fs"/>
    <child type="script"/>

to /usr/share/cluster/fs.sh file.  After the change things
work as expected.  But is that a safe change?  What are the
ramifications?

It seems we can allow any root resource to contain children
of any other root type.  Not every case will have a
practical use case but many will.  For example, <ip> can be
configured as a child of <script>, to bring up the IP
address only when the application is ready, and to remove
the IP address and therefore stop any more incoming clients
before shutting down the application.

What do folks think?  Are there reasons we shouldn't do
this?

-Jiho


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From cjkovacs at verizon.net  Sat Feb 26 16:57:47 2005
From: cjkovacs at verizon.net (Corey Kovacs)
Date: Sat, 26 Feb 2005 11:57:47 -0500
Subject: [Linux-cluster] GFS 6.0.2-24 + NFS
Message-ID: <200502261157.48143.cjkovacs@verizon.net>

I have a 5 node cluster running GFS 6.0.2-24 with kernel 2.4.21-27.0.1 on 
RHASu4. I have three GFS filesystems 20GB, 40GB and ~1.8TB mounted from an 
MSA1000 SAN. The large partion is being re-exported via NFS. When copying a 
large file (~450GB) to the nfs re-exported GFS filesystem, the filesystem 
system hangs across all nodes. When the offending node is shutdown (never 
gets fenced and I am using fence_ilo) the system "wakes up". The nodes are 
DL360's with 2GB of ram each. The are using qlogic 2340 fibre cards and 
redhat branded drivers. Three of the 5 nodes are configured as lock managers. 
I've seen messages about lock_gulm not freeing mem. Are there issues with NFS 
and GFS together? What things should be done to tune such a configuration?  
Any help would be greatly appreicated.


Corey


From sunjw at onewaveinc.com  Sun Feb 27 14:54:01 2005
From: sunjw at onewaveinc.com (=?gb2312?B?y++/oc6w?=)
Date: Sun, 27 Feb 2005 22:54:01 +0800
Subject: [Linux-cluster] GFS umount hang after heavy read/write from NFS
	client
Message-ID: <SERVER3q1vblaLJPv0000001931@mail.onewaveinc.com>

Hello,all

I got messages as follows:
	
GFS: fsid=MyTest:vgtest.2: Unmount seems to be stalled. Dumping lock state...
Glock (2, 26)
  gl_flags =
  gl_count = 2
  gl_state = 0
  req_gh = no
  req_bh = no
  lvb_count = 0
  object = yes
  new_le = no
  incore_le = no
  reclaim = no
  aspace = 0
  ail_bufs = no
  Inode:
    num = 26/26
    type = 2
    i_count = 1
    i_flags =
    vnode = yes
Glock (5, 26)
  gl_flags =
  gl_count = 2
  gl_state = 3
  req_gh = no
  req_bh = no
  lvb_count = 0
  object = yes
  new_le = no
  incore_le = no
  reclaim = no
  aspace = no
  ail_bufs = no
  Holder
    owner = -1
    gh_state = 3
    gh_flags = 5 7
    error = 0
    gh_iflags = 1 6 7

What's the meaning ? GFS lock out,or NFS lock out?

And the "umount" process occupied 99.9% of CPU.
The status of the "umount" process was 
"29121 ?        R     74:08 umount /mnt/gfs/10".

I use the kernel 2.6.9.

Thanks for any reply!
Best regards!
Luckey


From cjkovacs at verizon.net  Sun Feb 27 16:10:55 2005
From: cjkovacs at verizon.net (Corey Kovacs)
Date: Sun, 27 Feb 2005 11:10:55 -0500
Subject: [Linux-cluster] GFS 6.0.2-24 + NFS (ALSO)
In-Reply-To: <200502261157.48143.cjkovacs@verizon.net>
References: <200502261157.48143.cjkovacs@verizon.net>
Message-ID: <200502271110.56310.cjkovacs@verizon.net>

Also, on this same cluster, when using the "gulm stonith" fencing module in 
clumanager, I get errors generated by ...

log_err("Protocol Mismatch: We're %#x and They're %#x\n",   GIO_WIREPROT_VERS, 
x_proto); which, by looking at the surrounding code seems to indicate the the 
fence device login is failing. I am trying to fence using fence_ilo against 
DL360's with iLO firmaware 1.64. My config looks something like this...

fence_devidces {
	iLO_1 {
		agent="fence_ilo"
		hostame=1.2.3.4
		login=admin_user
		passwd=somepassword
		action=off		
	}

}


and in the nodes file I reference the fence like this..


nodes{

	somenode{
		ip_interfaces{
			eth0="2.3.4.5"
		}
		fence {
			iLO {
				iLO_1{}
			}
		}
}
}

I pass no options since the only option I use is "off" and it is defined in 
the fence.ccs file.


Any ideas as to what might be causing this?


Corey


On Saturday 26 February 2005 11:57, Corey Kovacs wrote:
> I have a 5 node cluster running GFS 6.0.2-24 with kernel 2.4.21-27.0.1 on
> RHASu4. I have three GFS filesystems 20GB, 40GB and ~1.8TB mounted from an
> MSA1000 SAN. The large partion is being re-exported via NFS. When copying a
> large file (~450GB) to the nfs re-exported GFS filesystem, the filesystem
> system hangs across all nodes. When the offending node is shutdown (never
> gets fenced and I am using fence_ilo) the system "wakes up". The nodes are
> DL360's with 2GB of ram each. The are using qlogic 2340 fibre cards and
> redhat branded drivers. Three of the 5 nodes are configured as lock
> managers. I've seen messages about lock_gulm not freeing mem. Are there
> issues with NFS and GFS together? What things should be done to tune such a
> configuration? Any help would be greatly appreicated.
>
>
> Corey
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> http://www.redhat.com/mailman/listinfo/linux-cluster


From teigland at redhat.com  Mon Feb 28 03:59:35 2005
From: teigland at redhat.com (David Teigland)
Date: Mon, 28 Feb 2005 11:59:35 +0800
Subject: [Linux-cluster] GFS umount hang after heavy read/write from NFS
	client
In-Reply-To: <SERVER3q1vblaLJPv0000001931@mail.onewaveinc.com>
References: <SERVER3q1vblaLJPv0000001931@mail.onewaveinc.com>
Message-ID: <20050228035935.GB5682@redhat.com>

On Sun, Feb 27, 2005 at 10:54:01PM +0800, ?????? wrote:
> Hello,all
> 
> I got messages as follows:
> 	
> GFS: fsid=MyTest:vgtest.2: Unmount seems to be stalled. Dumping lock state...

> What's the meaning ? GFS lock out,or NFS lock out?

GFS can't unlock those two locks for some reason; it's probably a dlm bug.
We saw this in the past, but not recently.  Maybe we're not running enough
GFS/NFS tests.

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=142993

> And the "umount" process occupied 99.9% of CPU.
> The status of the "umount" process was 
> "29121 ?        R     74:08 umount /mnt/gfs/10".
> 
> I use the kernel 2.6.9.

It would be helpful to know if you see this with the current code in cvs
(which needs 2.6.10) and if it happens often.

-- 
Dave Teigland  <teigland at redhat.com>


From sunjw at onewaveinc.com  Mon Feb 28 06:50:10 2005
From: sunjw at onewaveinc.com (=?gb2312?B?y++/oc6w?=)
Date: Mon, 28 Feb 2005 14:50:10 +0800
Subject: [Linux-cluster] GFS umount hang after heavy read/write from
	NFS client
Message-ID: <SERVERAOeIiGXix4A2D00000051@mail.onewaveinc.com>

>On Sun, Feb 27, 2005 at 10:54:01PM +0800, ?????? wrote:
>> Hello,all
>> 
>> I got messages as follows:
>> 	
>> GFS: fsid=MyTest:vgtest.2: Unmount seems to be stalled. Dumping lock state...
>
>> What's the meaning ? GFS lock out,or NFS lock out?
>
>GFS can't unlock those two locks for some reason; it's probably a dlm bug.
>We saw this in the past, but not recently.  Maybe we're not running enough
>GFS/NFS tests.
>
>https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=142993
>
>> And the "umount" process occupied 99.9% of CPU.
>> The status of the "umount" process was 
>> "29121 ?        R     74:08 umount /mnt/gfs/10".
>> 
>> I use the kernel 2.6.9.
>
>It would be helpful to know if you see this with the current code in cvs
>(which needs 2.6.10) and if it happens often.
I checked out the cluster's code on 2005-02-25 with tag RHEL4.
My program opened many files(such as 300) and readed/writed them concurrently,
Sometimes I got error messages as "Stale NFS file handle ".
to reproduce it, I just do:
	.mount GFS
	.start NFS service
	.mount NFS client
	.read from NFS
	.stop NFS service
	.umount GFS
	.mount the same GFS
	.start NFS service
	.read the same files from NFS
	.umount NFS
	.stop NFS service
	.umount GFS 
Then, after some minutes, I got things above.
The process status of "umount" is "R",particular as:
29053 ?        S<     0:00  \_ [lock_dlm1]
   11 ?        S<     0:00 [events/1]
 4051 ?        S<     0:00  \_ [cman_serviced]
29052 ?        S<     0:00  \_ [dlm_recoverd]
29054 ?        S<     0:00  \_ [lock_dlm2]
   12 ?        S<     0:00 [events/2]
 4066 ?        S<     0:01  \_ [dlm_astd]
 4068 ?        S<     0:00  \_ [dlm_sendd]
29543 ?        S<     0:00  \_ [dlm_recoverd]
29545 ?        S<     0:00  \_ [lock_dlm2]
   13 ?        S<     0:00 [events/3]
 4067 ?        S<     0:01  \_ [dlm_recvd]
 4069 ?        S<     0:00  \_ [dlm_recoverd]
29544 ?        S<     0:00  \_ [lock_dlm1]
29611 ?        R    1017:42 umount /mnt/gfs/11


From teigland at redhat.com  Mon Feb 28 07:17:04 2005
From: teigland at redhat.com (David Teigland)
Date: Mon, 28 Feb 2005 15:17:04 +0800
Subject: [Linux-cluster] GFS umount hang after heavy read/write from
	NFS client
In-Reply-To: <SERVERAOeIiGXix4A2D00000051@mail.onewaveinc.com>
References: <SERVERAOeIiGXix4A2D00000051@mail.onewaveinc.com>
Message-ID: <20050228071704.GC5682@redhat.com>

> I checked out the cluster's code on 2005-02-25 with tag RHEL4.
> My program opened many files(such as 300) and readed/writed them concurrently,
> Sometimes I got error messages as "Stale NFS file handle ".
> to reproduce it, I just do:
> 	.mount GFS
> 	.start NFS service
> 	.mount NFS client
> 	.read from NFS
> 	.stop NFS service
> 	.umount GFS
> 	.mount the same GFS
> 	.start NFS service
> 	.read the same files from NFS
> 	.umount NFS
> 	.stop NFS service
> 	.umount GFS 
> Then, after some minutes, I got things above.

Thanks, that's a big help, we'll work on it.

-- 
Dave Teigland  <teigland at redhat.com>


From sunjw at onewaveinc.com  Mon Feb 28 08:08:01 2005
From: sunjw at onewaveinc.com (=?gb2312?B?y++/oc6w?=)
Date: Mon, 28 Feb 2005 16:08:01 +0800
Subject: [Linux-cluster] GFS umount hang after heavy read/write
	from NFS client
Message-ID: <SERVERRKLpgTFur2iz20000009f@mail.onewaveinc.com>

>
>Thanks, that's a big help, we'll work on it.
I updated the code from cvs just now(with tag RHEL4), 
and rebuilded it, and I got the same result.
I did as follows:
	.mount GFS
	.start NFS service
	.mount NFS client
	.concurrently read from NFS
	.umount NFS client
	.stop NFS service
	.umount GFS
Then I blocked there. I got the message "Stale NFS file handle" again.
the process status of "umount" was "R+".


From teigland at redhat.com  Mon Feb 28 08:17:57 2005
From: teigland at redhat.com (David Teigland)
Date: Mon, 28 Feb 2005 16:17:57 +0800
Subject: [Linux-cluster] GFS umount hang after heavy read/write
	from NFS client
In-Reply-To: <SERVERRKLpgTFur2iz20000009f@mail.onewaveinc.com>
References: <SERVERRKLpgTFur2iz20000009f@mail.onewaveinc.com>
Message-ID: <20050228081757.GE5682@redhat.com>

On Mon, Feb 28, 2005 at 04:08:01PM +0800, ?????? wrote:

> 	.concurrently read from NFS

What do you mean by this?

-- 
Dave Teigland  <teigland at redhat.com>


From sunjw at onewaveinc.com  Mon Feb 28 08:37:03 2005
From: sunjw at onewaveinc.com (=?gb2312?B?y++/oc6w?=)
Date: Mon, 28 Feb 2005 16:37:03 +0800
Subject: [Linux-cluster] GFS umount hang after heavy
	read/write from NFS client
Message-ID: <SERVERKd6QR7j1HaKtK000000ba@mail.onewaveinc.com>

>> 	.concurrently read from NFS
>
>What do you mean by this?
My test program is something like iozone, but is multithreading.
On each thread, it reads one file with speed limit about 1MB/s.
And all threads start reading at the same time, so I refer it as concurrent read.

By the way, in spite of how I get it,
every time the umount hangs, I get the same message as:
Glock (2, 26) <<the same
  gl_flags =
  gl_count = 2
  gl_state = 0
  req_gh = no
  req_bh = no
  lvb_count = 0
  object = yes
  new_le = no
  incore_le = no
  reclaim = no
  aspace = 0
  ail_bufs = no
  Inode:
    num = 26/26
    type = 2
    i_count = 1
    i_flags =
    vnode = yes
Glock (5, 26) <<the same
  gl_flags =
  gl_count = 2
  gl_state = 3
  req_gh = no
  req_bh = no
  lvb_count = 0
  object = yes
  new_le = no
  incore_le = no
  reclaim = no
  aspace = no
  ail_bufs = no
  Holder
    owner = -1
    gh_state = 3
    gh_flags = 5 7
    error = 0
    gh_iflags = 1 6 7


From ialberdi at histor.fr  Mon Feb 28 13:05:21 2005
From: ialberdi at histor.fr (Ion Alberdi)
Date: Mon, 28 Feb 2005 14:05:21 +0100
Subject: [Linux-cluster] Configuring rgmanager
In-Reply-To: <421F42C1.8010506@histor.fr>
References: <421B1F98.1040708@histor.fr>	<1109184126.12299.11.camel@ayanami.boston.redhat.com>	<421DCFD1.5010606@histor.fr>	<1109273352.12299.50.camel@ayanami.boston.redhat.com>
	<421F42C1.8010506@histor.fr>
Message-ID: <42231711.3050603@histor.fr>


>>
>> Failover will not occur until after CMAN (or gulm) says the node is dead
>> and has been fenced.  When using the kernel Service Manager (provided by
>> CMAN), recovery is in the following order:
>>
>> (1) Fencing
>> (2) Locking
>> (3) GFS
>> (4) User services (e.g. rgmanager)
>>
>> How long did you wait? :)
>>
>>  
>>
Results from my tests with two nodes(buba and gump)(and latest 
cvs(update done today)):
I tried to put a basic script in failover on two nodes.
Initialization:
-ccsd, cman_tool join fence_tool join on the two nodes
Then I start the rgmanager on the two nodes:
 the script coucou (echo `uname -n` >> bla.txt) is launched on one of 
the two nodes.
With clusvcadm I made this script ran on gump, and I rebooted gump:
There is the syslog on buba:
Feb 28 13:20:17 buba kernel: CMAN: removing node gump from the cluster : 
Missed too many heartbeats
Feb 28 13:20:17 buba fenced[7573]: gump not a cluster member after 0 sec 
post_fail_delay
Feb 28 13:20:17 buba fenced[7573]: fencing node "gump"
Feb 28 13:20:20 buba fence_manual: Node 200.0.0.102 needs to be reset 
before recovery can procede.  Waiting for 200.0.0.102 to rejoin the 
cluster or for manual acknowledgement that it has been reset (i.e. 
fence_ack_manual -n 200.0.0.102)
Feb 28 13:20:29 buba fenced[7573]: fence "gump" success
Feb 28 13:20:32 buba clurgmgrd[7581]: <notice> Taking over resource 
group coucou from down member (null)
Feb 28 13:20:32 buba clurgmgrd[7581]: <notice> Resource group coucou started

Then gump came and rejoined the cluster: syslog of buba:
Feb 28 13:23:47 buba kernel: CMAN: node gump rejoining

I put the script on gump(always with clusvcadm):
Feb 28 13:24:50 buba clurgmgrd[7581]: <notice> Stopping resource group 
coucou
Feb 28 13:24:50 buba clurgmgrd[7581]: <notice> Resource group coucou is 
stopped
Feb 28 13:24:50 buba clurgmgrd[7581]: <notice> Resource group coucou is 
now running on member 2

Then I re rebooted (:)) gump and there came the problems:
Gump was removed from the cluster
Feb 28 13:25:57 buba kernel: CMAN: removing node gump from the cluster : 
Missed too many heartbeats
Feb 28 13:25:57 buba fenced[7573]: gump not a cluster member after 0 sec 
post_fail_delay
Feb 28 13:25:57 buba fenced[7573]: fencing node "gump"
Feb 28 13:26:03 buba fence_manual: Node 200.0.0.102 needs to be reset 
before recovery can procede.  Waiting for 200.0.0.102 to rejoin the 
cluster or for manual acknowledgement that it has been reset (i.e. 
fence_ack_manual -n 200.0.0.102)
Feb 28 13:26:14 buba fenced[7573]: fence "gump" success

And there, the rgmanager did nothing
when I looked to /proc/cluster/services I had:
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 run       -
[1]

DLM Lock Space:  "Magma"                             3   4 run       -
[1]

User:            "usrm::manager"                     2   3 recover 2 -
[1]

Whereas I had, during the first reboot of gump:

Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 run       -
[1]

DLM Lock Space:  "Magma"                             3   4 run       -
[1]

User:            "usrm::manager"                     2   3 run       -
[1]

Then I tried to bring gump back:

And there is what I had in gump:
[root at gump ~]# cat /proc/cluster/services
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 run       -
[1 2]

User:            "usrm::manager"                     0   3 join      
S-1,80,2
[]

So there, nothing worked, I hopelessly tried to restart the rgmanager on 
the two nodes, but nothing worked, I had states where in gump

[root at gump ~]# cat /proc/cluster/services
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 run       -
[1 2]

DLM Lock Space:  "Magma"                             3   6 run       -
[2]

User:            "usrm::manager"                     4   5 run       -
[2]

and in buba:
[root at buba ~]# cat /proc/cluster/services
Service          Name                              GID LID State     Code
Fence Domain:    "default"                           1   2 run       -
[1 2]

(buba seems not to have any clurgmrgrd running, even if I started the 
rgmanager...)

I don't know if it's a bug of the rgmanager or if I'm doing something 
wrong, but I don't understand why during the first reboot everything 
worked and nothing then...


From mtilstra at redhat.com  Mon Feb 28 14:02:36 2005
From: mtilstra at redhat.com (Michael Conrad Tadpol Tilstra)
Date: Mon, 28 Feb 2005 08:02:36 -0600
Subject: [Linux-cluster] GFS 6.0.2-24 + NFS (ALSO)
In-Reply-To: <200502271110.56310.cjkovacs@verizon.net>
References: <200502261157.48143.cjkovacs@verizon.net>
	<200502271110.56310.cjkovacs@verizon.net>
Message-ID: <4223247C.4030500@redhat.com>

Corey Kovacs wrote:
> Also, on this same cluster, when using the "gulm stonith" fencing module in 
> clumanager, I get errors generated by ...
> 
> log_err("Protocol Mismatch: We're %#x and They're %#x\n",   GIO_WIREPROT_VERS, 
> x_proto); which, by looking at the surrounding code seems to indicate the the 
> fence device login is failing.

I would guess that would be from the "gulm stonith" module trying to 
connect to gulm.  It rather sounds like someone didn't rebuild the rpm 
that has that module in it recently.

Every time a change is made to the structures that gulm uses, the 
protocol version is incremented.  You get that error message when trying 
to mix two versions.

-- 
Michael Conrad Tadpol Tilstra
What sane person could live in this world and not be crazy?

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 256 bytes
Desc: OpenPGP digital signature
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050228/0e8622c9/attachment.sig>

From pcaulfie at redhat.com  Mon Feb 28 15:02:38 2005
From: pcaulfie at redhat.com (Patrick Caulfield)
Date: Mon, 28 Feb 2005 15:02:38 +0000
Subject: [Linux-cluster] node kicked out of cluster
In-Reply-To: <1109036063.24124.7.camel@ibm-c.pdx.osdl.net>
References: <1109036063.24124.7.camel@ibm-c.pdx.osdl.net>
Message-ID: <20050228150238.GC18006@tykepenguin.com>

OK, I think we've found it. Or to be more accurate, Dave found it.

There was a counter that was rolling back to zero and causing problems. This
would explain why your tests always failed after a similar amount of time.

The fix in CVS. Let us know how you get on. 

-- 

patrick


From rajkum2002 at rediffmail.com  Mon Feb 28 15:14:47 2005
From: rajkum2002 at rediffmail.com (Raj  Kumar)
Date: 28 Feb 2005 15:14:47 -0000
Subject: [Linux-cluster] client doesnt start when lock master is not
	ready
Message-ID: <20050228151447.776.qmail@webmail28.rediffmail.com>

Thanks Adam! This explains the cause of the problem.

Yes, I started experiencing this problem after the upgrade from GFS-6.0.0-15 to GFS-6.0.2-24. 

Thank you,
Raj


On Thu, 24 Feb 2005 Adam Manthei wrote :
>On Thu, Feb 24, 2005 at 04:45:27PM -0000, Raj  Kumar wrote:
> > Hi All,
> >
> > We have a two node system using GFS. One of them is the lock server and
> > other is just client. We restarted our servers recently and brought the
> > lock client before bringing up the lock master. lock_gulmd is set to
> > restart at levels 3, 4 and 5. The lock client system just hungup with the
> > message "Starting lock_gulmd..." in the boot process. It's clear that this
> > situation happened since lock master server wasn't available then. When
> > the lock master server started the lock client system started successfully.
>
>This is the desired behavior.  Adjust the following value in
>/etc/sysconfig/gfs if you don't like it's behavior.
>
># GULM_QUORUM_TIMEOUT -- amount of time to wait for there to be a master
>#     before giving up.  If GULM_QUORUM_TIMEOUT is positive, then we will
>#     wait GULM_QUORUM_TIMEOUT seconds before giving up and failing when
>#     a master server is not found.  If GULM_QUORUM_TIMEOUT is zero, then
>#     wait indefinately for a master server.  If GULM_QUORUM_TIMEOUT is
>#     negative, just start lock_gulmd and not worry about whether it is
>#     quorate.
>GULM_QUORUM_TIMEOUT=300
>
> > I noticed before client system started even when lock master was not
> > available and the status of the lock_gulmd on client was set to "pending".
> > But now the system doesnt start until the master server is also started.
>
>Did you have the system mounting GFS automatically?  Apparently not since it
>would have "hung" there too.  The client node should have eventually timed
>out after 5 minutes without a master server to log into.
>
> > Has this changed recently?
>
>Define recently... sort of need the version information you are using :)
>
>My guess is that since you are complaining about this behavior, you just
>upgraded from GFS-6.0.0-15 to GFS-6.0.2-24.  From the rpm change log:
>
>* Mon Nov 15 2004 Chris Feist <cfeist at redhat.com> 6.0.2-0
>- init.d/lock_gulmd will not start if quorum is not established after
>   a specified time (rbz135732).
>- init.d/lock_gulmd will not stop if GFS is mounted (rbz135730).
>- pool init.d scripts no longer hang on startup until console input
>   is provided (rbz137382).
>
>
> > It is possible that other administrators in the
> > group may have to restart the system at times. If they start the client
> > before master (or worse they dont start master at all) then the system will
> > not complete its boot process and other services remain unavailable.
>
>Your nodes won't be able to mount GFS if there cluster the gulm servers
>aren't quorate, so what's the problem?
>
> > I like
> > the system to complete its boot process and have the lock_gulmd stay in
> > pending state until master comes back. Is there any trick to achieve this
> > behavior?
>
>GULM_QUORUM_TIMEOUT=-1
>
>One other suggestion.  I usually start sshd immediately after networking on
>my machines so that I can get into them as soon as possible.  This often
>helps when dealing with complaints of this nature.
>--
>Adam Manthei  <amanthei at redhat.com>
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>http://www.redhat.com/mailman/listinfo/linux-cluster
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050228/dab4e6b3/attachment.htm>

From oracle at provocation.net  Mon Feb 28 20:31:43 2005
From: oracle at provocation.net (Zenon Panoussis)
Date: Mon, 28 Feb 2005 21:31:43 +0100
Subject: [Linux-cluster] Wire encryption
Message-ID: <42237FAF.3050106@provocation.net>


Having read the (excellent) HOWTO at
https://www.redhat.com/docs/manuals/enterprise/RHEL-3-Manual/cluster-suite/
but not even started testing GFS, I miss some information on how the
nodes communicate with each-other. What protocols and ports are used?

The reason I ask is that I wonder whether GFS, together with stunnel
or similar, could be used to create a SAN that's physically connected
through the internet, more or less along the ideas of sfs (www.fs.net).
Has anyone seen any documentation of such attempts? Is there any reason
why the idea could be flawed by definition?

Z


-- 
The best defence against logic is ignorance.


From treed at ultraviolet.org  Thu Feb 24 17:02:02 2005
From: treed at ultraviolet.org (Tracy R Reed)
Date: Thu, 24 Feb 2005 09:02:02 -0800
Subject: [Linux-cluster] GFS and iscsi?
In-Reply-To: <20050224164527.6384.qmail@webmail29.rediffmail.com>
References: <20050224164527.6384.qmail@webmail29.rediffmail.com>
Message-ID: <20050224170202.GD24154@ultraviolet.org>

What SAN/LAN technology are most people using with GFS these days? I know
it is traditionally used with fibrechannel but that remains very expensive
technology. I recently heard from someone contemplating using it with
iscsi. Is that known to work?

-- 
Tracy Reed
http://ultraviolet.org
This message is cryptographically signed for your protection.
Info: http://copilotconsulting.com/sig
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050224/3ec326a4/attachment.sig>

From bkorb at veritas.com  Thu Feb 24 18:03:53 2005
From: bkorb at veritas.com (Bruce Korb)
Date: Thu, 24 Feb 2005 10:03:53 -0800
Subject: [Linux-cluster] usage.txt hint
Message-ID: <200502241003.53736.bkorb@veritas.com>

Hi,

I'm just getting started figuring this stuff out.
I tend to avoid doing things as root, so I choked.
It would be useful to mention that the build must
be done as root with the files and directories
modifiable by root.

Attached is a quick hack I threw together several
years ago for keeping build clutter separate from
source trees.  This build seems to require building
in the source tree.

Regards, Bruce
-------------- next part --------------
A non-text attachment was scrubbed...
Name: clone.c
Type: text/x-csrc
Size: 13812 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20050224/2fec4304/attachment.bin>

From bkorb at veritas.com  Thu Feb 24 20:55:14 2005
From: bkorb at veritas.com (Bruce Korb)
Date: Thu, 24 Feb 2005 12:55:14 -0800
Subject: [Linux-cluster] Re: usage.txt hint
In-Reply-To: <200502241003.53736.bkorb@veritas.com>
References: <200502241003.53736.bkorb@veritas.com>
Message-ID: <200502241255.15075.bkorb@veritas.com>

On Thursday 24 February 2005 10:03 am, Bruce Korb wrote:
> Hi,
> 
> I'm just getting started figuring this stuff out.

Another hint to the novice:  the kernel sources not only have to
be configured, but built as well!  Anyway, I did so and get
this in my build of the clustering software:

  CC [M]  /home/bkorb/RHCluster/build-cluster/gfs-kernel/src/gfs/quota.o
/home/bkorb/RHCluster/build-cluster/gfs-kernel/src/gfs/quota.c: \
 In function `print_quota_message':
/home/bkorb/RHCluster/build-cluster/gfs-kernel/src/gfs/quota.c:993: \
 warning: passing arg 2 of pointer to function makes integer from pointer without a cast
/home/bkorb/RHCluster/build-cluster/gfs-kernel/src/gfs/quota.c:993: \
 warning: passing arg 3 of pointer to function makes pointer from integer without a cast
/home/bkorb/RHCluster/build-cluster/gfs-kernel/src/gfs/quota.c:993: \
 error: too few arguments to function
make[5]: *** [/home/bkorb/RHCluster/build-cluster/gfs-kernel/src/gfs/quota.o] Error 1
make[4]: *** [_module_/home/bkorb/RHCluster/build-cluster/gfs-kernel/src/gfs] Error 2
make[4]: Leaving directory `/usr/src/linux-build'
make[3]: *** [all] Error 2
make[3]: Leaving directory `/home/bkorb/RHCluster/build-cluster/gfs-kernel/src/gfs'
make[2]: *** [install] Error 2
make[2]: Leaving directory `/home/bkorb/RHCluster/build-cluster/gfs-kernel/src'
make[1]: *** [install] Error 2
make[1]: Leaving directory `/home/bkorb/RHCluster/build-cluster/gfs-kernel'
make: *** [all] Error 2

I can unwind the issue farther than the source below, but
if there is already a known issue & resolution, then I'd avoid
the bother:

static int
print_quota_message(struct gfs_sbd *sdp, struct gfs_quota_data *qd, char *type)
{
	ENTER(GFN_PRINT_QUOTA_MESSAGE)
	struct tty_struct *tty;
	char *line;
	int len;

	line = kmalloc(256, GFP_KERNEL);
	if (!line)
		RETURN(GFN_PRINT_QUOTA_MESSAGE, -ENOMEM);

	len = snprintf(line, 256, "GFS: fsid=%s: quota %s for %s %u\r\n",
		       sdp->sd_fsname, type,
		       (test_bit(QDF_USER, &qd->qd_flags)) ? "user" : "group",
		       qd->qd_id);

	if (current->signal) {
		tty = current->signal->tty;
		if (tty && tty->driver->write)
			tty->driver->write(tty, line, len);  <<--------- line at issue
//  Of course, the usage of "len" is bogus, too.  You have not verified
//  that "len" is less than 256.  If you a priori know it is less than
//  256, then you could as well use "sprintf" instead.
	}

	kfree(line);

	RETURN(GFN_PRINT_QUOTA_MESSAGE, 0);
}


From bkorb at veritas.com  Thu Feb 24 21:15:01 2005
From: bkorb at veritas.com (Bruce Korb)
Date: Thu, 24 Feb 2005 13:15:01 -0800
Subject: [Linux-cluster] print_quota_message
Message-ID: <200502241315.01105.bkorb@veritas.com>

It was missing the "from user" argument to the write call:

static int
print_quota_message(struct gfs_sbd *sdp, struct gfs_quota_data *qd, char *type)
{
	struct tty_struct *tty;
	ENTER(GFN_PRINT_QUOTA_MESSAGE)

	/* avoid doing work if we're not going to emit the text */
	if (current->signal == 0)
		RETURN(GFN_PRINT_QUOTA_MESSAGE, 0);

	tty = current->signal->tty;
	if ((tty == NULL) || (tty->driver->write == NULL))
		RETURN(GFN_PRINT_QUOTA_MESSAGE, 0);

	{
		int len;
		char *line = kmalloc(256, GFP_KERNEL);
		if (!line)
			RETURN(GFN_PRINT_QUOTA_MESSAGE, -ENOMEM);

		len = snprintf(line, 256, "GFS: fsid=%s: quota %s for %s %u\r\n",
			       sdp->sd_fsname, type,
			       (test_bit(QDF_USER, &qd->qd_flags))
			       ? "user" : "group", qd->qd_id);
		if (len > 255)
			len = 255;

		tty->driver->write(tty, 0, line, len);
		kfree(line);
	}

	RETURN(GFN_PRINT_QUOTA_MESSAGE, 0);
}