From Fredrik.Hudner at evry.com  Fri May  3 13:33:38 2013
From: Fredrik.Hudner at evry.com (Fredrik Hudner)
Date: Fri, 3 May 2013 13:33:38 +0000
Subject: [Linux-cluster] fence_vmware_soap sslv3 alert certificat unkown
Message-ID: <64275F83AE588D4EBF165BF5D8783D41042D3B@ccdex003>

Dear all, 

I have a pacemaker cluster and need to setup a stonith fencing agent, in this case fence_vmware_soap. 

Environment: 
Centos 6.3
fence-agents.x86_64

I'm running the command manually with different options: 

# fence_vmware_soap -o off -a vcenter-adress -l drift\vcenter_tdtestclu -p password  -n tdtestclu02 -u 443
Unable to connect/login to fencing device

# fence_vmware_soap -o off -a 192.168.231.31 -l drift\vcenter_tdtestclu -p password -z -n tdtestclu02 -u 443
No handlers could be found for logger "suds.client"
Unable to connect/login to fencing device


In vCenters (5.1) system logs I can see the following error: 

2013-05-03T14:00:07.031+02:00 [07800 error 'Default'] [0] error:14094416:SSL routines:SSL3_READ_BYTES:sslv3 alert certificate unknown
2013-05-03T14:00:07.031+02:00 [07800 error 'Default'] SSLStreamImpl::DoServerHandshake (000000005d11ce30) SSL_accept failed. Dumping SSL error queue:
2013-05-03T14:00:07.031+02:00 [07800 warning 'ProxySvc'] SSL Handshake failed for stream TCPStreamWin32(socket=TCP(fd=31640) local=vcentre-adress:443,? peer=vcentre-adress:53876), error: class Vmacore::Ssl::SSLException(SSL Exception: error:14094416:SSL routines:SSL3_READ_BYTES:sslv3 alert certificate unknown)

Question is: 
Is the unknown certificate the real problem here ?  and if so, on which host is it actually missing (source host, vCentre or target host) ?

Any other clues how to get this to work is much appreciated
(and if you need more information, please let me know)

Kind regards
/Fred


From mgrac at redhat.com  Fri May  3 14:13:56 2013
From: mgrac at redhat.com (Marek Grac)
Date: Fri, 03 May 2013 16:13:56 +0200
Subject: [Linux-cluster] fence_vmware_soap sslv3 alert certificat unkown
In-Reply-To: <64275F83AE588D4EBF165BF5D8783D41042D3B@ccdex003>
References: <64275F83AE588D4EBF165BF5D8783D41042D3B@ccdex003>
Message-ID: <5183C624.1000807@redhat.com>

On 05/03/2013 03:33 PM, Fredrik Hudner wrote:
> Dear all,
>
> I have a pacemaker cluster and need to setup a stonith fencing agent, in this case fence_vmware_soap.
>
> Environment:
> Centos 6.3
> fence-agents.x86_64
>
> I'm running the command manually with different options:
>
> # fence_vmware_soap -o off -a vcenter-adress -l drift\vcenter_tdtestclu -p password  -n tdtestclu02 -u 443
> Unable to connect/login to fencing device
>
> # fence_vmware_soap -o off -a 192.168.231.31 -l drift\vcenter_tdtestclu -p password -z -n tdtestclu02 -u 443
> No handlers could be found for logger "suds.client"
> Unable to connect/login to fencing device
It looks like you are trying to connect to API on port 443 without using 
ssl (on command line you can use --ssl; no need to use -u)

m,


From Fredrik.Hudner at evry.com  Fri May  3 14:25:55 2013
From: Fredrik.Hudner at evry.com (Fredrik Hudner)
Date: Fri, 3 May 2013 14:25:55 +0000
Subject: [Linux-cluster] fence_vmware_soap sslv3 alert certificat unkown
In-Reply-To: <5183C624.1000807@redhat.com>
References: <64275F83AE588D4EBF165BF5D8783D41042D3B@ccdex003>
	<5183C624.1000807@redhat.com>
Message-ID: <64275F83AE588D4EBF165BF5D8783D41042DC4@ccdex003>


On 05/03/2013 03:33 PM, Fredrik Hudner wrote:
> Dear all,
>
> I have a pacemaker cluster and need to setup a stonith fencing agent, in this case fence_vmware_soap.
>
> Environment:
> Centos 6.3
> fence-agents.x86_64
>
> I'm running the command manually with different options:
>
> # fence_vmware_soap -o off -a vcenter-adress -l 
> drift\vcenter_tdtestclu -p password  -n tdtestclu02 -u 443 Unable to 
> connect/login to fencing device
>
> # fence_vmware_soap -o off -a 192.168.231.31 -l 
> drift\vcenter_tdtestclu -p password -z -n tdtestclu02 -u 443 No handlers could be found for logger "suds.client"
> Unable to connect/login to fencing device
It looks like you are trying to connect to API on port 443 without using ssl (on command line you can use --ssl; no need to use -u)

m,

I thought the -z option did that ?
Besides if I don't user -u <portnumber> it default to port 23 (telnet) and they will never open that in the firewall for me :)
--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From Fredrik.Hudner at evry.com  Sat May  4 13:50:33 2013
From: Fredrik.Hudner at evry.com (Fredrik Hudner)
Date: Sat, 4 May 2013 13:50:33 +0000
Subject: [Linux-cluster] fence_vmware_soap sslv3 alert certificat unkown
In-Reply-To: <64275F83AE588D4EBF165BF5D8783D41042DC4@ccdex003>
References: <64275F83AE588D4EBF165BF5D8783D41042D3B@ccdex003>
	<5183C624.1000807@redhat.com>
	<64275F83AE588D4EBF165BF5D8783D41042DC4@ccdex003>
Message-ID: <64275F83AE588D4EBF165BF5D8783D41042FCA@ccdex003>


On 05/03/2013 03:33 PM, Fredrik Hudner wrote:
> Dear all,
>
> I have a pacemaker cluster and need to setup a stonith fencing agent, in this case fence_vmware_soap.
>
> Environment:
> Centos 6.3
> fence-agents.x86_64
>
> I'm running the command manually with different options:
>
> # fence_vmware_soap -o off -a vcenter-adress -l 
> drift\vcenter_tdtestclu -p password  -n tdtestclu02 -u 443 Unable to 
> connect/login to fencing device
>
> # fence_vmware_soap -o off -a 192.168.231.31 -l 
> drift\vcenter_tdtestclu -p password -z -n tdtestclu02 -u 443 No handlers could be found for logger "suds.client"
> Unable to connect/login to fencing device
It looks like you are trying to connect to API on port 443 without using ssl (on command line you can use --ssl; no need to use -u)

m,

I thought the -z option did that ?
Besides if I don't user -u <portnumber> it default to port 23 (telnet) and they will never open that in the firewall for me :)

I tried with --ssl option and still get the same result


From mgrac at redhat.com  Thu May  9 07:20:54 2013
From: mgrac at redhat.com (Marek Grac)
Date: Thu, 09 May 2013 09:20:54 +0200
Subject: [Linux-cluster] fence_vmware_soap sslv3 alert certificat unkown
In-Reply-To: <64275F83AE588D4EBF165BF5D8783D41042DC4@ccdex003>
References: <64275F83AE588D4EBF165BF5D8783D41042D3B@ccdex003>
	<5183C624.1000807@redhat.com>
	<64275F83AE588D4EBF165BF5D8783D41042DC4@ccdex003>
Message-ID: <518B4E56.9030809@redhat.com>

On 05/03/2013 04:25 PM, Fredrik Hudner wrote:
> On 05/03/2013 03:33 PM, Fredrik Hudner wrote:
>> Dear all,
>>
>> I have a pacemaker cluster and need to setup a stonith fencing agent, in this case fence_vmware_soap.
>>
>> Environment:
>> Centos 6.3
>> fence-agents.x86_64
>>
>> I'm running the command manually with different options:
>>
>> # fence_vmware_soap -o off -a vcenter-adress -l
>> drift\vcenter_tdtestclu -p password  -n tdtestclu02 -u 443 Unable to
>> connect/login to fencing device
>>
>> # fence_vmware_soap -o off -a 192.168.231.31 -l
>> drift\vcenter_tdtestclu -p password -z -n tdtestclu02 -u 443 No handlers could be found for logger "suds.client"
>> Unable to connect/login to fencing device
> It looks like you are trying to connect to API on port 443 without using ssl (on command line you can use --ssl; no need to use -u)
>
> m,
>
> I thought the -z option did that ?
> Besides if I don't user -u <portnumber> it default to port 23 (telnet) and they will never open that in the firewall for me :)
>
Yes, you are right (there is no need to set a port if it is a default 
one and -z/--ssl is set). Which version of VMWare do you have? Take a 
look if it is supported https://access.redhat.com/site/articles/2860 and 
no work-arounds are needed.

m,


From ssloh at singnet.com.sg  Fri May 10 06:35:04 2013
From: ssloh at singnet.com.sg (ssloh)
Date: Fri, 10 May 2013 14:35:04 +0800
Subject: [Linux-cluster] cluster issue
References: <AANLkTikZGrpMMZWB=8va7OYD2k-Wk7pAhysm4xNkwFMn@mail.gmail.com>
Message-ID: <288507E9FAC14412BC43756C95DF9EB3@vince>


----- Original Message ----- 
From: "santosh lohar" <sslohar at gmail.com>
To: <linux-cluster at redhat.com>
Sent: Tuesday, September 28, 2010 2:44 PM
Subject: [Linux-cluster] cluster issue


Hi all,

I am facing the problem with SGE and flexlm licencing details are below:

*Hardware: * IBM 3650 , 2 Quad core CPU , 16 GB RAM , total nos of node2 +
one master node conected with IB switch connectivity:
*Software* : ROCKS 5.1 / os -RHEL4 mars hill/ fluent / MSC mentat.

Problem :
1 when I submitt the jobs with SGE the "qhost -F MDAdv " is showinf updated
status of license issued and avilable
but when I submitt the jobs outside SGE then it will not able
to recognize the latest status of license tokens
2. jobs submitted after 4 cpu's then cluster computation will get slows down
,

Kindly suggest me what to do in this case , thanks in advance

Regards
Santosh


On Mon, Sep 27, 2010 at 11:07 PM, <linux-cluster-request at redhat.com> wrote:

> Send Linux-cluster mailing list submissions to
>        linux-cluster at redhat.com
>
> To subscribe or unsubscribe via the World Wide Web, visit
>        https://www.redhat.com/mailman/listinfo/linux-cluster
> or, via email, send a message with subject or body 'help' to
>        linux-cluster-request at redhat.com
>
> You can reach the person managing the list at
>        linux-cluster-owner at redhat.com
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Linux-cluster digest..."
>
>
> Today's Topics:
>
>   1. Unable to patch conga (fosiul alam)
>   2. Re: ricci is very unstable in one nodes (Paul M. Dyer)
>   3. Re: porblem with quorum at cluster boot (brem belguebli)
>   4. Re: ricci is very unstable in one nodes (fosiul alam)
>   5. Re: ricci is very unstable in one nodes (fosiul alam)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Mon, 27 Sep 2010 17:02:20 +0100
> From: fosiul alam <expertalert at gmail.com>
> To: linux clustering <linux-cluster at redhat.com>
> Subject: [Linux-cluster] Unable to patch conga
> Message-ID:
>        <AANLkTimdQNO3x3g5EKc2ETMPePf3iA-Cptiih6rLb4Au at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> hi
> Due to the same issue, I see exact same problem in my luci interface
> so i am trying to patch conga.
>
> I downloaded ,
>
>
> http://mirrors.kernel.org/centos/5/os/SRPMS/conga-0.12.2-12.el5.centos.1.src.rpm
> rpm -i conga-0.12.2-12.el5.centos.1.src.rpm
> cd /usr/src/redhat/SOURCE
>
> tar -xvzf conga-0.12.2.tar.gz
> patch -p0 < /path/to/where_the_patch/ricci.patch
>
> [root at beaver SOURCES]# cd conga-0.12.2
>
> Now i am facing the problem to install
>
> ./autogen.sh --include_zope_and_plone=yes
> Zope-2.9.8-final.tgz passed sha512sum test
> Plone-2.5.5.tar.gz passed sha512sum test
> cat: clustermon.spec.in.in: No such file or directory
>
> Run `./configure` to configure conga build,
> or `make srpms` to build conga and clustermon srpms
> or `make rpms` to build all rpms
>
> [root at beaver conga-0.12.2]#  ./configure --include_zope_and_plone=yes
> D-BUS version 1.1.2 detected  -> major 1, minor 1
> missing zope directory, extract zope source-code into it and try again
>
>
> Now, how will i tell ./configure where is zope and plone ?
> do i need this zope and plone ?
>
> Please give me some advise
>
> Fosiul
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <
> https://www.redhat.com/archives/linux-cluster/attachments/20100927/21959f19/attachment.html
> >
>
> ------------------------------
>
> Message: 2
> Date: Mon, 27 Sep 2010 11:55:28 -0500 (CDT)
> From: "Paul M. Dyer" <pmdyer at ctgcentral2.com>
> To: linux clustering <linux-cluster at redhat.com>
> Subject: Re: [Linux-cluster] ricci is very unstable in one nodes
> Message-ID: <1480320.10.1285606528829.JavaMail.root at athena>
> Content-Type: text/plain; charset=utf-8
>
> http://rhn.redhat.com/errata/RHBA-2010-0716.html
>
> It appears that this problem has been fixed in this errata.
>
> I installed the luci and ricci updates and did some lite testing.   So far,
> the timeout 11111 error has not shown up.
>
> Paul
>
> ----- Original Message -----
> From: "fosiul alam" <expertalert at gmail.com>
> To: "linux clustering" <linux-cluster at redhat.com>
> Sent: Monday, September 27, 2010 10:48:27 AM
> Subject: Re: [Linux-cluster] ricci is very unstable in one nodes
>
> Hi
> i am trying to patch ricci . let see how it goes
>
> but clusvcadm is failing as well
>
> [root at http1 ~]# clusvcadm -e httpd1 -m http1.xxxx.local
> Member http1.xxxx.local trying to enable service:httpd1...Invalid
> operation for resource
>
> here, http1 , where i was trying to run the service from luci
>
> what could be the problem ?
> is there any way to find out if there is any problem with config ??
>
> On 27 September 2010 16:26, Ben Turner < bturner at redhat.com > wrote:
>
>
> RHEL 5.6 hasn't been released yet so your package probably contains the
> problem. I'm not sure how in sync Centos is with RHEL or if they patch
> earlier so I cannot give you a time frame when it will be in Centos or
> if they have already patched it. The problem in that BZ is more of an
> annoyance, you usually just have to retry a time or two and it works. If
> you can't get Luci working properly with your service at all you should
> try enabling the service through the command line with clusvcadm -e. If
> it is not working from the command line either then there is a problem
> with the service config.
>
>
>
>
> -Ben
>
>
>
>
> ----- "fosiul alam" < expertalert at gmail.com > wrote:
>
> > Hi Ben
> > Thanks
> >
> > I named this cluster as mysql-server but i have not installed mysql
> > database in their yet
> >
> > and both luci and ricci on luci server and node1 is running this
> > version
> >
> > luci-0.12.2-12.el5.centos.1
> > ricci-0.12.2-12.el5.centos.1
> >
> >
> > do you think this version has problem as well ??
> >
> > thanks for your help
> >
> >
> >
> >
> > On 24 September 2010 15:33, Ben Turner < bturner at redhat.com > wrote:
> >
> >
> > There is an issue with ricci timeouts that was fixed recently:
> >
> > https://bugzilla.redhat.com/show_bug.cgi?id=564490
> >
> > I'm not sure but you may be hitting that bug. Symptoms include: luci
> > isn't able to get the status from the node, timeouts when querying
> > ricci, etc. The fix should be released with 5.6
> >
> > On the mysql service there are some options that you need to set. Here
> > are all the options available to that agent:
> >
> > mysql
> > Defines a MySQL database server
> >
> > Attribute Description
> > config_file Define configuration file
> > listen_address Define an IP address for MySQL server. If the address
> > is not given then first IP address from the service is taken.
> > mysqld_options Other command-line options for mysqld
> > name Name
> > ref Reference to existing mysql resource in the resources section.
> > service_name Inherit the service name.
> > shutdown_wait Wait X seconds for correct end of service shutdown
> > startup_wait Wait X seconds for correct end of service startup
> > __enforce_timeouts Consider a timeout for operations as fatal.
> > __failure_expire_time Amount of time before a failure is forgotten.
> > __independent_subtree Treat this and all children as an independent
> > subtree. __max_failures Maximum number of failures before returning a
> > failure to a status check.
> >
> > If I recall correctly you may need to tweak:
> >
> > shutdown_wait Wait X seconds for correct end of service shutdown
> > startup_wait Wait X seconds for correct end of service startup
> >
> > There can be problems relocating the DB if it takes too long to
> > start/shutdown. If you are having problems relocating with luci it may
> > be a good idea to test with:
> >
> > # clusvcadm -r <service name> -m <cluster node>
> >
> > -Ben
> >
> >
> >
> >
> >
> >
> > ----- "fosiul alam" < expertalert at gmail.com > wrote:
> >
> > > Hi
> > > I have 4 nodes cluster,
> > > It was running fine. but today one nodes is giving trouble
> > >
> > > From luci Gui interface, when i try to relocate service into this
> > node
> > > and trying to relocate from this nodes to another nodes
> > >
> > > from luci gui interface, its showing :
> > >
> > > Unable to retrieve batch 1908047789 status from
> > > beaver.domain.local:11111: clusvcadm start failed to start httpd1:
> > > Starting cluster service "httpd1" on node "http1.domain.local" --
> > You
> > > will be redirected in 5 seconds.
> > > also
> > >
> > > The ricci agent for this node is unresponsive. Node-specific
> > > information is not available at this time. :
> > >
> > > but ricci is running on problematic node ,
> > > ricci 7324 0.0 0.1 58876 2932 ? S<s 14:40 0:00 ricci -u 101
> > >
> > > there is not any firewall running.
> > >
> > > iptables -L
> > > Chain INPUT (policy ACCEPT)
> > > target prot opt source destination
> > >
> > > Chain FORWARD (policy ACCEPT)
> > > target prot opt source destination
> > >
> > > Chain OUTPUT (policy ACCEPT)
> > > target prot opt source destination
> > >
> > > Chain RH-Firewall-1-INPUT (0 references)
> > > target prot opt source destination
> > >
> > > port 11111 is runningg
> > >
> > > netstat -an | grep 11111
> > > tcp 0 0 0.0.0.0:11111 0.0.0.0:* LISTEN
> > >
> > >
> > > but still ricci is very unstable , and i cant relocate any service
> > on
> > > this node or i cant relocate any service away from this node.
> > >
> > > from problematic node if i type this
> > >
> > > clustat
> > > Cluster Status for ng1 @ Thu Sep 23 20:24:02 2010
> > > Member Status: Quorate
> > >
> > > Member Name ID Status
> > > ------ ---- ---- ------
> > > beaver.xxx.local 1 Online, rgmanager ::: luci is running from this
> > > server publicdns1.xxxx.local 2 Online, rgmanager
> > > http1.xxxx.local 3 Online, Local, rgmanager
> > > mail01.xxxxx.local 4 Online, rgmanager
> > >
> > > Service Name Owner (Last) State
> > > ------- ---- ----- ------ -----
> > > service:httpd1 mail01.xxxx.local started
> > > service:mysql-server http1.xxxx.local started -------------------
> > this
> > > is the problematic node
> > > service:public-dns publicdns1.xxxxxx.local started
> > >
> > > I cant move that service mysql-server from this node or cant
> > relocate
> > > any service on this node ..
> > > I am very confused.
> > >
> > > what shall i do to fix this issue ??
> > >
> > > thanks for your advise.
> > >
> > >
> > >
> > >
> > > -- Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> > -- Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> >
> > -- Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
>
> -- Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
> -- Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>
> ------------------------------
>
> Message: 3
> Date: Mon, 27 Sep 2010 19:05:06 +0200
> From: brem belguebli <brem.belguebli at gmail.com>
> To: linux clustering <linux-cluster at redhat.com>
> Subject: Re: [Linux-cluster] porblem with quorum at cluster boot
> Message-ID:
>        <AANLkTi=FOA-cj5hg11zBmZdzWyQiMpPCM9FZiKgFQHH9 at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> The configuration you are trying to build, 2 cluster nodes (1 vote each)
> plus a quorum disk 1 vote (making a total expected votes= 3) must remain up
> if you loose 1 of the members (as long as the remaining node still accesses
> the quorum disk) because there are still 2   active votes (1 remaining node
> + 1 quorum disk) = 2 > expected_votes/2.
>
> The Quorum (majority) must be greater (absolutely greater  >) than
> expected_votes/2 (51% or greater) in order to service to continue.
>
>
> 2010/9/27 Bennie R Thomas <Bennie_R_Thomas at raytheon.com>
>
> > Try setting your expected votes to 2 or 1..
> >
> > Your Cluster is hanging with one node because it want's 3 votes.
> >
> >
> >
> >   From: Brem Belguebli <brem.belguebli at gmail.com> To: linux clustering <
> > linux-cluster at redhat.com> Date: 09/25/2010 10:30 AM Subject: Re:
> > [Linux-cluster] porblem with quorum at cluster boot Sent by:
> > linux-cluster-bounces at redhat.com
> > ------------------------------
> >
> >
> >
> > On Fri, 2010-09-24 at 12:52 -0400, Jason_Henderson at Mitel.com wrote:
> > >
> > > I think you still need two_node="1" in your conf file if you want a
> > > single node to become quorate.
> > >
> > two_nodes=1 is only valid if you do not have a quorum disk.
> >
> > > linux-cluster-bounces at redhat.com wrote on 09/24/2010 12:38:17 PM:
> > >
> > > > hello,
> > > >
> > > > I have a 2 node cluster with qdisk quorum partition;
> > > >
> > > > each node has 1 vote and the qdisk has 1 vote too; in cluster.conf
> > > I
> > > > have this explicit declaration:
> > > > <cman expected_votes="3" two_node="0"\>
> > > >
> > > > when I have both 2 nodes active cman_tool status tell me this:
> > > >
> > > > Version: 6.1.0
> > > > Nodes: 2
> > > > Expected votes: 3
> > > > Quorum device votes: 1
> > > > Total votes: 3
> > > > Node votes: 1
> > > > Quorum: 2
> > > >
> > > > then, if I power off a node these value, as expected, changed this
> > > way:
> > > > Nodes: 1
> > > > Total votes: 2
> > > >
> > > > and the cluster is still quorate and functional.
> > > >
> > > > the problem is if I power off both the node and them power on only
> > > one
> > > > of them: in this case the single node does not quorate and the
> > > cluster
> > > > does not start: I have to power on both the node to have the
> > > cluster
> > > > (and services on the cluster) working.
> > > >
> > > > I'd like the cluster can work (and boot) even with a single node
> > > (ie, if
> > > > one of the node has hw failure and is down I still want to be able
> > > to
> > > > reboot the working node and have it booting correctly the cluster)
> > > >
> > > > any hints? (thank's for reading all this)
> > > >
> > > > --
> > > > bye,
> > > > emilio
> > > >
> > > > --
> > > > Linux-cluster mailing list
> > > > Linux-cluster at redhat.com
> > > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > > --
> > > Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <
> https://www.redhat.com/archives/linux-cluster/attachments/20100927/e452edb5/attachment.html
> >
>
> ------------------------------
>
> Message: 4
> Date: Mon, 27 Sep 2010 18:31:31 +0100
> From: fosiul alam <expertalert at gmail.com>
> To: linux clustering <linux-cluster at redhat.com>
> Subject: Re: [Linux-cluster] ricci is very unstable in one nodes
> Message-ID:
> 
> <AANLkTikwtYxG3_gf0QxqJpGzZxowh4T7rGbwH-+MhWs8 at mail.gmail.com<AANLkTikwtYxG3_gf0QxqJpGzZxowh4T7rGbwH-%2BMhWs8 at mail.gmail.com>
> >
> Content-Type: text/plain; charset="iso-8859-1"
>
> Hi
> Thanks for your advise,
> Currently i got this
>
> luci-0.12.2-12.el5.centos.1
> ricci-0.12.2-12.el5.centos.1
>
> is this the same rpm as
>
> luci-0.12.2-12.el5_5.4.i386.rpm  ?
> ricci-0.12.2-12.el5_5.4.i386.rpm  ?
>
> Thanks
>
>
> On 27 September 2010 17:55, Paul M. Dyer <pmdyer at ctgcentral2.com> wrote:
>
> > http://rhn.redhat.com/errata/RHBA-2010-0716.html
> >
> > It appears that this problem has been fixed in this errata.
> >
> > I installed the luci and ricci updates and did some lite testing.   So
> far,
> > the timeout 11111 error has not shown up.
> >
> > Paul
> >
> > ----- Original Message -----
> > From: "fosiul alam" <expertalert at gmail.com>
> > To: "linux clustering" <linux-cluster at redhat.com>
> > Sent: Monday, September 27, 2010 10:48:27 AM
> > Subject: Re: [Linux-cluster] ricci is very unstable in one nodes
> >
> > Hi
> > i am trying to patch ricci . let see how it goes
> >
> > but clusvcadm is failing as well
> >
> > [root at http1 ~]# clusvcadm -e httpd1 -m http1.xxxx.local
> > Member http1.xxxx.local trying to enable service:httpd1...Invalid
> > operation for resource
> >
> > here, http1 , where i was trying to run the service from luci
> >
> > what could be the problem ?
> > is there any way to find out if there is any problem with config ??
> >
> > On 27 September 2010 16:26, Ben Turner < bturner at redhat.com > wrote:
> >
> >
> > RHEL 5.6 hasn't been released yet so your package probably contains the
> > problem. I'm not sure how in sync Centos is with RHEL or if they patch
> > earlier so I cannot give you a time frame when it will be in Centos or
> > if they have already patched it. The problem in that BZ is more of an
> > annoyance, you usually just have to retry a time or two and it works. If
> > you can't get Luci working properly with your service at all you should
> > try enabling the service through the command line with clusvcadm -e. If
> > it is not working from the command line either then there is a problem
> > with the service config.
> >
> >
> >
> >
> > -Ben
> >
> >
> >
> >
> > ----- "fosiul alam" < expertalert at gmail.com > wrote:
> >
> > > Hi Ben
> > > Thanks
> > >
> > > I named this cluster as mysql-server but i have not installed mysql
> > > database in their yet
> > >
> > > and both luci and ricci on luci server and node1 is running this
> > > version
> > >
> > > luci-0.12.2-12.el5.centos.1
> > > ricci-0.12.2-12.el5.centos.1
> > >
> > >
> > > do you think this version has problem as well ??
> > >
> > > thanks for your help
> > >
> > >
> > >
> > >
> > > On 24 September 2010 15:33, Ben Turner < bturner at redhat.com > wrote:
> > >
> > >
> > > There is an issue with ricci timeouts that was fixed recently:
> > >
> > > https://bugzilla.redhat.com/show_bug.cgi?id=564490
> > >
> > > I'm not sure but you may be hitting that bug. Symptoms include: luci
> > > isn't able to get the status from the node, timeouts when querying
> > > ricci, etc. The fix should be released with 5.6
> > >
> > > On the mysql service there are some options that you need to set. Here
> > > are all the options available to that agent:
> > >
> > > mysql
> > > Defines a MySQL database server
> > >
> > > Attribute Description
> > > config_file Define configuration file
> > > listen_address Define an IP address for MySQL server. If the address
> > > is not given then first IP address from the service is taken.
> > > mysqld_options Other command-line options for mysqld
> > > name Name
> > > ref Reference to existing mysql resource in the resources section.
> > > service_name Inherit the service name.
> > > shutdown_wait Wait X seconds for correct end of service shutdown
> > > startup_wait Wait X seconds for correct end of service startup
> > > __enforce_timeouts Consider a timeout for operations as fatal.
> > > __failure_expire_time Amount of time before a failure is forgotten.
> > > __independent_subtree Treat this and all children as an independent
> > > subtree. __max_failures Maximum number of failures before returning a
> > > failure to a status check.
> > >
> > > If I recall correctly you may need to tweak:
> > >
> > > shutdown_wait Wait X seconds for correct end of service shutdown
> > > startup_wait Wait X seconds for correct end of service startup
> > >
> > > There can be problems relocating the DB if it takes too long to
> > > start/shutdown. If you are having problems relocating with luci it may
> > > be a good idea to test with:
> > >
> > > # clusvcadm -r <service name> -m <cluster node>
> > >
> > > -Ben
> > >
> > >
> > >
> > >
> > >
> > >
> > > ----- "fosiul alam" < expertalert at gmail.com > wrote:
> > >
> > > > Hi
> > > > I have 4 nodes cluster,
> > > > It was running fine. but today one nodes is giving trouble
> > > >
> > > > From luci Gui interface, when i try to relocate service into this
> > > node
> > > > and trying to relocate from this nodes to another nodes
> > > >
> > > > from luci gui interface, its showing :
> > > >
> > > > Unable to retrieve batch 1908047789 status from
> > > > beaver.domain.local:11111: clusvcadm start failed to start httpd1:
> > > > Starting cluster service "httpd1" on node "http1.domain.local" --
> > > You
> > > > will be redirected in 5 seconds.
> > > > also
> > > >
> > > > The ricci agent for this node is unresponsive. Node-specific
> > > > information is not available at this time. :
> > > >
> > > > but ricci is running on problematic node ,
> > > > ricci 7324 0.0 0.1 58876 2932 ? S<s 14:40 0:00 ricci -u 101
> > > >
> > > > there is not any firewall running.
> > > >
> > > > iptables -L
> > > > Chain INPUT (policy ACCEPT)
> > > > target prot opt source destination
> > > >
> > > > Chain FORWARD (policy ACCEPT)
> > > > target prot opt source destination
> > > >
> > > > Chain OUTPUT (policy ACCEPT)
> > > > target prot opt source destination
> > > >
> > > > Chain RH-Firewall-1-INPUT (0 references)
> > > > target prot opt source destination
> > > >
> > > > port 11111 is runningg
> > > >
> > > > netstat -an | grep 11111
> > > > tcp 0 0 0.0.0.0:11111 0.0.0.0:* LISTEN
> > > >
> > > >
> > > > but still ricci is very unstable , and i cant relocate any service
> > > on
> > > > this node or i cant relocate any service away from this node.
> > > >
> > > > from problematic node if i type this
> > > >
> > > > clustat
> > > > Cluster Status for ng1 @ Thu Sep 23 20:24:02 2010
> > > > Member Status: Quorate
> > > >
> > > > Member Name ID Status
> > > > ------ ---- ---- ------
> > > > beaver.xxx.local 1 Online, rgmanager ::: luci is running from this
> > > > server publicdns1.xxxx.local 2 Online, rgmanager
> > > > http1.xxxx.local 3 Online, Local, rgmanager
> > > > mail01.xxxxx.local 4 Online, rgmanager
> > > >
> > > > Service Name Owner (Last) State
> > > > ------- ---- ----- ------ -----
> > > > service:httpd1 mail01.xxxx.local started
> > > > service:mysql-server http1.xxxx.local started -------------------
> > > this
> > > > is the problematic node
> > > > service:public-dns publicdns1.xxxxxx.local started
> > > >
> > > > I cant move that service mysql-server from this node or cant
> > > relocate
> > > > any service on this node ..
> > > > I am very confused.
> > > >
> > > > what shall i do to fix this issue ??
> > > >
> > > > thanks for your advise.
> > > >
> > > >
> > > >
> > > >
> > > > -- Linux-cluster mailing list
> > > > Linux-cluster at redhat.com
> > > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > >
> > > -- Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> > >
> > >
> > > -- Linux-cluster mailing list
> > > Linux-cluster at redhat.com
> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> > -- Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> >
> > -- Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <
> https://www.redhat.com/archives/linux-cluster/attachments/20100927/462f567b/attachment.html
> >
>
> ------------------------------
>
> Message: 5
> Date: Mon, 27 Sep 2010 18:37:44 +0100
> From: fosiul alam <expertalert at gmail.com>
> To: linux clustering <linux-cluster at redhat.com>
> Subject: Re: [Linux-cluster] ricci is very unstable in one nodes
> Message-ID:
>        <AANLkTi=DfrVMFkp8No9UbwD+fVoRx9FmpO+qzY2RxLPk at mail.gmail.com<DfrVMFkp8No9UbwD%2BfVoRx9FmpO%2BqzY2RxLPk at mail.gmail.com>
> >
> Content-Type: text/plain; charset="iso-8859-1"
>
> Hi, Addition to my previous email have a look to this one
>
> from http1 ( where i am trying to relocate a service)
>
> [root at http1 ~]# clusvcadm -e httpd1 -m http1.xxxx.local
> Member http1.xxxx.local trying to enable service:httpd1...Success
> Warning: service:httpd1 is now running on mail01.xxxx.local
>
> so, its saying its Success..
> but it actually no..
>
> Thanks again
>
>
>
> On 27 September 2010 18:31, fosiul alam <expertalert at gmail.com> wrote:
>
> > Hi
> > Thanks for your advise,
> > Currently i got this
> >
> >
> > luci-0.12.2-12.el5.centos.1
> > ricci-0.12.2-12.el5.centos.1
> >
> > is this the same rpm as
> >
> > luci-0.12.2-12.el5_5.4.i386.rpm  ?
> > ricci-0.12.2-12.el5_5.4.i386.rpm  ?
> >
> > Thanks
> >
> >
> >
> > On 27 September 2010 17:55, Paul M. Dyer <pmdyer at ctgcentral2.com> wrote:
> >
> >> http://rhn.redhat.com/errata/RHBA-2010-0716.html
> >>
> >> It appears that this problem has been fixed in this errata.
> >>
> >> I installed the luci and ricci updates and did some lite testing.   So
> >> far, the timeout 11111 error has not shown up.
> >>
> >> Paul
> >>
> >> ----- Original Message -----
> >> From: "fosiul alam" <expertalert at gmail.com>
> >> To: "linux clustering" <linux-cluster at redhat.com>
> >> Sent: Monday, September 27, 2010 10:48:27 AM
> >> Subject: Re: [Linux-cluster] ricci is very unstable in one nodes
> >>
> >> Hi
> >> i am trying to patch ricci . let see how it goes
> >>
> >> but clusvcadm is failing as well
> >>
> >> [root at http1 ~]# clusvcadm -e httpd1 -m http1.xxxx.local
> >> Member http1.xxxx.local trying to enable service:httpd1...Invalid
> >> operation for resource
> >>
> >> here, http1 , where i was trying to run the service from luci
> >>
> >> what could be the problem ?
> >> is there any way to find out if there is any problem with config ??
> >>
> >> On 27 September 2010 16:26, Ben Turner < bturner at redhat.com > wrote:
> >>
> >>
> >> RHEL 5.6 hasn't been released yet so your package probably contains the
> >> problem. I'm not sure how in sync Centos is with RHEL or if they patch
> >> earlier so I cannot give you a time frame when it will be in Centos or
> >> if they have already patched it. The problem in that BZ is more of an
> >> annoyance, you usually just have to retry a time or two and it works. If
> >> you can't get Luci working properly with your service at all you should
> >> try enabling the service through the command line with clusvcadm -e. If
> >> it is not working from the command line either then there is a problem
> >> with the service config.
> >>
> >>
> >>
> >>
> >> -Ben
> >>
> >>
> >>
> >>
> >> ----- "fosiul alam" < expertalert at gmail.com > wrote:
> >>
> >> > Hi Ben
> >> > Thanks
> >> >
> >> > I named this cluster as mysql-server but i have not installed mysql
> >> > database in their yet
> >> >
> >> > and both luci and ricci on luci server and node1 is running this
> >> > version
> >> >
> >> > luci-0.12.2-12.el5.centos.1
> >> > ricci-0.12.2-12.el5.centos.1
> >> >
> >> >
> >> > do you think this version has problem as well ??
> >> >
> >> > thanks for your help
> >> >
> >> >
> >> >
> >> >
> >> > On 24 September 2010 15:33, Ben Turner < bturner at redhat.com > wrote:
> >> >
> >> >
> >> > There is an issue with ricci timeouts that was fixed recently:
> >> >
> >> > https://bugzilla.redhat.com/show_bug.cgi?id=564490
> >> >
> >> > I'm not sure but you may be hitting that bug. Symptoms include: luci
> >> > isn't able to get the status from the node, timeouts when querying
> >> > ricci, etc. The fix should be released with 5.6
> >> >
> >> > On the mysql service there are some options that you need to set. Here
> >> > are all the options available to that agent:
> >> >
> >> > mysql
> >> > Defines a MySQL database server
> >> >
> >> > Attribute Description
> >> > config_file Define configuration file
> >> > listen_address Define an IP address for MySQL server. If the address
> >> > is not given then first IP address from the service is taken.
> >> > mysqld_options Other command-line options for mysqld
> >> > name Name
> >> > ref Reference to existing mysql resource in the resources section.
> >> > service_name Inherit the service name.
> >> > shutdown_wait Wait X seconds for correct end of service shutdown
> >> > startup_wait Wait X seconds for correct end of service startup
> >> > __enforce_timeouts Consider a timeout for operations as fatal.
> >> > __failure_expire_time Amount of time before a failure is forgotten.
> >> > __independent_subtree Treat this and all children as an independent
> >> > subtree. __max_failures Maximum number of failures before returning a
> >> > failure to a status check.
> >> >
> >> > If I recall correctly you may need to tweak:
> >> >
> >> > shutdown_wait Wait X seconds for correct end of service shutdown
> >> > startup_wait Wait X seconds for correct end of service startup
> >> >
> >> > There can be problems relocating the DB if it takes too long to
> >> > start/shutdown. If you are having problems relocating with luci it may
> >> > be a good idea to test with:
> >> >
> >> > # clusvcadm -r <service name> -m <cluster node>
> >> >
> >> > -Ben
> >> >
> >> >
> >> >
> >> >
> >> >
> >> >
> >> > ----- "fosiul alam" < expertalert at gmail.com > wrote:
> >> >
> >> > > Hi
> >> > > I have 4 nodes cluster,
> >> > > It was running fine. but today one nodes is giving trouble
> >> > >
> >> > > From luci Gui interface, when i try to relocate service into this
> >> > node
> >> > > and trying to relocate from this nodes to another nodes
> >> > >
> >> > > from luci gui interface, its showing :
> >> > >
> >> > > Unable to retrieve batch 1908047789 status from
> >> > > beaver.domain.local:11111: clusvcadm start failed to start httpd1:
> >> > > Starting cluster service "httpd1" on node "http1.domain.local" --
> >> > You
> >> > > will be redirected in 5 seconds.
> >> > > also
> >> > >
> >> > > The ricci agent for this node is unresponsive. Node-specific
> >> > > information is not available at this time. :
> >> > >
> >> > > but ricci is running on problematic node ,
> >> > > ricci 7324 0.0 0.1 58876 2932 ? S<s 14:40 0:00 ricci -u 101
> >> > >
> >> > > there is not any firewall running.
> >> > >
> >> > > iptables -L
> >> > > Chain INPUT (policy ACCEPT)
> >> > > target prot opt source destination
> >> > >
> >> > > Chain FORWARD (policy ACCEPT)
> >> > > target prot opt source destination
> >> > >
> >> > > Chain OUTPUT (policy ACCEPT)
> >> > > target prot opt source destination
> >> > >
> >> > > Chain RH-Firewall-1-INPUT (0 references)
> >> > > target prot opt source destination
> >> > >
> >> > > port 11111 is runningg
> >> > >
> >> > > netstat -an | grep 11111
> >> > > tcp 0 0 0.0.0.0:11111 0.0.0.0:* LISTEN
> >> > >
> >> > >
> >> > > but still ricci is very unstable , and i cant relocate any service
> >> > on
> >> > > this node or i cant relocate any service away from this node.
> >> > >
> >> > > from problematic node if i type this
> >> > >
> >> > > clustat
> >> > > Cluster Status for ng1 @ Thu Sep 23 20:24:02 2010
> >> > > Member Status: Quorate
> >> > >
> >> > > Member Name ID Status
> >> > > ------ ---- ---- ------
> >> > > beaver.xxx.local 1 Online, rgmanager ::: luci is running from this
> >> > > server publicdns1.xxxx.local 2 Online, rgmanager
> >> > > http1.xxxx.local 3 Online, Local, rgmanager
> >> > > mail01.xxxxx.local 4 Online, rgmanager
> >> > >
> >> > > Service Name Owner (Last) State
> >> > > ------- ---- ----- ------ -----
> >> > > service:httpd1 mail01.xxxx.local started
> >> > > service:mysql-server http1.xxxx.local started -------------------
> >> > this
> >> > > is the problematic node
> >> > > service:public-dns publicdns1.xxxxxx.local started
> >> > >
> >> > > I cant move that service mysql-server from this node or cant
> >> > relocate
> >> > > any service on this node ..
> >> > > I am very confused.
> >> > >
> >> > > what shall i do to fix this issue ??
> >> > >
> >> > > thanks for your advise.
> >> > >
> >> > >
> >> > >
> >> > >
> >> > > -- Linux-cluster mailing list
> >> > > Linux-cluster at redhat.com
> >> > > https://www.redhat.com/mailman/listinfo/linux-cluster
> >> >
> >> > -- Linux-cluster mailing list
> >> > Linux-cluster at redhat.com
> >> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >> >
> >> >
> >> > -- Linux-cluster mailing list
> >> > Linux-cluster at redhat.com
> >> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> >> -- Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> >>
> >> -- Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> >
> >
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <
> https://www.redhat.com/archives/linux-cluster/attachments/20100927/4101fdf9/attachment.html
> >
>
> ------------------------------
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> End of Linux-cluster Digest, Vol 77, Issue 23
> *********************************************
>


-- 
Santosh


--------------------------------------------------------------------------------


--
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster 


From Fredrik.Hudner at evry.com  Fri May 10 06:58:08 2013
From: Fredrik.Hudner at evry.com (Fredrik Hudner)
Date: Fri, 10 May 2013 06:58:08 +0000
Subject: [Linux-cluster] fence_vmware_soap sslv3 alert certificat unkown
In-Reply-To: <518B4E56.9030809@redhat.com>
References: <64275F83AE588D4EBF165BF5D8783D41042D3B@ccdex003>
	<5183C624.1000807@redhat.com>
	<64275F83AE588D4EBF165BF5D8783D41042DC4@ccdex003>
	<518B4E56.9030809@redhat.com>
Message-ID: <64275F83AE588D4EBF165BF5D8783D41043F4F@ccdex003>


-----Ursprungligt meddelande-----
Fr?n: linux-cluster-bounces at redhat.com [mailto:linux-cluster-bounces at redhat.com] F?r Marek Grac
Skickat: den 9 maj 2013 09:21
Till: linux-cluster at redhat.com
?mne: Re: [Linux-cluster] fence_vmware_soap sslv3 alert certificat unkown

On 05/03/2013 04:25 PM, Fredrik Hudner wrote:
> On 05/03/2013 03:33 PM, Fredrik Hudner wrote:
>> Dear all,
>>
>> I have a pacemaker cluster and need to setup a stonith fencing agent, in this case fence_vmware_soap.
>>
>> Environment:
>> Centos 6.3
>> fence-agents.x86_64
>>
>> I'm running the command manually with different options:
>>
>> # fence_vmware_soap -o off -a vcenter-adress -l 
>> drift\vcenter_tdtestclu -p password  -n tdtestclu02 -u 443 Unable to 
>> connect/login to fencing device
>>
>> # fence_vmware_soap -o off -a 192.168.231.31 -l 
>> drift\vcenter_tdtestclu -p password -z -n tdtestclu02 -u 443 No handlers could be found for logger "suds.client"
>> Unable to connect/login to fencing device
> It looks like you are trying to connect to API on port 443 without 
> using ssl (on command line you can use --ssl; no need to use -u)
>
> m,
>
> I thought the -z option did that ?
> Besides if I don't user -u <portnumber> it default to port 23 (telnet) 
> and they will never open that in the firewall for me :)
>
Yes, you are right (there is no need to set a port if it is a default one and -z/--ssl is set). Which version of VMWare do you have? Take a look if it is supported https://access.redhat.com/site/articles/2860 and no work-arounds are needed.

m,


Thanks for the reply Marek,

I think I have come around the problem logging in to the Vcentre.. I investigated the log files in there and it seems like I havet o build a SSL certificate  to access.. So now I have to learn how to rebuild a windows cert to a linux cert, but I think I understand that part

Many thanks
/Fredrik
-- 
Linux-cluster mailing list
Linux-cluster at redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster


From delphine.ramalingom at univ-reunion.fr  Mon May 13 06:28:53 2013
From: delphine.ramalingom at univ-reunion.fr (Delphine Ramalingom)
Date: Mon, 13 May 2013 10:28:53 +0400
Subject: [Linux-cluster] error clusvcadm
Message-ID: <51908825.2010902@univ-reunion.fr>

Hello,

I have a problem and I need some help.

Our cluster linux have been stopped for maintenance in the room server 
butr, an error was occured during the stopping procedure :
Local machine disabling service:HA_MGMT...Failure

The cluster was electrically stopped. But since the restart, I don't 
succed to restart services with command clussvcadm.
I have this message :

clusvcadm -e HA_MGMT
Local machine trying to enable service:HA_MGMT...Aborted; service failed
and
<err>    startFilesystem: Could not match LABEL=postfix with a real device

Do you have a solution for me ?

Thanks a lot in advance.

Regards
Delphine


From torajveersingh at gmail.com  Mon May 13 06:37:20 2013
From: torajveersingh at gmail.com (Rajveer Singh)
Date: Mon, 13 May 2013 12:07:20 +0530
Subject: [Linux-cluster] error clusvcadm
In-Reply-To: <51908825.2010902@univ-reunion.fr>
References: <51908825.2010902@univ-reunion.fr>
Message-ID: <CAHG1G7FZOtoyrvnGsbZo9Whm0dtbbuVG_bSOfVH-i9NurkfFNA@mail.gmail.com>

Hi Delphine,
It seems there is some filesystem crash. Please share your
/var/log/messages and /etc/cluster/cluster.conf file to help you futher.

Regards,
Rajveer Singh


On Mon, May 13, 2013 at 11:58 AM, Delphine Ramalingom <
delphine.ramalingom at univ-reunion.fr> wrote:

> Hello,
>
> I have a problem and I need some help.
>
> Our cluster linux have been stopped for maintenance in the room server
> butr, an error was occured during the stopping procedure :
> Local machine disabling service:HA_MGMT...Failure
>
> The cluster was electrically stopped. But since the restart, I don't
> succed to restart services with command clussvcadm.
> I have this message :
>
> clusvcadm -e HA_MGMT
> Local machine trying to enable service:HA_MGMT...Aborted; service failed
> and
> <err>    startFilesystem: Could not match LABEL=postfix with a real device
>
> Do you have a solution for me ?
>
> Thanks a lot in advance.
>
> Regards
> Delphine
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/**mailman/listinfo/linux-cluster<https://www.redhat.com/mailman/listinfo/linux-cluster>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20130513/bc795869/attachment.htm>

From delphine.ramalingom at univ-reunion.fr  Mon May 13 07:32:27 2013
From: delphine.ramalingom at univ-reunion.fr (Delphine Ramalingom)
Date: Mon, 13 May 2013 11:32:27 +0400
Subject: [Linux-cluster] error clusvcadm
In-Reply-To: <CAHG1G7FZOtoyrvnGsbZo9Whm0dtbbuVG_bSOfVH-i9NurkfFNA@mail.gmail.com>
References: <51908825.2010902@univ-reunion.fr>
	<CAHG1G7FZOtoyrvnGsbZo9Whm0dtbbuVG_bSOfVH-i9NurkfFNA@mail.gmail.com>
Message-ID: <5190970B.7030908@univ-reunion.fr>

Hi,

This is the cluster.conf :

[root at titan0 11:29:14 ~]# cat /etc/cluster/cluster.conf
<?xml version="1.0" ?>
<cluster config_version="7" name="HA_MGMT">
         <fence_daemon clean_start="1" post_fail_delay="0" 
post_join_delay="60"/>
         <clusternodes>
                 <clusternode name="titan0"  nodeid="1" votes="1">
                         <fence>
                                 <method name="1">
                                         <device name="titan0fence" 
option="reboot"/>
                                 </method>
                         </fence>
                 </clusternode>
                 <clusternode name="titan1" nodeid="2" votes="1">
                         <fence>
                                 <method name="1">
                                         <device name="titan1fence" 
option="reboot"/>
                                 </method>
                         </fence>
                 </clusternode>
         </clusternodes>
         <cman  cluster_id="0" expected_votes="1" two_node="1"/>
         <fencedevices>
                 <fencedevice agent="fence_ipmilan" 
ipaddr="172.17.0.101" login="administrator" name="titan0fence" 
passwd="administrator"/>
                 <fencedevice agent="fence_ipmilan" 
ipaddr="172.17.0.102" login="administrator" name="titan1fence" 
passwd="administrator"/>
         </fencedevices>
         <rm>
                 <failoverdomains>
                         <failoverdomain name="titan0_heuristic" 
ordered="0" restricted="1">
                                 <failoverdomainnode name="titan0" 
priority="1"/>
                         </failoverdomain>
                         <failoverdomain name="titan1_heuristic" 
ordered="0" restricted="1">
                                 <failoverdomainnode name="titan1" 
priority="1"/>
                         </failoverdomain>
                         <failoverdomain name="MgmtNodes" ordered="0" 
restricted="0">
                                 <failoverdomainnode name="titan0" 
priority="1"/>
                                 <failoverdomainnode name="titan1" 
priority="2"/>
                         </failoverdomain>
             <failoverdomain name="NFSHA" ordered="0" restricted="0">
                 <failoverdomainnode name="titan0" priority="2"/>
                 <failoverdomainnode name="titan1" priority="1"/>
             </failoverdomain>
                 </failoverdomains>
             <service domain="titan0_heuristic" name="ha_titan0_check" 
autostart="1" checkinterval="10">
                     <script file="/usr/sbin/ha_titan0_check" 
name="ha_titan0_check"/>
             </service>
             <service domain="titan1_heuristic" name="ha_titan1_check" 
autostart="1" checkinterval="10">
                     <script file="/usr/sbin/ha_titan1_check" 
name="ha_titan1_check"/>
             </service>
                 <service domain="MgmtNodes" name="HA_MGMT" 
autostart="0" recovery="relocate">
             <!-- ip addresses lines mgmt -->
                                 <ip address="172.17.0.99/16" 
monitor_link="1"/>
                                 <ip address="10.90.0.99/24" 
monitor_link="1"/>
             <!-- devices lines mgmt -->
                        <fs device="LABEL=postfix" 
mountpoint="/var/spool/postfix" force_unmount="1" fstype="ext3" 
name="mgmtha5" options=""/>
                        <fs device="LABEL=bigimage" 
mountpoint="/var/lib/systemimager" force_unmount="1" fstype="ext3" 
name="mgmtha4" options=""/>
                        <clusterfs device="LABEL=HA_MGMT:conman" 
mountpoint="/var/log/conman" force_unmount="0" fstype="gfs2" 
name="mgmtha3" options=""/>
                        <clusterfs device="LABEL=HA_MGMT:ganglia" 
mountpoint="/var/lib/ganglia/rrds" force_unmount="0" fstype="gfs2" 
name="mgmtha2" options=""/>
                        <clusterfs device="LABEL=HA_MGMT:syslog" 
mountpoint="/var/log/HOSTS" force_unmount="0" fstype="gfs2" 
name="mgmtha1" options=""/>
                        <clusterfs device="LABEL=HA_MGMT:cdb" 
mountpoint="/var/lib/pgsql/data" force_unmount="0" fstype="gfs2" 
name="mgmtha0" options=""/>
                         <script file="/usr/sbin/haservices" 
name="haservices"/>
                 </service>
         <service domain="NFSHA" name="HA_NFS" autostart="0" 
checkinterval="60">
             <!-- ip addresses lines nfs -->
                                 <ip address="10.31.0.99/16" 
monitor_link="1"/>
                                 <ip address="10.90.0.88/24" 
monitor_link="1"/>
                                 <ip address="172.17.0.88/16" 
monitor_link="1"/>
             <!-- devices lines nfs -->
                        <fs device="LABEL=PROGS" mountpoint="/programs" 
force_unmount="1" fstype="ext3" name="nfsha4" options=""/>
                        <fs device="LABEL=WRKTMP" mountpoint="/worktmp" 
force_unmount="1" fstype="ext3" name="nfsha3" options=""/>
                        <fs device="LABEL=LABOS" mountpoint="/labos" 
force_unmount="1" fstype="xfs" name="nfsha2" options="ikeep"/>
                        <fs device="LABEL=OPTINTEL" 
mountpoint="/opt/intel" force_unmount="1" fstype="ext3" name="nfsha1" 
options=""/>
                        <fs device="LABEL=HOMENFS" 
mountpoint="/home_nfs" force_unmount="1" fstype="ext3" name="nfsha0" 
options=""/>
             <script file="/etc/init.d/nfs" name="nfs_service"/>
         </service>
         </rm>
     <totem token="21000" />
</cluster>
<!-- !!!!! DON'T REMOVE OR CHANGE ANYTHING IN PARAMETERS SECTION BELOW
node_name=titan0
node_ipmi_ipaddr=172.17.0.101
node_hwmanager_login=administrator
node_hwmanager_passwd=administrator
ipaddr1_for_heuristics=172.17.0.200
node_ha_name=titan1
node_ha_ipmi_ipaddr=172.17.0.102
node_ha_hwmanager_login=administrator
node_ha_hwmanager_passwd=administrator
ipaddr2_for_heuristics=172.17.0.200
mngt_virt_ipaddr_for_heuristics=not used on this type of node
END OF SECTION !!!!! -->


The var/log/messages is too long and have some messages repeated :
May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP: 
[10.40.20.30]:39198
May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP: 
[10.40.20.30]:39198
May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP: 
[10.40.20.30]:39198
May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP: 
[10.40.20.30]:39198
May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP: 
[10.40.20.30]:39198
May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP: 
[10.40.20.30]:39198
May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP: 
[10.40.20.30]:39198
May 13 11:30:34 s_sys at titan0 snmpd[4584]: Connection from UDP: 
[10.40.20.30]:53030
May 13 11:30:34 s_sys at titan0 snmpd[4584]: Received SNMP packet(s) from 
UDP: [10.40.20.30]:53030
May 13 11:30:34 s_sys at titan0 snmpd[4584]: Connection from UDP: 
[10.40.20.30]:41083
May 13 11:30:34 s_sys at titan0 snmpd[4584]: Received SNMP packet(s) from 
UDP: [10.40.20.30]:41083

Regards
Delphine


Le 13/05/13 10:37, Rajveer Singh a ?crit :
> Hi Delphine,
> It seems there is some filesystem crash. Please share your 
> /var/log/messages and /etc/cluster/cluster.conf file to help you futher.
>
> Regards,
> Rajveer Singh
>
>
> On Mon, May 13, 2013 at 11:58 AM, Delphine Ramalingom 
> <delphine.ramalingom at univ-reunion.fr 
> <mailto:delphine.ramalingom at univ-reunion.fr>> wrote:
>
>     Hello,
>
>     I have a problem and I need some help.
>
>     Our cluster linux have been stopped for maintenance in the room
>     server butr, an error was occured during the stopping procedure :
>     Local machine disabling service:HA_MGMT...Failure
>
>     The cluster was electrically stopped. But since the restart, I
>     don't succed to restart services with command clussvcadm.
>     I have this message :
>
>     clusvcadm -e HA_MGMT
>     Local machine trying to enable service:HA_MGMT...Aborted; service
>     failed
>     and
>     <err>    startFilesystem: Could not match LABEL=postfix with a
>     real device
>
>     Do you have a solution for me ?
>
>     Thanks a lot in advance.
>
>     Regards
>     Delphine
>
>     -- 
>     Linux-cluster mailing list
>     Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>     https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20130513/a591136e/attachment.htm>

From emi2fast at gmail.com  Mon May 13 07:47:43 2013
From: emi2fast at gmail.com (emmanuel segura)
Date: Mon, 13 May 2013 09:47:43 +0200
Subject: [Linux-cluster] error clusvcadm
In-Reply-To: <5190970B.7030908@univ-reunion.fr>
References: <51908825.2010902@univ-reunion.fr>
	<CAHG1G7FZOtoyrvnGsbZo9Whm0dtbbuVG_bSOfVH-i9NurkfFNA@mail.gmail.com>
	<5190970B.7030908@univ-reunion.fr>
Message-ID: <CAE7pJ3Cht6v9zLBQ_99deJauYw235LzZjFiPCn5se+Vy16JaXg@mail.gmail.com>

Hello

If you would like see why your service doens't start, you should use
"rg_test test /etc/cluster/cluster.conf start service HA_MGMT"


2013/5/13 Delphine Ramalingom <delphine.ramalingom at univ-reunion.fr>

>  Hi,
>
> This is the cluster.conf :
>
> [root at titan0 11:29:14 ~]# cat /etc/cluster/cluster.conf
> <?xml version="1.0" ?>
> <cluster config_version="7" name="HA_MGMT">
>         <fence_daemon clean_start="1" post_fail_delay="0"
> post_join_delay="60"/>
>         <clusternodes>
>                 <clusternode name="titan0"  nodeid="1" votes="1">
>                         <fence>
>                                 <method name="1">
>                                         <device name="titan0fence"
> option="reboot"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>                 <clusternode name="titan1" nodeid="2" votes="1">
>                         <fence>
>                                 <method name="1">
>                                         <device name="titan1fence"
> option="reboot"/>
>                                 </method>
>                         </fence>
>                 </clusternode>
>         </clusternodes>
>         <cman  cluster_id="0" expected_votes="1" two_node="1"/>
>         <fencedevices>
>                 <fencedevice agent="fence_ipmilan" ipaddr="172.17.0.101"
> login="administrator" name="titan0fence" passwd="administrator"/>
>                 <fencedevice agent="fence_ipmilan" ipaddr="172.17.0.102"
> login="administrator" name="titan1fence" passwd="administrator"/>
>         </fencedevices>
>         <rm>
>                 <failoverdomains>
>                         <failoverdomain name="titan0_heuristic"
> ordered="0" restricted="1">
>                                 <failoverdomainnode name="titan0"
> priority="1"/>
>                         </failoverdomain>
>                         <failoverdomain name="titan1_heuristic"
> ordered="0" restricted="1">
>                                 <failoverdomainnode name="titan1"
> priority="1"/>
>                         </failoverdomain>
>                         <failoverdomain name="MgmtNodes" ordered="0"
> restricted="0">
>                                 <failoverdomainnode name="titan0"
> priority="1"/>
>                                 <failoverdomainnode name="titan1"
> priority="2"/>
>                         </failoverdomain>
>             <failoverdomain name="NFSHA" ordered="0" restricted="0">
>                 <failoverdomainnode name="titan0" priority="2"/>
>                 <failoverdomainnode name="titan1" priority="1"/>
>             </failoverdomain>
>                 </failoverdomains>
>             <service domain="titan0_heuristic" name="ha_titan0_check"
> autostart="1" checkinterval="10">
>                     <script file="/usr/sbin/ha_titan0_check"
> name="ha_titan0_check"/>
>             </service>
>             <service domain="titan1_heuristic" name="ha_titan1_check"
> autostart="1" checkinterval="10">
>                     <script file="/usr/sbin/ha_titan1_check"
> name="ha_titan1_check"/>
>             </service>
>                 <service domain="MgmtNodes" name="HA_MGMT" autostart="0"
> recovery="relocate">
>             <!-- ip addresses lines mgmt -->
>                                 <ip address="172.17.0.99/16"
> monitor_link="1"/>
>                                 <ip address="10.90.0.99/24"
> monitor_link="1"/>
>             <!-- devices lines mgmt -->
>                        <fs device="LABEL=postfix"
> mountpoint="/var/spool/postfix" force_unmount="1" fstype="ext3"
> name="mgmtha5" options=""/>
>                        <fs device="LABEL=bigimage"
> mountpoint="/var/lib/systemimager" force_unmount="1" fstype="ext3"
> name="mgmtha4" options=""/>
>                        <clusterfs device="LABEL=HA_MGMT:conman"
> mountpoint="/var/log/conman" force_unmount="0" fstype="gfs2" name="mgmtha3"
> options=""/>
>                        <clusterfs device="LABEL=HA_MGMT:ganglia"
> mountpoint="/var/lib/ganglia/rrds" force_unmount="0" fstype="gfs2"
> name="mgmtha2" options=""/>
>                        <clusterfs device="LABEL=HA_MGMT:syslog"
> mountpoint="/var/log/HOSTS" force_unmount="0" fstype="gfs2" name="mgmtha1"
> options=""/>
>                        <clusterfs device="LABEL=HA_MGMT:cdb"
> mountpoint="/var/lib/pgsql/data" force_unmount="0" fstype="gfs2"
> name="mgmtha0" options=""/>
>                         <script file="/usr/sbin/haservices"
> name="haservices"/>
>                 </service>
>         <service domain="NFSHA" name="HA_NFS" autostart="0"
> checkinterval="60">
>             <!-- ip addresses lines nfs -->
>                                 <ip address="10.31.0.99/16"
> monitor_link="1"/>
>                                 <ip address="10.90.0.88/24"
> monitor_link="1"/>
>                                 <ip address="172.17.0.88/16"
> monitor_link="1"/>
>             <!-- devices lines nfs -->
>                        <fs device="LABEL=PROGS" mountpoint="/programs"
> force_unmount="1" fstype="ext3" name="nfsha4" options=""/>
>                        <fs device="LABEL=WRKTMP" mountpoint="/worktmp"
> force_unmount="1" fstype="ext3" name="nfsha3" options=""/>
>                        <fs device="LABEL=LABOS" mountpoint="/labos"
> force_unmount="1" fstype="xfs" name="nfsha2" options="ikeep"/>
>                        <fs device="LABEL=OPTINTEL" mountpoint="/opt/intel"
> force_unmount="1" fstype="ext3" name="nfsha1" options=""/>
>                        <fs device="LABEL=HOMENFS" mountpoint="/home_nfs"
> force_unmount="1" fstype="ext3" name="nfsha0" options=""/>
>             <script file="/etc/init.d/nfs" name="nfs_service"/>
>         </service>
>         </rm>
>     <totem token="21000" />
> </cluster>
> <!-- !!!!! DON'T REMOVE OR CHANGE ANYTHING IN PARAMETERS SECTION BELOW
> node_name=titan0
> node_ipmi_ipaddr=172.17.0.101
> node_hwmanager_login=administrator
> node_hwmanager_passwd=administrator
> ipaddr1_for_heuristics=172.17.0.200
> node_ha_name=titan1
> node_ha_ipmi_ipaddr=172.17.0.102
> node_ha_hwmanager_login=administrator
> node_ha_hwmanager_passwd=administrator
> ipaddr2_for_heuristics=172.17.0.200
> mngt_virt_ipaddr_for_heuristics=not used on this type of node
> END OF SECTION !!!!! -->
>
>
> The var/log/messages is too long and have some messages repeated :
> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
> [10.40.20.30]:39198
> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
> [10.40.20.30]:39198
> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
> [10.40.20.30]:39198
> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
> [10.40.20.30]:39198
> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
> [10.40.20.30]:39198
> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
> [10.40.20.30]:39198
> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
> [10.40.20.30]:39198
> May 13 11:30:34 s_sys at titan0 snmpd[4584]: Connection from UDP:
> [10.40.20.30]:53030
> May 13 11:30:34 s_sys at titan0 snmpd[4584]: Received SNMP packet(s) from
> UDP: [10.40.20.30]:53030
> May 13 11:30:34 s_sys at titan0 snmpd[4584]: Connection from UDP:
> [10.40.20.30]:41083
> May 13 11:30:34 s_sys at titan0 snmpd[4584]: Received SNMP packet(s) from
> UDP: [10.40.20.30]:41083
>
> Regards
> Delphine
>
>
>
> Le 13/05/13 10:37, Rajveer Singh a ?crit :
>
>  Hi Delphine,
> It seems there is some filesystem crash. Please share your
> /var/log/messages and /etc/cluster/cluster.conf file to help you futher.
>
>  Regards,
> Rajveer Singh
>
>
> On Mon, May 13, 2013 at 11:58 AM, Delphine Ramalingom <
> delphine.ramalingom at univ-reunion.fr> wrote:
>
>> Hello,
>>
>> I have a problem and I need some help.
>>
>> Our cluster linux have been stopped for maintenance in the room server
>> butr, an error was occured during the stopping procedure :
>> Local machine disabling service:HA_MGMT...Failure
>>
>> The cluster was electrically stopped. But since the restart, I don't
>> succed to restart services with command clussvcadm.
>> I have this message :
>>
>> clusvcadm -e HA_MGMT
>> Local machine trying to enable service:HA_MGMT...Aborted; service failed
>> and
>> <err>    startFilesystem: Could not match LABEL=postfix with a real device
>>
>> Do you have a solution for me ?
>>
>> Thanks a lot in advance.
>>
>> Regards
>> Delphine
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
esta es mi vida e me la vivo hasta que dios quiera
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20130513/7232d378/attachment.htm>

From delphine.ramalingom at univ-reunion.fr  Mon May 13 08:35:41 2013
From: delphine.ramalingom at univ-reunion.fr (Delphine Ramalingom)
Date: Mon, 13 May 2013 12:35:41 +0400
Subject: [Linux-cluster] error clusvcadm
In-Reply-To: <CAE7pJ3Cht6v9zLBQ_99deJauYw235LzZjFiPCn5se+Vy16JaXg@mail.gmail.com>
References: <51908825.2010902@univ-reunion.fr>
	<CAHG1G7FZOtoyrvnGsbZo9Whm0dtbbuVG_bSOfVH-i9NurkfFNA@mail.gmail.com>
	<5190970B.7030908@univ-reunion.fr>
	<CAE7pJ3Cht6v9zLBQ_99deJauYw235LzZjFiPCn5se+Vy16JaXg@mail.gmail.com>
Message-ID: <5190A5DD.4030805@univ-reunion.fr>

Hi,

I used it :
rg_test test /etc/cluster/cluster.conf start service HA_MGMT
Running in test mode.
Starting HA_MGMT...
<err>    startFilesystem: Could not match LABEL=postfix with a real device
Failed to start HA_MGMT

But it gives me the same message.

Regards
Delphine

Le 13/05/13 11:47, emmanuel segura a ?crit :
> Hello
>
> If you would like see why your service doens't start, you should use 
> "rg_test test /etc/cluster/cluster.conf start service HA_MGMT"
>
>
>
>
> 2013/5/13 Delphine Ramalingom <delphine.ramalingom at univ-reunion.fr 
> <mailto:delphine.ramalingom at univ-reunion.fr>>
>
>     Hi,
>
>     This is the cluster.conf :
>
>     [root at titan0 11:29:14 ~]# cat /etc/cluster/cluster.conf
>     <?xml version="1.0" ?>
>     <cluster config_version="7" name="HA_MGMT">
>             <fence_daemon clean_start="1" post_fail_delay="0"
>     post_join_delay="60"/>
>             <clusternodes>
>                     <clusternode name="titan0" nodeid="1" votes="1">
>                             <fence>
>                                     <method name="1">
>                                             <device name="titan0fence"
>     option="reboot"/>
>                                     </method>
>                             </fence>
>                     </clusternode>
>                     <clusternode name="titan1" nodeid="2" votes="1">
>                             <fence>
>                                     <method name="1">
>                                             <device name="titan1fence"
>     option="reboot"/>
>                                     </method>
>                             </fence>
>                     </clusternode>
>             </clusternodes>
>             <cman  cluster_id="0" expected_votes="1" two_node="1"/>
>             <fencedevices>
>                     <fencedevice agent="fence_ipmilan"
>     ipaddr="172.17.0.101" login="administrator" name="titan0fence"
>     passwd="administrator"/>
>                     <fencedevice agent="fence_ipmilan"
>     ipaddr="172.17.0.102" login="administrator" name="titan1fence"
>     passwd="administrator"/>
>             </fencedevices>
>             <rm>
>                     <failoverdomains>
>                             <failoverdomain name="titan0_heuristic"
>     ordered="0" restricted="1">
>                                     <failoverdomainnode name="titan0"
>     priority="1"/>
>                             </failoverdomain>
>                             <failoverdomain name="titan1_heuristic"
>     ordered="0" restricted="1">
>                                     <failoverdomainnode name="titan1"
>     priority="1"/>
>                             </failoverdomain>
>                             <failoverdomain name="MgmtNodes"
>     ordered="0" restricted="0">
>                                     <failoverdomainnode name="titan0"
>     priority="1"/>
>                                     <failoverdomainnode name="titan1"
>     priority="2"/>
>                             </failoverdomain>
>                 <failoverdomain name="NFSHA" ordered="0" restricted="0">
>                     <failoverdomainnode name="titan0" priority="2"/>
>                     <failoverdomainnode name="titan1" priority="1"/>
>                 </failoverdomain>
>                     </failoverdomains>
>                 <service domain="titan0_heuristic"
>     name="ha_titan0_check" autostart="1" checkinterval="10">
>                         <script file="/usr/sbin/ha_titan0_check"
>     name="ha_titan0_check"/>
>                 </service>
>                 <service domain="titan1_heuristic"
>     name="ha_titan1_check" autostart="1" checkinterval="10">
>                         <script file="/usr/sbin/ha_titan1_check"
>     name="ha_titan1_check"/>
>                 </service>
>                     <service domain="MgmtNodes" name="HA_MGMT"
>     autostart="0" recovery="relocate">
>                 <!-- ip addresses lines mgmt -->
>                                     <ip address="172.17.0.99/16
>     <http://172.17.0.99/16>" monitor_link="1"/>
>                                     <ip address="10.90.0.99/24
>     <http://10.90.0.99/24>" monitor_link="1"/>
>                 <!-- devices lines mgmt -->
>                            <fs device="LABEL=postfix"
>     mountpoint="/var/spool/postfix" force_unmount="1" fstype="ext3"
>     name="mgmtha5" options=""/>
>                            <fs device="LABEL=bigimage"
>     mountpoint="/var/lib/systemimager" force_unmount="1" fstype="ext3"
>     name="mgmtha4" options=""/>
>                            <clusterfs device="LABEL=HA_MGMT:conman"
>     mountpoint="/var/log/conman" force_unmount="0" fstype="gfs2"
>     name="mgmtha3" options=""/>
>                            <clusterfs device="LABEL=HA_MGMT:ganglia"
>     mountpoint="/var/lib/ganglia/rrds" force_unmount="0" fstype="gfs2"
>     name="mgmtha2" options=""/>
>                            <clusterfs device="LABEL=HA_MGMT:syslog"
>     mountpoint="/var/log/HOSTS" force_unmount="0" fstype="gfs2"
>     name="mgmtha1" options=""/>
>                            <clusterfs device="LABEL=HA_MGMT:cdb"
>     mountpoint="/var/lib/pgsql/data" force_unmount="0" fstype="gfs2"
>     name="mgmtha0" options=""/>
>                             <script file="/usr/sbin/haservices"
>     name="haservices"/>
>                     </service>
>             <service domain="NFSHA" name="HA_NFS" autostart="0"
>     checkinterval="60">
>                 <!-- ip addresses lines nfs -->
>                                     <ip address="10.31.0.99/16
>     <http://10.31.0.99/16>" monitor_link="1"/>
>                                     <ip address="10.90.0.88/24
>     <http://10.90.0.88/24>" monitor_link="1"/>
>                                     <ip address="172.17.0.88/16
>     <http://172.17.0.88/16>" monitor_link="1"/>
>                 <!-- devices lines nfs -->
>                            <fs device="LABEL=PROGS"
>     mountpoint="/programs" force_unmount="1" fstype="ext3"
>     name="nfsha4" options=""/>
>                            <fs device="LABEL=WRKTMP"
>     mountpoint="/worktmp" force_unmount="1" fstype="ext3"
>     name="nfsha3" options=""/>
>                            <fs device="LABEL=LABOS"
>     mountpoint="/labos" force_unmount="1" fstype="xfs" name="nfsha2"
>     options="ikeep"/>
>                            <fs device="LABEL=OPTINTEL"
>     mountpoint="/opt/intel" force_unmount="1" fstype="ext3"
>     name="nfsha1" options=""/>
>                            <fs device="LABEL=HOMENFS"
>     mountpoint="/home_nfs" force_unmount="1" fstype="ext3"
>     name="nfsha0" options=""/>
>                 <script file="/etc/init.d/nfs" name="nfs_service"/>
>             </service>
>             </rm>
>         <totem token="21000" />
>     </cluster>
>     <!-- !!!!! DON'T REMOVE OR CHANGE ANYTHING IN PARAMETERS SECTION
>     BELOW
>     node_name=titan0
>     node_ipmi_ipaddr=172.17.0.101
>     node_hwmanager_login=administrator
>     node_hwmanager_passwd=administrator
>     ipaddr1_for_heuristics=172.17.0.200
>     node_ha_name=titan1
>     node_ha_ipmi_ipaddr=172.17.0.102
>     node_ha_hwmanager_login=administrator
>     node_ha_hwmanager_passwd=administrator
>     ipaddr2_for_heuristics=172.17.0.200
>     mngt_virt_ipaddr_for_heuristics=not used on this type of node
>     END OF SECTION !!!!! -->
>
>
>     The var/log/messages is too long and have some messages repeated :
>     May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>     [10.40.20.30]:39198
>     May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>     [10.40.20.30]:39198
>     May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>     [10.40.20.30]:39198
>     May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>     [10.40.20.30]:39198
>     May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>     [10.40.20.30]:39198
>     May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>     [10.40.20.30]:39198
>     May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>     [10.40.20.30]:39198
>     May 13 11:30:34 s_sys at titan0 snmpd[4584]: Connection from UDP:
>     [10.40.20.30]:53030
>     May 13 11:30:34 s_sys at titan0 snmpd[4584]: Received SNMP packet(s)
>     from UDP: [10.40.20.30]:53030
>     May 13 11:30:34 s_sys at titan0 snmpd[4584]: Connection from UDP:
>     [10.40.20.30]:41083
>     May 13 11:30:34 s_sys at titan0 snmpd[4584]: Received SNMP packet(s)
>     from UDP: [10.40.20.30]:41083
>
>     Regards
>     Delphine
>
>
>
>     Le 13/05/13 10:37, Rajveer Singh a ?crit :
>>     Hi Delphine,
>>     It seems there is some filesystem crash. Please share your
>>     /var/log/messages and /etc/cluster/cluster.conf file to help you
>>     futher.
>>
>>     Regards,
>>     Rajveer Singh
>>
>>
>>     On Mon, May 13, 2013 at 11:58 AM, Delphine Ramalingom
>>     <delphine.ramalingom at univ-reunion.fr
>>     <mailto:delphine.ramalingom at univ-reunion.fr>> wrote:
>>
>>         Hello,
>>
>>         I have a problem and I need some help.
>>
>>         Our cluster linux have been stopped for maintenance in the
>>         room server butr, an error was occured during the stopping
>>         procedure :
>>         Local machine disabling service:HA_MGMT...Failure
>>
>>         The cluster was electrically stopped. But since the restart,
>>         I don't succed to restart services with command clussvcadm.
>>         I have this message :
>>
>>         clusvcadm -e HA_MGMT
>>         Local machine trying to enable service:HA_MGMT...Aborted;
>>         service failed
>>         and
>>         <err>    startFilesystem: Could not match LABEL=postfix with
>>         a real device
>>
>>         Do you have a solution for me ?
>>
>>         Thanks a lot in advance.
>>
>>         Regards
>>         Delphine
>>
>>         -- 
>>         Linux-cluster mailing list
>>         Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>>         https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>>
>>
>>
>
>
>     --
>     Linux-cluster mailing list
>     Linux-cluster at redhat.com <mailto:Linux-cluster at redhat.com>
>     https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>
>
> -- 
> esta es mi vida e me la vivo hasta que dios quiera
>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20130513/80b342a9/attachment.htm>

From emi2fast at gmail.com  Mon May 13 08:48:39 2013
From: emi2fast at gmail.com (emmanuel segura)
Date: Mon, 13 May 2013 10:48:39 +0200
Subject: [Linux-cluster] error clusvcadm
In-Reply-To: <5190A5DD.4030805@univ-reunion.fr>
References: <51908825.2010902@univ-reunion.fr>
	<CAHG1G7FZOtoyrvnGsbZo9Whm0dtbbuVG_bSOfVH-i9NurkfFNA@mail.gmail.com>
	<5190970B.7030908@univ-reunion.fr>
	<CAE7pJ3Cht6v9zLBQ_99deJauYw235LzZjFiPCn5se+Vy16JaXg@mail.gmail.com>
	<5190A5DD.4030805@univ-reunion.fr>
Message-ID: <CAE7pJ3AZguJDmKEdo8hhwLvaXFR8eiUM_ogODV6ME0Z2bmnVgw@mail.gmail.com>

Hello Delphine

your problem as you know is here
===============================================================
<fs device="LABEL=postfix" mountpoint="/var/spool/
postfix" force_unmount="1" fstype="ext3" name="mgmtha5" options=""/>
===============================================================

I don't know if you are using lvm or partition, but you should look for the
device corresponding to that LABEL, if you are using lvm use vgs and lvs to
see if your volume are actived

Thanks


2013/5/13 Delphine Ramalingom <delphine.ramalingom at univ-reunion.fr>

>  Hi,
>
> I used it :
>
> rg_test test /etc/cluster/cluster.conf start service HA_MGMT
> Running in test mode.
> Starting HA_MGMT...
>
> <err>    startFilesystem: Could not match LABEL=postfix with a real device
> Failed to start HA_MGMT
>
> But it gives me the same message.
>
> Regards
> Delphine
>
> Le 13/05/13 11:47, emmanuel segura a ?crit :
>
>  Hello
>
>  If you would like see why your service doens't start, you should use
> "rg_test test /etc/cluster/cluster.conf start service HA_MGMT"
>
>
>
>
> 2013/5/13 Delphine Ramalingom <delphine.ramalingom at univ-reunion.fr>
>
>>  Hi,
>>
>> This is the cluster.conf :
>>
>> [root at titan0 11:29:14 ~]# cat /etc/cluster/cluster.conf
>> <?xml version="1.0" ?>
>> <cluster config_version="7" name="HA_MGMT">
>>         <fence_daemon clean_start="1" post_fail_delay="0"
>> post_join_delay="60"/>
>>         <clusternodes>
>>                 <clusternode name="titan0"  nodeid="1" votes="1">
>>                         <fence>
>>                                 <method name="1">
>>                                         <device name="titan0fence"
>> option="reboot"/>
>>                                 </method>
>>                         </fence>
>>                 </clusternode>
>>                 <clusternode name="titan1" nodeid="2" votes="1">
>>                         <fence>
>>                                 <method name="1">
>>                                         <device name="titan1fence"
>> option="reboot"/>
>>                                 </method>
>>                         </fence>
>>                 </clusternode>
>>         </clusternodes>
>>         <cman  cluster_id="0" expected_votes="1" two_node="1"/>
>>         <fencedevices>
>>                 <fencedevice agent="fence_ipmilan" ipaddr="172.17.0.101"
>> login="administrator" name="titan0fence" passwd="administrator"/>
>>                 <fencedevice agent="fence_ipmilan" ipaddr="172.17.0.102"
>> login="administrator" name="titan1fence" passwd="administrator"/>
>>         </fencedevices>
>>         <rm>
>>                 <failoverdomains>
>>                         <failoverdomain name="titan0_heuristic"
>> ordered="0" restricted="1">
>>                                 <failoverdomainnode name="titan0"
>> priority="1"/>
>>                         </failoverdomain>
>>                         <failoverdomain name="titan1_heuristic"
>> ordered="0" restricted="1">
>>                                 <failoverdomainnode name="titan1"
>> priority="1"/>
>>                         </failoverdomain>
>>                         <failoverdomain name="MgmtNodes" ordered="0"
>> restricted="0">
>>                                 <failoverdomainnode name="titan0"
>> priority="1"/>
>>                                 <failoverdomainnode name="titan1"
>> priority="2"/>
>>                         </failoverdomain>
>>             <failoverdomain name="NFSHA" ordered="0" restricted="0">
>>                 <failoverdomainnode name="titan0" priority="2"/>
>>                 <failoverdomainnode name="titan1" priority="1"/>
>>             </failoverdomain>
>>                 </failoverdomains>
>>             <service domain="titan0_heuristic" name="ha_titan0_check"
>> autostart="1" checkinterval="10">
>>                     <script file="/usr/sbin/ha_titan0_check"
>> name="ha_titan0_check"/>
>>             </service>
>>             <service domain="titan1_heuristic" name="ha_titan1_check"
>> autostart="1" checkinterval="10">
>>                     <script file="/usr/sbin/ha_titan1_check"
>> name="ha_titan1_check"/>
>>             </service>
>>                 <service domain="MgmtNodes" name="HA_MGMT" autostart="0"
>> recovery="relocate">
>>             <!-- ip addresses lines mgmt -->
>>                                 <ip address="172.17.0.99/16"
>> monitor_link="1"/>
>>                                 <ip address="10.90.0.99/24"
>> monitor_link="1"/>
>>             <!-- devices lines mgmt -->
>>                        <fs device="LABEL=postfix"
>> mountpoint="/var/spool/postfix" force_unmount="1" fstype="ext3"
>> name="mgmtha5" options=""/>
>>                        <fs device="LABEL=bigimage"
>> mountpoint="/var/lib/systemimager" force_unmount="1" fstype="ext3"
>> name="mgmtha4" options=""/>
>>                        <clusterfs device="LABEL=HA_MGMT:conman"
>> mountpoint="/var/log/conman" force_unmount="0" fstype="gfs2" name="mgmtha3"
>> options=""/>
>>                        <clusterfs device="LABEL=HA_MGMT:ganglia"
>> mountpoint="/var/lib/ganglia/rrds" force_unmount="0" fstype="gfs2"
>> name="mgmtha2" options=""/>
>>                        <clusterfs device="LABEL=HA_MGMT:syslog"
>> mountpoint="/var/log/HOSTS" force_unmount="0" fstype="gfs2" name="mgmtha1"
>> options=""/>
>>                        <clusterfs device="LABEL=HA_MGMT:cdb"
>> mountpoint="/var/lib/pgsql/data" force_unmount="0" fstype="gfs2"
>> name="mgmtha0" options=""/>
>>                         <script file="/usr/sbin/haservices"
>> name="haservices"/>
>>                 </service>
>>         <service domain="NFSHA" name="HA_NFS" autostart="0"
>> checkinterval="60">
>>             <!-- ip addresses lines nfs -->
>>                                 <ip address="10.31.0.99/16"
>> monitor_link="1"/>
>>                                 <ip address="10.90.0.88/24"
>> monitor_link="1"/>
>>                                 <ip address="172.17.0.88/16"
>> monitor_link="1"/>
>>             <!-- devices lines nfs -->
>>                        <fs device="LABEL=PROGS" mountpoint="/programs"
>> force_unmount="1" fstype="ext3" name="nfsha4" options=""/>
>>                        <fs device="LABEL=WRKTMP" mountpoint="/worktmp"
>> force_unmount="1" fstype="ext3" name="nfsha3" options=""/>
>>                        <fs device="LABEL=LABOS" mountpoint="/labos"
>> force_unmount="1" fstype="xfs" name="nfsha2" options="ikeep"/>
>>                        <fs device="LABEL=OPTINTEL"
>> mountpoint="/opt/intel" force_unmount="1" fstype="ext3" name="nfsha1"
>> options=""/>
>>                        <fs device="LABEL=HOMENFS" mountpoint="/home_nfs"
>> force_unmount="1" fstype="ext3" name="nfsha0" options=""/>
>>             <script file="/etc/init.d/nfs" name="nfs_service"/>
>>         </service>
>>         </rm>
>>     <totem token="21000" />
>> </cluster>
>> <!-- !!!!! DON'T REMOVE OR CHANGE ANYTHING IN PARAMETERS SECTION BELOW
>> node_name=titan0
>> node_ipmi_ipaddr=172.17.0.101
>> node_hwmanager_login=administrator
>> node_hwmanager_passwd=administrator
>> ipaddr1_for_heuristics=172.17.0.200
>> node_ha_name=titan1
>> node_ha_ipmi_ipaddr=172.17.0.102
>> node_ha_hwmanager_login=administrator
>> node_ha_hwmanager_passwd=administrator
>> ipaddr2_for_heuristics=172.17.0.200
>> mngt_virt_ipaddr_for_heuristics=not used on this type of node
>> END OF SECTION !!!!! -->
>>
>>
>> The var/log/messages is too long and have some messages repeated :
>> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>> [10.40.20.30]:39198
>> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>> [10.40.20.30]:39198
>> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>> [10.40.20.30]:39198
>> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>> [10.40.20.30]:39198
>> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>> [10.40.20.30]:39198
>> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>> [10.40.20.30]:39198
>> May 13 11:30:33 s_sys at titan0 snmpd[4584]: Connection from UDP:
>> [10.40.20.30]:39198
>> May 13 11:30:34 s_sys at titan0 snmpd[4584]: Connection from UDP:
>> [10.40.20.30]:53030
>> May 13 11:30:34 s_sys at titan0 snmpd[4584]: Received SNMP packet(s) from
>> UDP: [10.40.20.30]:53030
>> May 13 11:30:34 s_sys at titan0 snmpd[4584]: Connection from UDP:
>> [10.40.20.30]:41083
>> May 13 11:30:34 s_sys at titan0 snmpd[4584]: Received SNMP packet(s) from
>> UDP: [10.40.20.30]:41083
>>
>> Regards
>> Delphine
>>
>>
>>
>> Le 13/05/13 10:37, Rajveer Singh a ?crit :
>>
>>  Hi Delphine,
>> It seems there is some filesystem crash. Please share your
>> /var/log/messages and /etc/cluster/cluster.conf file to help you futher.
>>
>>  Regards,
>> Rajveer Singh
>>
>>
>> On Mon, May 13, 2013 at 11:58 AM, Delphine Ramalingom <
>> delphine.ramalingom at univ-reunion.fr> wrote:
>>
>>> Hello,
>>>
>>> I have a problem and I need some help.
>>>
>>> Our cluster linux have been stopped for maintenance in the room server
>>> butr, an error was occured during the stopping procedure :
>>> Local machine disabling service:HA_MGMT...Failure
>>>
>>> The cluster was electrically stopped. But since the restart, I don't
>>> succed to restart services with command clussvcadm.
>>> I have this message :
>>>
>>> clusvcadm -e HA_MGMT
>>> Local machine trying to enable service:HA_MGMT...Aborted; service failed
>>> and
>>> <err>    startFilesystem: Could not match LABEL=postfix with a real
>>> device
>>>
>>> Do you have a solution for me ?
>>>
>>> Thanks a lot in advance.
>>>
>>> Regards
>>> Delphine
>>>
>>> --
>>> Linux-cluster mailing list
>>> Linux-cluster at redhat.com
>>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>>
>>
>>
>>
>>
>>
>> --
>> Linux-cluster mailing list
>> Linux-cluster at redhat.com
>> https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
>
>
> --
> esta es mi vida e me la vivo hasta que dios quiera
>
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>


-- 
esta es mi vida e me la vivo hasta que dios quiera
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20130513/3109c944/attachment.htm>

From rmitchel at redhat.com  Tue May 14 00:42:32 2013
From: rmitchel at redhat.com (Ryan Mitchell)
Date: Tue, 14 May 2013 10:42:32 +1000
Subject: [Linux-cluster] error clusvcadm
In-Reply-To: <CAE7pJ3AZguJDmKEdo8hhwLvaXFR8eiUM_ogODV6ME0Z2bmnVgw@mail.gmail.com>
References: <51908825.2010902@univ-reunion.fr>
	<CAHG1G7FZOtoyrvnGsbZo9Whm0dtbbuVG_bSOfVH-i9NurkfFNA@mail.gmail.com>
	<5190970B.7030908@univ-reunion.fr>
	<CAE7pJ3Cht6v9zLBQ_99deJauYw235LzZjFiPCn5se+Vy16JaXg@mail.gmail.com>
	<5190A5DD.4030805@univ-reunion.fr>
	<CAE7pJ3AZguJDmKEdo8hhwLvaXFR8eiUM_ogODV6ME0Z2bmnVgw@mail.gmail.com>
Message-ID: <51918878.2040901@redhat.com>

On 05/13/2013 06:48 PM, emmanuel segura wrote:
> Hello Delphine
>
> your problem as you know is here
> ===============================================================
> <fs device="LABEL=postfix" mountpoint="/var/spool/
> postfix" force_unmount="1" fstype="ext3" name="mgmtha5" options=""/>
> ===============================================================
>
> I don't know if you are using lvm or partition, but you should look for
> the device corresponding to that LABEL, if you are using lvm use vgs and
> lvs to see if your volume are actived
>
> Thanks

Further to this:
- It is best-practice to use LVM resources (HA-LVM) for non-clustered 
filesystems (ie. ext3, ext4, xfs, etc).  This helps to protect against 
accidentally mounting the filesystem on more than one node at a time.

As you have some GFS2, I assume you have clustered LVM and therefore it 
is as simple as adding LVM resources to the services.  See 
https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Cluster_Administration/ap-ha-halvm-CA.html#s1-halvm-clvm-CA

- The error you're experiencing where the service won't start is because 
the filesystem with the label "postfix" can't be found (as Emmanuel 
said).  See if you can see the missing device with the following command:
# blkid -c /dev/null

If there is no device with LABEL=postfix, you need to try to find which 
is the correct device, and add the label to the device with e2label.

Regards,

Ryan Mitchell
Red Hat Global Support Services


From office at 5hosting.com  Tue May 14 06:17:55 2013
From: office at 5hosting.com (5hosting Team)
Date: Tue, 14 May 2013 08:17:55 +0200
Subject: [Linux-cluster] quota=accounting, needed on all nodes?
Message-ID: <048a01ce506a$c66ada50$53408ef0$@5hosting.com>

Hi guys,

 
right now we?re working on some quota accounting and we have the following
setup:

Loadbalancer ->

Node1 ? FTP Service

Node2-41 ? Webserver Nodes

 
Most of the incoming data traffic that is stored comes over node1, which
hosts the FTP server. It?s logic that we use quota=accounting on this one.
But now imagine wordpress or typo3 running on the webspace and therefore
node2-41 can also store uploads on the storage.

Do we need quota=accounting on those 40 Nodes too or does Node1 do the
accounting for all of them? If we need it on the other 40 nodes, do they
somehow synchronize (node12 and node32 running uploads for the same user at
the same time, thus creating some kind of race condition?) and how about
performance. I can imagine that keeping track of 41 synchronized quota
states is somehow expensive.

 
If it?s too expensive, I thought about only enabling the accounting at
Node1, but doing a quotacheck every 2-6 months, since 453GB (338458
directories and 2817145 files) take about 12 hours to finish

 
Thanks in advance for your input and help.

J?rgen

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20130514/f247ae62/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 6079 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20130514/f247ae62/attachment.p7s>

From emi2fast at gmail.com  Tue May 14 09:40:30 2013
From: emi2fast at gmail.com (emmanuel segura)
Date: Tue, 14 May 2013 11:40:30 +0200
Subject: [Linux-cluster] error clusvcadm
In-Reply-To: <51918878.2040901@redhat.com>
References: <51908825.2010902@univ-reunion.fr>
	<CAHG1G7FZOtoyrvnGsbZo9Whm0dtbbuVG_bSOfVH-i9NurkfFNA@mail.gmail.com>
	<5190970B.7030908@univ-reunion.fr>
	<CAE7pJ3Cht6v9zLBQ_99deJauYw235LzZjFiPCn5se+Vy16JaXg@mail.gmail.com>
	<5190A5DD.4030805@univ-reunion.fr>
	<CAE7pJ3AZguJDmKEdo8hhwLvaXFR8eiUM_ogODV6ME0Z2bmnVgw@mail.gmail.com>
	<51918878.2040901@redhat.com>
Message-ID: <CAE7pJ3D4CF5w3zGxgnCRnZTT2h3Uapi8SDb735XoQWt9PQfTrw@mail.gmail.com>

Hello Ryan

I think ha-lvm was deprecated for clvmd+exclusive mode, i think the seconde
method is more clear, ha-lvm is ugly toy, this what i think

Thanks


2013/5/14 Ryan Mitchell <rmitchel at redhat.com>

> On 05/13/2013 06:48 PM, emmanuel segura wrote:
>
>> Hello Delphine
>>
>> your problem as you know is here
>> ==============================**==============================**===
>> <fs device="LABEL=postfix" mountpoint="/var/spool/
>> postfix" force_unmount="1" fstype="ext3" name="mgmtha5" options=""/>
>> ==============================**==============================**===
>>
>> I don't know if you are using lvm or partition, but you should look for
>> the device corresponding to that LABEL, if you are using lvm use vgs and
>> lvs to see if your volume are actived
>>
>> Thanks
>>
>
> Further to this:
> - It is best-practice to use LVM resources (HA-LVM) for non-clustered
> filesystems (ie. ext3, ext4, xfs, etc).  This helps to protect against
> accidentally mounting the filesystem on more than one node at a time.
>
> As you have some GFS2, I assume you have clustered LVM and therefore it is
> as simple as adding LVM resources to the services.  See
> https://access.redhat.com/**site/documentation/en-US/Red_**
> Hat_Enterprise_Linux/6/html/**Cluster_Administration/ap-ha-**
> halvm-CA.html#s1-halvm-clvm-CA<https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Cluster_Administration/ap-ha-halvm-CA.html#s1-halvm-clvm-CA>
>
> - The error you're experiencing where the service won't start is because
> the filesystem with the label "postfix" can't be found (as Emmanuel said).
>  See if you can see the missing device with the following command:
> # blkid -c /dev/null
>
> If there is no device with LABEL=postfix, you need to try to find which is
> the correct device, and add the label to the device with e2label.
>
> Regards,
>
> Ryan Mitchell
> Red Hat Global Support Services
>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/**mailman/listinfo/linux-cluster<https://www.redhat.com/mailman/listinfo/linux-cluster>
>


-- 
esta es mi vida e me la vivo hasta que dios quiera
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20130514/8fd85305/attachment.htm>

From member at linkedin.com  Mon May 20 10:47:24 2013
From: member at linkedin.com (Rouamba Halidou)
Date: Mon, 20 May 2013 10:47:24 +0000 (UTC)
Subject: [Linux-cluster] Krishna, want to connect with me on LinkedIn?
Message-ID: <1261329965.20347206.1369046844632.JavaMail.app@ela4-bed84.prod>

LinkedIn
------------


    Rouamba Halidou requested to add you as a connection on LinkedIn:
  

------------------------------------------

I'd like to add you to my professional network on LinkedIn.

Accept invitation from Rouamba Halidou
http://www.linkedin.com/e/-odgn7o-hgxj560w-5j/ulDuieLaAX544oVCOYcgj_GaXIys4TuLMXGmOx/blk/I398185104_45/3wOtCVFbmdxnSVFbm8JrnpKqlZJrmZzbmNJpjRQnOpBtn9QfmhBt71BoSd1p65Lr6lOfPkQnPgMcjkUcjwVcQALiTBPuj1DjAsLc3ARejoUd34QcP4LrCBxbOYWrSlI/eml-comm_invm-b-in_ac-inv28/?hs=false&tok=1D-kUn-uiDaBM1

View profile of Rouamba Halidou
http://www.linkedin.com/e/-odgn7o-hgxj560w-5j/rso/126565658/dEqv/name/46069589_I398185104_45/?hs=false&tok=3vzClZ7YCDaBM1
------------------------------------------
You are receiving Invitation emails.


This email was intended for Krishna Kumar.
Learn why this is included: http://www.linkedin.com/e/-odgn7o-hgxj560w-5j/plh/http%3A%2F%2Fhelp%2Elinkedin%2Ecom%2Fapp%2Fanswers%2Fdetail%2Fa_id%2F4788/-GXI/?hs=false&tok=2w8DAlc1SDaBM1

(c) 2012, LinkedIn Corporation. 2029 Stierlin Ct, Mountain View, CA 94043, USA.


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20130520/85bd8158/attachment.htm>

From michael at ulimit.org  Mon May 20 20:16:22 2013
From: michael at ulimit.org (Michael Pye)
Date: Mon, 20 May 2013 21:16:22 +0100
Subject: [Linux-cluster] Oracle ASM in two node cluster
Message-ID: <519A8496.3000707@ulimit.org>

I'm after some advice for running an Oracle 11g database on RHEL 5 
cluster. ASM is a particular requirement.

Is it possible for the cluster to control which node has access to the 
oracle asm devices ? Whats the best practice around this ?

Thanks
Michael


From lists at alteeve.ca  Thu May 30 16:38:47 2013
From: lists at alteeve.ca (Digimer)
Date: Thu, 30 May 2013 12:38:47 -0400
Subject: [Linux-cluster] FenceAgentAPI needs to be updated
Message-ID: <51A78097.2080205@alteeve.ca>

Hi all,

   The FenceAgentAPI 
(https://fedorahosted.org/cluster/wiki/FenceAgentAPI) makes no mention 
of the new metadata options needed for cluster.rng support. I would 
update it, but I am not a cardinal source though, so I would need 
details from someone who can act as a cardinal source.

-- 
Digimer
Papers and Projects: https://alteeve.ca/w/
What if the cure for cancer is trapped in the mind of a person without 
access to education?


From mgrac at redhat.com  Fri May 31 07:14:31 2013
From: mgrac at redhat.com (Marek Grac)
Date: Fri, 31 May 2013 09:14:31 +0200
Subject: [Linux-cluster] FenceAgentAPI needs to be updated
In-Reply-To: <51A78097.2080205@alteeve.ca>
References: <51A78097.2080205@alteeve.ca>
Message-ID: <51A84DD7.1050706@redhat.com>

On 05/30/2013 06:38 PM, Digimer wrote:
> Hi all,
>
>   The FenceAgentAPI 
> (https://fedorahosted.org/cluster/wiki/FenceAgentAPI) makes no mention 
> of the new metadata options needed for cluster.rng support. I would 
> update it, but I am not a cardinal source though, so I would need 
> details from someone who can act as a cardinal source.
>
Hi,

I believe that I can be that source, feel free to mail me questions.

m,


From lists at alteeve.ca  Fri May 31 13:49:25 2013
From: lists at alteeve.ca (Digimer)
Date: Fri, 31 May 2013 09:49:25 -0400
Subject: [Linux-cluster] FenceAgentAPI needs to be updated
In-Reply-To: <51A84DD7.1050706@redhat.com>
References: <51A78097.2080205@alteeve.ca> <51A84DD7.1050706@redhat.com>
Message-ID: <51A8AA65.9050800@alteeve.ca>

On 05/31/2013 03:14 AM, Marek Grac wrote:
> On 05/30/2013 06:38 PM, Digimer wrote:
>> Hi all,
>>
>>   The FenceAgentAPI
>> (https://fedorahosted.org/cluster/wiki/FenceAgentAPI) makes no mention
>> of the new metadata options needed for cluster.rng support. I would
>> update it, but I am not a cardinal source though, so I would need
>> details from someone who can act as a cardinal source.
>>
> Hi,
>
> I believe that I can be that source, feel free to mail me questions.
>
> m,

If you can give me the rough notes/point-form list on what is required, 
I will write it out in long form for the wiki. Also, it's been long 
enough that it would be good, if you have the time, to review the 
document for any other missing or deprecated bits. Then I can give the 
whole thing a once-over at the same time.

-- 
Digimer
Papers and Projects: https://alteeve.ca/w/
What if the cure for cancer is trapped in the mind of a person without 
access to education?