[Linux-cluster] openais issue

brem belguebli brem.belguebli at gmail.com
Tue Sep 29 16:51:35 UTC 2009


I use log_level=7 to have more debugging info.

It seems 4 is not enough.

Brem


2009/9/29, Paras pradhan <pradhanparas at gmail.com>:
> Withe log_level of 3 I got only this
>
> Sep 29 10:31:31 cvtst1 rgmanager: [7170]: <notice> Shutting down
> Cluster Service Manager...
> Sep 29 10:31:31 cvtst1 clurgmgrd[6673]: <notice> Shutting down
> Sep 29 10:31:41 cvtst1 clurgmgrd[6673]: <notice> Shutdown complete, exiting
> Sep 29 10:31:41 cvtst1 rgmanager: [7170]: <notice> Cluster Service
> Manager is stopped.
> Sep 29 10:31:42 cvtst1 clurgmgrd[7224]: <notice> Resource Group
> Manager Starting
> Sep 29 10:39:06 cvtst1 rgmanager: [10327]: <notice> Shutting down
> Cluster Service Manager...
> Sep 29 10:39:16 cvtst1 rgmanager: [10327]: <notice> Cluster Service
> Manager is stopped.
> Sep 29 10:39:16 cvtst1 clurgmgrd[10380]: <notice> Resource Group
> Manager Starting
> Sep 29 10:39:52 cvtst1 clurgmgrd[10380]: <notice> Member 1 shutting down
>
> I do not know what the last line means.
>
> rgmanager version I am running is:
> rgmanager-2.0.52-1.el5.centos
>
> I don't what has gone wrong.
>
> Thanks
> Paras.
>
>
> On Mon, Sep 28, 2009 at 6:41 PM, brem belguebli
> <brem.belguebli at gmail.com> wrote:
> > you mean it stopped successfully on all the nodes but it is failing to
> > start only on node cvtst1 ?
> >
> > look at the following page  to make rgmanager more verbose. It 'll
> > help debug....
> >
> > http://sources.redhat.com/cluster/wiki/RGManager
> >
> > at Logging Configuration section
> >
> >
> >
> >
> > 2009/9/29 Paras pradhan <pradhanparas at gmail.com>:
> >> Brem,
> >>
> >> When I try to restart rgmanager on all the nodes, this time i do not
> >> see rgmanager running on the first node. But I do see on other 2
> >> nodes.
> >>
> >> Log on the first node:
> >>
> >> Sep 28 18:13:58 cvtst1 clurgmgrd[24099]: <notice> Resource Group
> >> Manager Starting
> >> Sep 28 18:17:29 cvtst1 rgmanager: [24627]: <notice> Shutting down
> >> Cluster Service Manager...
> >> Sep 28 18:17:29 cvtst1 clurgmgrd[24099]: <notice> Shutting down
> >> Sep 28 18:17:39 cvtst1 clurgmgrd[24099]: <notice> Shutdown complete, exiting
> >> Sep 28 18:17:39 cvtst1 rgmanager: [24627]: <notice> Cluster Service
> >> Manager is stopped.
> >> Sep 28 18:17:40 cvtst1 clurgmgrd[24679]: <notice> Resource Group
> >> Manager Starting
> >>
> >> -
> >> It seems service is running ,  but I do not see rgmanger running using clustat
> >>
> >>
> >> Don't know what is going on.
> >>
> >> Thanks
> >> Paras.
> >>
> >>
> >> On Mon, Sep 28, 2009 at 5:46 PM, brem belguebli
> >> <brem.belguebli at gmail.com> wrote:
> >>> Paras,
> >>>
> >>> Another thing, it would have been more interesting to have a start
> >>> DEBUG not a stop.
> >>>
> >>> That's why I was asking you to first stop the vm manually on all your
> >>> nodes, stop eventually rgmanager on all the nodes to reset the
> >>> potential wrong states you may have, restart rgmanager.
> >>>
> >>> If your VM is configured to autostart, this will make it start.
> >>>
> >>> It should normally fail (as it does now). Send out your newly created
> >>> DEBUG file.
> >>>
> >>> 2009/9/29 brem belguebli <brem.belguebli at gmail.com>:
> >>>> Hi Paras,
> >>>>
> >>>>
> >>>> I don't know the xen/cluster combination well, but if I do remember
> >>>> well, I think I've read somewhere that when using xen you have to
> >>>> declare the use_virsh=0 key in the VM definition in the cluster.conf.
> >>>>
> >>>> This would make rgmanager use xm commands instead of virsh
> >>>> The DEBUG output shows clearly that you are using virsh to manage your
> >>>> VM instead of xm commands.
> >>>> Check out the RH docs about virtualization
> >>>>
> >>>> I'm not a 100% sure about that, I may be completely wrong.
> >>>>
> >>>> Brem
> >>>>
> >>>> 2009/9/28 Paras pradhan <pradhanparas at gmail.com>:
> >>>>> The only thing I noticed is the message after stopping the vm using xm
> >>>>> in all nodes and starting using clusvcadm is
> >>>>>
> >>>>> "Virtual machine guest1 is blocked"
> >>>>>
> >>>>> The whole DEBUG file is attached.
> >>>>>
> >>>>>
> >>>>> Thanks
> >>>>> Paras.
> >>>>>
> >>>>> On Fri, Sep 25, 2009 at 5:53 PM, brem belguebli
> >>>>> <brem.belguebli at gmail.com> wrote:
> >>>>>> There's a problem with the script that is called by rgmanager to start
> >>>>>> the VM, I don't know what causes it
> >>>>>>
> >>>>>> May be you should try something like :
> >>>>>>
> >>>>>> 1) stop the VM on all nodes with xm commands
> >>>>>> 2) edit the /usr/share/cluster/vm.sh script and add the following
> >>>>>> lines (after the #!/bin/bash ):
> >>>>>>   exec >/tmp/DEBUG 2>&1
> >>>>>>   set -x
> >>>>>> 3) start the VM with clusvcadm -e vm:guest1
> >>>>>>
> >>>>>> It should fail as it did before.
> >>>>>>
> >>>>>> edit the the /tmp/DEBUG file and you will be able to see where it
> >>>>>> fails (it may generate a lot of debug)
> >>>>>>
> >>>>>> 4) remove the debug lines from /usr/share/cluster/vm.sh
> >>>>>>
> >>>>>> Post the DEBUG file if you're not able to see where it fails.
> >>>>>>
> >>>>>> Brem
> >>>>>>
> >>>>>> 2009/9/26 Paras pradhan <pradhanparas at gmail.com>:
> >>>>>>> No I am not manually starting not using automatic init scripts.
> >>>>>>>
> >>>>>>> I started the vm using: clusvcadm -e vm:guest1
> >>>>>>>
> >>>>>>> I have just stopped using clusvcadm -s vm:guest1. For few seconds it
> >>>>>>> says guest1 started . But after a while I can see the guest1 on all
> >>>>>>> three nodes.
> >>>>>>>
> >>>>>>> clustat says:
> >>>>>>>
> >>>>>>>  Service Name                                            Owner (Last)
> >>>>>>>                                          State
> >>>>>>>  ------- ----                                            ----- ------
> >>>>>>>                                          -----
> >>>>>>>  vm:guest1                                               (none)
> >>>>>>>                                          stopped
> >>>>>>>
> >>>>>>> But I can see the vm from xm li.
> >>>>>>>
> >>>>>>> This is what I can see from the log:
> >>>>>>>
> >>>>>>>
> >>>>>>> Sep 25 17:19:01 cvtst1 clurgmgrd[4298]: <notice> start on vm "guest1"
> >>>>>>> returned 1 (generic error)
> >>>>>>> Sep 25 17:19:01 cvtst1 clurgmgrd[4298]: <warning> #68: Failed to start
> >>>>>>> vm:guest1; return value: 1
> >>>>>>> Sep 25 17:19:01 cvtst1 clurgmgrd[4298]: <notice> Stopping service vm:guest1
> >>>>>>> Sep 25 17:19:02 cvtst1 clurgmgrd[4298]: <notice> Service vm:guest1 is
> >>>>>>> recovering
> >>>>>>> Sep 25 17:19:15 cvtst1 clurgmgrd[4298]: <notice> Recovering failed
> >>>>>>> service vm:guest1
> >>>>>>> Sep 25 17:19:16 cvtst1 clurgmgrd[4298]: <notice> start on vm "guest1"
> >>>>>>> returned 1 (generic error)
> >>>>>>> Sep 25 17:19:16 cvtst1 clurgmgrd[4298]: <warning> #68: Failed to start
> >>>>>>> vm:guest1; return value: 1
> >>>>>>> Sep 25 17:19:16 cvtst1 clurgmgrd[4298]: <notice> Stopping service vm:guest1
> >>>>>>> Sep 25 17:19:17 cvtst1 clurgmgrd[4298]: <notice> Service vm:guest1 is
> >>>>>>> recovering
> >>>>>>>
> >>>>>>>
> >>>>>>> Paras.
> >>>>>>>
> >>>>>>> On Fri, Sep 25, 2009 at 5:07 PM, brem belguebli
> >>>>>>> <brem.belguebli at gmail.com> wrote:
> >>>>>>>> Have you started  your VM via rgmanager (clusvcadm -e vm:guest1) or
> >>>>>>>> using xm commands out of cluster control  (or maybe a thru an
> >>>>>>>> automatic init script ?)
> >>>>>>>>
> >>>>>>>> When clustered, you should never be starting services (manually or
> >>>>>>>> thru automatic init script) out of cluster control
> >>>>>>>>
> >>>>>>>> The thing would be to stop your vm on all the nodes with the adequate
> >>>>>>>> xm command (not using xen myself) and try to start it with clusvcadm.
> >>>>>>>>
> >>>>>>>> Then see if it is started on all nodes (send clustat output)
> >>>>>>>>
> >>>>>>>>
> >>>>>>>>
> >>>>>>>> 2009/9/25 Paras pradhan <pradhanparas at gmail.com>:
> >>>>>>>>> Ok. Please see below. my vm is running on all nodes though clustat
> >>>>>>>>> says it is stopped.
> >>>>>>>>>
> >>>>>>>>> --
> >>>>>>>>> [root at cvtst1 ~]# clustat
> >>>>>>>>> Cluster Status for test @ Fri Sep 25 16:52:34 2009
> >>>>>>>>> Member Status: Quorate
> >>>>>>>>>
> >>>>>>>>>  Member Name                                                     ID   Status
> >>>>>>>>>  ------ ----                                                     ---- ------
> >>>>>>>>>  cvtst2                                                    1 Online, rgmanager
> >>>>>>>>>  cvtst1                                                     2 Online,
> >>>>>>>>> Local, rgmanager
> >>>>>>>>>  cvtst3                                                     3 Online, rgmanager
> >>>>>>>>>
> >>>>>>>>>  Service Name                                            Owner (Last)
> >>>>>>>>>                                          State
> >>>>>>>>>  ------- ----                                            ----- ------
> >>>>>>>>>                                          -----
> >>>>>>>>>  vm:guest1                                               (none)
> >>>>>>>>>                                          stopped
> >>>>>>>>> [root at cvtst1 ~]#
> >>>>>>>>>
> >>>>>>>>>
> >>>>>>>>> ---
> >>>>>>>>> o/p of xm li on cvtst1
> >>>>>>>>>
> >>>>>>>>> --
> >>>>>>>>> [root at cvtst1 ~]# xm li
> >>>>>>>>> Name                                      ID Mem(MiB) VCPUs State   Time(s)
> >>>>>>>>> Domain-0                                   0     3470     2 r-----  28939.4
> >>>>>>>>> guest1                                     7      511     1 -b----   7727.8
> >>>>>>>>>
> >>>>>>>>> o/p of xm li on cvtst2
> >>>>>>>>>
> >>>>>>>>> --
> >>>>>>>>> [root at cvtst2 ~]# xm li
> >>>>>>>>> Name                                      ID Mem(MiB) VCPUs State   Time(s)
> >>>>>>>>> Domain-0                                   0     3470     2 r-----  31558.9
> >>>>>>>>> guest1                                    21      511     1 -b----   7558.2
> >>>>>>>>> ---
> >>>>>>>>>
> >>>>>>>>> Thanks
> >>>>>>>>> Paras.
> >>>>>>>>>
> >>>>>>>>>
> >>>>>>>>>
> >>>>>>>>> On Fri, Sep 25, 2009 at 4:22 PM, brem belguebli
> >>>>>>>>> <brem.belguebli at gmail.com> wrote:
> >>>>>>>>>> It looks like no.
> >>>>>>>>>>
> >>>>>>>>>> can you send an output of clustat  of when the VM is running on
> >>>>>>>>>> multiple nodes at the same time?
> >>>>>>>>>>
> >>>>>>>>>> And by the way, another one after having stopped (clusvcadm -s vm:guest1) ?
> >>>>>>>>>>
> >>>>>>>>>>
> >>>>>>>>>>
> >>>>>>>>>> 2009/9/25 Paras pradhan <pradhanparas at gmail.com>:
> >>>>>>>>>>> Anyone having issue as mine? Virtual machine service is not being
> >>>>>>>>>>> properly handled by the cluster.
> >>>>>>>>>>>
> >>>>>>>>>>>
> >>>>>>>>>>> Thanks
> >>>>>>>>>>> Paras.
> >>>>>>>>>>>
> >>>>>>>>>>> On Mon, Sep 21, 2009 at 9:55 AM, Paras pradhan <pradhanparas at gmail.com> wrote:
> >>>>>>>>>>>> Ok.. here is my cluster.conf file
> >>>>>>>>>>>>
> >>>>>>>>>>>> --
> >>>>>>>>>>>> [root at cvtst1 cluster]# more cluster.conf
> >>>>>>>>>>>> <?xml version="1.0"?>
> >>>>>>>>>>>> <cluster alias="test" config_version="9" name="test">
> >>>>>>>>>>>>        <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
> >>>>>>>>>>>>        <clusternodes>
> >>>>>>>>>>>>                <clusternode name="cvtst2" nodeid="1" votes="1">
> >>>>>>>>>>>>                        <fence/>
> >>>>>>>>>>>>                </clusternode>
> >>>>>>>>>>>>                <clusternode name="cvtst1" nodeid="2" votes="1">
> >>>>>>>>>>>>                        <fence/>
> >>>>>>>>>>>>                </clusternode>
> >>>>>>>>>>>>                <clusternode name="cvtst3" nodeid="3" votes="1">
> >>>>>>>>>>>>                        <fence/>
> >>>>>>>>>>>>                </clusternode>
> >>>>>>>>>>>>        </clusternodes>
> >>>>>>>>>>>>        <cman/>
> >>>>>>>>>>>>        <fencedevices/>
> >>>>>>>>>>>>        <rm>
> >>>>>>>>>>>>                <failoverdomains>
> >>>>>>>>>>>>                        <failoverdomain name="myfd1" nofailback="0" ordered="1" restricted="0">
> >>>>>>>>>>>>                                <failoverdomainnode name="cvtst2" priority="3"/>
> >>>>>>>>>>>>                                <failoverdomainnode name="cvtst1" priority="1"/>
> >>>>>>>>>>>>                                <failoverdomainnode name="cvtst3" priority="2"/>
> >>>>>>>>>>>>                        </failoverdomain>
> >>>>>>>>>>>>                </failoverdomains>
> >>>>>>>>>>>>                <resources/>
> >>>>>>>>>>>>                <vm autostart="1" domain="myfd1" exclusive="0" max_restarts="0"
> >>>>>>>>>>>> name="guest1" path="/vms" recovery="r
> >>>>>>>>>>>> estart" restart_expire_time="0"/>
> >>>>>>>>>>>>        </rm>
> >>>>>>>>>>>> </cluster>
> >>>>>>>>>>>> [root at cvtst1 cluster]#
> >>>>>>>>>>>> ------
> >>>>>>>>>>>>
> >>>>>>>>>>>> Thanks!
> >>>>>>>>>>>> Paras.
> >>>>>>>>>>>>
> >>>>>>>>>>>>
> >>>>>>>>>>>> On Sun, Sep 20, 2009 at 9:44 AM, Volker Dormeyer <volker at ixolution.de> wrote:
> >>>>>>>>>>>>> On Fri, Sep 18, 2009 at 05:08:57PM -0500,
> >>>>>>>>>>>>> Paras pradhan <pradhanparas at gmail.com> wrote:
> >>>>>>>>>>>>>> I am using cluster suite for HA of xen virtual machines. Now I am
> >>>>>>>>>>>>>> having another problem. When I start the my xen vm in one node, it
> >>>>>>>>>>>>>> also starts on other nodes. Which daemon controls  this?
> >>>>>>>>>>>>>
> >>>>>>>>>>>>> This is usually done bei clurgmgrd (which is part of the rgmanager
> >>>>>>>>>>>>> package). To me, this sounds like a configuration problem. Maybe,
> >>>>>>>>>>>>> you can post your cluster.conf?
> >>>>>>>>>>>>>
> >>>>>>>>>>>>> Regards,
> >>>>>>>>>>>>> Volker
> >>>>>>>>>>>>>
> >>>>>>>>>>>>> --
> >>>>>>>>>>>>> Linux-cluster mailing list
> >>>>>>>>>>>>> Linux-cluster at redhat.com
> >>>>>>>>>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>>>>>>>>>>>>
> >>>>>>>>>>>>
> >>>>>>>>>>>
> >>>>>>>>>>> --
> >>>>>>>>>>> Linux-cluster mailing list
> >>>>>>>>>>> Linux-cluster at redhat.com
> >>>>>>>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>>>>>>>>>>
> >>>>>>>>>>
> >>>>>>>>>> --
> >>>>>>>>>> Linux-cluster mailing list
> >>>>>>>>>> Linux-cluster at redhat.com
> >>>>>>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>>>>>>>>>
> >>>>>>>>>
> >>>>>>>>> --
> >>>>>>>>> Linux-cluster mailing list
> >>>>>>>>> Linux-cluster at redhat.com
> >>>>>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>>>>>>>>
> >>>>>>>>
> >>>>>>>> --
> >>>>>>>> Linux-cluster mailing list
> >>>>>>>> Linux-cluster at redhat.com
> >>>>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>>>>>>>
> >>>>>>>
> >>>>>>> --
> >>>>>>> Linux-cluster mailing list
> >>>>>>> Linux-cluster at redhat.com
> >>>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>>>>>>
> >>>>>>
> >>>>>> --
> >>>>>> Linux-cluster mailing list
> >>>>>> Linux-cluster at redhat.com
> >>>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>>>>>
> >>>>>
> >>>>> --
> >>>>> Linux-cluster mailing list
> >>>>> Linux-cluster at redhat.com
> >>>>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>>>>
> >>>>
> >>>
> >>> --
> >>> Linux-cluster mailing list
> >>> Linux-cluster at redhat.com
> >>> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>>
> >>
> >> --
> >> Linux-cluster mailing list
> >> Linux-cluster at redhat.com
> >> https://www.redhat.com/mailman/listinfo/linux-cluster
> >>
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster at redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>




More information about the Linux-cluster mailing list