[Linux-cluster] Problem deleting running VM from rgmanager

Digimer lists at alteeve.ca
Sat Jul 27 00:58:52 UTC 2013


I rebuilt the VM and deleted it a second time and it worked properly... 
I hate bugs like that.

digimer

On 26/07/13 20:40, Digimer wrote:
> Hi all,
>
>    I've got a problem where I deleted a running VM from the cluster using;
>
> ccs -h localhost --activate --sync --password "secret" --rmvm vm01-win7
>
>    This kind of worked, in that the VM was removed from cluster.conf,
> but 'clustat' still shows it. The logs from the call are:
>
> =====
> Jul 26 20:19:01 an-c05n01 ricci[18020]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:01 an-c05n01 ricci[18066]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:01 an-c05n01 ricci[18069]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:01 an-c05n01 ricci[18071]: Executing
> '/usr/libexec/ricci/ricci-worker -f /var/lib/ricci/queue/1428781577'
> Jul 26 20:19:01 an-c05n01 ricci[18075]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:01 an-c05n01 ricci[18077]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:01 an-c05n01 ricci[18080]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:01 an-c05n01 ricci[18082]: Executing
> '/usr/libexec/ricci/ricci-worker -f /var/lib/ricci/queue/881799278'
> Jul 26 20:19:03 an-c05n01 ricci[18088]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:03 an-c05n01 ricci[18090]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:03 an-c05n01 ricci[18093]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:03 an-c05n01 ricci[18095]: Executing
> '/usr/libexec/ricci/ricci-worker -f /var/lib/ricci/queue/439919971'
> Jul 26 20:19:03 an-c05n01 modcluster: Updating cluster.conf
> Jul 26 20:19:03 an-c05n01 corosync[3479]:   [QUORUM] Members[2]: 1 2
> Jul 26 20:19:03 an-c05n01 ricci[18140]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:03 an-c05n01 ricci[18170]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:03 an-c05n01 rgmanager[3710]: Reconfiguring
> Jul 26 20:19:03 an-c05n01 ricci[18194]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:03 an-c05n01 ricci[18234]: Executing
> '/usr/libexec/ricci/ricci-worker -f /var/lib/ricci/queue/446527166'
> Jul 26 20:19:04 an-c05n01 ricci[18457]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:04 an-c05n01 ricci[18496]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:04 an-c05n01 ricci[18528]: Executing '/usr/bin/virsh nodeinfo'
> Jul 26 20:19:04 an-c05n01 ricci[18560]: Executing
> '/usr/libexec/ricci/ricci-worker -f /var/lib/ricci/queue/1207456461'
> Jul 26 20:19:04 an-c05n01 modcluster: Updating cluster.conf
> Jul 26 20:19:04 an-c05n01 corosync[3479]:   [QUORUM] Members[2]: 1 2
> Jul 26 20:19:05 an-c05n01 kernel: vbr2: port 4(vnet2) entering disabled
> state
> Jul 26 20:19:05 an-c05n01 kernel: device vnet2 left promiscuous mode
> Jul 26 20:19:05 an-c05n01 kernel: vbr2: port 4(vnet2) entering disabled
> state
> Jul 26 20:19:06 an-c05n01 rgmanager[3710]: vm:vm01-win7 removed from the
> config, but I am not stopping it.
> Jul 26 20:19:06 an-c05n01 rgmanager[3710]: Reconfiguring
> Jul 26 20:19:07 an-c05n01 ntpd[2794]: Deleting interface #16 vnet2,
> fe80::fc54:ff:fea5:37ea#123, interface stats: received=0, sent=0,
> dropped=0, active_time=135 secs
> =====
>
> However, clustat still shows;
>
> =====
> Cluster Status for an-cluster-05 @ Fri Jul 26 20:37:17 2013
> Member Status: Quorate
>
>   Member Name                             ID   Status
>   ------ ----                             ---- ------
>   an-c05n01.alteeve.ca                        1 Online, rgmanager
>   an-c05n02.alteeve.ca                        2 Online, Local, rgmanager
>
>   Service Name                   Owner (Last)                   State
>   ------- ----                   ----- ------                   -----
>   service:storage_n01            an-c05n01.alteeve.ca           started
>   service:storage_n02            an-c05n02.alteeve.ca           started
>   vm:vm01-win7                   an-c05n02.alteeve.ca           started
>   vm:vm02-rhel6                  an-c05n02.alteeve.ca           started
>   vm:vm03-debian7                an-c05n01.alteeve.ca           started
>   vm:vm04-solaris11              an-c05n02.alteeve.ca           started
>   vm:vm05-win2008r2              an-c05n02.alteeve.ca           started
>   vm:vm06-win8                   an-c05n01.alteeve.ca           started
>   vm:vm07-win2012                an-c05n02.alteeve.ca           started
>   vm:vm08-freebsd9               an-c05n01.alteeve.ca           started
>   vm:vm09-suse11                 an-c05n01.alteeve.ca           started
> =====
>
> Trying to stop it produces;
>
> =====
> an-c05n02:~# clusvcadm -d vm:vm01-win7
> Local machine disabling vm:vm01-win7...Failure
> =====
>
> CentOS 6.4, fully up to date; rgmanager-3.0.12.1-17.el6.x86_64
>


-- 
Digimer
Papers and Projects: https://alteeve.ca/w/
What if the cure for cancer is trapped in the mind of a person without 
access to education?




More information about the Linux-cluster mailing list