[linux-lvm] mirrored LV + cmirror problem

Jonathan Brassow jbrassow at redhat.com
Fri Feb 15 15:41:43 UTC 2008


Are all the packages rhel4.6 as well, or have you compiled pkgs  
yourself?

What was the load you had on the system?

The messages I see from dm-cmirror suggest that it is properly  
shutting down in the face of the failure... However, before it has  
finished, we can see "Failed to remove faulty devices in vgtest- 
lvtest".  This suggests to me that clvmd is not waiting long enough  
for the shutdown to complete, but I only see 3 seconds of the log.   
When was the device failure initiated?

  brassow


On Feb 15, 2008, at 5:36 AM, Lajkó Attila wrote:

> Hello,
>
>
> I have a problem with clvmd an cmirror:
>
> We have a two nodes cluster (RHEL4.6). I created a mirrored LV on a  
> clustered volume group on 2 iscsi LUNS (VTrak M200i).
> When i disconnect one of the LUNs - simulating a storage problem -   
> the mirrored LV doesn't go to linear mode, the LVM commands (lvs,  
> lvconvert, etc.) get stuck and the GFS file system is not accessible  
> (on both nodes).
>
> What is see in /var/log/messages:
>
> Feb 15 12:29:26 el42 kernel: dm-cmirror: server_complete_resync_work  
> - Setting recovery_halted = 1
> Feb 15 12:29:26 el42 kernel: dm-cmirror: Log flush failure: -5 -EIO
> Feb 15 12:29:26 el42 last message repeated 4 times
> Feb 15 12:29:26 el42 kernel: dm-cmirror: Log flush failure: -5 -EIO
> Feb 15 12:29:26 el42 kernel: dm-cmirror: Recovery halted due to  
> error on ItlWCmkP
> Feb 15 12:29:26 el42 lvm[4929]: WARNING: dev_open(/dev/mapper/ 
> mirrp3) called while suspended
> Feb 15 12:29:26 el42 kernel: dm-cmirror: LOG INFO:
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   uuid: LVM- 
> zEHPYfjtLCL7yqQhsG2kcPzthyLbyBPd7xlok1gd7NHgXR3l2XaVQWEVItlWCmkP
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   uuid_ref    : 1
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   log type    : disk
> Feb 15 12:29:26 el42 kernel: dm-cmirror:  ?region_count: 320
> Feb 15 12:29:26 el42 kernel: dm-cmirror:  ?sync_count  : 320
> Feb 15 12:29:26 el42 kernel: dm-cmirror:  ?sync_search : 320
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   in_sync     : YES
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   suspended   : NO
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   recovery_halted : YES
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   server_id   : 2
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   server_valid: YES
> Feb 15 12:29:26 el42 kernel: dm-cmirror: cluster_presuspend:  
> recovery halted on ItlWCmkP(1)
> Feb 15 12:29:26 el42 kernel: dm-cmirror: cluster_postsuspend
> Feb 15 12:29:26 el42 kernel: dm-cmirror: Telling everyone I'm  
> suspending (ItlWCmkP)
> Feb 15 12:29:26 el42 kernel: dm-cmirror: LRT_MASTER_LEAVING(13):  
> (ItlWCmkP)
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   starter     : 2
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   co-ordinator: 0
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   node_count  : 0
> Feb 15 12:29:26 el42 kernel: dm-cmirror: LRT_MASTER_LEAVING(13):  
> (ItlWCmkP)
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   starter     : 2
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   co-ordinator: 0
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   node_count  : 2
> Feb 15 12:29:26 el42 kernel: dm-cmirror: LRT_ELECTION(10): (ItlWCmkP)
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   starter     : 2
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   co-ordinator: 57005
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   node_count  : 0
> Feb 15 12:29:26 el42 kernel: dm-cmirror: LRT_ELECTION(10): (ItlWCmkP)
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   starter     : 2
> Feb 15 12:29:26 el42 lvm[4929]: WARNING: dev_open(/etc/lvm/lvm.conf)  
> called while suspended
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   co-ordinator: 1
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   node_count  : 2
> Feb 15 12:29:26 el42 kernel: dm-cmirror: LRT_SELECTION(11): (ItlWCmkP)
> Feb 15 12:29:26 el42 kernel: dm-cmirror:   starter     : 2
> Feb 15 12:29:27 el42 kernel: dm-cmirror:   co-ordinator: 1
> Feb 15 12:29:27 el42 kernel: dm-cmirror:   node_count  : 2
> Feb 15 12:29:27 el42 kernel: dm-cmirror: LRT_MASTER_ASSIGN(12):  
> (ItlWCmkP)
> Feb 15 12:29:27 el42 kernel: dm-cmirror:   starter     : 2
> Feb 15 12:29:27 el42 kernel: dm-cmirror:   co-ordinator: 1
> Feb 15 12:29:27 el42 lvm[4929]: Failed to remove faulty devices in  
> vgtest-lvtest
> Feb 15 12:29:27 el42 kernel: dm-cmirror:   node_count  : 1
> Feb 15 12:29:27 el42 kernel: dm-cmirror: Suspending now (ItlWCmkP)
> Feb 15 12:29:28 el42 lvm[4929]: No longer monitoring mirror device  
> vgtest-lvtest for events
>
> Regards,
> Attila Lajkó
>
> _______________________________________________
> linux-lvm mailing list
> linux-lvm at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-lvm
> read the LVM HOW-TO at http://tldp.org/HOWTO/LVM-HOWTO/





More information about the linux-lvm mailing list