[linux-lvm] mirrored LV + cmirror problem
Jonathan Brassow
jbrassow at redhat.com
Fri Feb 15 15:41:43 UTC 2008
Are all the packages rhel4.6 as well, or have you compiled pkgs
yourself?
What was the load you had on the system?
The messages I see from dm-cmirror suggest that it is properly
shutting down in the face of the failure... However, before it has
finished, we can see "Failed to remove faulty devices in vgtest-
lvtest". This suggests to me that clvmd is not waiting long enough
for the shutdown to complete, but I only see 3 seconds of the log.
When was the device failure initiated?
brassow
On Feb 15, 2008, at 5:36 AM, Lajkó Attila wrote:
> Hello,
>
>
> I have a problem with clvmd an cmirror:
>
> We have a two nodes cluster (RHEL4.6). I created a mirrored LV on a
> clustered volume group on 2 iscsi LUNS (VTrak M200i).
> When i disconnect one of the LUNs - simulating a storage problem -
> the mirrored LV doesn't go to linear mode, the LVM commands (lvs,
> lvconvert, etc.) get stuck and the GFS file system is not accessible
> (on both nodes).
>
> What is see in /var/log/messages:
>
> Feb 15 12:29:26 el42 kernel: dm-cmirror: server_complete_resync_work
> - Setting recovery_halted = 1
> Feb 15 12:29:26 el42 kernel: dm-cmirror: Log flush failure: -5 -EIO
> Feb 15 12:29:26 el42 last message repeated 4 times
> Feb 15 12:29:26 el42 kernel: dm-cmirror: Log flush failure: -5 -EIO
> Feb 15 12:29:26 el42 kernel: dm-cmirror: Recovery halted due to
> error on ItlWCmkP
> Feb 15 12:29:26 el42 lvm[4929]: WARNING: dev_open(/dev/mapper/
> mirrp3) called while suspended
> Feb 15 12:29:26 el42 kernel: dm-cmirror: LOG INFO:
> Feb 15 12:29:26 el42 kernel: dm-cmirror: uuid: LVM-
> zEHPYfjtLCL7yqQhsG2kcPzthyLbyBPd7xlok1gd7NHgXR3l2XaVQWEVItlWCmkP
> Feb 15 12:29:26 el42 kernel: dm-cmirror: uuid_ref : 1
> Feb 15 12:29:26 el42 kernel: dm-cmirror: log type : disk
> Feb 15 12:29:26 el42 kernel: dm-cmirror: ?region_count: 320
> Feb 15 12:29:26 el42 kernel: dm-cmirror: ?sync_count : 320
> Feb 15 12:29:26 el42 kernel: dm-cmirror: ?sync_search : 320
> Feb 15 12:29:26 el42 kernel: dm-cmirror: in_sync : YES
> Feb 15 12:29:26 el42 kernel: dm-cmirror: suspended : NO
> Feb 15 12:29:26 el42 kernel: dm-cmirror: recovery_halted : YES
> Feb 15 12:29:26 el42 kernel: dm-cmirror: server_id : 2
> Feb 15 12:29:26 el42 kernel: dm-cmirror: server_valid: YES
> Feb 15 12:29:26 el42 kernel: dm-cmirror: cluster_presuspend:
> recovery halted on ItlWCmkP(1)
> Feb 15 12:29:26 el42 kernel: dm-cmirror: cluster_postsuspend
> Feb 15 12:29:26 el42 kernel: dm-cmirror: Telling everyone I'm
> suspending (ItlWCmkP)
> Feb 15 12:29:26 el42 kernel: dm-cmirror: LRT_MASTER_LEAVING(13):
> (ItlWCmkP)
> Feb 15 12:29:26 el42 kernel: dm-cmirror: starter : 2
> Feb 15 12:29:26 el42 kernel: dm-cmirror: co-ordinator: 0
> Feb 15 12:29:26 el42 kernel: dm-cmirror: node_count : 0
> Feb 15 12:29:26 el42 kernel: dm-cmirror: LRT_MASTER_LEAVING(13):
> (ItlWCmkP)
> Feb 15 12:29:26 el42 kernel: dm-cmirror: starter : 2
> Feb 15 12:29:26 el42 kernel: dm-cmirror: co-ordinator: 0
> Feb 15 12:29:26 el42 kernel: dm-cmirror: node_count : 2
> Feb 15 12:29:26 el42 kernel: dm-cmirror: LRT_ELECTION(10): (ItlWCmkP)
> Feb 15 12:29:26 el42 kernel: dm-cmirror: starter : 2
> Feb 15 12:29:26 el42 kernel: dm-cmirror: co-ordinator: 57005
> Feb 15 12:29:26 el42 kernel: dm-cmirror: node_count : 0
> Feb 15 12:29:26 el42 kernel: dm-cmirror: LRT_ELECTION(10): (ItlWCmkP)
> Feb 15 12:29:26 el42 kernel: dm-cmirror: starter : 2
> Feb 15 12:29:26 el42 lvm[4929]: WARNING: dev_open(/etc/lvm/lvm.conf)
> called while suspended
> Feb 15 12:29:26 el42 kernel: dm-cmirror: co-ordinator: 1
> Feb 15 12:29:26 el42 kernel: dm-cmirror: node_count : 2
> Feb 15 12:29:26 el42 kernel: dm-cmirror: LRT_SELECTION(11): (ItlWCmkP)
> Feb 15 12:29:26 el42 kernel: dm-cmirror: starter : 2
> Feb 15 12:29:27 el42 kernel: dm-cmirror: co-ordinator: 1
> Feb 15 12:29:27 el42 kernel: dm-cmirror: node_count : 2
> Feb 15 12:29:27 el42 kernel: dm-cmirror: LRT_MASTER_ASSIGN(12):
> (ItlWCmkP)
> Feb 15 12:29:27 el42 kernel: dm-cmirror: starter : 2
> Feb 15 12:29:27 el42 kernel: dm-cmirror: co-ordinator: 1
> Feb 15 12:29:27 el42 lvm[4929]: Failed to remove faulty devices in
> vgtest-lvtest
> Feb 15 12:29:27 el42 kernel: dm-cmirror: node_count : 1
> Feb 15 12:29:27 el42 kernel: dm-cmirror: Suspending now (ItlWCmkP)
> Feb 15 12:29:28 el42 lvm[4929]: No longer monitoring mirror device
> vgtest-lvtest for events
>
> Regards,
> Attila Lajkó
>
> _______________________________________________
> linux-lvm mailing list
> linux-lvm at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-lvm
> read the LVM HOW-TO at http://tldp.org/HOWTO/LVM-HOWTO/
More information about the linux-lvm
mailing list