[dm-devel] Re: [RFC][PATCH] dm-mirror: fix data corruption

Mikulas Patocka mpatocka at redhat.com
Mon Aug 31 21:39:53 UTC 2009



On Sun, 30 Aug 2009, malahal at us.ibm.com wrote:

> Mikulas Patocka [mpatocka at redhat.com] wrote:
> > 
> > > How do bios queued in ms->failures are processed later? It seems that
> > > the bios stay in ms->failures forever, and the upper layer can not
> > > receive "success" for those bios. Don't we need a mechanism to block/unblock
> > > write bios to fix this issue?
> > 
> > They are resubmitted with DM_ENDIO_REQUEUE on noflush suspend. My patch 
> > has a bug that they aren't --- but I will provide a better patch, also 
> > without this periodic polling of ms->failures queue.
> 
> Trying to verify this patch. Mikulas, did you provide a better patch
> yet? Does this patch work at all?
> 
> I would like to verify if this patch works with devices that fail
> temporarily. I will plan on using dm-flakey devices for testing purposes.
> 
> Thanks, Malahal.

Hi

I uploaded patches for this bug at: 
http://people.redhat.com/mpatocka/patches/kernel/mirror-race/

But note! They were never tried. When I wanted to try them, I found out 
that dmeventd is totally nonworking in upstream code (it doesn't pass 
"handle_errors" argument and has some crashes and signal errors), so I 
didn't test them with dmeventd. Dmeventd must be fixed first, then we can 
work on this bug.

Mikulas




More information about the dm-devel mailing list