[Cluster-devel] [GFS2] flush the log if a transaction can't allocate space

Steven Whitehouse swhiteho at redhat.com
Mon Mar 26 08:32:07 UTC 2007


Hi,

Now applied to the -nmw git tree. Thanks,

Steve.

On Fri, 2007-03-23 at 14:51 -0600, Benjamin Marzinski wrote:
> This is a fix for bz #208514. When GFS2 frees up space, the freed blocks
> aren't available for reuse until the resource group is successfully written
> to the ondisk journal. So in rare cases, GFS2 operations will fail, saying
> that the filesystem is out of space, when in reality, you are just waiting for
> a log flush. For instance, on a 1Gig filesystem, if I continually write 10 Mb
> to a file, and then truncate it, after a hundred interations, the write will
> fail with -ENOSPC, even though the filesystem is just 1% full.
> 
> The attached patch calls a log flush in these cases.  I tested this patch
> fairly heavily to check if there were any locking issues that I missed, and
> it seems to work just fine. Also, this patch only does the log flush if
> get_local_rgrp makes a complete loop of resource groups without skipping
> any do to locking issues. The code would be slightly simpler if it just always
> did the log flush after the first failed pass, and you could only ever have
> to go through the loop twice, instead of up to three times. However, I guessed
> that failing to find a rg simply do to locking issues would be common enough
> to skip the log flush in that case, but I'm not certain that this is the right
> way to go. Either way, I don't suppose this code will be hit all that often.
> 
> Signed-off-by: Benjamin E. Marzinski <bmarzins at redhat.com>
> 
> plain text document attachment (flush_if_full.patch)
> diff -urpN gfs2-2.6-nmw-new-clean/fs/gfs2/rgrp.c gfs2-2.6-nmw-patched/fs/gfs2/rgrp.c
> --- gfs2-2.6-nmw-new-clean/fs/gfs2/rgrp.c	2007-03-19 17:23:35.000000000 -0500
> +++ gfs2-2.6-nmw-patched/fs/gfs2/rgrp.c	2007-03-22 12:43:39.000000000 -0500
> @@ -27,6 +27,7 @@
>  #include "trans.h"
>  #include "ops_file.h"
>  #include "util.h"
> +#include "log.h"
>  
>  #define BFITNOENT ((u32)~0)
>  
> @@ -941,9 +942,13 @@ static int get_local_rgrp(struct gfs2_in
>  			rgd = gfs2_rgrpd_get_first(sdp);
>  
>  		if (rgd == begin) {
> -			if (++loops >= 2 || !skipped)
> +			if (++loops >= 3)
>  				return -ENOSPC;
> +			if (!skipped)
> +				loops++;
>  			flags = 0;
> +			if (loops == 2)
> +				gfs2_log_flush(sdp, NULL);
>  		}
>  	}
>  




More information about the Cluster-devel mailing list