[Linux-cachefs] [PATCH v2 3/8] nfs: Move to using the alternate fallback fscache I/O API

Trond Myklebust trondmy at hammerspace.com
Fri Oct 1 14:29:26 UTC 2021


On Fri, 2021-10-01 at 07:11 -0400, David Wysochanski wrote:
> 
> 
> On Fri, Sep 17, 2021 at 11:05 AM David Howells <dhowells at redhat.com>
> wrote:
> > 
> > Move NFS to using the alternate fallback fscache I/O API instead of
> > the old
> > upstream I/O API as that is about to be deleted.  The alternate API
> > will
> > also be deleted at some point in the future as it's dangerous (as
> > is the
> > old API) and can lead to data corruption if the backing filesystem
> > can
> > insert/remove bridging blocks of zeros into its extent list[1].
> > 
> > The alternate API reads and writes pages synchronously, with the
> > intention
> > of allowing removal of the operation management framework and
> > thence the
> > object management framework from fscache.
> > 
> > The preferred change would be to use the netfs lib, but the new I/O
> > API can
> > be used directly.  It's just that as the cache now needs to track
> > data for
> > itself, caching blocks may exceed page size...
> > 
> > Changes
> > =======
> > ver #2:
> >   - Changed "deprecated" to "fallback" in the new function
> > names[2].
> > 
> > Signed-off-by: David Howells <dhowells at redhat.com>
> > cc: Trond Myklebust <trond.myklebust at hammerspace.com>
> > cc: Anna Schumaker <anna.schumaker at netapp.com>
> > cc: linux-nfs at vger.kernel.org
> > cc: linux-cachefs at redhat.com
> > Link: https://lore.kernel.org/r/YO17ZNOcq+9PajfQ@mit.edu [1]
> > Link:
> > https://lore.kernel.org/r/CAHk-=wiVK+1CyEjW8u71zVPK8msea=qPpznX35gnX+s8sXnJTg@mail.gmail.com/
> > [2]
> > Link:
> > https://lore.kernel.org/r/163162771421.438332.11563297618174948818.stgit@warthog.procyon.org.uk/
> > # rfc
> > ---
> > 
> >  fs/nfs/file.c    |   14 +++--
> >  fs/nfs/fscache.c |  161 +++++++-----------------------------------
> > ------------
> >  fs/nfs/fscache.h |   85 ++++-------------------------
> >  fs/nfs/read.c    |   25 +++-----
> >  fs/nfs/write.c   |    7 ++
> >  5 files changed, 55 insertions(+), 237 deletions(-)
> > 
> > diff --git a/fs/nfs/file.c b/fs/nfs/file.c
> > index aa353fd58240..209dac208477 100644
> > --- a/fs/nfs/file.c
> > +++ b/fs/nfs/file.c
> > @@ -416,7 +416,7 @@ static void nfs_invalidate_page(struct page
> > *page, unsigned int offset,
> >         /* Cancel any unstarted writes on this page */
> >         nfs_wb_page_cancel(page_file_mapping(page)->host, page);
> > 
> > -       nfs_fscache_invalidate_page(page, page->mapping->host);
> > +       wait_on_page_fscache(page);
> >  }
> > 
> >  /*
> > @@ -432,7 +432,12 @@ static int nfs_release_page(struct page *page,
> > gfp_t gfp)
> >         /* If PagePrivate() is set, then the page is not freeable
> > */
> >         if (PagePrivate(page))
> >                 return 0;
> > -       return nfs_fscache_release_page(page, gfp);
> > +       if (PageFsCache(page)) {
> > +               if (!(gfp & __GFP_DIRECT_RECLAIM) || !(gfp &
> > __GFP_FS))
> > +                       return false;
> > +               wait_on_page_fscache(page);
> > +       }

I've found this generally not to be safe. The VM calls ->release_page()
from a variety of contexts, and often fails to report it correctly in
the gfp flags. That's particularly true of the stuff in mm/vmscan.c.
This is why we have the check above that vetos page removal upon
PagePrivate() being set.
> 

-- 
Trond Myklebust
Linux NFS client maintainer, Hammerspace
trond.myklebust at hammerspace.com






More information about the Linux-cachefs mailing list