[Virtio-fs] [RFC PATCH 0/7] Inotify support in FUSE and virtiofs

Amir Goldstein amir73il at gmail.com
Wed Nov 17 06:40:57 UTC 2021


On Wed, Nov 17, 2021 at 12:12 AM Ioannis Angelakopoulos
<iangelak at redhat.com> wrote:
>
>
>
> On Tue, Nov 16, 2021 at 12:10 AM Stef Bon <stefbon at gmail.com> wrote:
>>
>> Hi Ioannis,
>>
>> I see that you have been working on making fsnotify work on virtiofs.
>> Earlier you contacted me since I've written this:
>>
>> https://github.com/libfuse/libfuse/wiki/Fsnotify-and-FUSE
>>
>> and send you my patches on 23 june.
>> I want to mention first that I would have appreciated it if you would
>> have reacted to me after I've sent you my patches. I did not get any
>> reaction from you. Maybe these patches (which differ from what you
>> propose now, but there is also a lot in common) have been an
>> inspiration for you.
>>
>> Second, what I've written about is that with network filesystems (eg a
>> backend shared with other systems) fsnotify support in FUSE has some
>> drawbacks.
>> In a network environment, where a network fs is part of making people
>> collaborate, it's very useful to have information on who did what on
>> which host, and also when. Simply a message "a file has been created
>> in the folder you watch" is not enough. For example, if you are part
>> of a team, and assigned to your team is a directory on a server where
>> you can work on some shared documents. Now in this example there is a
>> planning, and some documents have to be written. In that case you want
>> to be informed that someone in your team has started a document (by
>> creating it) by the system.
>>
> I agree that the out of band approach you propose is actually more powerful, since it can
> provide the client with more information than remote fsnotify. However, in the virtiofs setup
> your approach might not be as efficient.
>
> Specifically, the information on who did what might not make sense to the guest within QEMU, since all the
> virtiofs filesystem operations are handled by viritofsd on the host and the guest does not know about the
> server or any other guests. Vivek, correct me here if I am wrong.
>
> Thus, for now at least, it might be sufficient for the guest to know that just a remote event
> occurred.
>
>> This "extended" information will never get through fsnotify.
>>
>> Other info useful to you as team member:
>>
>> -  you have become member of another team: sbon at anotherteam.example.org
>> -  diskspace and/or quota shortage reported by networksystem
>> -  new teammember, teammember left
>> -  your "rights" or role in the network/team have been changed (for
>> example from reader to reader and writer to some documents)
>>
>> What I want to say is that in a network where lots of people work
>> together in teams/projects, (and I want Linux to play a role there, as
>> desktop/workstation) communication is very important, and all these
>> messages should be supported by the system. My idea is the support of
>> watching fs events with FUSE filesystems should go through userspace,
>> and not via the kernel (cause fs events are part of your setup in the
>> network, together with all other tools to make people collaborate like
>> chat/call/text, and because mentioned above extended info about the
>> who on what host etc is not supported by fsnotify).
>> There should be a fs event watcher which takes care of all watches on
>> behalf of applications during a session, similar to gamin and FAM once
>> did (not used anymore?).
>> When receiving a request from one of the applications this fsevent
>> watcher will use inotify and/or fanotify for local fs's only. With a
>> FUSE fs, it should contact (via a socket) this fs that a watch has
>> been set on an inode with a certain mask.
>> If the FUSE fs does not support this, fallback on normal inotify/fanotify.
>> This way extended info is possible.
>>
>> Is this extended information also useful for virtiofs?
>>
> Also based on your explanation, your out of band approach is specific to FUSE filesystems.
> Granted, with your approach there is less complexity in the kernel and more flexibility since
> the event notification occurs solely in user-space.
> However, during the discussion with Amir and Jan about potential routes we could take to support the remote
> fanotify/inotify/fsnotify one important concern was that the API should be able to support other
> network/remote filesystems if needed and not only FUSE filesystems.
> It seems that your approach would require a lot of work (correct me if I am wrong) to be adopted
> by other network filesystems.
>
> Finally, user-space applications should also be aware of your new API, which will probably result in a non-negligible effort by app developers to adopt it or change their existing apps. The remote inotify/fsnotify ( the current implementation) even though it has many limitations, relies on the existing API and should require less modifications in user space apps. That is why we chose the remote inotify/fsnotify route.
>

The way you depict the options seems like either applications are not
aware of the UAPI
changes or they need to be modified to adapt to the changes.

I actually think that the much better approach would be to deal with
most of the UAPI
complexity in a library, so applications may need to be rebuilt or
adapted to use a new
library, but going forward, the library would abstract most of the
complexity from the
applications.

The holy grail would be a portable library, such as this go library [1].
It is quite hard to design an API that would abstract all the
different capabilities
on Linux-inotify/Linux-fanotify/MacOS-fsevents/Win-USNJournal and more.

The second best would be a library to abstract the ever growing complexity
of Linux inotify/fanotify UAPI from applications.
I had already made the first step with adapting libinotifytools to fanotify [2].
We could continue down that path or start creating/improving a
different library.

The point is that abstracting different capabilities of remote fs notifications
(i.e. cifs, virtiofs, generic FUSE) is going to be challenging, so starting the
design from a user library API and deriving the needed pieces from the
kernel UAPI is the right way to go IMO.

The library approach will have the advantage that some remote fs capabilities
(e.g. for cifs) will be available for old kernels as well and in
theory, the library
could also use some standardized OOB notifications channel to get changed
made on the host from VM guest tools as Stef proposed.

> That said, I do not see a reason why both implementations cannot co-exist
> and have the user-space applications choose which approach they want.
>

True, an OOB channel and kernel generic remote fs support can both exist,
but it would be best if the application was not aware of either.
The library would pick the facility based on the requested functionality,
availability of the facilities in the filesystem and sysadmin/user policy.

Thanks,
Amir.

[1] https://github.com/fsnotify/fsnotify
[2] https://github.com/inotify-tools/inotify-tools/pull/134





More information about the Virtio-fs mailing list