On Mon, 2019-05-13 at 11:04 -0500, Bruce Dubbs wrote: > On 5/13/19 4:08 AM, Karel Zak wrote: > > On Mon, May 13, 2019 at 01:33:22PM +0800, Ian Kent wrote: > > > Some of you may know that David Howells is working on getting > > > a new system call fsinfo() merged into the Linux kernel. > > > > > > This system call will provide access to information about mounted > > > mounts without having to read and parse file based mount tables > > > such as /proc/self/mountinfo, etc. > > > > > > Essentially all mounts have an id and one can get the id of a > > > mount by it's path and then use that to obtain a large range > > > of information about it. > > > > > > The information can include a list of mounts within the mount > > > which can be used to traverse a tree of mounts or the id used > > > to lookup information on an individual mount without the need > > > to traverse a file based mount table. > > > > > > I'd like to update libmount to use the fsinfo() system call > > > because I believe using file based methods to get mount > > > information introduces significant overhead that can be > > > avoided. > > > > > > Because the fsinfo() system call provides a very different way > > > to get information > > > about mounts, and having looked at the current > > > code, I'm wondering what will be > > > the best way to go about it. > > > > > > Any suggestions about the way this could best be done, given > > > that the existing methods must still work, will be very much > > > appreciated. > > > > It would be nice to start with some low-level things to read info > > about a target (mountpoint) into libmnt_fs, something like: > > > > int mnt_fsinfo_fill_fs(chat char *tgt, struct libmnt_fs *fs) > > > > and fill create a complete mount table by fsinfo(): > > > > int mnt_fsinfo_fill_table(struct libmnt_table *tab) > > > > ... probably add fsinfo.c to code to keep it all together. > > > > So, after then we can use these functions in our code. > > > > The nice place where is ugly overhead with the current mountinfo is > > context_umount.c code, see lookup_umount_fs() and > > mnt_context_find_umount_fs(). In this code we have mountpoint and we > > need more information about it (due to redirection to umount.<type> > > helpers, userspace mount options, etc.). It sounds like ideal to use > > mnt_fsinfo_fill_fs() if possible. > > > > The most visible change will be to use mnt_fsinfo_fill_table() with in > > mnt_table_parse_file() if the file name is "/proc/self/mountinfo". > > This will be huge improvement as we use this function in systemd on > > each mount table change... > > > > The question is how easily will be to replace mountinfo with fsinfo(). > > I may be stating the obvious, but this proposal does not appear to > simplify anything because it is kernel version dependent. From what I > understand, the new and old methods will both need to be supported for > quite some time. Yes, it won't really simplify the code base overall because of the need to support kernel versions that may not have the system call. But what I didn't talk about is there's a real problem handling large mount tables with the current method of reading the proc file system mount tables and these tables can get very large at times. And this is also about processes being flooded with notifications due to heavy mount/umount activity and then re-reading the entire mount table (or at least half on average) on every one because there's no other way to locate the mount they are looking for. I think the situation with util-linux isn't so bad in this respect but I still believe keeping the in-memory mount table up to date should see improvement. And libmount is used by quite a number of problematic applications so improving it will translate to improvement in those applications too. Ultimately I'll need to look at other applications (perhaps persuade them to use libmount). There's also the large number of notifications itself but I'm still not sure how to improve that. There will be a notifications implementation to accompany the recent mount-API/fsinfo changes as well so hopefully we'll be able to improve the situation with the implementation of that. > > I'm not suggesting that the changes not be made, but I suggest going slow. The changes will be fairly difficult because the util-linux mount handling is quite complex. And the fact that the fsinfo() patch series hasn't been merged yet means this isn't going to be done quickly (at least not "rushed" anyway). But it does need to be done ahead of the merge so we can work out what's missing in the fsinfo() implementation and try to have things added/fixed prior to the upstream merge. Ian