On Thu, 2020-07-02 at 14:41 -0500, Benjamin Marzinski wrote: > On Thu, Jul 02, 2020 at 04:45:21PM +0000, Martin Wilck wrote: > > > > What's wrong with deferred remove? After all, the user explicitly > > asked > > for a flush. As long as some other process has the device open, it > > won't be removed. That's why I like the O_EXCL idea, which will > > allow > > small programs like blkid to access the device, but will cause all > > attempts to mount or add stacked devices to fail until the device > > is > > actually removed. I see no reason no to do this, as it's a race > > anyway > > if some other process opens the device when we're supposed to flush > > it > > and the opencount already dropped to 0. By using O_EXCL, we just > > increase multipathd's chances to win the race and do what the user > > asked for. > > I'm not actually a fan of deferred remove in general. It leaves the > device in this weird state were it is there but no longer openable. Ok, I didn't expect that ;-) AFAICS, devices in DEFERRED REMOVE state are actually still openable. I just tested this once more on a 5.3 kernel. As long as the device is opened by some process and thus not removed, it can be opened by other processes, and is not deleted until the last opener closes it. It's even possible to create new device mapper layers like kpartx partitions on top of a DM device in DEFERRED REMOVE state. > I > wish I had originally dealt with deferred removes by having > multipathd > occasionally try to flush devices with no paths, or possibly listen > for > notifications that the device has been closed, and flush then. > > My specific objections here are that not all things that open a > device > for longer than an instant do so with O_EXCL. So it's very possible > that you run "multipath -F", it returns having removed a number of > unused devices. But some of the devices it didn't remove were opened > without O_EXCL, and they will stick around for a while, and then > suddenly disappear. Even if they don't say around for that long, > this > is a can still effect scripts or other programs that are expecting to > check the device state immediately after calling multipath -F, and > having it not change a second or so later. So far multipath -f/-F > will > not return until it has removed all the removeable devices (and > waited > for them to be removed from udev). I think it should stay this way. I see. That's a valid point. IMHO it'd be better if the kernel didn't allow any new access to devices in "deferred remove" state, and possibly sent a REMOVE uevent and hide the device immediately after the deferred remove ioctl. That would also be closer to how "lazy umount" (umount -l) behaves. But I'm certainly overlooking some subtle semantic issues. @Mike, Zdenek: perhaps you can explain why "deferred remove" behaves like this? Martin -- Dr. Martin Wilck <mwilck@xxxxxxxx>, Tel. +49 (0)911 74053 2107 SUSE Software Solutions Germany GmbH HRB 36809, AG Nürnberg GF: Felix Imendörffer -- dm-devel mailing list dm-devel@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/dm-devel