On 2024-11-18 18:55:23 -0600, Steve French wrote:
Looks like you dropped the patch:
"smb: No need to wait for work when cleaning up cached directories"
Otherwise for the four remaining patches, looks like the first patch
stayed the same (trivial comment change).
Can you remind me which of these three changed:
smb: Don't leak cfid when reconnect races with open_cached_dir
smb: prevent use-after-free due to open_cached_dir error paths
smb: During unmount, ensure all cached dir instances drop their dentry
All the substantive changes are in the last patch. I should have clarified,
but I just folded the changes from "smb: No need to wait for work when
cleaning up cached directories" into that patch, as well.
On Mon, Nov 18, 2024 at 3:53 PM Paul Aurich <paul@xxxxxxxxxxxxxx> wrote:
v2:
- Added locking in closed_all_cached_dirs()
- Replaced use of the cifsiod_wq with a new workqueue used for dropping cached
dir dentries, and split out the "drop dentry" work from "potential
SMB2_close + cleanup" work so that close_all_cached_dirs() doesn't block on
server traffic, but can ensure all "drop dentry" work has run.
- Repurposed the (essentially unused) cfid->fid_lock to protect cfid->dentry
The SMB client cached directory functionality can either leak a cfid if
open_cached_dir() races with a reconnect, or can have races between the
unmount process and cached dir cleanup/lease breaks that all lead to
a cached_dir instance not dropping its dentry ref in close_all_cached_dirs().
These all manifest as a pair of BUGs when unmounting:
[18645.013550] BUG: Dentry ffff888140590ba0{i=1000000000080,n=/} still in use (2) [unmount of cifs cifs]
[18645.789274] VFS: Busy inodes after unmount of cifs (cifs)
These issues started with the lease directory cache handling introduced in
commit ebe98f1447bb ("cifs: enable caching of directories for which a lease is
held"), and go away if I mount with 'nohandlecache'.
I'm able to reproduce the "Dentry still in use" errors by connecting to an
actively-used SMB share (the server organically generates lease breaks) and
leaving these running for 'a while':
- while true; do cd ~; sleep 1; for i in {1..3}; do cd /mnt/test/subdir; echo $PWD; sleep 1; cd ..; echo $PWD; sleep 1; done; echo ...; done
- while true; do iptables -F OUTPUT; mount -t cifs -a; for _ in {0..2}; do ls /mnt/test/subdir/ | wc -l; done; iptables -I OUTPUT -p tcp --dport 445 -j DROP; sleep 10; echo "unmounting"; umount -l -t cifs -a; echo "done unmounting"; sleep 20; echo "recovering"; iptables -F OUTPUT; sleep 10; done
('a while' is anywhere from 10 minutes to overnight. Also, it's not the
cleanest reproducer, but I stopped iterating once I had something that was
even remotely reliable for me...)
This series attempts to fix these, as well as a use-after-free that could
occur because open_cached_dir() explicitly frees the cached_fid, rather than
relying on reference counting.
Paul Aurich (4):
smb: cached directories can be more than root file handle
smb: Don't leak cfid when reconnect races with open_cached_dir
smb: prevent use-after-free due to open_cached_dir error paths
smb: During unmount, ensure all cached dir instances drop their dentry
fs/smb/client/cached_dir.c | 228 +++++++++++++++++++++++++------------
fs/smb/client/cached_dir.h | 6 +-
fs/smb/client/cifsfs.c | 14 ++-
fs/smb/client/cifsglob.h | 3 +-
fs/smb/client/inode.c | 3 -
fs/smb/client/trace.h | 3 +
6 files changed, 179 insertions(+), 78 deletions(-)
--
2.45.2
--
Thanks,
Steve