On Wed, Jul 22, 2015 at 07:58:24AM +1000, NeilBrown wrote: > On Thu, 16 Jul 2015 16:51:48 -0400 "J. Bruce Fields" > <bfields@xxxxxxxxxxxx> wrote: > > > On Thu, Jul 16, 2015 at 09:40:46AM +1000, NeilBrown wrote: > > > On Wed, 15 Jul 2015 17:07:56 -0400 "J. Bruce Fields" > > > <bfields@xxxxxxxxxxxx> wrote: > > > > > > > > Wow.... this is turning out to be a lot more complex that I imagined at > > > > > first (isn't that always the way!). > > > > > > > > > > There is a lot of good stuff here, but I think we can probably make it > > > > > simpler and so even better. > > > > > > > > I'm still not convinced that the expkey > > > > (Sorry, I meant an entry in the export cache, not the expkey cache.) > > They are very closely related. An incoming filehandle has its 'fs' > identifier mapped through the expkey cache to get an "export point". > Then the "export point" plus "client identifier" are mapped through the > export cache to get export options. > > So the "export point" thing in the expkey cache should really be the > same as the thing in the export cache. > > > > > > should have a dentry reference > > > > in the key in the first place. Fixing that would fix the immediate > > > > problem. > > > > > > ??? If we removed the dentry, how would you export a subdirectory of a > > > filesystem? > > > > I've been wondering if the export cache should really be keyed on the > > string representation of the path instead of the struct path. That's > > what the userspace interface uses. > > That makes sense for handling updates to the cache from user-space. > I'm not sure it is so good for handling mapping from file-handle to > export flags. > > > > > There's a related bug: if there are mountpoints at both /a and /a/b, > > Just to make sure I'm clear on what you are saying, this would be > achieved by, e.g. > > mkdir -p /a/b > mount /dev/sdX /a/b > mount /dev/sdY /a > > so the mount of /dev/sdX is complete unreachable from '/'. > > Correct? Actually my reproducer used something like: server# mount --bind /a server# mount --bind /a/b then a v3 mount of / and "ls /mnt/a/b" from the client. > > then thanks to the lookup-underneath-mountpoint behavior of the server, > > an NFSv3 client looking that up will end up going underneath the first > > mountpoint and doing an export cache lookup for > > > > (vfsmnt, dentry) == (/, /a/b) > > Maybe this step is where the bug is. "/a/b" is not really a valid name. > Should d_path() check for paths that have been mounted over, and attach > "(unreachable)" to the end of the path, similar to "(deleted)". > > sys_getcwd() can give you "(unreachable)" when in a filesystem that has > e.g. been lazy-unmounted. Maybe we want something similar for a > mounted-over filesystem??? > > > > > > When the server gets a response that starts with "/a/b", it interprets > > that as applying to the path (/a, /a/b), so doesn't recognize it as > > resolving the query about (/, /a/b). > > > > Well, at least I assume that's why I see "ls" hang if I run "ls > > /mnt/a/b" on the client. And there may be some better fix, but I always > > figured the root (hah) problem here was due to indexing the cache on > > struct path while the upcall interface uses the full path string. > > > Sounds like a very odd corner case - how did you stumble on to it? I dug through my old email, but that may be lost in the mists of time.... My memory is that I ran across a similar hang while testing some mountd changes, but couldn't reproduce it. (It might have involved a change to the exports?) So came up with this case by inspection. I've had this nagging todo to work out if there are other interesting consequences of the fact that the cache is internally keyed on one thing and appears to mountd to be keyed on another. (And that there's a complicated many<->many relationship between those two things.) But I haven't gotten to it. Could be all unlikely corner cases, for all I know. --b. -- To unsubscribe from this list: send the line "unsubscribe linux-nfs" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html