Adding some people who pay more attention to nfs in network namespaces than I usually do. Oleg Nesterov <oleg@xxxxxxxxxx> writes: > On 07/28, Eric W. Biederman wrote: >> >> > This untested patch should fix it without any need to worry about >> > dynamic behavior. > > Yeees ;) I was thinking about this change too, but I have no idea > what this ->nodename actually means for nfs. > > If you are going to send this patch - great! Just batting it around for now, and hoping we have the right combination of eyeballs look at it. There are more places in nfs that have questionable uses of utsname() instead of init_utsname(). So I think we probably need a more comprehensive patch at the very least. nfsclnt_reclaim is never called from userspace. nfsclnt_cancel is called from a callback that seems to have no guarantee of an approprate userspace context. nfsclnt_proc is called from rpc_ops and I did not spend the time to see if that could be a userspace context. So I really don't think using utsname() aka current->nsproxy->uts_ns makes sense in nlmclnt_setlockargs. We most definitely have an inconsistency in nfs. I am a little hesitant to suggest my patch because it is likely to have strange effects on nfs in a network namespace. On the flip side the code is broken anyway, we might as well at least use a version that is guarnateed not to cause a null pointer dereference in the kernel. This is the first time someone has seen a problem since late 2006 since Serge updated most of the references to use system_utsname but I think we have just been lucky. >> Although I am wondering if we have a few other spots >> > where the dynamic behavior might be iffy. > > Yes. And perhaps the patch which moves exit_task_namespaces() after > exit_task_work() makes sense anyway (the patch I showed). > > (but if you change nlmclnt_setlockargs() then it is not 3.11 material). > > The original motivation for 8aac62706 was the leak reported by Andrey, > but that leak should be also fixed by e7b2c406. "Move exit_task_namespaces() > from exit_notify() to do_exit()" is still fine imho, the reason for > exit_task_namespaces() from the middle of exit_notify() has gone away. > > But perhaps it would be better if work->func() could use ->nsproxy even > if the task is PF_EXITING. So far there is nothing in the nfs code that would suggest allowing work->func() being able to use ->nsproxy would make this code any better. I think that would just paper over the problem we are seeing right now. >> > Serge do you remember any of this? >> > >> > On a good day I can follow the nfs code but it takes quite a while. I >> > feel the same way about filesystems locks so I am not really certain >> > what is going on. >> > >> > Eric >> > >> > diff --git a/fs/lockd/clntproc.c b/fs/lockd/clntproc.c >> > index 9760ecb..6643cfc 100644 >> > --- a/fs/lockd/clntproc.c >> > +++ b/fs/lockd/clntproc.c >> > @@ -128,11 +128,11 @@ static void nlmclnt_setlockargs(struct nlm_rqst *req, struct file_lock *fl) >> > >> > nlmclnt_next_cookie(&argp->cookie); >> > memcpy(&lock->fh, NFS_FH(file_inode(fl->fl_file)), sizeof(struct nfs_fh)); >> > - lock->caller = utsname()->nodename; >> > + lock->caller = init_utsname()->nodename; >> > lock->oh.data = req->a_owner; >> > lock->oh.len = snprintf(req->a_owner, sizeof(req->a_owner), "%u@%s", >> > (unsigned int)fl->fl_u.nfs_fl.owner->pid, >> > - utsname()->nodename); >> > + init_utsname()->nodename); >> > lock->svid = fl->fl_u.nfs_fl.owner->pid; >> > lock->fl.fl_start = fl->fl_start; >> > lock->fl.fl_end = fl->fl_end; >> > >> > Eric -- To unsubscribe from this list: send the line "unsubscribe linux-nfs" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html