Due to a recent commit (d151ddc00498 - fs: Update i_[ug]id_(read|write) to translate relative to s_user_ns), inodes under /proc/sys have -1 written to their i_uid/i_gid members if a containing userns does not have entries for root in the uid/gid_map. This wouldn't normally matter, because these values are not used for access checks. However, a later change (0bd23d09b874 - Don't modify inodes with a uid or gid unknown to the vfs) changes the kernel to prevent opens for write if the i_uid/i_gid field in the inode is -1, even if the /proc/sys-specific access checks would otherwise pass. This causes a problem: in a userns without root mapping, even the namespace creator cannot write to e.g. /proc/sys/kernel/shmmax. This change fixes the problem by overriding i_uid/i_gid back to GLOBAL_ROOT_UID/GID. Tested: Used a repro program that creates a user namespace without any mapping and stat'ed /proc/$PID/root/proc/sys/kernel/shmmax from outside. Before the change, it shows uid/gid of 65534, with the change it's 0. Signed-off-by: Radoslaw Burny <rburny@xxxxxxxxxx> --- fs/proc/proc_sysctl.c | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/fs/proc/proc_sysctl.c b/fs/proc/proc_sysctl.c index c5cbbdff3c3d..67379a389658 100644 --- a/fs/proc/proc_sysctl.c +++ b/fs/proc/proc_sysctl.c @@ -499,6 +499,10 @@ static struct inode *proc_sys_make_inode(struct super_block *sb, if (root->set_ownership) root->set_ownership(head, table, &inode->i_uid, &inode->i_gid); + else { + inode->i_uid = GLOBAL_ROOT_UID; + inode->i_gid = GLOBAL_ROOT_GID; + } out: return inode; -- 2.20.0.rc0.387.gc7a69e6b6c-goog