On Thu, Dec 16, 2021 at 04:00:40PM -0500, Stefan Berger wrote: > > On 12/16/21 07:50, Christian Brauner wrote: > > On Thu, Dec 16, 2021 at 12:43:09AM -0500, Stefan Berger wrote: > > > From: Stefan Berger <stefanb@xxxxxxxxxxxxx> > > > > > > The goal of this series of patches is to start with the namespacing of > > > IMA and support auditing within an IMA namespace (IMA-ns) as the first > > > step. > > > > > > In this series the IMA namespace is piggy backing on the user namespace > > > and therefore an IMA namespace gets created when a user namespace is > > > created. The advantage of this is that the user namespace can provide > > > the keys infrastructure that IMA appraisal support will need later on. > > > > > > We chose the goal of supporting auditing within an IMA namespace since it > > > requires the least changes to IMA. Following this series, auditing within > > > an IMA namespace can be activated by a user running the following lines > > > that rely on a statically linked busybox to be installed on the host for > > > execution within the minimal container environment: > > > > > > mkdir -p rootfs/{bin,mnt,proc} > > > cp /sbin/busybox rootfs/bin > > > cp /sbin/busybox rootfs/bin/busybox2 > > > echo >> rootfs/bin/busybox2 > > > PATH=/bin unshare --user --map-root-user --mount-proc --pid --fork \ > > > --root rootfs busybox sh -c \ > > > "busybox mount -t securityfs /mnt /mnt; \ > > > busybox echo 'audit func=BPRM_CHECK mask=MAY_EXEC' > /mnt/ima/policy; \ > > > busybox2 cat /mnt/ima/policy" > > > > > > [busybox2 is used to demonstrate 2 measurements; see below] > > > > > > Following the audit log on the host the last line cat'ing the IMA policy > > > inside the namespace would have been audited. Unfortunately the auditing > > > line is not distinguishable from one stemming from actions on the host. > > > The hope here is that Richard Brigg's container id support for auditing > > > would help resolve the problem. > > > > > > The following lines added to a suitable IMA policy on the host would > > > cause the execution of the commands inside the container (by uid 1000) > > > to be measured and audited as well on the host, thus leading to two > > > auditing messages for the 'busybox2 cat' above and log entries in IMA's > > > system log. > > > > > > echo -e "measure func=BPRM_CHECK mask=MAY_EXEC uid=1000\n" \ > > > "audit func=BPRM_CHECK mask=MAY_EXEC uid=1000\n" \ > > > > /sys/kernel/security/ima/policy > > > > > > The goal of supporting measurement and auditing by the host, of actions > > > occurring within IMA namespaces, is that users, particularly root, > > > should not be able to evade the host's IMA policy just by spawning > > > new IMA namespaces, running programs there, and discarding the namespaces > > > again. This is achieved through 'hierarchical processing' of file > > > accesses that are evaluated against the policy of the namespace where > > > the action occurred and against all namespaces' and their policies leading > > > back to the root IMA namespace (init_ima_ns). > > Note that your worst-case is 32 levels (maximum supported userns > > nesting) where each ima namespace defines a separate policy. > > > > So make sure you don't run into locking issues when hierarchically > > processing rules. So far I think it's fine since the locks aren't held > > across the hierarchial walk but are dropped and reaqcuired for each > > level. > > > > But that could still mean a lot of contention on iint->mutex since this > > lock is global, i.e. in this context: for all ima namespaces. You might > > want to consider coming up with some rough ideas for how to solve this > > _if_ this becomes a problem in the future. > > > The plan is that each IMA namespace will have its own rbtree with its own > set of iints. We cannot do it all at the same time, so this will take while > until things can be completely moved over into a per-IMA namespace rbtree > and each IMA namespace becomes fully independent. Ok, good to hear that you have already thought about that. > > > > > > > The patch series adds support for a virtualized SecurityFS with a few > > > new API calls that are used by IMA namespacing. Only the data relevant > > > to the IMA namespace are shown. The files and directories of other > > > security subsystems (TPM, evm, Tomoyo, safesetid) are not showing > > > up when secruityfs is mounted inside a user namespace. > > > > > > Much of the code leading up to the virtualization of SecurityFS deals > > > with moving IMA's variables from various files into the IMA namespace > > > structure called 'ima_namespace'. When it comes to determining the > > > current IMA namespace I took the approach to get the current IMA > > > namespace (get_current_ns()) on the top level and pass the pointer all > > > the way down to those functions that now need access to the ima_namespace > > > to get to their variables. This later on comes in handy once hierarchical > > > processing is implemented in this series where we walk the list of > > > namespaces backwards and again need to pass the pointer into functions. > > Just to repeat the point from earlier reviews, all those functions need > > to be guaranteed to call from syscall context. Functions that operate on > > files have different semantics. > > > You mean files in general or SecurityFS files in particular? Files in general but I was specifically referring to securityfs here as that's the relevant bit here. > > > > > > > This patch also introduces usage of CAP_MAC_ADMIN to allow access to the > > > IMA policy via reduced capabilities. We would again later on use this > > > capability to allow users to set file extended attributes for IMA appraisal > > > support. > > > > > > My tree with these patches is here: > > > > > > git fetch https://github.com/stefanberger/linux-ima-namespaces v5.15+imans.v7.posted > > > > > > Regards, > > > Stefan > > > > > > v7: > > > - Dropped 2 patches related to key queues; using &init_ima_ns for all calls > > > from functions related to key queues where calls need ima_namespace > > > - Moved ima_namespace to security/integrity/ima/ima.h > > > - Extended API descriptions with ns parameter where needed > > > - Using init_ima_ns in functions related to appraisal and xattrs > > > - SecurityFS: Using ima_ns_from_file() to get ns pointer > > > - Reformatted to 80 columns per line > > Since we're starting to be fairly along I would ask you to please write > > detailed commit messages for the next revision. > > Expand the existing commit texts, is that what you suggest that I do? Yeah, right now they don't really explain the how and why. What I'm saying is that the commit messages - at least for the namespace part and the securityfs part - aren't detailed enough. > > > > > > I would also like to see all links for prior versions of this patchset > > in the commit message since the discussion has been fairly extensive so > > for this series it makes a lot of sense. So something like: > > > > Link: https://lore.kernel.org/r/$MSGID (v1) > > Link: https://lore.kernel.org/r/$MSGID (v2) > > Link: https://lore.kernel.org/r/$MSGID (v3) > > Link: https://lore.kernel.org/r/$MSGID (v4) > > Link: https://lore.kernel.org/r/$MSGID (v5) > > Link: https://lore.kernel.org/r/$MSGID (v6) > > Link: https://lore.kernel.org/r/$MSGID (v7) > > Signed-off-by: meh > > Signed-off-by: mih > > Signed-off-by: muh > > So that's a link per patch to all its previous versions? When Mimi ends up sending the PR for this she can then put links to all previous versions in the pull request mail or in the tag. I'm only suggesting you do it that way. Since this will be Mimi's PR it'll be her decision. So for now it would just be nice to put links to previous versions at the end of your cover letter. Because if you rework the series as I suggested you won't have a 1:1 correspondence between patches anymore as you did vor v1 to v7.