On Fri, Jul 28, 2017 at 1:55 PM, Tyler Hicks <tyhicks@xxxxxxxxxxxxx> wrote: > This patch creates a read-only sysctl containing an ordered list of > seccomp actions that the kernel supports. The ordering, from left to > right, is the lowest action value (kill) to the highest action value > (allow). Currently, a read of the sysctl file would return "kill trap > errno trace allow". The contents of this sysctl file can be useful for > userspace code as well as the system administrator. > > The path to the sysctl is: > > /proc/sys/kernel/seccomp/actions_avail > > libseccomp and other userspace code can easily determine which actions > the current kernel supports. The set of actions supported by the current > kernel may be different than the set of action macros found in kernel > headers that were installed where the userspace code was built. > > In addition, this sysctl will allow system administrators to know which > actions are supported by the kernel and make it easier to configure > exactly what seccomp logs through the audit subsystem. Support for this > level of logging configuration will come in a future patch. > > Signed-off-by: Tyler Hicks <tyhicks@xxxxxxxxxxxxx> > --- > > * Changes since v4: > - move device_initcall() into CONFIG_SYSCTL ifdef > - mark the seccomp_actions_avail string as const > - adjust for new reStructuredText format > > Documentation/sysctl/kernel.txt | 1 + > Documentation/userspace-api/seccomp_filter.rst | 16 ++++++++ > kernel/seccomp.c | 51 ++++++++++++++++++++++++++ > 3 files changed, 68 insertions(+) > > diff --git a/Documentation/sysctl/kernel.txt b/Documentation/sysctl/kernel.txt > index bac23c1..995c42c 100644 > --- a/Documentation/sysctl/kernel.txt > +++ b/Documentation/sysctl/kernel.txt > @@ -74,6 +74,7 @@ show up in /proc/sys/kernel: > - reboot-cmd [ SPARC only ] > - rtsig-max > - rtsig-nr > +- seccomp/ ==> Documentation/userspace-api/seccomp_filter.rst > - sem > - sem_next_id [ sysv ipc ] > - sg-big-buff [ generic SCSI device (sg) ] > diff --git a/Documentation/userspace-api/seccomp_filter.rst b/Documentation/userspace-api/seccomp_filter.rst > index f71eb5e..35fc7cb 100644 > --- a/Documentation/userspace-api/seccomp_filter.rst > +++ b/Documentation/userspace-api/seccomp_filter.rst > @@ -169,7 +169,23 @@ The ``samples/seccomp/`` directory contains both an x86-specific example > and a more generic example of a higher level macro interface for BPF > program generation. > > +Sysctls > +======= > + > +Seccomp's sysctl files can be found in the ``/proc/sys/kernel/seccomp/`` > +directory. Here's a description of each file in that directory: > + > +``actions_avail``: > + A read-only ordered list of seccomp return values (refer to the > + ``SECCOMP_RET_*`` macros above) in string form. The ordering, from > + left-to-right, is the least permissive return value to the most > + permissive return value. > > + The list represents the set of seccomp return values supported > + by the kernel. A userspace program may use this list to > + determine if the actions found in the ``seccomp.h``, when the > + program was built, differs from the set of actions actually > + supported in the current running kernel. > > Adding architecture support > =========================== > diff --git a/kernel/seccomp.c b/kernel/seccomp.c > index 98b59b5..6bff068 100644 > --- a/kernel/seccomp.c > +++ b/kernel/seccomp.c > @@ -17,11 +17,13 @@ > #include <linux/audit.h> > #include <linux/compat.h> > #include <linux/coredump.h> > +#include <linux/kmemleak.h> > #include <linux/sched.h> > #include <linux/sched/task_stack.h> > #include <linux/seccomp.h> > #include <linux/slab.h> > #include <linux/syscalls.h> > +#include <linux/sysctl.h> > > #ifdef CONFIG_HAVE_ARCH_SECCOMP_FILTER > #include <asm/syscall.h> > @@ -922,3 +924,52 @@ long seccomp_get_filter(struct task_struct *task, unsigned long filter_off, > return ret; > } > #endif > + > +#ifdef CONFIG_SYSCTL > + > +/* Human readable action names for friendly sysctl interaction */ > +#define SECCOMP_RET_KILL_NAME "kill" > +#define SECCOMP_RET_TRAP_NAME "trap" > +#define SECCOMP_RET_ERRNO_NAME "errno" > +#define SECCOMP_RET_TRACE_NAME "trace" > +#define SECCOMP_RET_ALLOW_NAME "allow" > + > +static const char seccomp_actions_avail[] = SECCOMP_RET_KILL_NAME " " > + SECCOMP_RET_TRAP_NAME " " > + SECCOMP_RET_ERRNO_NAME " " > + SECCOMP_RET_TRACE_NAME " " > + SECCOMP_RET_ALLOW_NAME; > + > +static struct ctl_path seccomp_sysctl_path[] = { > + { .procname = "kernel", }, > + { .procname = "seccomp", }, > + { } > +}; > + > +static struct ctl_table seccomp_sysctl_table[] = { > + { > + .procname = "actions_avail", > + .data = (void *) &seccomp_actions_avail, > + .maxlen = sizeof(seccomp_actions_avail), > + .mode = 0444, > + .proc_handler = proc_dostring, > + }, > + { } > +}; > + > +static int __init seccomp_sysctl_init(void) > +{ > + struct ctl_table_header *hdr; > + > + hdr = register_sysctl_paths(seccomp_sysctl_path, seccomp_sysctl_table); > + if (!hdr) > + pr_warn("seccomp: sysctl registration failed\n"); > + else > + kmemleak_not_leak(hdr); > + > + return 0; > +} > + > +device_initcall(seccomp_sysctl_init) > + > +#endif /* CONFIG_SYSCTL */ Looks good. And for the record, the BPF return values, while not checked in seccomp_check_filter(), are part of ABI and the kernel will behave differently for unexpected values. For example, an older kernel encountering the future SECCOMP_RET_LOG will treat it as SECCOMP_RET_KILL since it's missing from the switch statement in __seccomp_filter(). A question about patch ordering: should the new seccomp action introspection patch maybe follow this one, so they're together in the series (they provide the same information)? -Kees -- Kees Cook Pixel Security -- To unsubscribe from this list: send the line "unsubscribe linux-api" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html