On Thu, May 07, 2020 at 02:47:05PM -0400, Rafael Aquini wrote: > On Thu, May 07, 2020 at 02:43:16PM -0400, Rafael Aquini wrote: > > On Thu, May 07, 2020 at 06:22:57PM +0000, Luis Chamberlain wrote: > > > On Thu, May 07, 2020 at 02:06:31PM -0400, Rafael Aquini wrote: > > > > diff --git a/kernel/sysctl.c b/kernel/sysctl.c > > > > index 8a176d8727a3..b80ab660d727 100644 > > > > --- a/kernel/sysctl.c > > > > +++ b/kernel/sysctl.c > > > > @@ -1217,6 +1217,13 @@ static struct ctl_table kern_table[] = { > > > > .extra1 = SYSCTL_ZERO, > > > > .extra2 = SYSCTL_ONE, > > > > }, > > > > + { > > > > + .procname = "panic_on_taint", > > > > + .data = &panic_on_taint, > > > > + .maxlen = sizeof(unsigned long), > > > > + .mode = 0644, > > > > + .proc_handler = proc_doulongvec_minmax, > > > > + }, > > > > > > You sent this out before I could reply to the other thread on v1. > > > My thoughts on the min / max values, or lack here: > > > > > > Valid range doesn't mean "currently allowed defined" masks. > > > > > > For example, if you expect to panic due to a taint, but a new taint type > > > you want was not added on an older kernel you would be under a very > > > *false* sense of security that your kernel may not have hit such a > > > taint, but the reality of the situation was that the kernel didn't > > > support that taint flag only added in future kernels. > > > > > > You may need to define a new flag (MAX_TAINT) which should be the last > > > value + 1, the allowed max values would be > > > > > > (2^MAX_TAINT)-1 > > > > > > or > > > > > > (1<<MAX_TAINT)-1 > > > > > > Since this is to *PANIC* I think we do want to test ranges and ensure > > > only valid ones are allowed. > > > > > > > Ok. I'm thinking in: > > > > diff --git a/kernel/sysctl.c b/kernel/sysctl.c > > index 8a176d8727a3..ee492431e7b0 100644 > > --- a/kernel/sysctl.c > > +++ b/kernel/sysctl.c > > @@ -1217,6 +1217,15 @@ static struct ctl_table kern_table[] = { > > .extra1 = SYSCTL_ZERO, > > .extra2 = SYSCTL_ONE, > > }, > > + { > > + .procname = "panic_on_taint", > > + .data = &panic_on_taint, > > + .maxlen = sizeof(unsigned long), > > + .mode = 0644, > > + .proc_handler = proc_doulongvec_minmax, > > + .extra1 = SYSCTL_ZERO, > > + .extra2 = (1 << TAINT_FLAGS_COUNT << 1) - 1, > ^^^^^^^^ > Without that crap, obviously. Sorry. That was a screw up on my side, > when copyin' and pasting. I really think that the implications of this needs a bit further review, hence the wider CCs. Since this can trivially crash a system, I think we need to be careful about this value. First, proc_doulongvec_minmax() will not suffice alone, we'll *at least* want to check for capable(CAP_SYS_ADMIN)) as in proc_taint(). Second first note that we *always* build proc_taint(), if just CONFIG_PROC_SYSCTL is enabled. That has been the way since it got merged via commit 34f5a39899f3f ("Add TAINT_USER and ability to set taint flags from userspace") since v2.6.21. We need to evaluate if this little *new* knob you are introducing merits its own kconfig tucked away under debugging first. The ship has already sailed for proc_taint(). Anyone with CAP_SYS_ADMIN can taint. The good thing is that proc_taint() added its own TAINT_USER, *but*, hey it didn't use it. A panic-on-taint system would be able to tell if a panic was caused by proc_taint() throught the stack trace only. If panic-on-taint proc was used *later* after a custom taint was set or happened naturally, no panic would trigger since your panic-on-taint check on your patch only happens on add_taint(). This means that for those thinking about using this for QA or security purposes, the only sensible *reliable* way to use panic-on-taint would be through cmdline, from boot. Post-boot means to enable this would either need to check existing taint flags, or we'd want to a way to check if this was not added post boot. Also, a post-booteed system with panic-on-taint could easily allow for reductions of the intended goal, thereby allowing one to cheat. I think a new TAINT_MODIFIED for use when proc_taint() is used is worth considering. Ted? Even though 'M' is taken -- I think its silly to rely on the character to be anything of meaning, once we run out of the alphabet letters that will be the way anyway, unless we-redo this a bit. Note we use value for when this is on and off, typically an empty space when a taint is not seen. The good thing is that proc_taint() only *increments* taint, it doesn't remove taints. Are we OK with panic-on-taint only with CAP_SYS_ADMIN? I can see this building up to a "testing" solution to ensure / gaurantee no bugs have happened during QA, but since QA would want the same binary for production it is hard to see this enabled for QA but not production. To resolve that last concern, if we do go with moving this under a kconfig value, a simple cmdline append would address the concerns. Ie, even if you enabled this mechanism through its kconfig you would not be able to modify the panic-on-tain unless you passed a cmdline option. Note that Vlastimil has some patches which are visible on linux-next, but not yet merged on Linus' tree, which enable these params to be set on the cmdline too now, so perhaps yet-another cmdline param is not needed anymore. I *think* that a cmdline route to enable this would likely remove the need for the kernel config for this. But even with Vlastimil's work merged, I think we'd want yet-another value to enable / disable this feature. Do we need yet-another-taint flag to tell us that this feature was enabled? Luis