Re: [RFC][PATCH 0/5] arch: atomic rework

"Paul E. McKenney" <paulmck@xxxxxxxxxxxxxxxxxx> · Tue, 18 Feb 2014 11:42:18 -0800

On Tue, Feb 18, 2014 at 06:23:47PM +0000, Peter Sewell wrote:
> On 18 February 2014 17:16, Paul E. McKenney <paulmck@xxxxxxxxxxxxxxxxxx> wrote:
> > On Tue, Feb 18, 2014 at 08:49:13AM -0800, Linus Torvalds wrote:
> >> On Tue, Feb 18, 2014 at 7:31 AM, Torvald Riegel <triegel@xxxxxxxxxx> wrote:
> >> > On Mon, 2014-02-17 at 16:05 -0800, Linus Torvalds wrote:
> >> >> And exactly because I know enough, I would *really* like atomics to be
> >> >> well-defined, and have very clear - and *local* - rules about how they
> >> >> can be combined and optimized.
> >> >
> >> > "Local"?
> >>
> >> Yes.
> >>
> >> So I think that one of the big advantages of atomics over volatile is
> >> that they *can* be optimized, and as such I'm not at all against
> >> trying to generate much better code than for volatile accesses.
> >>
> >> But at the same time, that can go too far. For example, one of the
> >> things we'd want to use atomics for is page table accesses, where it
> >> is very important that we don't generate multiple accesses to the
> >> values, because parts of the values can be change *by*hardware* (ie
> >> accessed and dirty bits).
> >>
> >> So imagine that you have some clever global optimizer that sees that
> >> the program never ever actually sets the dirty bit at all in any
> >> thread, and then uses that kind of non-local knowledge to make
> >> optimization decisions. THAT WOULD BE BAD.
> >
> > Might as well list other reasons why value proofs via whole-program
> > analysis are unreliable for the Linux kernel:
> >
> > 1.      As Linus said, changes from hardware.
> >
> > 2.      Assembly code that is not visible to the compiler.
> >         Inline asms will -normally- let the compiler know what
> >         memory they change, but some just use the "memory" tag.
> >         Worse yet, I suspect that most compilers don't look all
> >         that carefully at .S files.
> >
> >         Any number of other programs contain assembly files.
> >
> > 3.      Kernel modules that have not yet been written.  Now, the
> >         compiler could refrain from trying to prove anything about
> >         an EXPORT_SYMBOL() or EXPORT_SYMBOL_GPL() variable, but there
> >         is currently no way to communicate this information to the
> >         compiler other than marking the variable "volatile".
> >
> >         Other programs have similar issues, e.g., via dlopen().
> >
> > 4.      Some drivers allow user-mode code to mmap() some of their
> >         state.  Any changes undertaken by the user-mode code would
> >         be invisible to the compiler.
> >
> > 5.      JITed code produced based on BPF: https://lwn.net/Articles/437981/
> >
> > And probably other stuff as well.
> 
> interesting list.  So are you saying that value-range-analysis and
> such-like (I say glibly, without really knowing what "such-like"
> refers to here) are fundamentally incompatible with
> the kernel code, or can you think of some way to tell the compiler a
> bound on the footprint of the "unseen" changes in each of those cases?

Other than the "volatile" keyword, no.

Well, I suppose you could also create a function that changed the
variables in question, then arrange to never call it, but in such a way
that the compiler could not prove that it was never called.  But ouch!

							Thanx, Paul

--
To unsubscribe from this list: send the line "unsubscribe linux-arch" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html