On Wed, Jan 13, 2016 at 02:26:16PM -0800, Leonid Yegoshin wrote: > On 01/13/2016 02:45 AM, Will Deacon wrote: > >> > >I don't think the address dependency is enough on its own. By that > >reasoning, the following variant (WRC+addr+addr) would work too: > > > > > >P0: > >Wx = 1 > > > >P1: > >Rx == 1 > ><address dep> > >Wy = 1 > > > >P2: > >Ry == 1 > ><address dep> > >Rx = 0 > > > > > >So are you saying that this is also forbidden? > >Imagine that P0 and P1 are two threads that share a store buffer. What > >then? > > OK, I collected answers and it is: > > In MIPS R6 this test passes OK, I mean - P2: Rx = 1 if Ry is read as 1. > By design. > > However, it is unclear that happens in MIPS R2 1004K. How can it be unclear? If, for example, the outcome is permitted on that CPU, then your original reasoning for the WRC+sync+addr doesn't apply there and SYNC is not transitive. That's what I'm trying to get to the bottom of. Does the MIPS kernel target a particular CPU at compile time? > Moreover, there are voices against guarantee that it will be in future > and that voices point me to Documentation/memory-barriers.txt section "DATA > DEPENDENCY BARRIERS" examples which require SYNC_RMB between loading > address/index and using that for loading data based on that address or index > for shared data (look on CPU2 pseudo-code): > >To deal with this, a data dependency barrier or better must be inserted > >between the address load and the data load: > > > > CPU 1 CPU 2 > > =============== =============== > > { A == 1, B == 2, C = 3, P == &A, Q == &C } > > B = 4; > > <write barrier> > > WRITE_ONCE(P, &B); > > Q = READ_ONCE(P); > > <data dependency barrier> <----------- > >SYNC_RMB is here > > D = *Q; > ... > >Another example of where data dependency barriers might be required is > >where a > >number is read from memory and then used to calculate the index for an > >array > >access: > > > > CPU 1 CPU 2 > > =============== =============== > > { M[0] == 1, M[1] == 2, M[3] = 3, P == 0, Q == 3 } > > M[1] = 4; > > <write barrier> > > WRITE_ONCE(P, 1); > > Q = READ_ONCE(P); > > <data dependency barrier> <------------ > >SYNC_RMB is here > > D = M[Q]; > > That voices say that there is a legitimate reason to relax HW here for > performance if SYNC_RMB is needed anyway to work with this sequence of > shared data. Are you saying that MIPS needs to implement [smp_]read_barrier_depends? > And all that is out-of-topic here in my mind. I just want to be sure that > this patchset still provides a use of a specific lightweight SYNCs on MIPS > vs bold and heavy generalized "SYNC 0" in any case. We may be highjacking the thread slightly, but there are much bigger issues at play here if you want to start using lightweight barriers to implement relaxed kernel primitives such as smp_load_acquire and smp_store_release. Will