On Thu, 28 Sep 2017 17:51:15 +0200 Peter Zijlstra <peterz@xxxxxxxxxxxxx> wrote: > On Fri, Sep 29, 2017 at 01:01:12AM +1000, Nicholas Piggin wrote: > > That's fine. If a user is not bound to a subset of CPUs, they could > > also cause disturbances with other syscalls and faults, taking locks, > > causing tlb flushes and IPIs and things. > > So on the big SGI class machines we've had trouble with > for_each_cpu() loops before, and IIRC the biggest Power box is not too > far from that 1-2K CPUs IIRC. This is a loop in process context, interrupts on, can reschedule, no locks held though. The biggest power boxes are more tightly coupled than those big SGI systems, but even so just plodding along taking and releasing locks in turn would be fine on those SGI ones as well really. Not DoS level. This is not a single mega hot cache line or lock that is bouncing over the entire machine, but one process grabbing a line and lock from each of 1000 CPUs. Slight disturbance sure, but each individual CPU will see it as 1/1000th of a disturbance, most of the cost will be concentrated in the syscall caller. > > Bouncing that lock across the machine is *painful*, I have vague > memories of cases where the lock ping-pong was most the time spend. > > But only Power needs this, all the other architectures are fine with the > lockless approach for MEMBAR_EXPEDITED_PRIVATE. Yes, we can add an iterator function that power can override in a few lines. Less arch specific code than this proposal. > > The ISYNC variant of the same however appears to want TIF flags or > something to aid a number of archs, the rq->lock will not help there. The SYNC_CORE? Yes it seems different. I think that's another issue though. Thanks, Nick