On 04/21/2017 04:55 AM, Reshetova, Elena wrote: >>>> Please don't send any more conversions until those have been resolved. >> >> Like I suggested months ago, how about doing an efficient implementation of >> refcount_t which doesn't use the bloated cmpxchg loop? Then there would be >> no >> need for endless performance arguments. In fact, in PaX there are already >> example implementations for several architectures. It's unfortunate that >> they're still being ignored for some reason. > > Providing efficient arch. specific implementation for refcount_t is > likely the next step to make performance-sensitive places happy. > However, in order to do it, we will need to measure for each subsystem > a) current atomic_t usage - base measurement > b) pure refcount_t usage impact > c) arch. specific refcount_t impact > Otherwise I have a chicken and egg problem here: people want better > performance and want to see arch. specific code for this, but in order > to convince maintainer in need of arch. specific code, I need to show > that we do have indeed a performance issue. Sorry, but that's a big load of... You have it so easy - the code is completely standalone, building a small test framework around it and measuring performance in _user space_ is trivial. Don't go around benchmarking conversions in specific subsystems. Generate numbers showing how refcount_t compares to atomic_t for various (valid) use cases. Once you have that, convincing people to adopt it would be so much easier. So stop talking about excuses, and start producing some numbers. If you can't do that, my level of interest in converting anything in block is zero. -- Jens Axboe