在 2020/8/17 下午5:58, Wei Yang 写道: > On Sun, Aug 16, 2020 at 11:47:56AM +0800, Alex Shi wrote: >> pageblock_flags is used as long, since every pageblock_flags is just 4 >> bits, 'long' size will include 8(32bit machine) or 16 pageblocks' flags, >> that flag setting has to sync in cmpxchg with 7 or 15 other pageblock >> flags. It would cause long waiting for sync. >> >> If we could change the pageblock_flags variable as char, we could use >> char size cmpxchg, which just sync up with 2 pageblock flags. it could >> relief much false sharing in cmpxchg. >> > > If my understanding is correct, CPU reads data in the size of cacheline. > Define a variable a char or other, doesn't help on false sharing. > > Correct me, if not. Right and not, Cacheline false sharing is right. but after that, cmpxchg still need to compare the pointed data, if the data is long, it need to compare long word and make sure nothing changes in the long word. If we narrow the comparsion data to byte, cmpxchg will just sync up on a byte. So it looks like there are 2 level false sharing here. Thanks Alex for (;;) { - old_word = cmpxchg(&bitmap[word_bitidx], word, (word & ~mask) | flags); - if (word == old_word) + old_byte = cmpxchg(&bitmap[byte_bitidx], byte, (byte & ~mask) | flags); + if (byte == old_byte) break; - word = old_word; + byte = old_byte; }