Re: [patch 1/3] IA64: Slim down __clear_bit_unlock #2

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Please have a look at the patch below.

Taking this opportunity, in addition:
- I removed the unnecessary barrier() from __clear_bit_unlock().
 ia64_st4_rel_nta() makes sure all the modifications are globally
 seen before the bit is seen to be off.
- I made __clear_bit() modeled after __set_bit() and __change_bit().
- I corrected some comments sating that a memory barrier is provided,
 yet in reality, it is the acquisition side of the memory barrier only.
- I corrected some comments, e.g. test_and_clear_bit() was peaking
 about "bit to set".

Here is the code generated from my and the old versions.
(Though I do not know why "and" is moved in the 2nd bundle.):

test_new()
{
       __clear_bit_unlock(3, &data);
}

test_old()
{
       old__clear_bit_unlock(3, &data);
}

0000000000000000 <test_new>:
  0:   02 00 00 00 01 00       [MII]       nop.m 0x0
  6:   e0 00 04 00 48 00                   addl r14=0,r1;;
  c:   00 00 04 00                         nop.i 0x0
 10:   0b 10 00 1c 10 10       [MMI]       ld4 r2=[r14];;
 16:   f0 b8 0b 58 44 00                   and r15=-9,r2
 1c:   00 00 04 00                         nop.i 0x0;;
 20:   0a 00 3c 1c b6 11       [MMI]       st4.rel.nta [r14]=r15;;
 26:   00 00 00 02 00 00                   nop.m 0x0
 2c:   01 70 00 84                         mov r8=r14
 30:   1d 00 00 00 01 00       [MFB]       nop.m 0x0
 36:   00 00 00 02 00 80                   nop.f 0x0
 3c:   08 00 84 00                         br.ret.sptk.many b0;;

0000000000000040 <test_old>:
 40:   02 00 00 00 01 00       [MII]       nop.m 0x0
 46:   e0 00 04 00 48 00                   addl r14=0,r1;;
 4c:   00 00 04 00                         nop.i 0x0
 50:   03 10 00 1c b0 10       [MII]       ld4.acq r2=[r14]
 56:   00 00 00 02 00 e0                   nop.i 0x0;;
 5c:   71 17 b0 88                         and r15=-9,r2;;
 60:   0a 00 3c 1c b6 11       [MMI]       st4.rel.nta [r14]=r15;;
 66:   00 00 00 02 00 00                   nop.m 0x0
 6c:   01 70 00 84                         mov r8=r14
 70:   1d 00 00 00 01 00       [MFB]       nop.m 0x0
 76:   00 00 00 02 00 80                   nop.f 0x0
 7c:   08 00 84 00                         br.ret.sptk.many b0;;

Signed-off-by: Zoltan Menyhart, <Zoltan.Menyhart@xxxxxxxx>

Thanks,

Zoltan Menyhart


--- include/asm-ia64/bitops.h-old	2007-12-21 02:25:48.000000000 +0100
+++ include/asm-ia64/bitops.h-tmp	2008-01-15 13:16:26.000000000 +0100
@@ -122,38 +122,40 @@
 }
 
 /**
- * __clear_bit_unlock - Non-atomically clear a bit with release
+ * __clear_bit_unlock - Non-atomically clears a bit in memory with release
+ * @nr: Bit to clear
+ * @addr: Address to start counting from
  *
- * This is like clear_bit_unlock, but the implementation uses a store
+ * Similarly to clear_bit_unlock, the implementation uses a store
  * with release semantics. See also __raw_spin_unlock().
  */
 static __inline__ void
-__clear_bit_unlock(int nr, volatile void *addr)
+__clear_bit_unlock(int nr, void *addr)
 {
-	__u32 mask, new;
-	volatile __u32 *m;
+	__u32 * const m = (__u32 *) addr + (nr >> 5);
+	__u32 const new = *m & ~(1 << (nr & 31));
 
-	m = (volatile __u32 *)addr + (nr >> 5);
-	mask = ~(1 << (nr & 31));
-	new = *m & mask;
-	barrier();
 	ia64_st4_rel_nta(m, new);
 }
 
 /**
  * __clear_bit - Clears a bit in memory (non-atomic version)
+ * @nr: the bit to clear
+ * @addr: the address to start counting from
+ *
+ * Unlike clear_bit(), this function is non-atomic and may be reordered.
+ * If it's called on the same region of memory simultaneously, the effect
+ * may be that only one operation succeeds.
  */
 static __inline__ void
 __clear_bit (int nr, volatile void *addr)
 {
-	volatile __u32 *p = (__u32 *) addr + (nr >> 5);
-	__u32 m = 1 << (nr & 31);
-	*p &= ~m;
+	*((__u32 *) addr + (nr >> 5)) &= ~(1 << (nr & 31));
 }
 
 /**
  * change_bit - Toggle a bit in memory
- * @nr: Bit to clear
+ * @nr: Bit to toggle
  * @addr: Address to start counting from
  *
  * change_bit() is atomic and may not be reordered.
@@ -178,7 +180,7 @@
 
 /**
  * __change_bit - Toggle a bit in memory
- * @nr: the bit to set
+ * @nr: the bit to toggle
  * @addr: the address to start counting from
  *
  * Unlike change_bit(), this function is non-atomic and may be reordered.
@@ -197,7 +199,7 @@
  * @addr: Address to count from
  *
  * This operation is atomic and cannot be reordered.  
- * It also implies a memory barrier.
+ * It also implies the acquisition side of the memory barrier.
  */
 static __inline__ int
 test_and_set_bit (int nr, volatile void *addr)
@@ -247,11 +249,11 @@
 
 /**
  * test_and_clear_bit - Clear a bit and return its old value
- * @nr: Bit to set
+ * @nr: Bit to clear
  * @addr: Address to count from
  *
  * This operation is atomic and cannot be reordered.  
- * It also implies a memory barrier.
+ * It also implies the acquisition side of the memory barrier.
  */
 static __inline__ int
 test_and_clear_bit (int nr, volatile void *addr)
@@ -272,7 +274,7 @@
 
 /**
  * __test_and_clear_bit - Clear a bit and return its old value
- * @nr: Bit to set
+ * @nr: Bit to clear
  * @addr: Address to count from
  *
  * This operation is non-atomic and can be reordered.  
@@ -292,11 +294,11 @@
 
 /**
  * test_and_change_bit - Change a bit and return its old value
- * @nr: Bit to set
+ * @nr: Bit to change
  * @addr: Address to count from
  *
  * This operation is atomic and cannot be reordered.  
- * It also implies a memory barrier.
+ * It also implies the acquisition side of the memory barrier.
  */
 static __inline__ int
 test_and_change_bit (int nr, volatile void *addr)
@@ -315,8 +317,12 @@
 	return (old & bit) != 0;
 }
 
-/*
- * WARNING: non atomic version.
+/**
+ * __test_and_change_bit - Change a bit and return its old value
+ * @nr: Bit to change
+ * @addr: Address to count from
+ *
+ * This operation is non-atomic and can be reordered.
  */
 static __inline__ int
 __test_and_change_bit (int nr, void *addr)

[Index of Archives]     [Linux Kernel]     [Sparc Linux]     [DCCP]     [Linux ARM]     [Yosemite News]     [Linux SCSI]     [Linux x86_64]     [Linux for Ham Radio]

  Powered by Linux