On Sun, Apr 21, 2019 at 06:02:35PM +0100, Al Viro wrote: > On Thu, Apr 18, 2019 at 03:50:19PM -0700, Andrew Morton wrote: > > On Wed, 17 Apr 2019 17:22:00 +0530 Bharath Vedartham <linux.bhar@xxxxxxxxx> wrote: > > > > > This patch fixes the sparse warning: > > > > > > fs/reiserfs//xattr.c:453:28: warning: incorrect type in return > > > expression (different base types) > > > fs/reiserfs//xattr.c:453:28: expected unsigned int > > > fs/reiserfs//xattr.c:453:28: got restricted __wsum > > > fs/reiserfs//xattr.c:453:28: warning: incorrect type in return > > > expression (different base types) > > > fs/reiserfs//xattr.c:453:28: expected unsigned int > > > fs/reiserfs//xattr.c:453:28: got restricted __wsum > > > > > > csum_partial returns restricted integer __wsum whereas xattr_hash > > > expects a return type of __u32. > > > > > > ... > > > > > > --- a/fs/reiserfs/xattr.c > > > +++ b/fs/reiserfs/xattr.c > > > @@ -450,7 +450,7 @@ static struct page *reiserfs_get_page(struct inode *dir, size_t n) > > > > > > static inline __u32 xattr_hash(const char *msg, int len) > > > { > > > - return csum_partial(msg, len, 0); > > > + return (__force __u32)csum_partial(msg, len, 0); > > > } > > > > > > int reiserfs_commit_write(struct file *f, struct page *page, > > > > hm. Conversion from int to __u32 should be OK - why is sparse being so > > picky here? > > Because csum_partial() returns __wsum_t, not int. > > > Why is the __force needed, btw? > > So that accidental mixing of those csums (both 16bit and 32bit) with > host- or net-endian would be caught. > > And I'm not at all sure reiserfs xattr_hash() doesn't bugger it up, actually. > > Recall that 16bit inet csum is the sum of 16bit words (treated as host-endian) > modulo 0xffff, i.e. the entire buffer interpreted as host-endian integer > taken modulo 0xffff. That has a lovely property - memory representation > of that value is the same whether we'd done calculations on b-e or l-e > host; the reason is that modulo 65535 byteswap is the same as multiplying > by 256, so the sum of byteswapped 16bit values modulo 65535 is byteswapped > sum of original values. > > csum_partial() is sum of 32bit words (treated as host-endian) modulo 0xffffffff, > i.e. the entire buffer treated as host-endian number modulo 0xffffffff. > It is convenient when we want to calculate the 16bit csum - 0xffffffff is > a multiple of 0xffff, so residue modulo 0xffffffff determines the residue > modulo 0xffff; that's what csum_fold() is. > > However, result of csum_partial() on big- and little-endian hosts > does *not* have the same property. Consider e.g. an array {0, 0, 0, 128, > 0, 0, 0, 128}. csum_partial of that on l-e will be (2^31 + 2^31)mod(2^32 - 1), > i.e. 1, with {1, 0, 0, 0} as memory representation. 16bit csum will > again be 1, with {1, 0} as memory representation. On big-endian we > get (128 + 128)mod(2^32 - 1), i.e. 256, with {0, 0, 1, 0} as memory > representation. 16bit csum is again 256, stored as {1, 0}, i.e. > the same as if we'd done everything on l-e; however, raw csum_partial() > values have different memory representations. They certainly are > different as host-endian (and so are 16bit csums). > > Reiserfs takes csum_partial() on buffer, interprets it as host-endian > and stores it little-endian on disk. When fetching those it does > the same calculation and fails on mismatch. However, if the > store had been done on little-endian host and load - on big-endian > one we *will* get mismatch almost all the time. Treating ->rx_hash > as __wsum_t (and not doing that cpu_to_le32()) would lower the > frequency of mismatches, but still would be broken. Storing > a 16bit csum (declared as __sum16_t, again, without cpu_to_le...()) > would be endian-safe, but that's not what reiserfs folks wanted > (16 bits of csum instead of 32, for starters). > > IOW, what sparse has caught here is a genuine endianness bug; images > created on little-endian host and mounted on big-endian (or vice > versa) will see csum mismatches when trying to fetch xattrs. > Broken since > commit 0b1a6a8ca8a78c2e068b04acf97479ee89a024ac > Author: Andrew Morton <akpm@xxxxxxxx> > Date: Sun May 9 23:59:13 2004 -0700 > > [PATCH] reiserfs: xattr support > > From: Chris Mason <mason@xxxxxxxx> > > From: jeffm@xxxxxxxx > > reiserfs support for xattrs > > ISTR some discussions of reiserfs layout endianness problems, but > that had been many years ago and I could be wrong; I _think_ > the conclusion had been "it sucks, but we can't do anything > without breaking existing filesystem images". Not sure if that > was the same bug or something different, though. Hi Al, Thanks for your detailed explanation. I learnt quite a bit from it. I agree we should not "supress" this bug. I have noticed in the reiserfs code that, a checksum mismatch only causes a warning? Even if there is a checksum mismatch, data still is copied to the buffer? What is the point of the checksum over here? Thanks