Aneesh Kumar K.V wrote: > On Fri, Nov 21, 2008 at 11:22:04AM -0600, Eric Sandeen wrote: >> Aneesh Kumar K.V wrote: >>> We need to make sure we update the block bitmap and clear >>> EXT4_BG_BLOCK_UNINIT flag with sb_bgl_lock held. We look >>> at EXT4_BG_BLOCK_UNINIT and reinit the block bitmap each >>> time in ext4_read_block_bitmap (introduced by >>> c806e68f5647109350ec546fee5b526962970fd2 ) >> Can you add details about the failure mode(s) of this race, so people >> (i.e. me) have an idea which bugs they've seen that it might address? >> > > ext4_read_block_bitmap does > > spin_lock(sb_bgl_lock(EXT4_SB(sb), block_group)); > if (desc->bg_flags & cpu_to_le16(EXT4_BG_BLOCK_UNINIT)) { > ext4_init_block_bitmap(sb, bh, block_group, desc); > > the above ext4_init_block_bitmap actually zero out the block bitmap. > > Now on the block allocation side we do > > mb_set_bits(sb_bgl_lock(sbi, ac->ac_b_ex.fe_group), bitmap_bh->b_data, > ac->ac_b_ex.fe_start, ac->ac_b_ex.fe_len); > > spin_lock(sb_bgl_lock(sbi, ac->ac_b_ex.fe_group)); > if (gdp->bg_flags & cpu_to_le16(EXT4_BG_BLOCK_UNINIT)) { > gdp->bg_flags &= cpu_to_le16(~EXT4_BG_BLOCK_UNINIT); > > ie on allocation we update the bitmap then we take the sb_bgl_lock > and clear the EXT4_BG_BLOCK_UNINIT flag. What can happen is a > parallel ext4_read_block_bitmap can zero out the bitmap in between > the above mb_set_bits and spin_lock(sb_bg_lock..) > > Result of this race is > a) blocks getting allocated multiple times > b) File corruption because two files have same blocks allocated > c) mb_free_blocks called multiple times on the same block Thanks - And do any of these cases lead to BUG(), WARNING(), ext3_error(), etc messages that people may one day google for? -Eric -- To unsubscribe from this list: send the line "unsubscribe linux-ext4" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html