Re: [PATCH] replaced BUG() with return -EIO from ext4_ext_get_blocks

Eric Sandeen <sandeen@xxxxxxxxxx> · Fri, 11 Dec 2009 14:02:58 -0600

Frank Mayhar wrote:
> On Fri, 2009-12-11 at 16:06 +0200, Surbhi Palande wrote:
>> This patch fixes the upstream bug# 14286. When the address of an extent
>> corresponding to a given block is NULL and the tree is being traversed for
>> fetching such an address, a -EIO should be reported instead of a BUG(). This
>> situation should normally not occur. However if it does, then the system
>> should be rendered usable.
>>
>> Signed-off-by: Surbhi Palande <surbhi.palande@xxxxxxxxxxxxx>
>> ---
>>  fs/ext4/extents.c |    7 ++++++-
>>  1 files changed, 6 insertions(+), 1 deletions(-)
>>
>> diff --git a/fs/ext4/extents.c b/fs/ext4/extents.c
>> index 3a7928f..51f87f3 100644
>> --- a/fs/ext4/extents.c
>> +++ b/fs/ext4/extents.c
>> @@ -3190,7 +3190,12 @@ int ext4_ext_get_blocks(handle_t *handle, struct inode *inode,
>>  	 * this situation is possible, though, _during_ tree modification;
>>  	 * this is why assert can't be put in ext4_ext_find_extent()
>>  	 */
>> -	BUG_ON(path[depth].p_ext == NULL && depth != 0);
>> +	if (path[depth].p_ext == NULL && depth != 0) {
>> +		err = -EIO;
>> +		printk(KERN_ERR "\n ext4 fs error in %s,%s,%s while reading a block ", \
>> +					__FILE__, __func__, __LINE__);
>> +		goto out2;
>> +	}
>>  	eh = path[depth].p_hdr;
>>  
>>  	ex = path[depth].p_ext;
> 
> As it happens, I fixed this locally but, as I considered it a band-aid
> fix at the time, I never pushed it to you guys.  My version of the fix
> is below; it dumps more information before returning the EIO.  I don't
> have a strong preference between the two versions but I would like to
> see the commentary included and the extra information is often useful
> during debugging.

My first thought was that this was a bandaid too, but I guess it can
come about due to on-disk corruption for any reason, so it should
be handled gracefully, and I suppose this approach seems fine.

I think the original plan was that a BUG() should be primarily for things
that would arise from a programming error, though EIO & graceful shutdown
even for that class of errors would probably be best.

Since this is catching on-disk corruption, though, it'd be better to call
ext4_error() and let the mount-time error-handling policy decide what to do.

I like having more info but below seems awfully wordy ;)  Maybe the first
printk would suffice, and switching it to an ext4_error() would be best,
I think.

-Eric

> Signed-off-by:  Frank Mayhar <fmayhar@xxxxxxxxxx>
> 
>  fs/ext4/extents.c |   18 +++++++++++++++++-
>  1 files changed, 17 insertions(+), 1 deletions(-)
> 
> diff --git a/fs/ext4/extents.c b/fs/ext4/extents.c
> index f7bdd55..7aa0bf6 100644
> --- a/fs/ext4/extents.c
> +++ b/fs/ext4/extents.c
> @@ -2859,8 +2859,24 @@ int ext4_ext_get_blocks(handle_t *handle, struct inode *inode,
>  	 * consistent leaf must not be empty;
>  	 * this situation is possible, though, _during_ tree modification;
>  	 * this is why assert can't be put in ext4_ext_find_extent()
> +	 *
> +	 * We don't want to panic in this case, as it can lead to a crash
> +	 * loop; instead we want to catch the error and abort.
>  	 */
> -	BUG_ON(path[depth].p_ext == NULL && depth != 0);
> +	if (path[depth].p_ext == NULL && depth != 0) {
> +		printk(KERN_ERR
> +		       "EXT4-fs (%s): corrupt extent node ino %lu iblock %d"
> +		       " depth %d pblock %lld\n",
> +		       inode->i_sb->s_id, inode->i_ino, iblock, depth,
> +		       path[depth].p_block);
> +		if (!path[depth].p_hdr)
> +			path[depth].p_hdr = ext_block_hdr(path[depth].p_bh);
> +		printk(KERN_ERR "EXT4-fs (%s): eh_entries %d eh_max %d\n",
> +		       inode->i_sb->s_id, path[depth].p_hdr->eh_entries,
> +		       path[depth].p_hdr->eh_max);
> +		err = -EIO;
> +		goto out2;
> +	}
>  	eh = path[depth].p_hdr;
>  
>  	ex = path[depth].p_ext;

--
To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html