Re: Does raid5 have error handling while reading?

"NeilBrown" <neilb@xxxxxxx> · Fri, 22 May 2009 21:21:38 +1000 (EST)

On Fri, May 22, 2009 7:13 pm, Xu Yuehai wrote:
>>
>> If chunk_aligned_read results in an error, then the request
>> is re-tried through the stripe cache.  I would call that "error
>> handling".
>
> I think you mean that error() and raid5_align_endio are invoked and
> the read request is sent to conf->retry_read_aligned_list, then the
> raid5d will handle it.

Yes, that sounds right.

>
>> If the cache finds that a drives is failed, the data is calculated
>> from the other drives, though the code is not a straight forward
>> as it used to be.
>> See fetch_block5 which sets STRIPE_OP_COMPUTE_BLK which causes
>> ops_run_compute5 to do the work (or to schedule the work on separate
>> hardware).
>>
> I notice that the bit STRIPE_OP_COMPUTE_BLK is set if only s->uptodate
> == disks - 1, however, suppose there is just a single read request, I
> fail to find out why this condition is satisfied.

If there is just one read for a block on a device that has failed,
then fetch_block5 will schedule read requests (Wantread) on all
the other devices.  Once all those reads complete Uptodate will
be set, so s->update will indeed equal disks - 1, and so
COMPUTE_BLK will get set.

NeilBrown

--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html