Re: RAID5 reconstruction ?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 31/05/2009 12:54, Goswin von Brederlow wrote:
SandeepKsinha <sandeepksinha@xxxxxxxxx> writes:
On Sat, 2009-05-30 at 20:55 +0200, Goswin von Brederlow wrote:
And just when I hit send I thought of something else.

Instead of the initial sync when creating a raid the bitmap could just
mark all blocks as unused. Much faster raid creation.

This really sounds like a good option. This would have a slight hit
for writes which I believe will compensate for later re-constructions,
replacing a disk, mirror resysnc and many more operation.

What hit? Currently with bitmap support a write will set the block to
"unclean", write the data, write the parity and set the block to
"clean". Setting the "used" bit along the way should not cost much.

Only difference I see is that the bitmap would have to have finer
granularity so one "used" bit covers one filesystem block (4k usualy).
Otherwise you could only "use" blocks but not "unuse" them again when
the filesystem frees them in 4k chunks.

I think the whole thing probably ought to be done in such a way as to support the pass-down and pass-through of TRIM/DISCARD commands, which I vaguely recall from previous discussions operate at sector granularity. The idea would be for md to be able to use a bitmap (or other some other data structure for a free/used block/sector list) when operating over devices which don't support TRIM/DISCARD themselves, but take advantage of the devices' own capability when it's there - and since it'll be SSDs, we'd want to avoid repeatedly rewriting a bitmap since the point of TRIM/DISCARD is to help SSDs manage wear levelling.

I am assuming that devices supporting TRIM/DISCARD are able to indicate whether a given sector is used or free; if they don't and just return arbitary data we would have to keep a bitmap (or whatever) in md to be able to support TRIM/DISCARD at all.

Of course any bitmap (or whatever) might still be optimised if we know md and its clients never use anything smaller than e.g. 4k.

Cheers,

John.
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux