Re: RAID5 Performance

Adam Goryachev <mailinglists@xxxxxxxxxxxxxxxxxxxxxx> · Thu, 28 Jul 2016 23:57:14 +1000

On 28/07/2016 23:28, Peter Grandi wrote:
[ ... ]
That largely explains why in the tests I have mentioned small
sync write IOPS for many "consumerish" flash SSDs top at around
100, instead of the usual > 10,000 for small non-sync writes.
[ ... ]

To summarize the preceding long discussion:

* The stats reported show a low level of IOPS being carried out.

* The critical part of the workload seems to be synchronous
   small writes.

* Probably then the primary issue is the use of flash SSDs that
   have a limited number of IOPS for small synchronous writes.

* A secondary issue is that RAID5 results in RMW for small
   writes.

There are two possible options:

* Replace the flash SSDs with those that are known to deliver
   high (at least > 10,000 single threaded) small synchronous
   write IOPS.
Is there a "known" SSD that you would suggest? My problem is that Intel 
spec sheets seem to suggest that there is little performance difference 
across the range of SSD's, so it's really not clear which SSD model I 
should buy. Obviously it's not something I can afford to buy one of each 
and test them either.
* Relax the requirement for synchronous writes on *both* the
   primary and secondary DRBD servers, if feeling lucky.
I have the following entries for DRBD which were suggested by linbit 
(which previously lifted performance from abysmal to more than 
sufficient around 2+ years ago). I guess we are demanding more from the 
system now, and we have added the 530 model drives later...

        disk-barrier no;
        disk-flushes no;
        md-flushes no;

I've not configured anything special for LVM/MD/iSCSI/xen in relation to 
cache/buffer/etc, and windows has disabled the write back buffer option 
(within the VM).
The third option, which is to change the workload so that it
does not emit small synchronous writes to the storage layer,
seems not practical in the context.

Ideally the system would also be switched from RAID5 to RAID10
to avoid the large penalty on small writes at the RAID level
too.

That may be considered expensive, but as I wrote:

[ ... ] it requires a storage layer that has to cover all
possible IO workloads optimally, [ ... ]
That will be the final optimisation/fallback option if still needed. I'd 
prefer to avoid that as it limits the capacity of the system, and will 
obviously cost more.

Do you have any other suggestions or ideas that might assist?

Thanks,
Adam
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html