On 12/02/2015 10:28 AM, Robert Kierski wrote: > Thanks for the response. > > Nice try... But, the reason I’m using the 3.18.4 kernel is that it has the parallelization. I've got group_thread_cnt set to 32. I'm watching the CPU's with mpstat, and they're pretty much idle. I'm also watching the system traces with perf. It claims that only 11.9% of my time is spent doing the xor. Hmm. Ok. > I've got my CS set at 128k. I have noticed that if I set the CS to 32k, the TP is about 2x. I'm pretty sure the problem is that the 1M writes I'm doing are being broken into 4K pages, and then reassembled before going to disk. I think you're right. What is your stripe cache size? > Also, this is independent of the IO Scheduler. I've tried all 3 and got the same results. If your stripe cache is too small, sequential writes with large chunks can exhaust the cache before complete stripes are written, turning all of those partial stripe writes into read-modify-write cycles. -- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html