On Tuesday November 20, bs@xxxxxxxxx wrote: > > My personal (wild) guess for this problem is, that there is somewhere a global > lock, preventing all other CPUs to do something. At 100%s (at 80 MB/s) > there's probably not left any time frame to wake up the other CPUs or its > sufficiently small to only allow high priority kernel threads to do > something. > When I limit the sync to 40MB/s each resync-CPU has to wait sufficiently long > to allow the other CPUs to wake up. > > md doesn't hold any locks that would interfere with other parts of the kernel from working. I cannot imagine what would be causing your problems. The resync thread makes a point of calling cond_resched() periodically so that it will let other processes run even if it constantly has work to do. If you have nothing that could write to the RAID6 arrays, then I cannot see how the resync could affect the rest of the system except to reduce the amount of available CPU time. And as CPU is normally much faster than drives, you wouldn't expect that effect to be very great. Very strange. Can you do 'alt-sysrq-T' when it is frozen and get the process traces from the kernel logs? Can you send me "cat /proc/mdstat" after the resyn has started, but before the system has locked up. I'm sorry that I cannot suggest anything more useful. NeilBrown - To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html