On Sat, 23 Sep 2023 17:18:00 +0200 Joel Parthemore <joel@xxxxxxxxxxxxxxx> wrote: > I didn't want to try that again until I had confirmation that the > out-of-sync wouldn't (or shouldn't) be an issue. (I had tried it once > before, but the system had somehow swapped /dev/md126 and /dev/md127 so > that /dev/md126 became the container and /dev/md127 the RAID-5 array, > which confused me. So I stopped experimenting further until I had a > chance to write to the list.) > > The array is assembled read only, and this time both /dev/md126 and > /dev/md127 are looking like I expect them to. I started dd to make a > backup image using dd if=/dev/md126 of=/dev/sdc bs=64K > conv=noerror,sync. (The EXT4 file store on the 2TB RAID-5 array is about > 900GB full.) At first, it was running most of the time and just > occasionally in uninterruptible sleep, but the periods of > uninterruptible sleep quickly started getting longer. Now it seems to be > spending most but not quite all of its time in uninterruptible sleep. Is > this some kind of race condition? Anyway, I'll leave it running > overnight to see if it completes. > > Accessing the RAID array definitely isn't locking things up this time. I > can go in and look at the partition table, for example, no problem. > Access is awfully slow, but I assume that's because of whatever dd is or > isn't doing. > > By the way, I'm using kernel 6.5.3, which isn't the latest (that would > be 6.5.5) but is close. Maybe it's an HDD issue, one of them did have some unreadable sectors in the past, although the firmware has not decided to do anything about that, such as reallocating them and recording that in SMART. Check if one of the drives is holding up things, with a command like iostat -x 2 /dev/sd? If you see 100% next to one of the drives, and much less for others, that one might be culprit. -- With respect, Roman