Re: hung grow

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sun, Oct 08 2017, Curt wrote:

>> You get this because sdf1 says:
>>
>>    Raid Devices : 7
>>   Total Devices : 7
>>
>> while sda1 (for example) says:
>>
>>    Raid Devices : 8
>>   Total Devices : 6
>> Preferred Minor : 127
>>
>>   Reshape pos'n : 3799296 (3.62 GiB 3.89 GB)
>>   Delta Devices : 1 (7->8)
>>
>> mdadm cannot reconcile this difference.
>>
>> It appears that sdf1 was never involved in any reshape.
>> So you need to revert the reshape before trying to include sdf1 into the
>> array.  Clearly you need at least 6 devices that were involved in
>> the reshape to do this.
>> I haven't been following closely ... do you have 6 such devices?
>>
>> NeilBrown
>>
>
> Correct.  Which I thought was sorta the point, but could have
> completely misunderstood it, sdf was restored from a "faulty" drive
> that was out before the reshape.  Whether I have 6 devices depends on
> how picky things are.  I've got 5 that should be in sync, the 6th not,
> but it was involved in the reshape.
>
> Short version, is I shot myself in the foot on this one. Reshape never
> got anywhere, but need to try to revert and save what data I can.

Hmmm... (goes back and looks at more of thread..)

Ahhh .. you had an array which was rebuilding two spares, and you
told it to start reshaping... Interesting.
Theoretically that should work.  Was it deliberate? (I cannot seem to
find the start of the thread).

Looking at the list of "current --examine output", it appears that
 /dev/sdg1
 /dev/sdd1
 /dev/sdc1
 /dev/sda1
 /dev/sde1

 /dev/sdb

are all valid devices with the same event counts.  They are the six that
you need.
To confirm that names haven't changed, you can:
  mdadm --examine  /dev/sdg1 /dev/sdd1 /dev/sdc1 /dev/sda1 /dev/sde1 \
  /dev/sdb | grep Events

and confirm all the numbers are the same.
Then do the same and grep for "this" and confirm all the Raid Disk
numbers are different.

Interesting that /dev/sdb is a whole device and the rest are partitions.
I assume you know about that and why it is.

What happens if you run the --assemble --update=revert-reshape command on
these 6 devices (without --force)??

NeilBrown

Attachment: signature.asc
Description: PGP signature


[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux