Re: RAID 6, 6 device array - all devices lost superblock

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Phil,

The correct findHash - python tool on github - pturmel/findHash?

- Peter

On Sun, Aug 28, 2022 at 7:24 PM Peter Sanders <plsander@xxxxxxxxx> wrote:
>
> Phil,
>
> fstab from the working config -
>
> # <file system> <mount point>   <type>  <options>       <dump>  <pass>
> # / was on /dev/sda1 during installation
> UUID=50976432-b750-4809-80ac-3bbdd2773163 /               ext4
> errors=remount-ro 0       1
> # /home was on /dev/sda6 during installation
> UUID=eb93a2c4-0190-41fa-a41d-7a5966c6bc47 /home           ext4
> defaults        0       2
> # /var was on /dev/sda5 during installation
> UUID=d1aa6d1f-3ee9-48a8-9350-b15149f738c4 /var            ext4
> defaults        0       2
> /dev/sr0        /media/cdrom0   udf,iso9660 user,noauto     0       0
> /dev/sr1        /media/cdrom1   udf,iso9660 user,noauto     0       0
> # raid array
> /dev/md0    /mnt/raid6    ext4    defaults    0    2
>
> No LVM, one large EXT4 partition
>
> I have several large files ( NEF and various mpg files) I can identify
> and have backup copies available.
>
> I have the overlays created. 300G for each of the six drives.
>
> - Peter
>
> On Sun, Aug 28, 2022 at 3:49 PM John Stoffel <john@xxxxxxxxxxx> wrote:
> >
> > >>>>> "Phil" == Phil Turmel <philip@xxxxxxxxxx> writes:
> >
> > Phil> Pssst! John,
> > Phil> All of my comments were attributed to Peter by your mail client. ):
> >
> > Yeah... sometimes my mail reader gets confused when it cites previous
> > emails.  I shoudl probably just drop to > only from now on.
> >
> >
> > Phil> On 8/28/22 14:45, John Stoffel wrote:
> > >>>>>>> "Peter" == Peter Sanders <plsander@xxxxxxxxx> writes:
> > >>
> > Peter> It was set up on the device level, not partitions.  (I remember
> > Peter> getting some advice on the web that device was better than
> > Peter> partition... Yay for internet advice)
> > >>
> > >> Yeah, this is NOT good advice.  Generally systems will not destroy
> > >> existing partition tables, but if they see an empty (to them)
> > >> disk... all bets are off.
> > >>
> > Peter> I'm surveying my other disks to see what I have available to do
> > Peter> the overlay attempt.
> > >>
> > >> They're small.  They are sparse files, so just follow the
> > >> instructions.
> > >>
> > Peter> What are the size of the overlay files going to end up being?
> > >>
> > >> Not too large, but it depends on how much data is written to the
> > >> overlayfs to get your data back.  If you follow the instructions on
> > >> this page:
> > >>
> > >> https://raid.wiki.kernel.org/index.php/Recovering_a_damaged_RAID
> > >>
> > >> It says to create a sparse file for each disk that is 1% of the size
> > >> of the disk.  This can add up... you might need to add a blank disk to
> > >> your system to hold these.
> > >>
> > >> In this case, if you think you know which order the disks were in, you
> > >> could try to create the RAID6 array (but only using the overlayfs
> > >> devices!!!!!)  I can't stress this enough.
> > >>
> > >>
> > Peter> I did run into UEFI vs AHCI issues early in the process.. they
> > Peter> are all set to non-UEFI.
> > >>
> > >> That's good.
> > >>
> > Peter> OS update was onto a new SSD...
> > >>
> > >> Ok.  Do you have the old OS disk around by any chance?  That might
> > >> give some pointers to how the disks are setup..  You could look in
> > >> /var/tmp/initrd/... for old mdadm.conf files, which might give more
> > >> details.
> > >>
> > Peter> On Sun, Aug 28, 2022, 12:47 Phil Turmel <philip@xxxxxxxxxx> wrote:
> > >>
> > Peter> Hi Peter, et al,
> > >>
> > Peter> On 8/28/22 05:54, Wols Lists wrote:
> > >>>> On 28/08/2022 10:14, Wols Lists wrote:
> > >>>>> Currently I have no /dev/md* devices.
> > >>>>> I have access to the old mdadm.conf file - have tried assembling with
> > >>>>> it, with the default mdadm.conf, and with no mdadm.conf file in /etc
> > >>>>> and /etc/mdadm.
> > >>>>>
> > >>>>> It looks like the drives weren't partitioned :-( I think you're into
> > >>>>> forensics.
> > >>
> > Peter> It is too soon to say this.  The supplied mdadm.conf file does not
> > Peter> contain specific partition information.  It is possible the partition
> > Peter> tables have just been wiped.
> > >>
> > >>>> Whoops - my system froze while I was originally writing my reply, and I
> > >>>> forgot to put this into my rewrite ...
> > >>>>
> > >>>> Look up overlays in the wiki. I've never done it myself, but a fair few
> > >>>> people have said the instructions worked a treat.
> > >>>>
> > >>>> You're basically making the drives read-only (all writes get dumped into
> > >>>> the overlay file), and then re-creating the array over the top, so you
> > >>>> can test whether you got it right. If you don't, you just ditch the
> > >>>> overlays and start again, if you did get it right you can recreate the
> > >>>> array for real.
> > >>>>
> > >>>> Cheers,
> > >>>> Wol
> > >>
> > Peter> On 8/28/22 11:10, John Stoffel wrote:
> > >>>>>>>>> "Peter" == Peter Sanders <plsander@xxxxxxxxx> writes:
> > >>>>
> > Peter> have a RAID 6 array, 6 devices.  Been running it for years without much issue.
> > Peter> Had hardware issues with my system - ended up replacing the
> > Peter> motherboard, video card, and power supply and re-installing the OS
> > Peter> (Debian 11).
> > >>>>
> > >>>> Can you give us details on the old vs new motherboard/cpu?  It might
> > >>>> be that you need to tweak the BIOS of the motherboard to expose the
> > >>>> old SATA formats as well.
> > >>>>
> > >>>> Did you install debian onto a fresh boot disk?  Is your BIOS setup to
> > >>>> only do the new form of booting from UEFI devices, so maybe check your
> > >>>> BIOS settings that the data drives are all in AHCI mode, or possibly
> > >>>> even in IDE mode.  It all depends on how old the original hardware
> > >>>> was.
> > >>>>
> > >>>> I just recenly upgraded from a 2010 MB/CPU combo and I had to tweak
> > >>>> the BIOS defaults to see my disks.  I guess I should do a clean
> > >>>> install from a blank disk, but I wanted to minimize downtime.
> > >>
> > Peter> It is important to end up in AHCI mode on all MOBO ports.  If not set
> > Peter> that way now, please change them.
> > >>
> > >>>> Wols has some great advice here, and I heartily recommend that you use
> > >>>> overlayfs when doing your testing.  Check the RAID WIKI for
> > >>>> suggestions.
> > >>
> > Peter> Concur.
> > >>
> > >>>> And don't panic!  Your data is probably there, but just missing the
> > >>>> super blocks or partition tables.
> > >>
> > Peter> Both, I suspect.
> > >>
> > Peter> On 8/27/22 22:00, Peter Sanders wrote:
> > >>>> lsdrv ------------------------
> > >>>> PCI [nvme] 01:00.0 Non-Volatile memory controller: Phison Electronics
> > >>>> Corporation E12 NVMe Controller (rev 01)
> > >>>> └nvme nvme0 PCIe SSD                                 {21112925606047}
> > >>>>   └nvme0n1 238.47g [259:0] Partitioned (dos)
> > >>>>    ├nvme0n1p1 485.00m [259:1] ext4 {f38776ac-1ce9-4fc8-ba50-94844b9f504e}
> > >>>>    │└Mounted as /dev/nvme0n1p1 @ /boot
> > >>>>    ├nvme0n1p2 1.00k [259:2] Partitioned (dos)
> > >>>>    ├nvme0n1p5 60.54g [259:3] ext4 {5ee1c3c0-3a05-466c-9f98-f5807c8d813b}
> > >>>>    │└Mounted as /dev/nvme0n1p5 @ /
> > >>>>    ├nvme0n1p6 93.13g [259:4] ext4 {9064169f-4fe3-4836-a906-28c1b445cdff}
> > >>>>    │└Mounted as /dev/nvme0n1p6 @ /var
> > >>>>    ├nvme0n1p7 37.00m [259:5] ext4 {25e161ad-94a0-4298-afaf-18e2433766ee}
> > >>>>    ├nvme0n1p8 82.89g [259:6] ext4 {ac874071-d759-4d33-b32f-83272f3eacd9}
> > >>>>    │└Mounted as /dev/nvme0n1p8 @ /home
> > >>>>    └nvme0n1p9 1.41g [259:7] swap {02cef84b-9a9d-4a0a-973c-fda1a78c533c}
> > >>>> PCI [pata_jmicron] 26:00.1 IDE interface: JMicron Technology Corp.
> > >>>> JMB368 IDE controller (rev 10)
> > >>>> └scsi 0:0:0:0 MAD DOG  LS-DVDRW TSH652M {MAD_DOG_LS-DVDRW_TSH652M}
> > >>>>   └sr0 1.00g [11:0] Empty/Unknown
> > >>>> PCI [ahci] 26:00.0 SATA controller: JMicron Technology Corp. JMB363
> > >>>> SATA/IDE Controller (rev 10)
> > >>>> └scsi 2:x:x:x [Empty]
> > >>>> PCI [ahci] 2b:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD]
> > >>>> FCH SATA Controller [AHCI mode] (rev 51)
> > >>>> ├scsi 6:0:0:0 ATA      TOSHIBA HDWD130  {477ALBNAS}
> > >>>> │└sda 2.73t [8:0] Partitioned (PMBR)
> > >>>> └scsi 7:0:0:0 ATA      TOSHIBA HDWD130  {Y7211KPAS}
> > >>>>   └sdc 2.73t [8:32] Partitioned (gpt)
> > >>>> PCI [ahci] 2c:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD]
> > >>>> FCH SATA Controller [AHCI mode] (rev 51)
> > >>>> ├scsi 8:0:0:0 ATA      WDC WD30EZRX-00D {WD-WCC1T0668790}
> > >>>> │└sdb 2.73t [8:16] Partitioned (gpt)
> > >>>> ├scsi 9:0:0:0 ATA      WDC WD30EZRX-00D {WD-WCC4N0091255}
> > >>>> │└sdd 2.73t [8:48] Partitioned (gpt)
> > >>>> ├scsi 12:0:0:0 ATA      WDC WD30EZRX-00M {WD-WCAWZ2669166}
> > >>>> │└sde 2.73t [8:64] Partitioned (gpt)
> > >>>> └scsi 13:0:0:0 ATA      TOSHIBA HDWD130  {477ABEJAS}
> > >>>>   └sdf 2.73t [8:80] Partitioned (gpt)
> > >>
> > Peter> Unfortunately, my lsdrv tool is not able to reconstruct missing parts.
> > Peter> It is most useful when used on a *good* system and *saved* for help
> > Peter> diagnosing *future* problems.
> > >>
> > Peter> Please share your /etc/fstab, and if you were using LVM on top of the
> > Peter> raid, share your lvm.conf and anything in /etc/lvm/backup.
> > >>
> > Peter> Please describe the layer(s) that were on top of the raid.
> > >>
> > Peter> We need to help you look for signatures, and it helps to be selective in
> > Peter> what signatures to look for.
> > >>
> > Peter> After that, we will want to figure out your raid's chunk size and data
> > Peter> offsets.  If you know of a particular large file (8MB or larger) that is
> > Peter> sure to be in the raid and you happen to have a copy tucked away, then
> > Peter> my findHash[1] tool might be able to definitively determine those
> > Peter> values.  (Time consuming, though.)
> > >>
> > Peter> Meanwhile, don't do *anything* that would write to those drives.
> > >>
> > Peter> Phil
> > >>
> > Peter> [1] https://github.com/pturmel/findHash
> > >>
> >




[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux