Re: Persistent superblock error

Saurabh Barve <sa@xxxxxxxxxxxxxxxxxxx> · Mon, 25 Oct 2004 12:23:19 -0600

Hi Guy.

I am still having problems. I'm sorry; I'm new to mdadm :(

I tried running the mdadm command you told me :

mdadm /dev/md0 -C -c 64 -l raid5 -p left-symmetric -n 4 /dev/hdc1 
missing /dev/hdg1 /dev/hdh1

However, it gave me the error:
mdadm: -C would set mode to create, but it is already manage.

mdadm --examine /dev/md0 gives the message "mdadm: /dev/md0 is too small 
for md".

mdadm --detail --test /dev/md0 gives the message "mdadm: md device 
/dev/md0 does not appear to be active".

fdisk is unable to read /dev/md0, and fsck exits with the following error:
---------
fsck.ext3: Invalid argument while trying to open /dev/md0

The superblock could not be read or does not describe a correct ext2
filesystem.  If the device is valid and it really contains an ext2
filesystem (and not swap or ufs or something else), then the superblock
is corrupt, and you might try running e2fsck with an alternate superblock:
    e2fsck -b 8193 <device>
---------

How do I get out of the manage mode, and force mdadm into the create mode?

Saurabh.

Guy wrote:
Ok, to preserve the data, when you create a new array you must use the same
parameters as before.  If you get the order of the disks, block size, or
layout wrong then the creation of the parity will overwrite previous data.

It would have been best to assemble the array with 1 disk missing.  Then the
re-sync would not occur, and if the data was bad you could try again after
correcting your mistake.  I think a create with 1 disk missing would also be
safe, but that would change the super block.  I have never needed to create
over a previous array where I wanted to retail the data.  Even with a
create, if you did it wrong, you could just stop the array and create it
again with the correct options, if you keep a disk missing each time.

Since you did a create, I assume the re-sync started.  As I said, this would
not damage your data if the parameters were the same as the previous array.
If you had anything different the re-sync trashed some of your data, enough
that I don't think you can recover.  Since the fsck could not recognize the
filesystem I believe you are out of luck.

I don't understand why "mdadm --detail" gives errors if you did a create.
Even an assemble should look better than what you show.

Now let's pretend your data is still there.  :)
If this is the case you must have done the create with the parameters wrong.
If this is the correct listing of your array when it was working:
raiddev /dev/md0

        raid-level              5
        nr-raid-disks           4
        nr-spare-disks          0
        persistent-superblock   1
        parity-algorithm        left-symmetric
        chunk-size              64
        device                  /dev/hdc1
        raid-disk               0
        device                  /dev/hdd1
        raid-disk               1
        device                  /dev/hdg1
        raid-disk               2
        device                  /dev/hdh1
        raid-disk               3

then try this mdadm command:

mdadm /dev/md0 -C -c 64 -l raid5 -p left-symmetric -n 4 /dev/hdc1 /dev/hdd1
/dev/hdg1 missing
or
mdadm /dev/md0 -C -l raid5 -n 4 /dev/hdc1 /dev/hdd1 /dev/hdg1 missing

-c 64 is the default
-p left-symmetric is the default

We leave 1 disk missing so that a re-sync can't happen.  If for some reason
you think you know which disk is least likely to have the correct data, then
make that disk missing.  Example:  hdd1 maybe had the most bad spots when
doing the ddrescue command.  This disk should be left "missing".

mdadm /dev/md0 -C -c 64 -l raid5 -p left-symmetric -n 4 /dev/hdc1 missing
/dev/hdg1 /dev/hdh1

You must maintain the order and position of the disks.

If this does not yield good data, stop the array and try again with a
different missing disk.  Since you had 2 disks go bad, I think it is a safe
bet that the 2 disks that did not fail have good data.  So no need to try
them as missing.

I am making some of this up as I go along!  But I have had to do something
similar.  This part is confusing and not really worth reading unless you
like scary stories!  I did not have a backup!  ---  I had a disk fail.  Just
a read error I think.  The re-sync to the spare worked.  Since this happens
when you get a read error, I did a remove and add of the failed disk, it was
now the spare.  I then failed the previous spare so the data would be put
back to the failed disk which is now the spare.  (This is a normal event for
me.  I have a read error on 1 of my 14 disks 2 to 4 time per year.  The disk
is not really bad, it just need to re-locate the bad sector.)  Then while
the spare was re-syncing other disks on the same SCSI bus failed.  The disk
had more than a bad sector, and I guess the bad disk affected the bus.  My
array went down.  To verify that each disk was good or bad I used dd and it
did effect the other disks.  I unplugged the bad disk and now the other
disks were fine.  I replaced the disk and partitioned the disk.  So I did an
assemble with the force option (since I had about 4 failed disks).  Then
failed the old spare again.  The re-sync failed again.  The new disk seemed
bad.  I need SCSI adapters to use my SCA-80 disks, so I determined the
SCSI-68/SCA-80 adapter was bad.  I replaced it.  I did another assemble with
the force option, the system decided the new disk was more current than
another disk, so the other disk was being re-built based on the new disk.  I
noticed this after the fsck did not work very well.  I stopped the array.  I
did an assemble with the disk that was just trashed marked as missing and
did not list the new disk, but I did list the old spare which is now good.
Now the array worked just fine.  I did an add of the trashed disk, after the
re-sync the array was in good shape.  I added the new disk as a spare, then
failed the old spare.  The re-sync went fine.  I removed and added the old
spare, it is now the spare again.  I did not need to keep the same disk as
the spare, but by doing so I know the logical order of my 14 disks (plus
spare).  I anyone understands the above mess, let me know!

I don't know of any hardware RAID that would allow you to correct a problem
like that.  The support group would just say your data is gone, sorry,
restore from your backup.  Have a nice day!  And I have had an HP FC-60 that
had a similar problem caused by a read error during a re-sync after another
disk failed.  ddrescue could have helped, but no such access to the disks.
Have a nice day!

Anyway, I hope this helps!  Have a nice day!

Guy

-----Original Message-----
From: linux-raid-owner@xxxxxxxxxxxxxxx
[mailto:linux-raid-owner@xxxxxxxxxxxxxxx] On Behalf Of Saurabh Barve
Sent: Thursday, October 21, 2004 10:07 PM
To: Guy
Cc: linux-raid@xxxxxxxxxxxxxxx
Subject: Re: Persistent superblock error

OK. Maybe I wasn't clear enough. This is from my earlier post 
(http://marc.theaimsgroup.com/?l=linux-raid&m=109753064507208&w=2):

I am running a server that has four 250 GB hard drives in a RAID 5 
configuration. Recently, two of the hard drives failed. I copied the 
data bitwise from one of the failed hard drives (/dev/hdc1) to another 
(/dev/hdd1) using dd_rescue 
(http://www.garloff.de/kurt/linux/ddrescue/). The failed hard drive had 
about 300 bad blocks (I checked using the badblocks utility).

I tried to use the two new disks to recover the RAID data working on an 
assumption in the post:

http://www.spinics.net/lists/raid/msg03502.html

However, that didn't seem to work. I'd like to recover the data if 
possible, but I'm pretty sure I had all of it backed up. So, I just 
tried to force the creation of the array using mkraid /dev/md0 
--really-force. That seemed to give me problems when running fsck, so I 
rebooted the server and tried creating the array using mdadm.

That is when I was getting the errors. Hopefully, this made things a 
bit clearer.

If you think I'm spamming on the list, you can mail me at 
sa@xxxxxxxxxxxxxxxxxxxx

Thanks,
Saurabh.

On Oct 21, 2004, at 6:57 PM, Guy wrote:

You said: "Any advice on why my RAID array will not run?"
Yes, you have 2 failed disks!

    Number   Major   Minor   RaidDevice State
       0      22        1        0      active sync   /dev/hdc1
       1      22       65        1      faulty   /dev/hdd1
       2       0        0        2      faulty removed
       3      34       65        3      active sync   /dev/hdh1

       4      34        1        4      spare   /dev/hdg1

If you replaced 2 disks from a RAID5, then the data is lost!
Are you sure this is what you wanted to do?

How did you assemble with 2 failed/replaced disks?

I am confused.  Based on the output of "mdadm --detail", the array 
should be

down.

Do you want to recover the data?
If yes then
	If you hope to save the data stop doing stuff!
	And don't lose the failed disks!  They may not be bad.
	And label them so you know which is which, if not too late!
	If both disks are really bad (unreadable), then the data is gone.
	Post another message with the status of the failed disks.
If no then
	Just recreate the array with the current disks and move on.
	Use mkfs to create a new filesystem.

Guy

-----Original Message-----
From: linux-raid-owner@xxxxxxxxxxxxxxx
[mailto:linux-raid-owner@xxxxxxxxxxxxxxx] On Behalf Of Saurabh Barve
Sent: Thursday, October 21, 2004 6:55 PM
To: linux-raid@xxxxxxxxxxxxxxx
Subject: Persistent superblock error

Hi,

I recently lost 2 hard drives in my 4-drive RAID-5 array. I replaced 
the

two faulty hard drives and tried rebuilding the array. However, I

continue to get errors.

I created the raid array using madam:
mdadm --assemble /dev/md0 --update=summaries /dev/hdc1 /dev/hdd1
/dev/hdg1 /dev/hdh1

I then tried to run fsck on /dev/md0 to make sure there were no file

system errors. However, fsck returned error messages that the number of

cylinders according to the superblock were 183833712, while the 
physical

size of the device was 183104736.

Running fdisk on /dev/md0 returned the following:

--------------

Device contains neither a valid DOS partition table, nor Sun, SGI or 
OSF

disklabel

Building a new DOS disklabel. Changes will remain in memory only,

until you decide to write them. After that, of course, the previous

content won't be recoverable.

The number of cylinders for this disk is set to 183104736.
There is nothing wrong with that, but this is larger than 1024,
and could in certain setups cause problems with:
1) software that runs at boot time (e.g., old versions of LILO)
2) booting and partitioning software from other OSs
   (e.g., DOS FDISK, OS/2 FDISK)
Warning: invalid flag 0x0000 of partition table 4 will be corrected by
w(rite)

Command (m for help): p

Disk /dev/md0: 749.9 GB, 749996998656 bytes
2 heads, 4 sectors/track, 183104736 cylinders
Units = cylinders of 8 * 512 = 4096 bytes
---------------

I tried inspecting the raid array with mdamd - 'mdadm --detail --test
/dev/md0'. It gave me the following results:
------------------
/dev/md0:
        Version : 00.90.00
  Creation Time : Thu Oct 21 16:27:20 2004
     Raid Level : raid5
     Array Size : 732418944 (698.49 GiB 749.100 GB)
    Device Size : 244139648 (232.83 GiB 249.100 GB)
   Raid Devices : 4
  Total Devices : 4
Preferred Minor : 0
    Persistence : Superblock is persistent

    Update Time : Thu Oct 21 16:38:59 2004
          State : dirty, degraded
 Active Devices : 2
Working Devices : 3
 Failed Devices : 1
  Spare Devices : 1

         Layout : left-symmetric
     Chunk Size : 64K

           UUID : c7a73d47:a072c630:7693d236:dff40ca6
         Events : 0.6

    Number   Major   Minor   RaidDevice State
       0      22        1        0      active sync   /dev/hdc1
       1      22       65        1      faulty   /dev/hdd1
       2       0        0        2      faulty removed
       3      34       65        3      active sync   /dev/hdh1

       4      34        1        4      spare   /dev/hdg1
--------------------

My /etc/raidtab files reads like this:

---------
raiddev /dev/md0
        raid-level              5
        nr-raid-disks           4
        nr-spare-disks          0
        persistent-superblock   1
        parity-algorithm        left-symmetric
        chunk-size              64
        device                  /dev/hdc1
        raid-disk               0
        device                  /dev/hdd1
        raid-disk               1
        device                  /dev/hdg1
        raid-disk               2
        device                  /dev/hdh1
        raid-disk               3
----------

Any advice on why my RAID array will not run?

Thanks,
Saurabh.

-

To unsubscribe from this list: send the line "unsubscribe linux-raid" 
in

the body of a message to majordomo@xxxxxxxxxxxxxxx

More majordomo info at  http://vger.kernel.org/majordomo-info.html

-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html
begin:vcard
fn:Saurabh Barve
n:Barve;Saurabh
org:Colorado State University;Department of Atmospheric Science
adr:;;4100 West Laporte Avenue;Fort Collins;CO;80523;USA
email;internet:sa@xxxxxxxxxxxxxxxxxxx
title:Systems Administrator
tel;work:(970) 491-7714
tel;home:(970) 416-7512
x-mozilla-html:TRUE
version:2.1
end:vcard