Hi, all. I’ve had a 5-drive RAID5 mdraid array that’s been running fine for several years, but recently had a drive begin to fail with SMART errors. I replaced the drive, then decided to add another drive and reshape the array to RAID6 for added redundancy in case any of the other drives failed in future. I put the backup file on an external USB drive without realising that mdraid would be writing to the backup file throughout the reshape operation, and the USB drive disconnected itself midway through the reshape. The reshape continued on without any apparent critical errors, though, so I left it rather than try to stop and resume it. The reshape appeared to complete this morning without reporting any issues, but when I rebooted the machine it started in emergency mode because it failed to mount the filesystems on the RAID array. In one case mount can’t find the ext4 superblock at all, in two other cases the ext4 filesystem can be found but appears corrupted. I’m just wondering if anyone might have any suggestions about how best to attempt recovery. There’s nothing on the array I ultimately can’t live without, but I’d like to retrieve any data I can. /var/log/syslog output when starting the reshape operation: Sep 28 11:19:22 bibliotheca kernel: [ 706.039584] md/raid:md0: raid level 6 active with 5 out of 6 devices, algorithm 18 Sep 28 11:19:23 bibliotheca mdadm[889]: RebuildStarted event detected on md device /dev/md0 Sep 28 11:19:23 bibliotheca kernel: [ 706.952130] md: reshape of RAID array md0 Sep 28 11:19:23 bibliotheca kernel: [ 706.952136] md: minimum _guaranteed_ speed: 1000 KB/sec/disk. Sep 28 11:19:23 bibliotheca kernel: [ 706.952139] md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for reshape. Sep 28 11:19:23 bibliotheca kernel: [ 706.952147] md: using 128k window, over a total of 3906885120k. Sep 28 11:19:25 bibliotheca mdadm[889]: RebuildFinished event detected on md device /dev/md0 Sep 28 11:36:05 bibliotheca mdadm[889]: RebuildStarted event detected on md device /dev/md0 /var/log/syslog output when the USB drive disconnected: Sep 29 16:54:00 bibliotheca kernel: [107184.545094] EXT4-fs warning (device sdh1): ext4_end_bio:329: I/O error -5 writing to inode 12 (offset 0 size 0 starting block 35329) Sep 29 16:54:00 bibliotheca kernel: [107184.545102] Buffer I/O error on device sdh1, logical block 34817 Sep 29 16:54:00 bibliotheca kernel: [107184.545149] Buffer I/O error on device sdh1, logical block 34818 Sep 29 16:54:00 bibliotheca kernel: [107184.545183] Buffer I/O error on device sdh1, logical block 34819 Sep 29 16:54:00 bibliotheca kernel: [107184.545217] Buffer I/O error on device sdh1, logical block 34820 … Sep 29 16:54:00 bibliotheca kernel: [107184.548901] Aborting journal on device sdh1-8. Sep 29 16:54:00 bibliotheca kernel: [107184.549451] JBD2: Error -5 detected when updating journal superblock for sdh1-8. Sep 29 16:54:00 bibliotheca kernel: [107184.550633] EXT4-fs error (device sdh1): ext4_journal_check_start:56: Detected aborted journal Sep 29 16:54:00 bibliotheca kernel: [107184.551804] EXT4-fs (sdh1): Remounting filesystem read-only Sep 29 16:54:00 bibliotheca kernel: [107184.552420] EXT4-fs (sdh1): previous I/O error to superblock detected Sep 29 16:54:00 bibliotheca kernel: [107184.553155] EXT4-fs (sdh1): ext4_writepages: jbd2_start: 9223372036854775807 pages, ino 12; err -30 /var/log/syslog output when the reshape finished: Oct 3 04:27:10 bibliotheca kernel: [404373.812923] md: md0: reshape done. Oct 3 04:27:10 bibliotheca mdadm[889]: RebuildFinished event detected on md device /dev/md0 Oct 3 04:27:10 bibliotheca kernel: [404373.836909] md: recovery of RAID array md0 Oct 3 04:27:10 bibliotheca kernel: [404373.836915] md: minimum _guaranteed_ speed: 1000 KB/sec/disk. Oct 3 04:27:10 bibliotheca kernel: [404373.836918] md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for recovery. Oct 3 04:27:10 bibliotheca kernel: [404373.836926] md: using 128k window, over a total of 3906885120k. Oct 3 04:27:10 bibliotheca kernel: [404373.836930] md: resuming recovery of md0 from checkpoint. Oct 3 04:27:10 bibliotheca kernel: [404373.836940] md: md0: recovery done. Oct 3 04:27:10 bibliotheca kernel: [404373.837219] EXT4-fs error (device sdh1): ext4_wait_block_bitmap:503: comm mdadm: Cannot read block bitmap - block_group = 1, block_bitmap = 1026 Oct 3 04:27:10 bibliotheca kernel: [404373.839207] EXT4-fs error (device sdh1): ext4_discard_preallocations:4021: comm mdadm: Error loading buddy information for 1 Oct 3 04:27:11 bibliotheca mdadm[889]: RebuildStarted event detected on md device /dev/md0 Oct 3 04:27:11 bibliotheca mdadm[889]: RebuildFinished event detected on md device /dev/md0 Oct 3 04:27:11 bibliotheca mdadm[889]: SpareActive event detected on md device /dev/md0, component device /dev/sdg mdadm and lsdrv output: $ sudo mdadm -D /dev/md0 /dev/md0: Version : 1.2 Creation Time : Wed Jun 26 21:00:29 2013 Raid Level : raid6 Array Size : 15627540480 (14903.58 GiB 16002.60 GB) Used Dev Size : 3906885120 (3725.90 GiB 4000.65 GB) Raid Devices : 6 Total Devices : 6 Persistence : Superblock is persistent Update Time : Mon Oct 3 10:57:44 2016 State : clean Active Devices : 6 Working Devices : 6 Failed Devices : 0 Spare Devices : 0 Layout : left-symmetric-6 Chunk Size : 512K Name : bibliotheca:0 (local to host bibliotheca) UUID : d7c7fad6:fa50ed87:1d51f480:3d6405a5 Events : 2565608 Number Major Minor RaidDevice State 6 8 65 0 active sync /dev/sde1 1 8 49 1 active sync /dev/sdd1 2 8 1 2 active sync /dev/sda1 3 8 33 3 active sync /dev/sdc1 5 8 17 4 active sync /dev/sdb1 7 8 96 5 active sync /dev/sdg $ sudo mdadm -E /dev/sda1 /dev/sda1: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : d7c7fad6:fa50ed87:1d51f480:3d6405a5 Name : bibliotheca:0 (local to host bibliotheca) Creation Time : Wed Jun 26 21:00:29 2013 Raid Level : raid6 Raid Devices : 6 Avail Dev Size : 7813771264 (3725.90 GiB 4000.65 GB) Array Size : 15627540480 (14903.58 GiB 16002.60 GB) Used Dev Size : 7813770240 (3725.90 GiB 4000.65 GB) Data Offset : 262144 sectors Super Offset : 8 sectors Unused Space : before=262064 sectors, after=1024 sectors State : clean Device UUID : bf1149d3:6289a253:e441df54:9d8a41a8 Update Time : Mon Oct 3 10:57:44 2016 Checksum : f578645a - correct Events : 2565608 Layout : left-symmetric-6 Chunk Size : 512K Device Role : Active device 2 Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing) $ sudo mdadm -E /dev/sdb1 /dev/sdb1: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : d7c7fad6:fa50ed87:1d51f480:3d6405a5 Name : bibliotheca:0 (local to host bibliotheca) Creation Time : Wed Jun 26 21:00:29 2013 Raid Level : raid6 Raid Devices : 6 Avail Dev Size : 7813771264 (3725.90 GiB 4000.65 GB) Array Size : 15627540480 (14903.58 GiB 16002.60 GB) Used Dev Size : 7813770240 (3725.90 GiB 4000.65 GB) Data Offset : 262144 sectors Super Offset : 8 sectors Unused Space : before=262064 sectors, after=1024 sectors State : clean Device UUID : a3233875:73036f81:b64b4098:e6244f43 Update Time : Mon Oct 3 10:57:44 2016 Checksum : a3a2946d - correct Events : 2565608 Layout : left-symmetric-6 Chunk Size : 512K Device Role : Active device 4 Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing) $ sudo mdadm -E /dev/sdc1 /dev/sdc1: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : d7c7fad6:fa50ed87:1d51f480:3d6405a5 Name : bibliotheca:0 (local to host bibliotheca) Creation Time : Wed Jun 26 21:00:29 2013 Raid Level : raid6 Raid Devices : 6 Avail Dev Size : 7813771264 (3725.90 GiB 4000.65 GB) Array Size : 15627540480 (14903.58 GiB 16002.60 GB) Used Dev Size : 7813770240 (3725.90 GiB 4000.65 GB) Data Offset : 262144 sectors Super Offset : 8 sectors Unused Space : before=262064 sectors, after=1024 sectors State : clean Device UUID : 9d0277b9:c20b549a:40b6788c:5dbb7cfa Update Time : Mon Oct 3 10:57:44 2016 Checksum : ac2c7cb6 - correct Events : 2565608 Layout : left-symmetric-6 Chunk Size : 512K Device Role : Active device 3 Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing) $ sudo mdadm -E /dev/sdd1 /dev/sdd1: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : d7c7fad6:fa50ed87:1d51f480:3d6405a5 Name : bibliotheca:0 (local to host bibliotheca) Creation Time : Wed Jun 26 21:00:29 2013 Raid Level : raid6 Raid Devices : 6 Avail Dev Size : 7813771264 (3725.90 GiB 4000.65 GB) Array Size : 15627540480 (14903.58 GiB 16002.60 GB) Used Dev Size : 7813770240 (3725.90 GiB 4000.65 GB) Data Offset : 262144 sectors Super Offset : 8 sectors Unused Space : before=262064 sectors, after=1024 sectors State : clean Device UUID : b7da88c1:d3d78671:76a4b284:49530d65 Update Time : Mon Oct 3 10:57:44 2016 Checksum : ee3ba700 - correct Events : 2565608 Layout : left-symmetric-6 Chunk Size : 512K Device Role : Active device 1 Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing) $ sudo mdadm -E /dev/sde1 /dev/sde1: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : d7c7fad6:fa50ed87:1d51f480:3d6405a5 Name : bibliotheca:0 (local to host bibliotheca) Creation Time : Wed Jun 26 21:00:29 2013 Raid Level : raid6 Raid Devices : 6 Avail Dev Size : 7813771264 (3725.90 GiB 4000.65 GB) Array Size : 15627540480 (14903.58 GiB 16002.60 GB) Used Dev Size : 7813770240 (3725.90 GiB 4000.65 GB) Data Offset : 262144 sectors Super Offset : 8 sectors Unused Space : before=262056 sectors, after=1024 sectors State : clean Device UUID : 8d552590:9541d3e9:bf926301:be485b84 Update Time : Mon Oct 3 10:57:44 2016 Bad Block Log : 512 entries available at offset 72 sectors Checksum : d12b6fa3 - correct Events : 2565608 Layout : left-symmetric-6 Chunk Size : 512K Device Role : Active device 0 Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing) $ sudo mdadm -E /dev/sdg /dev/sdg: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : d7c7fad6:fa50ed87:1d51f480:3d6405a5 Name : bibliotheca:0 (local to host bibliotheca) Creation Time : Wed Jun 26 21:00:29 2013 Raid Level : raid6 Raid Devices : 6 Avail Dev Size : 7813775024 (3725.90 GiB 4000.65 GB) Array Size : 15627540480 (14903.58 GiB 16002.60 GB) Used Dev Size : 7813770240 (3725.90 GiB 4000.65 GB) Data Offset : 262144 sectors Super Offset : 8 sectors Unused Space : before=262056 sectors, after=4784 sectors State : clean Device UUID : 1163b8d4:01cecee1:070f39f0:9562f32b Update Time : Mon Oct 3 10:57:44 2016 Bad Block Log : 512 entries available at offset 72 sectors Checksum : a427ae64 - correct Events : 2565608 Layout : left-symmetric-6 Chunk Size : 512K Device Role : Active device 5 Array State : AAAAAA ('A' == active, '.' == missing, 'R' == replacing) $ python lsdrv PCI [ata_piix] 00:1f.2 IDE interface: Intel Corporation 82801IB (ICH9) 2 port SATA Controller [IDE mode] (rev 02) ├scsi 0:0:0:0 ATA ST4000DM000-1F21 │└sda 3.64t [8:0] Empty/Unknown │ └sda1 3.64t [8:1] Empty/Unknown │ └md0 14.55t [9:0] MD v1.2 raid6 (6) clean, 512k Chunk {None} │ │ Empty/Unknown │ ├dm-2 10.00t [252:2] Empty/Unknown │ ├dm-3 4.00t [252:3] Empty/Unknown │ │└dm-5 4.00t [252:5] Empty/Unknown │ └dm-4 567.58g [252:4] Empty/Unknown └scsi 1:0:0:0 ATA ST4000DM000-1F21 └sdb 3.64t [8:16] Empty/Unknown └sdb1 3.64t [8:17] Empty/Unknown └md0 14.55t [9:0] MD v1.2 raid6 (6) clean, 512k Chunk {None} Empty/Unknown PCI [ata_piix] 00:1f.5 IDE interface: Intel Corporation 82801I (ICH9 Family) 2 port SATA Controller [IDE mode] (rev 02) ├scsi 2:0:0:0 ATA ST4000DM000-1F21 │└sdc 3.64t [8:32] Empty/Unknown │ └sdc1 3.64t [8:33] Empty/Unknown │ └md0 14.55t [9:0] MD v1.2 raid6 (6) clean, 512k Chunk {None} │ Empty/Unknown └scsi 3:0:0:0 ATA ST4000DM000-1F21 └sdd 3.64t [8:48] Empty/Unknown └sdd1 3.64t [8:49] Empty/Unknown └md0 14.55t [9:0] MD v1.2 raid6 (6) clean, 512k Chunk {None} Empty/Unknown USB [usb-storage] Bus 001 Device 002: ID 0781:5580 SanDisk Corp. SDCZ80 Flash Drive {AA010803151943020344} └scsi 4:0:0:0 SanDisk Extreme └sdf 29.22g [8:80] Empty/Unknown ├sdf1 243.00m [8:81] Empty/Unknown │└Mounted as /dev/sdf1 @ /boot ├sdf2 1.00k [8:82] Empty/Unknown └sdf5 14.60g [8:85] Empty/Unknown ├dm-0 12.72g [252:0] Empty/Unknown │└Mounted as /dev/mapper/bibliotheca-root @ / └dm-1 1.88g [252:1] Empty/Unknown PCI [ahci] 03:00.0 SATA controller: ASMedia Technology Inc. ASM1062 Serial ATA Controller (rev 02) ├scsi 5:0:0:0 ATA ST4000DM000-2AE1 │└sde 3.64t [8:64] Empty/Unknown │ └sde1 3.64t [8:65] Empty/Unknown │ └md0 14.55t [9:0] MD v1.2 raid6 (6) clean, 512k Chunk {None} │ Empty/Unknown └scsi 6:0:0:0 ATA ST4000DM000-2AE1 └sdg 3.64t [8:96] Empty/Unknown └md0 14.55t [9:0] MD v1.2 raid6 (6) clean, 512k Chunk {None} Empty/Unknown PCI [pata_marvell] 04:00.0 IDE interface: Marvell Technology Group Ltd. 88SE6121 SATA II / PATA Controller (rev b2) ├scsi 7:0:0:0 LITE-ON DVDRW SOHW-832S {2004062400030158} │└sr0 1.00g [11:0] Empty/Unknown └scsi 8:x:x:x [Empty] Other Block Devices ├loop0 0.00k [7:0] Empty/Unknown ├loop1 0.00k [7:1] Empty/Unknown ├loop2 0.00k [7:2] Empty/Unknown ├loop3 0.00k [7:3] Empty/Unknown ├loop4 0.00k [7:4] Empty/Unknown ├loop5 0.00k [7:5] Empty/Unknown ├loop6 0.00k [7:6] Empty/Unknown ├loop7 0.00k [7:7] Empty/Unknown ├ram0 64.00m [1:0] Empty/Unknown ├ram1 64.00m [1:1] Empty/Unknown ├ram2 64.00m [1:2] Empty/Unknown ├ram3 64.00m [1:3] Empty/Unknown ├ram4 64.00m [1:4] Empty/Unknown ├ram5 64.00m [1:5] Empty/Unknown ├ram6 64.00m [1:6] Empty/Unknown ├ram7 64.00m [1:7] Empty/Unknown ├ram8 64.00m [1:8] Empty/Unknown ├ram9 64.00m [1:9] Empty/Unknown ├ram10 64.00m [1:10] Empty/Unknown ├ram11 64.00m [1:11] Empty/Unknown ├ram12 64.00m [1:12] Empty/Unknown ├ram13 64.00m [1:13] Empty/Unknown ├ram14 64.00m [1:14] Empty/Unknown └ram15 64.00m [1:15] Empty/Unknown-- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html