badblocks seem to be causing problems with raid6 - badblocks list replicating all all drives

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello,

I posted a while back about getting buffer i/o errors in my dmesg logs to my raid array, something along the lines of this:

[158219.456484] EXT4-fs warning (device md4): ext4_end_bio:329: I/O error -5 writing to inode 125274714 (offset 176160768 size 8388608 starting block 4955235712)
[158219.456487] Buffer I/O error on device md4, logical block 4955235584
[158219.456490] Buffer I/O error on device md4, logical block 4955235585
[158219.456491] Buffer I/O error on device md4, logical block 4955235586
[158219.456491] Buffer I/O error on device md4, logical block 4955235587
[158219.456492] Buffer I/O error on device md4, logical block 4955235588
[158219.456493] Buffer I/O error on device md4, logical block 4955235589
[158219.456494] Buffer I/O error on device md4, logical block 4955235590
[158219.456495] Buffer I/O error on device md4, logical block 4955235591
[158219.456496] Buffer I/O error on device md4, logical block 4955235592
[158219.456497] Buffer I/O error on device md4, logical block 4955235593
[158219.456580] EXT4-fs warning (device md4): ext4_end_bio:329: I/O error -5 writing to inode 125274714 (offset 176160768 size 8388608 starting block 4955235456) [158219.456663] EXT4-fs warning (device md4): ext4_end_bio:329: I/O error -5 writing to inode 125274714 (offset 176160768 size 8388608 starting block 4955235200) [158219.456747] EXT4-fs warning (device md4): ext4_end_bio:329: I/O error -5 writing to inode 125274714 (offset 176160768 size 8388608 starting block 4955234944) [158219.456829] EXT4-fs warning (device md4): ext4_end_bio:329: I/O error -5 writing to inode 125274714 (offset 176160768 size 8388608 starting block 4955234688) [158219.456912] EXT4-fs warning (device md4): ext4_end_bio:329: I/O error -5 writing to inode 125274714 (offset 176160768 size 8388608 starting block 4955234432) [158469.158278] EXT4-fs warning (device md4): ext4_end_bio:329: I/O error -5 writing to inode 123995503 (offset 0 size 8388608 starting block 4970080384)
[158469.158281] buffer_io_error: 1526 callbacks suppressed

I am now using the latest mainline kernel, 4.3.0 and I believe something is going wrong with the badblocks implementation.

I originally had 3 drives, all with the same badblocks list. This array has been running a while so I have no idea how these 3 discs all ended up with the same list of badblocks.

Now, if I remove any drive, which has no badblock entries, and re-add it. Once the sync is complete I end up with another drive with the same badblocks list.

At the moment 5 of the drives in the array all have the following entries (exactly the same):

Bad-blocks on /dev/sdi1:
          1938038928 for 512 sectors
          1938039440 for 512 sectors
          1938977144 for 512 sectors
          1938977656 for 512 sectors
          3303750816 for 512 sectors
          3303751328 for 512 sectors
          3313648904 for 512 sectors
          3313649416 for 512 sectors
          3313651976 for 512 sectors
          3313652488 for 512 sectors
          3418023432 for 512 sectors
          3418023944 for 512 sectors
          3418024456 for 512 sectors
          3418024968 for 512 sectors
          3418037768 for 512 sectors
          3418038280 for 512 sectors
          3418038792 for 512 sectors
          3418039304 for 512 sectors
          3418112520 for 512 sectors
          3418113032 for 512 sectors
          3418113544 for 512 sectors
          3418114056 for 512 sectors
          3418114568 for 512 sectors
          3418115080 for 512 sectors
          3418124808 for 512 sectors
          3418125320 for 512 sectors
          3418165768 for 512 sectors
          3418166280 for 512 sectors
          3418187272 for 512 sectors
          3418187784 for 512 sectors
          3418213224 for 512 sectors
          3418213736 for 512 sectors
          3418214248 for 512 sectors
          3418214760 for 512 sectors
          3418215272 for 512 sectors
          3418215784 for 512 sectors
          3420607528 for 512 sectors
          3420608040 for 512 sectors
          3420626984 for 512 sectors
          3420627496 for 512 sectors
          3448897824 for 512 sectors
          3448898336 for 512 sectors
          3458897888 for 512 sectors
          3458898400 for 512 sectors
          3519403992 for 512 sectors
          3519404504 for 512 sectors
          3617207456 for 512 sectors
          3617207968 for 512 sectors


How can I clear the badblocks list on all the drives? Something seems very wrong and I believe I only actually have 1 faulty disc (I have run smartctl long tests on all drives, only 1 failed).

If I can't clear them, how can I get ext4 to recognise the badblocks within the array so that it no longer attempts to write to those blocks?

Do the blocks in the list above map to blocks on a the physical harddrive, or to blocks on the md device - IE: If that block list was passed to ext4 filesystem as bad sectors, would that be the correct location on the array or are those the badblocks on one of the harddrives in the array.

Thanks
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux