Re: weird XFS problem

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]



On 2012-01-22, Boris Epstein <borepstein@xxxxxxxxx> wrote:
>
> Also, here's somethine else I have discovered. Apparently there is an
> potential intermittent RAID disk trouble. At least I found the following in
> the system log:
>
> Jan 22 09:17:53 nrims-bs kernel: 3w-9xxx: scsi6: AEN: ERROR (0x04:0x0026):
> Drive ECC error reported:port=4, unit=0.
> Jan 22 09:17:53 nrims-bs kernel: 3w-9xxx: scsi6: AEN: ERROR (0x04:0x002D):
> Source drive error occurred:port=4, unit=0.

Which 3ware controller is this?  I have had lots of problems with the
3ware 9550SX controller and WD-EA[RD]S drives in a similar
configuration.  (Yes, I know all about the EARS drives, but they work
mostly fine with the 3ware 9650 controller, so I suspect some weird
interaction between the cheap drives and the old not-so-great
controller.  I also suspect an intermittently failing port, which I'll
be testing more later this week.)

> Jan 22 09:55:23 nrims-bs kernel: 3w-9xxx: scsi6: AEN: WARNING
> (0x04:0x000F): SMART threshold exceeded:port=9.
> Jan 22 09:55:23 nrims-bs kernel: 3w-9xxx: scsi6: AEN: WARNING
> (0x04:0x000F): SMART threshold exceeded:port=9.
> Jan 22 09:56:17 nrims-bs kernel: 3w-9xxx: scsi6: AEN: INFO (0x04:0x000B):
> Rebuild started:unit=0.

What does your RAID look like?  Are you using the 3ware's RAID6 (in
which case it's not a 9550) or mdraid?  Are the 3ware errors in the logs
across a large number of ports or just a few?  Have you used the drive
tester for your drives to verify that they're still good?  On all my
other systems, when the controller has reported a failure, and I've run
it through the tester, it's reported a failure.  (Often when my 9550
reports a failure the drive passes all tests.)

If you happen to have real RAID drive models, you may also try
contacting LSI support.  They will steadfastly refuse to help if you
have desktop-edition drives, but can be at least somewhat helpful if you
have enterprise drives.

--keith


-- 
kkeller@xxxxxxxxxxxxxxxxxxxxxxxxxx


_______________________________________________
CentOS mailing list
CentOS@xxxxxxxxxx
http://lists.centos.org/mailman/listinfo/centos


[Index of Archives]     [CentOS]     [CentOS Announce]     [CentOS Development]     [CentOS ARM Devel]     [CentOS Docs]     [CentOS Virtualization]     [Carrier Grade Linux]     [Linux Media]     [Asterisk]     [DCCP]     [Netdev]     [Xorg]     [Linux USB]
  Powered by Linux