Re: possible HighPoint RocketRAID 2720SGL failure

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 22/09/17 11:08, Roger Heflin wrote:
If it is the marvell issue I had before then quit doing smartctl
commands (disable all smart queries of any sort) as that seemed to
massively increase the reliablity.  It did not completely fix the
issues, it just made them happen a lot less often.

I now tried this and it did not help. I stopped all smart access, but running
a md 'check' fails as before (all the disks disappear). Each test fails at a
different address. This time the machine was mostly idle when the check was
running.

I should note that this HighPoint card was running without any problem for
4 years.

Maybe a driver issue? I was running f19 until a few weeks ago, and the failures
all happened after I upgraded to f22 (and now on f26).

Eyal

good luck, I finally just gave up and quit using marvell controllers.

On Thu, Sep 21, 2017 at 7:20 AM, Roman Mamedov <rm@xxxxxxxxxxx> wrote:
On Thu, 21 Sep 2017 21:12:36 +1000
Eyal Lebedinsky <eyal@xxxxxxxxxxxxxx> wrote:

It looks like the controller failed as all 7 disks disappeared together and did not respond
to any i/o or even smart.

After power off/on things look OK. The raid6 did a very short recovery, then the ext4 fs did
a quick recovery. fsck found no problems.

I later started a raid 'check' but it failed in less that an hour (out of 10) in the same way.
A day later I tried again and it failed within 15 minutes.

So far it looks like nothing was lost but I am uncomfortable with this situation.
No surprise here...

The controller did not log any errors.

Does this look familiar to anyone?

The controller is based on the Marvell 9485 chip and Marvell SATA/RAID
controllers seem to have a bad reputation for reliability:

https://www.jethrocarr.com/2013/11/24/adventures-in-io-hell/
https://www.youtube.com/watch?v=010urq9wY3A

I have also faced some CRC errors or disk drop-outs/reconnects on 9123 cards,
and in one case all disks (or possibly the controller itself) disappear from
the system until reboot on a 88SX7042 based controller.

--
With respect,
Roman

--
Eyal Lebedinsky (eyal@xxxxxxxxxxxxxx)
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux