On Mon, Sep 1, 2014 at 9:36 PM, Andrew Robertson <andyrobertson101@xxxxxxxxx> wrote: > Hi, > > I have an Adaptec 71605H HBA that's randomly failing to detect any > drives at boot. I have two systems with this HBA, and both are > showing the exact same behavior. I can reproduce this randomly about > 3 out of 4 times, where most of the time it comes up where "lsscsi" > shows no drives attached (other than my boot disk, not attached to > this HBA) - but then occasionally it works fine (~1 out of 4 reboots) > and detects the drives. > > On a "good" boot, the kernel messages include: > [ 27.575091] pm80xx pm8001_exec_internal_task_abort 834:TMF task timeout. > [ 27.754640] sas: --- Exit sas_scsi_recover_host: busy: 0 failed: 0 tries: 1 > > On a "bad" boot, it doesn't include these messages; instead it hangs > for ~60 seconds after "Enter sas_scsi_recover_host busy" and then just > continues to boot (and that seems to never recover). It also never > cleanly shuts down in this state (it hangs forever on "modprobe -q -r > ipmi_devintf" and I have to power-cycle the machine -- I suspect this > is more of a symptom that the kernel is busy doing other things - > possibly still trying to init scsi - isntead an ipmi-specific issue). > > I'm happy to test patches/etc on this system if necessary -- and/or if > someone can help point me in the right direction, I'd appreciate it. Hi Andy, Can you share the following details? 1. Fw version in the card you have. Use below command to get the fw_version sys file. 2. Expander and drive details and how you connected it to HBA? 3. Can you enable full logs and hot plug the expander/devices and share the log if the discovery fails. Find /sys -iname "fw_version" Find /sys -iname logging_level Echo oxfff > "logging_level sys file" Also can you test with latest Linux stable release? Thanks, Suresh > > Thanks, > Andy ��.n��������+%������w��{.n�����{������ܨ}���Ơz�j:+v�����w����ޙ��&�)ߡ�a����z�ޗ���ݢj��w�f