RE: Bug 14579 - Devices disappear... and Bug 14577 - Data corruption with Adaptec

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello Lukas,


> -----Original Message-----
> From: Lukas Kolbe [mailto:lkolbe@xxxxxxxxxxxxxxxxxxxxxxxx]
> Sent: Tuesday, November 17, 2009 7:53 PM
> To: Desai, Kashyap
> Cc: linux-scsi@xxxxxxxxxxxxxxx
> Subject: Re: Bug 14579 - Devices disappear... and Bug 14577 - Data
> corruption with Adaptec
> 
> Desai, Kashyap wrote:
> 
> >Subject line is related to *Adaptec* and there are some places LSI
> >related issue is pointed out. Little confusing to me. Is it possible to
> >rewrite what is an issue related to LSI card?
> 
> Sorry for that one. This system has an Adaptec Controller for its
> Storage array and an LSI controller for the tape library. Bug 14577 is
> about a possible data corruption on 2.6.32-rc6 that seems to be either a
> hardware error (currently trying to find that out) or a regression in
> 2.6.32-rc6, as 2.6.30 is very happy with its storage.
OK. In data corruption condition only LSI driver and controller are involved? I mean can I nullify Adaptec controller's roll in your test?
> 
> Finally, the real problem here is Bug 14579 that is about the systems
> problems when using the tape library.
> 
> >From dmesg log I can figure out 3.04.07 is mpt fusion driver version.
> >Please update LSI driver using latest upstream driver version 3.04.13.
> And see what a result is.
> 
> Thanks for the pointer. Linus' current tree contains 3.04.12 - where can
> I find 3.04.13?

It is there in 2.6.32-rc5. Not sure in which exact rc version it is included, but I have 2.6.32-rc5 tree in my setup and for that kernel mptfusion version is 3.104.13
> 
> >- Kashyap
> 
> Kind regards,
> Lukas Kolbe
> 
> 
> >-----Original Message-----
> >From: linux-scsi-owner@xxxxxxxxxxxxxxx [mailto:linux-scsi-
> owner@xxxxxxxxxxxxxxx] On Behalf Of Sascha Frey
> >Sent: Friday, November 13, 2009 4:28 AM
> >To: linux-scsi@xxxxxxxxxxxxxxx
> >Cc: Lukas Kolbe
> >Subject: Re: Bug 14579 - Devices disappear... and Bug 14577 - Data
> corruption with Adaptec
> >
> >Hi,
> >
> >Lukas Kolbe wrote:
> >>we'd really appreciate any hints and help we can get for the following
> >>bugs:
> >>http://bugzilla.kernel.org/show_bug.cgi?id=14579
> >
> >We've done some further testing:
> >it's very hard to trigger this bug. Sometimes the machine freezes after
> >a few minutes into tape access and sometimes it works days - or even
> >weeks - without any problem.
> >
> >The bug only appears during tape I/O (regardless of which tape program is
> >used: btape, dd or tar).
> >In most cases the tape write ends with an input/output error. After this
> >error occurred, any access to the tape library robot (connected through
> >the SAS interface of the first drive) fails:
> >
> ># mtx unload 1 1
> >Unloading drive 1 into Storage Element 1...mtx: Request Sense: Long
> Report=yes
> >mtx: Request Sense: Valid Residual=no
> >mtx: Request Sense: Error Code=70 (Current)
> >mtx: Request Sense: Sense Key=Illegal Request
> >mtx: Request Sense: FileMark=no
> >mtx: Request Sense: EOM=no
> >mtx: Request Sense: ILI=no
> >mtx: Request Sense: Additional Sense Code = 53
> >mtx: Request Sense: Additional Sense Qualifier = 01
> >mtx: Request Sense: BPV=no
> >mtx: Request Sense: Error in CDB=no
> >mtx: Request Sense: SKSV=no
> >MOVE MEDIUM from Element Address 257 to 4096 Failed
> >
> >After resetting the scsi bus (echo "- - -" >
> >/sys/class/scsi_host/host5/scan) the tape drives are revitalized, but
> >the changer device disappears. Even after a cold restart of the whole
> >library the device keeps missing.
> >
> >Yet another problem: restting the SCSI bus of the LSI SAS HBA sometimes
> >results in a hardy freeze (console stuck; no log messages).
> >
> >> [...]
> >>
> >>I do not believe it's a hardware fault at the moment as the machine
> >>ran OK under Solaris for a few weeks (including successful btape runs).
> >>
> >
> >The very same piece of hardware worked fine using Solaris 10 with heavy
> >disk and tape I/O at the same time for two months.
> >
> >We really prefer using Linux instead, but we're in pressure of time.
> >
> >
> >We appreciate any help resolving this bug!
> >
> >
> >
> >
> >Regards,
> >Sascha Frey
> >
> >--
> >To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
> >the body of a message to majordomo@xxxxxxxxxxxxxxx
> >More majordomo info at  http://vger.kernel.org/majordomo-info.html
> >

--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [SCSI Target Devel]     [Linux SCSI Target Infrastructure]     [Kernel Newbies]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Linux IIO]     [Samba]     [Device Mapper]
  Powered by Linux