Re: libata EH appears to be NFG up to 2.6.17 (at least).

Mark Lord <liml@xxxxxx> · Fri, 07 Jul 2006 09:03:29 -0400

Jeff Garzik wrote:
Mark Lord wrote:
Enable libata-scsi to report correct sense data on errors.

Signed-off-by:  Mark Lord <mlord@xxxxxxxxx>
---

--- linux/drivers/scsi/libata-scsi.c.orig    2006-07-06 
17:09:54.000000000 -0400
+++ linux/drivers/scsi/libata-scsi.c    2006-07-06 17:17:43.000000000 
-0400
@@ -667,6 +667,13 @@
     qc->ap->ops->tf_read(qc->ap, tf);
 
     /*
+     * Restore the error bit, which got cleared when the
+     * interrupt handler first read the ata_status.
+     */
+    if (qc->err_mask & AC_ERR_DEV)
+        tf->command |= ATA_ERR;
+
+    /*
      * Use ata_to_sense_error() to map status register bits


Again it's an LLDD issue.  Your answer of "all LLDDs" sounds a bit 
suspicious.

AC_ERR_DEV should not be set unless (a) some code noticed that ATA_ERR 
was already present in the Status register, or (b) some code set 
AC_ERR_DEV for some reason other than ATA_ERR presence.  Both of those 
factors depend heavily on LLDD behavior.

So that *is* the meaning of AC_ERR_DEV --> "LLDD saw ATA_ERR" ?
I'm just not sure whether to check for that one bit,
or for any bit set in qc->err_mask.

Of course even doing this is not at all foolproof -- the code in libata-scsi
will then go and read the ATA status again (along with all of the others),
and the drive/software may well have cleared/changed bits by then.

In fact, you've even had isolated bug reports from the past month with
exactly this happening -- where the reported error status was 0x50
by the time libata-scsi got hold of it.

I dunno what the new EH does (yet), but the only good way to deal with this
is for the ATA status & error values to be read *and* saved on command completion,
and then reused by any later code that needs to do detailed sense reporting
or other diagnostics.

With NCQ/TCQ, this can be tough to achieve, possibly requiring the LLDD to 
drop to legacy mode on error, and capture the values there.  Of course, by then
it's too late to be perfect, as the host controller has probably already read
and discarded the drive status at least once.  (the PDC sata_qstor device
is an exception there -- they save/return failed status during queuing).

In summary, when a command fails, we need to save the failed status and
error values that we saw when we decided it fails.  Reading them again later
is not optimal, because they can change in the interim (asynchronous notifications,
or just drives that like to clear ERR after status is read).

I'm out today, but I'll try and fiddle with this some more on the weekend.
I have several host controllers here to try this with.

Cheers
-
: send the line "unsubscribe linux-ide" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html