Vlad Codrea wrote: > ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen > ata2.00: (BMDMA stat 0x25) > ata2.00: cmd a0/01:00:00:00:00/00:00:00:00:00/a0 tag 0 cdb 0x12 data 36 in > res 58/00:02:00:24:00/00:00:00:00:00/a0 Emask 0x2 (HSM violation) > ata2: soft resetting port > ata2: port is slow to respond, please be patient (Status 0xd8) > ata2: port failed to respond (30 secs, Status 0xd8) > ATA: abnormal status 0xD8 on port 0x177 > ATA: abnormal status 0xD8 on port 0x177 Okay, now that you're on libata driver, it's easier for me to debug. Can you apply the attached patch over 2.6.20 and report what the kernel says? (the patch will apply with some noise, it's okay) Thanks. -- tejun
diff --git a/drivers/ata/libata-core.c b/drivers/ata/libata-core.c index 14629a3..387235f 100644 --- a/drivers/ata/libata-core.c +++ b/drivers/ata/libata-core.c @@ -4453,9 +4453,13 @@ fsm_start: if (likely(status & (ATA_ERR | ATA_DF))) /* device stops HSM for abort/error */ qc->err_mask |= AC_ERR_DEV; - else + else { /* HSM violation. Let EH handle this */ + ata_port_printk(ap, KERN_WARNING, + "!DRQ on HSM_ST_FIRST (0x%x)\n", + status); qc->err_mask |= AC_ERR_HSM; + } ap->hsm_task_state = HSM_ST_ERR; goto fsm_start; @@ -4547,13 +4551,17 @@ fsm_start: if (likely(status & (ATA_ERR | ATA_DF))) /* device stops HSM for abort/error */ qc->err_mask |= AC_ERR_DEV; - else + else { + ata_port_printk(ap, KERN_WARNING, + "!DRQ on HSM_ST (0x%x)\n", + status); /* HSM violation. Let EH handle this. * Phantom devices also trigger this * condition. Mark hint. */ qc->err_mask |= AC_ERR_HSM | AC_ERR_NODEV_HINT; + } ap->hsm_task_state = HSM_ST_ERR; goto fsm_start; @@ -4579,8 +4587,12 @@ fsm_start: status = ata_wait_idle(ap); } - if (status & (ATA_BUSY | ATA_DRQ)) + if (status & (ATA_BUSY | ATA_DRQ)) { + ata_port_printk(ap, KERN_WARNING, + "BUSY|DRQ on ERR|DF (0x%x)\n", + status); qc->err_mask |= AC_ERR_HSM; + } /* ata_pio_sectors() might change the * state to HSM_ST_LAST. so, the state