Hello, Matthew. On Mon, May 11, 2015 at 01:34:40PM -0700, Matthew Garrett wrote: > On Mon, May 11, 2015 at 1:28 PM, Tejun Heo <tj@xxxxxxxxxx> wrote: > > > CommWake isn't the problem here. SError is being dumped just for > > information. The disk is reporting failure on a write command which > > is diagnosed as "device error" and thus the link is not reset. It's > > really the device actively reporting command failure. > > Ok, that makes sense. Is there any practical way for us to identify > why the device might be doing that? It seems to be limited to the LPM > case, but this is (theoretically) in the same configuration that the > firmware programmed, so it's a little surprising. Modern ATA spec do implement extended error reporting and Hannes recently (not mainline yet) added support for it and the kernel will print out sense codes if the device reports it ("NCQ Autosense xx/xx/xx"). Another vector could be SMART error log which is accessible through smartctl, but it's quite possible that the ABRT bit is the only thing the device is exposing at all. Thanks. -- tejun -- To unsubscribe from this list: send the line "unsubscribe linux-ide" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html