On 05/14/2015 10:13 AM, Tejun Heo wrote: > Hello, Wen. > > On Wed, May 13, 2015 at 08:35:19PM -0500, wenxiong@xxxxxxxxxxxxxxxxxx wrote: >> From: Wen Xiong <wenxiong@xxxxxxxxxxxxxxxxxx> >> >> This patch adds the callback functions to support EEH error >> recovery in ahci driver. Also adds the code in ahci_error_handler >> to issue an MMIO load then check if it is in EEH. If it is in EEH, >> ahci_error_handler will wait until EEH recovery is completed. > > Can you please explain why we would want this? What does it buy us? So, on the Power platform, the pci_error_handlers map to our EEH recovery. In that case, without this patch, if we hit any sort of PCIe error, we won't be able to recover and we'll lose all access to the ahci disks. This could be the adapter trying to access an invalid DMA address due to a transient hardware issue, or it could be due to a driver bug giving the adapter an invalid address. It could also be other various PCIe errors that cause our PCIe bridge chip to isolate the device and place it into the EEH "frozen" state. When this occurs, if the driver associated with the hardware does not have these handlers registered, powerpc arch kernel code will hotplug remove the adapter, recover the adapter, then hotplug add it back. This works OK for some devices, but generally not so well for storage devices with mounted filesystems, which would tend to go readonly in this case. -Brian -- Brian King Power Linux I/O IBM Linux Technology Center -- To unsubscribe from this list: send the line "unsubscribe linux-ide" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html