https://bugzilla.kernel.org/show_bug.cgi?id=215704 --- Comment #1 from Michael Evans (mjevans1983@xxxxxxxxx) --- I should add some context. I want the kernel ata / sd layers to handle unresponsive devices so that, ideally, some kind of 'this path is slow, but you can keep waiting' message is given to upper layers. New commands should be soft-failed with a busy state or something similar that conveys the status of 'stalled' without 'error' (so far). I would also hope that any such stall is handled as a barrier for the device, and any other outstanding requests retried unless they too are returned with errors. Somehow events, such as the dmesg entry that follows, correlate to enough errors to 'fault' the device and knock it out of the pool (during a repair scrub). Thus I was looking for a Documentation file that covered the timeout configuration file and gave guidance on if or how it should be tuned in relation to other aspects of the disks. The disk with these responses is a Seagate Exos X16 (ST16000NM001G-2KK103) Firmware SN03 believed to be ATA ACS-4, 4k sector, CMR. No errors (no pending / remapped sectors, no logged sectors failed). [ 1362.163151] ata3.00: exception Emask 0x10 SAct 0x60000000 SErr 0x280100 action 0x6 frozen [ 1362.163184] ata3.00: irq_stat 0x08000000, interface fatal error [ 1362.163200] ata3: SError: { UnrecovData 10B8B BadCRC } [ 1362.163216] ata3.00: failed command: READ FPDMA QUEUED [ 1362.163230] ata3.00: cmd 60/c0:e8:28:48:d2/03:00:d9:03:00/40 tag 29 ncq dma 491520 in res 40/00:f0:e8:4b:d2/00:00:d9:03:00/40 Emask 0x10 (ATA bus error) [ 1362.163272] ata3.00: status: { DRDY } [ 1362.163283] ata3.00: failed command: READ FPDMA QUEUED [ 1362.163297] ata3.00: cmd 60/40:f0:e8:4b:d2/00:00:d9:03:00/40 tag 30 ncq dma 32768 in res 40/00:f0:e8:4b:d2/00:00:d9:03:00/40 Emask 0x10 (ATA bus error) [ 1362.163338] ata3.00: status: { DRDY } [ 1362.163350] ata3: hard resetting link [ 1362.476057] ata3: SATA link up 6.0 Gbps (SStatus 133 SControl 300) [ 1362.506459] ata3.00: ACPI cmd ef/10:06:00:00:00:00 (SET FEATURES) succeeded [ 1362.506465] ata3.00: ACPI cmd f5/00:00:00:00:00:00 (SECURITY FREEZE LOCK) filtered out [ 1362.506467] ata3.00: ACPI cmd b1/c1:00:00:00:00:00 (DEVICE CONFIGURATION OVERLAY) filtered out [ 1362.564800] ata3.00: ACPI cmd ef/10:06:00:00:00:00 (SET FEATURES) succeeded [ 1362.564815] ata3.00: ACPI cmd f5/00:00:00:00:00:00 (SECURITY FREEZE LOCK) filtered out [ 1362.564817] ata3.00: ACPI cmd b1/c1:00:00:00:00:00 (DEVICE CONFIGURATION OVERLAY) filtered out [ 1362.603044] ata3.00: configured for UDMA/133 [ 1362.603061] sd 2:0:0:0: [sdc] tag#29 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s [ 1362.603065] sd 2:0:0:0: [sdc] tag#29 Sense Key : Illegal Request [current] [ 1362.603067] sd 2:0:0:0: [sdc] tag#29 Add. Sense: Unaligned write command [ 1362.603070] sd 2:0:0:0: [sdc] tag#29 CDB: Read(16) 88 00 00 00 00 03 d9 d2 48 28 00 00 03 c0 00 00 [ 1362.603071] I/O error, dev sdc, sector 16539338792 op 0x0:(READ) flags 0x700 phys_seg 15 prio class 0 [ 1362.603129] zio pool=REDACTED vdev=/dev/disk/by-partlabel/REDACTED error=5 type=1 offset=... size=491520 flags=40080cb0 [ 1362.603239] sd 2:0:0:0: [sdc] tag#30 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s [ 1362.603276] sd 2:0:0:0: [sdc] tag#30 Sense Key : Illegal Request [current] [ 1362.603332] sd 2:0:0:0: [sdc] tag#30 Add. Sense: Unaligned write command [ 1362.603337] sd 2:0:0:0: [sdc] tag#30 CDB: Read(16) 88 00 00 00 00 03 d9 d2 4b e8 00 00 00 40 00 00 [ 1362.603389] I/O error, dev sdc, sector 16539339752 op 0x0:(READ) flags 0x700 phys_seg 1 prio class 0 [ 1362.603738] zio pool=REDACTED vdev=/dev/disk/by-partlabel/REDACTED error=5 type=1 offset=... size=32768 flags=1808b0 [ 1362.604011] ata3: EH complete FAULTED 17 0 0 too many errors (repairing) -- You may reply to this email to add a comment. You are receiving this mail because: You are watching the assignee of the bug.