https://bugzilla.kernel.org/show_bug.cgi?id=208215 Bug ID: 208215 Summary: HPSA P410 resetting logical Direct-Access never complete 5.7.1 Product: IO/Storage Version: 2.5 Kernel Version: 5.7.1 Hardware: All OS: Linux Tree: Mainline Status: NEW Severity: normal Priority: P1 Component: SCSI Assignee: linux-scsi@xxxxxxxxxxxxxxx Reporter: voronkovaa@xxxxxxxxx Regression: No I have a problem with HPSA P410 on two of my nodes with Kernel 5.7.1-1.el7.elrepo.x86_64 with CentOS 7 Here are the logs: 2020-06-16T14:59:00.8117 warning kern kernel [679613.058375] hpsa 0000:06:00.0: scsi 0:1:0:0: resetting logical Direct-Access HP LOGICAL VOLUME RAID-0 SSDSmartPathCap- En- Exp=1 2020-06-16T14:59:23.3999 info kern kernel [679635.647794] libceph: osd0 down 2020-06-16T14:59:23.3999 info kern kernel [679635.648599] libceph: osd6 down 2020-06-16T14:59:24.4468 warning kern kernel [679636.694762] rbd: rbd1: encountered watch error: -107 2020-06-16T14:59:24.4886 warning kern kernel [679636.736747] rbd: rbd2: encountered watch error: -107 2020-06-16T14:59:28.4377 info kern kernel [679640.685700] libceph: osd5 down 2020-06-16T14:59:36.6272 warning kern kernel [679648.874179] hpsa 0000:06:00.0: Controller lockup detected: 0x0015002f after 30 2020-06-16T14:59:36.6272 warning kern kernel [679648.875554] hpsa 0000:06:00.0: controller lockup detected: LUN:0000004000000000 CDB:01040000000000000000000000000000 2020-06-16T14:59:36.6272 warning kern kernel [679648.875591] hpsa 0000:06:00.0: failed 15 commands in fail_all 2020-06-16T14:59:36.6272 warning kern kernel [679648.876650] hpsa 0000:06:00.0: Controller lockup detected during reset wait 2020-06-16T14:59:36.6272 warning kern kernel [679648.876655] hpsa 0000:06:00.0: scsi 0:1:0:0: reset logical failed Direct-Access HP LOGICAL VOLUME RAID-0 SSDSmartPathCap- En- Exp=1 2020-06-16T14:59:36.6272 info kern kernel [679648.876667] sd 0:1:0:2: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6272 info kern kernel [679648.876672] sd 0:1:0:0: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6348 info kern kernel [679648.883168] sd 0:1:0:0: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6348 info kern kernel [679648.884214] sd 0:1:0:0: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6357 info kern kernel [679648.885286] sd 0:1:0:1: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6367 info kern kernel [679648.886297] sd 0:1:0:0: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6377 info kern kernel [679648.887301] sd 0:1:0:2: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6395 info kern kernel [679648.888269] sd 0:1:0:2: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6395 info kern kernel [679648.889193] sd 0:1:0:3: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6419 info kern kernel [679648.890076] sd 0:1:0:0: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6419 info kern kernel [679648.891496] sd 0:1:0:0: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6447 info kern kernel [679648.893012] sd 0:1:0:0: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6447 info kern kernel [679648.894114] sd 0:1:0:0: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6466 info kern kernel [679648.895182] sd 0:1:0:0: Device offlined - not ready after error recovery 2020-06-16T14:59:36.6466 info kern kernel [679648.896204] sd 0:1:0:2: [sdc] tag#477 FAILED Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK cmd_age=66s 2020-06-16T14:59:36.6489 info kern kernel [679648.897223] sd 0:1:0:2: [sdc] tag#477 CDB: Read(10) 28 00 00 ed 13 90 00 00 08 00 2020-06-16T14:59:36.6489 err kern kernel [679648.898309] blk_update_request: I/O error, dev sdc, sector 15537040 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0 2020-06-16T14:59:36.6523 info kern kernel [679648.899489] sd 0:1:0:0: [sda] tag#469 FAILED Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK cmd_age=70s 2020-06-16T14:59:36.6523 err kern kernel [679648.899956] sd 0:1:0:0: rejecting I/O to offline device 2020-06-16T14:59:36.6523 info kern kernel [679648.900659] sd 0:1:0:0: [sda] tag#469 CDB: Read(10) 28 00 14 78 2f e0 00 00 10 00 2020-06-16T14:59:36.6524 err kern kernel [679648.901820] blk_update_request: I/O error, dev sda, sector 929142240 op 0x0:(READ) flags 0x3000 phys_seg 1 prio class 0 2020-06-16T14:59:36.6537 err kern kernel [679648.903092] blk_update_request: I/O error, dev sda, sector 343420896 op 0x0:(READ) flags 0x80700 phys_seg 2 prio class 0 2020-06-16T14:59:36.6537 info kern kernel [679648.903138] sd 0:1:0:0: [sda] tag#470 FAILED Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK cmd_age=70s /usr/sbin/hpssacli ctrl all show detail Smart Array P410 in Slot 1 Bus Interface: PCI Slot: 1 Serial Number: PACCRID122902CV Cache Serial Number: PBCDH0CRH2K24K Controller Status: OK Hardware Revision: C Firmware Version: 6.64 Rebuild Priority: Medium Expand Priority: Medium Surface Scan Delay: 3 secs Surface Scan Mode: Idle Parallel Surface Scan Supported: No Queue Depth: Automatic Monitor and Performance Delay: 60 min Elevator Sort: Enabled Degraded Performance Optimization: Disabled Inconsistency Repair Policy: Disabled Wait for Cache Room: Disabled Surface Analysis Inconsistency Notification: Disabled Post Prompt Timeout: 15 secs Cache Board Present: True Cache Status: OK Cache Status Details: The current array controller had valid data stored in its battery/capacitor backed write cache the last time it was reset or was powered up. This indicates that the system may not have been shut down gracefully. The array controller has automatically written, or has attempted to write, this data to the drives. This message will continue to be displayed until the next reset or power-cycle of the array controller. Cache Ratio: 25% Read / 75% Write Drive Write Cache: Disabled Total Cache Size: 512 MB Total Cache Memory Available: 400 MB No-Battery Write Cache: Disabled Cache Backup Power Source: Capacitors Battery/Capacitor Count: 1 Battery/Capacitor Status: OK SATA NCQ Supported: True Number of Ports: 2 Internal only Driver Name: hpsa Driver Version: 3.4.20 Driver Supports HPE SSD Smart Path: True PCI Address (Domain:Bus:Device.Function): 0000:06:00.0 Sanitize Erase Supported: False Primary Boot Volume: logicaldrive 1 (600508B1001C3DAA9705279AD5D8DABA) Secondary Boot Volume: None -- You are receiving this mail because: You are the assignee for the bug.