Hi John, In my humble opinion what we observed very earlier was with respect smp_request()/response() crash and this patch resolved it. Given that the issue was not only specific to ARM, issue was observed on x86 too with IOMMU enabled. Device discovery went fine post application of this patch on x86. What we are observing right now on error handling/timeouts for commands on drives will be altogether different issue that needs separate debugging on ARM platform with separate patch since this is a very initial execution of pm80xx driver on ARM platform. This patch is acceptable. Let me know your further views. Thanks, Ajish On 26/11/2021 15:35, John Garry wrote: > /* > @@ -4280,8 +4283,9 @@ static int pm80xx_chip_smp_req(struct pm8001_hba_info *pm8001_ha, > pm8001_ha->smp_exp_mode = SMP_INDIRECT; > > > - tmp_addr = cpu_to_le64((u64)sg_dma_address(&task->smp_task.smp_req)); > - preq_dma_addr = (char *)phys_to_virt(tmp_addr); > + smp_req = &task->smp_task.smp_req; > + to = kmap(sg_page(smp_req)); This should be a kmap_atomic() as well, as I see the following for when CONFIG_DEBUG_ATOMIC_SLEEP is enabled: [ 27.222116] dump_backtrace+0x0/0x2b4 [ 27.225774] show_stack+0x1c/0x30 [ 27.229084] dump_stack_lvl+0x68/0x84 [ 27.232741] dump_stack+0x20/0x3c [ 27.236049] __might_resched+0x1d4/0x240 [ 27.239967] __might_sleep+0x70/0xd0 [ 27.243536] pm80xx_chip_smp_req+0x2c4/0x56c [ 27.247802] pm8001_task_exec.constprop.0+0x718/0x770 [ 27.252848] pm8001_queue_command+0x1c/0x2c [ 27.257026] smp_execute_task_sg+0x1e8/0x370 [ 27.261289] sas_ex_phy_discover+0x29c/0x31c [ 27.265553] smp_ata_check_ready+0x74/0x190 [ 27.269729] ata_wait_ready+0xd0/0x224 [ 27.273474] ata_wait_after_reset+0x78/0xac [ 27.277652] sas_ata_hard_reset+0xf0/0x18c [ 27.281742] ata_do_reset.constprop.0+0x80/0x9c [ 27.286266] ata_eh_reset+0xba4/0x1170 [ 27.290008] ata_eh_recover+0x4b0/0x1b40 [ 27.293924] ata_do_eh+0x8c/0x110 [ 27.297232] ata_std_error_handler+0x80/0xb0 [ 27.301495] ata_scsi_port_error_handler+0x3d4/0x9d0 [ 27.306454] async_sas_ata_eh+0x70/0xf8 [ 27.310285] async_run_entry_fn+0x5c/0x1e0 [ 27.314375] process_one_work+0x378/0x630 [ 27.318379] worker_thread+0xa8/0x6bc [ 27.322033] kthread+0x214/0x230 [ 27.325256] ret_from_fork+0x10/0x20 [ 27.328825] pm80xx0:: pm80xx_chip_smp_req 4292:SMP REQUEST INDIRECT MODE But I don't think that this is the problem which causes error handling to kick in later, as discussed in this thread. > + payload = to + smp_req->offset; > > /* INDIRECT MODE command settings. Use DMA */ > if (pm8001_ha->smp_exp_mode == SMP_INDIRECT) { > @@ -4289,7 +4293,7 @@ static int pm80xx_chip_smp_req(struct pm8001_hba_info *pm8001_ha, > /* for SPCv indirect mode. Place the top 4 bytes of > * SMP Request header here. */ > for (i = 0; i < 4; i++) > - smp_cmd.smp_req16[i] = *(preq_dma_addr + i); > + smp_cmd.smp_req16[i] = *(payload + i);