On 10/02/2015 10:21 AM, Johannes Thumshirn wrote: > Removing a SCSI target via scsi_remove_target() suspected to be racy. When a > sibling get's removed from the list it can occassionly happen that one CPU is > stuck endlessly looping around this code block > > list_for_each_entry(starget, &shost->__targets, siblings) { > if (starget->state == STARGET_DEL) > continue; > > Resulting in the following hard lockup. > > Kernel panic - not syncing: Watchdog detected hard LOCKUP on cpu 0 > [...] > Call Trace: > [<ffffffff8100471d>] dump_trace+0x7d/0x2d0 > [<ffffffff81004a04>] show_stack_log_lvl+0x94/0x170 > [<ffffffff81005cc1>] show_stack+0x21/0x50 > [<ffffffff8151aa75>] dump_stack+0x41/0x51 > [<ffffffff8151545a>] panic+0xc8/0x1d7 > [<ffffffff810fbdda>] watchdog_overflow_callback+0xba/0xc0 > [<ffffffff811336c8>] __perf_event_overflow+0x88/0x240 > [<ffffffff8101e3aa>] intel_pmu_handle_irq+0x1fa/0x3e0 > [<ffffffff81522836>] perf_event_nmi_handler+0x26/0x40 > [<ffffffff81521fcd>] nmi_handle.isra.2+0x8d/0x180 > [<ffffffff815221e6>] do_nmi+0x126/0x3c0 > [<ffffffff8152159b>] end_repeat_nmi+0x1a/0x1e > [<ffffffffa00212e8>] scsi_remove_target+0x68/0x240 [scsi_mod] > [<ffffffff81072742>] process_one_work+0x172/0x420 > [<ffffffff810733ba>] worker_thread+0x11a/0x3c0 > [<ffffffff81079d34>] kthread+0xb4/0xc0 > [<ffffffff81528cd8>] ret_from_fork+0x58/0x90 > > This patch decouples the list traversal for targets and the reaping of SCSI > targets by moving to be removed targets to a separate reap list. Entries in > this list can then be removed by the SCSI layer in a lockless manner. > > This was discovered by a partner in a 24h stress test. > > Signed-off-by: Johannes Thumshirn <jthumshirn@xxxxxxx> > --- > drivers/scsi/scsi_sysfs.c | 14 +++++++------- > 1 file changed, 7 insertions(+), 7 deletions(-) > Reviewed-by: Hannes Reinecke <hare@xxxxxxx> Cheers, Hannes -- Dr. Hannes Reinecke zSeries & Storage hare@xxxxxxx +49 911 74053 688 SUSE LINUX GmbH, Maxfeldstr. 5, 90409 Nürnberg GF: F. Imendörffer, J. Smithard, J. Guild, D. Upmanyu, G. Norton HRB 21284 (AG Nürnberg) -- To unsubscribe from this list: send the line "unsubscribe linux-scsi" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html