Hi all, On Monday 07 April 2008 03:07:38 Wakko Warner wrote: > Moore, Eric wrote: > > The other fellow was having his controller go into fault state, that is > > not the case here. Here commands are not completing back from [0:0:0:0] > > device in the timeout period, typically 30 seconds. Its hard to tell > > from from my point of view the error handler is by far too quiet by default. Below are some patches to improve the situation. I tested these patches with Infortrend scsi-raid systems und LSI scsi HBAs. With these patches the error handler will now tell why it got activated. We also had the problem the error handler got activated in an endless loop, several of these patches are to prevent this. http://www.pci.uni-heidelberg.de/tc/usr/bernd/downloads/scsi/2.6.22-eh-patches.tar.bz2 >From the series file: print_eh_activation.patch scsi_error_limit.patch scsi_error_state.patch soft_error_requeue.patch starget_quiesce_ignore_offlined.patch scsi_eh_did_no_connect.patch fusion_tip.patch All except the fusion_tip.patch are patches for the error handler. fusion_tip.patch will replace the kernel mpt fusion driver by a more recent version I got from Eric. Cheers, Bernd -- Bernd Schubert Q-Leap Networks GmbH -- To unsubscribe from this list: send the line "unsubscribe linux-scsi" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html