Re: [PATCH] scsi_lib: don't decrement busy counters when inserting commands

Andrew Vasquez <andrew.vasquez@xxxxxxxxxx> · Tue, 6 Jan 2009 11:48:24 -0800

On Tue, 06 Jan 2009, James Bottomley wrote:

> On Mon, 2009-01-05 at 23:01 -0800, Andrew Vasquez wrote:
> > This hunk is causing a panic during early SCSI init-time of the server
> > boot disk -- sg_free_table+0x51 (RIP), scsi_release_buffers+0xd4,
> > scsi_io_completion+0x324.  IAC, in looking through the code
> > scsi_end_request() returns NULL when the command has already been
> > requeued (via scsi_requeue_command()).  Modifying your original patch
> > to release-buffers prior to scsi_end_request()'s call to
> > scsi_requeue_command() and dropping the post-scsi_end_request() call
> > to scsi_release_buffers() fixes *both* the panic as well as the original
> > problem I reported regarding commands returned with a DID_RESET status
> > being requeued in an incomplete state.
> 
> Gosh this is a subtle nasty mess.

:O

> > Here's the updated patch.  Any other holes I missed?
> 
> Yes, I'm afraid ... if scsi_release_buffers() is moved into
> scsi_end_request() then it has to be called for every NULL returning
> path otherwise we leak buffers.  The missing release is the NULL return
> at the bottom of the function.  However, you can't call
> scsi_release_buffers() there because we've lost the request and the
> scsi_bidi_cmd() check tries to touch it.
> 
> Hopefully this version has all the holes plugged, if you could give it a
> spin...

Ok... The results look promising -- no panics and I/Os are resuming
properly after being returned with DID_RESET statuses.  Thanks for
working through this.

--
av
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html