On 06/10/2009 02:00 AM, FUJITA Tomonori wrote: > On Tue, 09 Jun 2009 16:32:15 +0300 > Boaz Harrosh <bharrosh@xxxxxxxxxxx> wrote: > >> On 06/09/2009 04:10 PM, FUJITA Tomonori wrote: >>> On Tue, 09 Jun 2009 14:53:51 +0300 >>> Boaz Harrosh <bharrosh@xxxxxxxxxxx> wrote: >>> >>>> Please do not revert. This is the point of all this. >>>> >>>> If there is no leak, You should NULL out the req->bio >>>> for now, and for 2.6.31 change the code to do >>>> blk_end_request_all(). That's what blk_end_request does, >>>> since you are doing your own completion then set req->bio >>>> to null after you're done. (And before put_request) >>>> >>>> This stuff is good for error paths to catch leaks, please >>>> leave it? >>> Has this your good stuff found any bio leak bugs in mainline? In >>> addition, breaking working code is not a proper development style. >>> >> It has found for me in error paths. That's why I put it. >> >> I Issue bsg bidi commands every day all day, and never seen this. >> What driver are you using? and can you post the stack trace. > > See the original mail. I already said, "BSG SMP requests get the > following warnings". I use mptsas however all BSG SMP users hit this > bug. The stack trace is not useful because the bsg users don't call > blk_put_request directly. > > But If you want to see: > > Call Trace: > [<ffffffff80320349>] ? __blk_put_request+0x52/0xc0 > [<ffffffff8022fd26>] warn_slowpath_common+0x77/0xa4 > [<ffffffff8022fd62>] warn_slowpath_null+0xf/0x11 > [<ffffffff80320349>] __blk_put_request+0x52/0xc0 > [<ffffffff803206d6>] ? blk_put_request+0x20/0x46 > [<ffffffff803206e4>] blk_put_request+0x2e/0x46 > [<ffffffff80327f64>] blk_complete_sgv4_hdr_rq+0x1a8/0x1b7 > [<ffffffff80328a36>] bsg_ioctl+0x1b4/0x1eb > [<ffffffff8032dfea>] ? __up_read+0x1c/0x9a > [<ffffffff804aab20>] ? _spin_unlock_irqrestore+0x3f/0x47 > [<ffffffff802a0f10>] vfs_ioctl+0x2a/0x78 > [<ffffffff802a13cc>] do_vfs_ioctl+0x46e/0x4aa > [<ffffffff80246384>] ? up_read+0x26/0x2b > [<ffffffff8020b8e9>] ? retint_swapgs+0xe/0x13 > [<ffffffff802a144a>] sys_ioctl+0x42/0x65 > [<ffffffff8020aeab>] system_call_fastpath+0x16/0x1b > >> The driver does something wrong. bsg over scsi-ml does not have this >> problem. Why? > > Because scsi-ml calls blk_end_request() but BSG SMP users don't. > > That is a violation of block API. All block drivers must call blk_end_request(). If they do not, then they can not for example be called from inside Kernel. They relay on special bsg behavior that always uses map_user. This must be fixed. So you see that WARN_ON was right on the spot ;) >>> Anyway, setting req->bio in bsg works. Either is fine by me. >>> >>> >>> Jens, can you please send either patch to Linus now? >>> >>> = >>> From: FUJITA Tomonori <fujita.tomonori@xxxxxxxxxxxxx> >>> Subject: [PATCH] bsg: setting rq->bio to NULL >>> >>> Due to commit 1cd96c242a829d52f7a5ae98f554ca9775429685 ("block: WARN >>> in __blk_put_request() for potential bio leak"), BSG SMP requests get >>> the false warnings: >>> >>> WARNING: at block/blk-core.c:1068 __blk_put_request+0x52/0xc0() >>> >>> This sets rq->bio to NULL to avoid that false warnings. >>> >>> >>> Signed-off-by: FUJITA Tomonori <fujita.tomonori@xxxxxxxxxxxxx> >>> --- >>> block/bsg.c | 3 +++ >>> 1 files changed, 3 insertions(+), 0 deletions(-) >>> >>> diff --git a/block/bsg.c b/block/bsg.c >>> index 206060e..dd81be4 100644 >>> --- a/block/bsg.c >>> +++ b/block/bsg.c >>> @@ -315,6 +315,7 @@ out: >>> blk_put_request(rq); >>> if (next_rq) { >>> blk_rq_unmap_user(next_rq->bio); >> I do not understand this here please explain? We have called blk_rq_map_user() >> and have bailed out on error later, without calling blk_execute_rq*. Now usually >> the bios are *double* referenced, one for the usual call of blk_end_request() that will >> release bios once, and second for the blk_rq_unmap_user() that will release second time. >> But here you only call blk_rq_unmap_user() don't you need to call blk_end_request() also? > > If I understand correctly, blk_end_request() doesn't release bios of a > request that blk_rq_map_user was called against. > This is the part I do not understand. The comment in map_user says that the bio ref count is taken twice, so when reference drops once at blk_end_request() the bio is not yet freed, but will so in unmap_user. But if blk_end_request() is not called then there is no double drop of reference. I do want to understand, how is the bio gets freed, is it forced some how? > You can test this without any SAS hardware. Let me know when you find > a bio leak here. How can I test this without SAS hardware? Please point me to documentation. I do believe you that there is no leak, I just want to understand why? And any way, all "BSG SMP" drivers must be fixed to call blk_end_request(). They can not be dependent on the specific ULD that calls them. Thanks Boaz -- To unsubscribe from this list: send the line "unsubscribe linux-scsi" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html