Re: [patchset 0/4] osd: Stop usage of blk_rq_append_bio

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, 2009-05-14 at 17:53 +0300, Boaz Harrosh wrote:
> On 05/13/2009 05:47 PM, James Bottomley wrote:
> > On Wed, 2009-05-13 at 17:36 +0300, Boaz Harrosh wrote:
> >> On 05/13/2009 05:28 PM, Boaz Harrosh wrote:
> >>> On 05/12/2009 02:25 PM, Jens Axboe wrote:
> >>>> On Thu, May 07 2009, Boaz Harrosh wrote:
> >>>>> Osd library needs to submit pre-allocated bios, form several sources.
> >>>>> osdblk exofs and pNFS-layout driver all have prepared bios for IO submission.
> >>>>> On top of that the osd library needs to append additional segments to the
> >>>>> IO memory, for get/set attributes and more.
> >>>>>
> >>>>> All these are done today by use of a temporary hack - blk_rq_append_bio.
> >>>>> This is bad on few accounts.
> >>>>> 1. blk_rq_append_bio was not meant to be exported and is very specific to its users.
> >>>>> 2. blk_rq_append_bio does not support chained bios.
> >>>>> 3. blk_rq_append_bio does not bounce the bio and therefore current osd implementation
> >>>>>    has a bug.
> >>>>>
> >>>>> The proposed solution adds two new fixtures to the block layer, and a corresponding
> >>>>> fixing patch to osd. These are:
> >>>>>
> >>>>> [PATCH 1/4] allow blk_rq_map_kern to append to requests
> >>>>> [PATCH 2/4] libosd: Use new blk_rq_map_kern
> >>>>>
> >>>>>   This is originally a James patch and it's used, to let blk_rq_map_kern append it's buffer
> >>>>>   to existing bio, and there for is able to be called multiple times in a loop, to append
> >>>>>   multiple segments. This API can also be useful for scsi/block targets that have segment
> >>>>>   information in some other memory structure (like scatterlist) and wants to set it into
> >>>>>   a request. Until such time that they have a proper support for mapping scatterlists directly.
> >>>>>   (Since above called on long lists might not be good for performance)
> >>>>>
> >>>>>   Here in osd it makes tons of sense, and should be considered for inclusion.
> >>>>>   (The patches are based on linus-tip but should patch on block tree)
> >>>>>
> >>>>> [RFC 3/4] New blk_make_request(), takes bio, returns a request
> >>>>> [RFC 4/4] libosd: Use of new blk_make_request
> >>>>>   
> >>>>>   Here I propose a new block API, that will support proper delegation of a bio
> >>>>>   to a full request. Please read inside the patch descriptions for details.
> >>>>>   After this patch both osd and block layer will have the proper support for osdblk
> >>>>>   driver as well as future needs.
> >>>>>   These patches also eliminate the last use of blk_rq_append_bio which can be now un-exported.
> >>>>>
> >>>>>   These two patches conflic with Tejun's branch and are based on linus-tip. Upon positive review
> >>>>>   I will serialize them with Tejun and submit them properly. But first they must be agreed upon.
> >>>>>   Jens, I specially need your opinion on this?
> >>>> Looks sane to me. Can you resubmit against 'for-2.6.31' of the block git
> >>>> repo?
> >>>>
> >>> Thanks Jens.
> >>>
> >>> I have done the rebase and ran some tests, however I was unable to test these patches
> >>> as is, because there are some inter tree fallouts.
> >>>
> >>> Jens, James, Stephan, I please need your help
> >>>
> >>> The situation is like that.
> >>> - Both block/for-next and scsi/master are based on an old osd upstream-point (v2.6.30-rc3--ce8a7424)
> >>> - Linus tip has important OSD patches that went in via scsi-rc-fixes which changed Wire format
> > 
> > So just pull them into Linus head and build on that ... as long as you
> > explain what the base was, I can rebase scsi-misc (or run a post merge
> > tree) to cope.  It needs rebasing anyway to redo the mvsas patches.
> > 
> >>> - If I try and merge block/for-next ontop of plain linus/master I get a merge conflict
> >>> - If I try merge scsi/master block/for-next I get build errors / conflicts
> > 
> > This is the problem of the renames ... I think we need a block postmerge
> > tree to fix this up, but that probably needs sorting out first.
> > 
> >>> So there is no sane tree point that I can test on.
> >>>
> >>> It would be nice if both Jens block/for-next and scsi-misc/master could be rebased on Linus rc5++
> >>> and resolve these conflicts. (And scsi-misc conflicts with Tejun's cleanups be put in a second stage
> >>> tree)
> >>>
> >>> Should I send the patches as is half tested? Or wait for things to settle after I tested them
> >>> with all changes included?
> >>>
> >>> I have cut a new osd/linux-next branch which is based, not on linus, but on v2.6.30-rc3--ce8a7424
> >>> the base point for block/for-next and scsi-misc/master. So in next it should all come together
> >>> well, and I will try to clone tomorrow's next and test on top of that.
> >>>
> >> This will not work I have one patch [3/4] New blk_make_request(), takes bio, returns a request
> >> which will conflict with block/for-next if I rebase it on v2.6.30-rc3--ce8a7424.
> >>
> >> Should I cut osd/linux-next on top of block/for-next ?
> > 
> > What you really want is on the combination of the necessary trees.  If
> > it's only block, then sure ... if it's block and SCSI, that's postmerge
> > territory.
> > 
> > James
> > 
> > 
> 
> Thank you for your reply.
> 
> I did more-less what you said rebased block/for-next on linus-tip and
> fixed the merge with scsi-misc as per Stephan advise. Plus my patches
> last. Test ran well.
> 
> I see that you rebased by now, though I suspect the fc's blk_end_request
> call will fail to build if merged with block tree.

Yes, that's the bit we need a postmerge tree for.  It has to build on
it's own in scsi-misc, but it's making use of an API Tejun is altering,
so the block postmerge has to do the API alteration based on the SCSI
tree.

> I'm also seeing some recent changes to block git so I suspect that Jens is
> in the middle of rebasing too. (I hope)
> 
> As far as OSD, I managed to separate the two block-based changes to osd
> from the rest of the changes scheduled for 2.6.31 in such a way that they do not
> conflict and can merge either way (block first or scsi-misc first).
> 
> I'll repost all these patches. Sunday hopfuly after Jens rebases.

OK ... thanks.

James


--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [SCSI Target Devel]     [Linux SCSI Target Infrastructure]     [Kernel Newbies]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Linux IIO]     [Samba]     [Device Mapper]
  Powered by Linux