Re: [PATCH] xfs: hold xfs_buf locked between shortform->leaf conversion and the addition of an attribute

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri, Aug 11, 2017 at 12:22:04PM +1000, Dave Chinner wrote:
> On Thu, Aug 10, 2017 at 02:32:33PM -0400, Brian Foster wrote:
> > On Thu, Aug 10, 2017 at 10:55:48AM -0700, Darrick J. Wong wrote:
> > > On Thu, Aug 10, 2017 at 10:52:49AM -0400, Brian Foster wrote:
> > > > On Thu, Aug 10, 2017 at 03:09:09PM +0300, Alex Lyakas wrote:
> > ...
> > > > OTOH, just adding deferred ops buffer relogging might not be too much
> > > > trouble either. ;) Anyways, thoughts?
> > > 
> > > I don't think it'd be difficult to add a _defer_bjoin operation that
> > > maintains a list of buffers that we need to bhold across rolls.
> > > 
> > > I think xfs_buf->b_list is only used for delwri buffers, and a buffer
> > > cannot be part of a transaction /and/ on a delwri list at the same time,
> > > right?  So it shouldn't be hard to whip something up and couple this
> > > patch to that.
> > > 
> > 
> > Hmm.. so if a buffer is modified, logged, committed, put on the AIL and
> > pushed, xfs_buf_item_push() locks it, puts it on the delwri queue and
> > unlocks. At that point, I _think_ it may be possible for another thread
> > to lock the buffer and join it to a new transaction. The delwri submit
> > skips the buffer if it has become pinned or locked since the delwri
> > queue (though I'm wondering if that unlocked pin check is racy against
> > locked buffer modifications. I suppose that would require a full
> > lock->pin->unlock cycle between the pin check and trylock however).
> 
> If it does race, we still catch pinned buffers in xfs_buf_submit() and
> block there on them. SO a race is just sub-optimal behaviour, not a
> bug.
> 

Ah I see, thanks.

> > The question I have for buffer relogging is what's the best way to track
> > the parts of the buffer that need to be relogged after a roll?
> > Copy/translate the dirty (xfs_buf_log_format) segment map(s)?
> 
> Just mark it ordered?
> 
> That way it goes through the transaction commit, pinned and put into
> the CIL and  gets moved forward in the AIL when the log checkpoints.
> We don't need to relog the actual contents in this case, just ensure
> it moves forward in the AIL appropriately while we hold it locked.

Hmm.. is it safe to mark a previously logged and AIL resident buffer
ordered in a subsequent transaction? The problem in this particular
example is that the empty leaf buffer is logged, committed and unpinned
(and thus AIL resident). We want to relog the buffer to move it forward
in the AIL on the next transaction because we're holding it locked and
thus it cannot be written back (and thus could pin the log tail).

If we mark the buffer ordered in the subsequent transaction and that
transaction commits/checkpoints to the log, don't we push the buffer
forward in the AIL to a checkpoint that doesn't have the originally
logged data..? IOW, it seems like if this does end up pushing the tail
of the log and we crash, we've thrown away checkpointed but not written
back metadata and potentially corrupted the fs. Hm?

In reading through some of this code, perhaps it doesn't matter which
part of the buffer we relog. We just need to make sure that the buffer
is marked dirty in the next transaction (i.e., via the log item
descriptor). If the buffer is held across transactions, I think that
means the underlying log item can't go away because the buffer can't be
written back. IIUC, the bli hangs around until writeback completion
precisely to allow this kind of relogging to occur by retaining the
dirty bitmap associated the buffer across potentially separate
transactions. If I'm following all that correctly (?), perhaps a new
xfs_trans_buf_relog() helper that just flags a buffer/transaction dirty
would suffice. Thoughts?

Brian

> The ordered flag is removed at each commit the buffer goes through,
> so as soon as we commit the final transaction it'll go back to
> behaving like a normal buffer...
> 
> Cheers,
> 
> Dave.
> -- 
> Dave Chinner
> david@xxxxxxxxxxxxx
> --
> To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [XFS Filesystem Development (older mail)]     [Linux Filesystem Development]     [Linux Audio Users]     [Yosemite Trails]     [Linux Kernel]     [Linux RAID]     [Linux SCSI]


  Powered by Linux