On Tue, Sep 03, 2013 at 02:25:07PM -0400, Brian Foster wrote: > An inode free operation can have several effects on the finobt. If > all inodes have been freed and the chunk deallocated, we remove the > finobt record. If the inode chunk was previously full, we must > insert a new record based on the existing inobt record. Otherwise, > we modify the record in place. > > Create the xfs_ifree_finobt() function to identify the potential > scenarios and update the finobt appropriately. The first thing I'd do is factor all the inobt manipulation code xfs_difree() into a xfs_difree_inobt() helper function. have it return the record and offset that is then passed to your new helper xfs_difree_finobt(). That way xfs_difree() really becomes the setup function for the two btree operations rather than containing one set of modifications and calling a function to do the other... > Signed-off-by: Brian Foster <bfoster@xxxxxxxxxx> > --- > fs/xfs/xfs_ialloc.c | 120 ++++++++++++++++++++++++++++++++++++++++++++++++++++ > 1 file changed, 120 insertions(+) > > diff --git a/fs/xfs/xfs_ialloc.c b/fs/xfs/xfs_ialloc.c > index 516f4af..96f71b5 100644 > --- a/fs/xfs/xfs_ialloc.c > +++ b/fs/xfs/xfs_ialloc.c > @@ -198,6 +198,117 @@ xfs_inobt_insert( > } > > /* > + * Free an inode in the free inode btree. > + */ > +STATIC int > +xfs_ifree_finobt( > + struct xfs_mount *mp, > + struct xfs_trans *tp, > + struct xfs_buf *agbp, > + struct xfs_inobt_rec_incore *ibtrec,/* inobt record */ > + int offset) /* offset of inode */ > +{ > + struct xfs_agi *agi = XFS_BUF_TO_AGI(agbp); > + xfs_agnumber_t agno = be32_to_cpu(agi->agi_seqno); > + struct xfs_btree_cur *cur; > + struct xfs_inobt_rec_incore rec; > + int error; > + int i; > + > + if (!xfs_sb_version_hasfinobt(&mp->m_sb)) > + return 0; There's that vector thing again... > + > + cur = xfs_inobt_init_cursor(mp, tp, agbp, agno, XFS_BTNUM_FINO); > + > + error = xfs_inobt_lookup(cur, ibtrec->ir_startino, XFS_LOOKUP_EQ, &i); > + if (error) > + goto error; > + > + if (i == 1) { > + int j; > + /* > + * Read and update the existing record. > + */ > + error = xfs_inobt_get_rec(cur, &rec, &j); > + if (error) > + goto error; > + XFS_WANT_CORRUPTED_GOTO(j == 1, error); > + > + rec.ir_free |= XFS_INOBT_MASK(offset); > + rec.ir_freecount++; > + > + XFS_WANT_CORRUPTED_GOTO((rec.ir_free == ibtrec->ir_free) && > + (rec.ir_freecount == ibtrec->ir_freecount), > + error); > + } I can't say I'm a great fan of the layout of the logic. Yes, there's lots of cases to handle. It looks like: lookup() if (found) modify in place if (found && full && deleting chunks) delete record else if (!found && no record) insert record else if (found) update record else corruption! I think it woul dbe better to get then "!found" case out of the way at the start. ie if (i == 0) { if (ibtrec->ir_freecount == 1) insert record else CORRUPTION goto out; } /* found a record, no need to check i == 1 anymore */ ASSERT(i == 1); /* read and update */ if (full && deleting chunks) delete record else update record > + > + /* > + * The content of inobt records should always match between the inobt > + * and finobt. The lifecycle of records in the finobt is different from > + * the inobt in that the finobt only tracks records with at least one > + * free inode. This is to optimize lookup for inode allocation purposes. > + * The following checks fix up the finobt appropriately based on the > + * state of the record subsequent to the current operation. > + */ > + > + if ((i == 1) && > + (rec.ir_freecount == XFS_IALLOC_INODES(mp) && > + !(mp->m_flags & XFS_MOUNT_IKEEP))) { > + /* > + * We have an existing finobt record. If all inodes are free > + * and we're in !ikeep mode, the entire inode chunk has been > + * deallocated. Remove the record from the finobt. > + */ > + error = xfs_btree_delete(cur, &i); > + if (error) > + goto error; > + ASSERT(i == 1); > + } else if ((i == 0) && (ibtrec->ir_freecount == 1)) { > + /* > + * No existing finobt record and the inobt record has a single > + * free inode. This means we've freed an inode in a previously > + * fully allocated chunk. Insert a new record into the finobt > + * based on the current inobt record. > + */ > + cur->bc_rec.i.ir_startino = ibtrec->ir_startino; > + cur->bc_rec.i.ir_free = ibtrec->ir_free; > + cur->bc_rec.i.ir_freecount = ibtrec->ir_freecount; > + error = xfs_btree_insert(cur, &i); > + if (error) > + goto error; > + ASSERT(i == 1); That's rather similar to the code in xfs_inobt_insert(). Indeed, is you write a helper - xfs_inobt_insert_rec() - for this, then rather than modifying xfs_inobt_lookup() to take extra parameters like I wondered for the previous patch, leave it alonge and pass the parameters to xfs_inobt_insert_rec() instead. Then this code is functionally identical to xfs_inobt_insert() done during allocation.... Cheers, Dave. -- Dave Chinner david@xxxxxxxxxxxxx _______________________________________________ xfs mailing list xfs@xxxxxxxxxxx http://oss.sgi.com/mailman/listinfo/xfs