On Nov 26, 2008 19:35 -0500, Theodore Ts'o wrote: > I guess I didn't make myself clear. I was *not* suggesting that we > share EA's in one inode, or in one extent tree. Instead, what I > suggested was that instead of having a pointer to an inode, if the > value of the EA is less than half the blocksize, it is stored in the > EA block. If it is between 50% and 100% of the blocksize, instead of > pointing at inode, we point to a block. If it is greater than a > blocksize, we point at a block containing an EA tree. (Which means > for a large EA the average space overhead is 6k --- 4k for the extent > block, plus 2k for the fragmentation cost). > > So this scheme very much uses separate EA's, and does not pack all of > the EA's into a single tree. It is deliberately kept simple precisely > because like you I don't think it's worth it to optimize EA's. On the > other hand, running out of inodes is a big problem, and dynamic inodes > is far more complicated an issue, especially if we don't have 64-bit > inode support in the kernel and in userspace, and we need to worry > about locality issues and how dynamic inodes work with online > resizing. > > The tradeoff is that my scheme doesn't burn an inode for each large > EA, but for EA's greater than a blocksize, we chew an extra block's > worth of overhead. Personally, I think it's a worthwhile tradeoff --- The other issue is that if we are pointing to a direct extent tree instead of a relative block in the inode then all of the normal IO functions are not usable (ext3_getblk(), etc), and they would have to be re-implemented. It might be possible to fake out the extent handling functions and do this by iterating over the blocks directly by virtue of passing in the parent inode, but it seems prone to breakage. Cheers, Andreas -- Andreas Dilger Sr. Staff Engineer, Lustre Group Sun Microsystems of Canada, Inc. -- To unsubscribe from this list: send the line "unsubscribe linux-ext4" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html