On 2011-06-23, at 1:52 AM, Akira Fujita wrote: > ext4: Fix ext4 timestamps corruption Hi Akira-san, thank you for the patch. For submitting patches, it is easier for Ted if you send the patch in a separate email with a proper subject (e.g. "[PATCH] ext4: Fix ext4 timestamp > 2038 corruption" from instead of containing the reply to an old email. Otherwise Ted has to manually reformat the patch. > Officially, ext4 can handle its timestamps until 2514 with 32bit > entries plus EPOCH_BIT (2bits). But when timestamps values use > 32+ bit (e.g. 2038-01-19 9:14:08 0x0000000080000000), we can get > corrupted values. Because sign bit is overwritten by transferring > value between kernel space and user space. > > To fix this issue, 32th bit of extra time fields in ext4_inode structure > (e.g. i_ctime_extra) are used as the sign for 64bit user space. > Because these are used only 20bits for nano-second and bottom of 2bits This should be "30 bits for nanosecond". > are for EXT4_EPOCH_BITS shift. > With this patch, ext4 supports timestamps Y1901-2514. > > The performance comparison is as follows: > ------------------------------------------------ > | | file create | ls -l | > |--------------|---------------|---------------- > |with patch | 138.2056 sec | 14.9333 sec | > |without patch | 133.4566 sec | 14.9096 sec | > ------------------------------------------------ > file count:1 million (average of 3 trials) > kernel: 3.0.0-rc3 > > There is a slightly difference, but I think it is acceptable. I'm surprised that the difference is even measurable for such a change. > Thanks and Regards, > Akira Fujita > > Signed-off-by: Akira Fujita <a-fujita@xxxxxxxxxxxxx> > --- > fs/ext4/ext4.h | 30 +++++++++++++++++++++--------- > 1 file changed, 21 insertions(+), 9 deletions(-) > > diff -Nrup -X linux-3.0-rc3-a/Documentation/dontdiff linux-3.0-rc3-a/fs/ext4/ext4.h linux-3.0-rc3-b/fs/ext4/ext4.h > --- linux-3.0-rc3-a/fs/ext4/ext4.h 2011-06-14 07:29:59.000000000 +0900 > +++ linux-3.0-rc3-b/fs/ext4/ext4.h 2011-06-23 14:18:47.000000000 +0900 > @@ -645,6 +645,7 @@ struct move_extent { > #define EXT4_EPOCH_BITS 2 > #define EXT4_EPOCH_MASK ((1 << EXT4_EPOCH_BITS) - 1) > #define EXT4_NSEC_MASK (~0UL << EXT4_EPOCH_BITS) > +#define EXT4_TIMESTAMP_SIGN_MASK 0x80000000 > > /* > * Extended fields will fit into an inode if the filesystem was formatted > @@ -665,16 +666,23 @@ struct move_extent { > static inline __le32 ext4_encode_extra_time(struct timespec *time) > { > + return cpu_to_le32((sizeof(time->tv_sec) > 4 ? > + (time->tv_sec >> 32) & > + (EXT4_EPOCH_MASK | EXT4_TIMESTAMP_SIGN_MASK) : > + time->tv_sec & EXT4_TIMESTAMP_SIGN_MASK) | > + ((time->tv_nsec << EXT4_EPOCH_BITS) & EXT4_NSEC_MASK)); > } As mentioned by Mark, this cannot work because the high bit is already used for the most significant bit of the nanoseconds. > static inline void ext4_decode_extra_time(struct timespec *time, __le32 extra) > { > + if (sizeof(time->tv_sec) > 4) { > time->tv_sec |= (__u64)(le32_to_cpu(extra) & EXT4_EPOCH_MASK) > << 32; > + if (le32_to_cpu(extra) & EXT4_TIMESTAMP_SIGN_MASK) > + time->tv_sec |= EXT4_NSEC_MASK << 32; > + } > + > + time->tv_nsec = ((le32_to_cpu(extra) & ~EXT4_TIMESTAMP_SIGN_MASK) >> > + EXT4_EPOCH_BITS); > } > > #define EXT4_INODE_SET_XTIME(xtime, inode, raw_inode) \ > @@ -696,19 +704,23 @@ do { \ > > #define EXT4_INODE_GET_XTIME(xtime, inode, raw_inode) \ > do { \ > - (inode)->xtime.tv_sec = (signed)le32_to_cpu((raw_inode)->xtime); \ > - if (EXT4_FITS_IN_INODE(raw_inode, EXT4_I(inode), xtime ## _extra)) \ > + if (EXT4_FITS_IN_INODE(raw_inode, EXT4_I(inode), xtime ## _extra)) { \ > + (inode)->xtime.tv_sec = \ > + (__u32)(le32_to_cpu((raw_inode)->xtime)); \ > ext4_decode_extra_time(&(inode)->xtime, \ > raw_inode->xtime ## _extra); \ > - else \ > + } else { \ > + (inode)->xtime.tv_sec = \ > + (signed)le32_to_cpu((raw_inode)->xtime); \ > (inode)->xtime.tv_nsec = 0; \ > + } \ > } while (0) > > #define EXT4_EINODE_GET_XTIME(xtime, einode, raw_inode) \ > do { \ > if (EXT4_FITS_IN_INODE(raw_inode, einode, xtime)) \ > - (einode)->xtime.tv_sec = \ > - (signed)le32_to_cpu((raw_inode)->xtime); \ > + (einode)->xtime.tv_sec = \ > + (__u32)(le32_to_cpu((raw_inode)->xtime)); \ > if (EXT4_FITS_IN_INODE(raw_inode, einode, xtime ## _extra)) \ > ext4_decode_extra_time(&(einode)->xtime, \ > raw_inode->xtime ## _extra); \ Cheers, Andreas -- To unsubscribe from this list: send the line "unsubscribe linux-ext4" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html