Re: [BUG] ext4 timestamps corruption

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 2011-06-23, at 1:52 AM, Akira Fujita wrote:
> ext4: Fix ext4 timestamps corruption

Hi Akira-san,
thank you for the patch.  For submitting patches, it is easier for
Ted if you send the patch in a separate email with a proper subject
(e.g. "[PATCH] ext4: Fix ext4 timestamp > 2038 corruption" from
instead of containing the reply to an old email.  Otherwise Ted has
to manually reformat the patch.

> Officially, ext4 can handle its timestamps until 2514 with 32bit
> entries plus EPOCH_BIT (2bits). But when timestamps values use
> 32+ bit (e.g. 2038-01-19 9:14:08 0x0000000080000000), we can get
> corrupted values.  Because sign bit is overwritten by transferring
> value between kernel space and user space.
> 
> To fix this issue, 32th bit of extra time fields in ext4_inode structure
> (e.g. i_ctime_extra) are used as the sign for 64bit user space.
> Because these are used only 20bits for nano-second and bottom of 2bits

This should be "30 bits for nanosecond".

> are for EXT4_EPOCH_BITS shift.
> With this patch, ext4 supports timestamps Y1901-2514.
> 
> The performance comparison is as follows:
> ------------------------------------------------
> |              | file create   |   ls -l       |
> |--------------|---------------|----------------
> |with patch    | 138.2056 sec  | 14.9333 sec   |
> |without patch | 133.4566 sec  | 14.9096 sec   |
> ------------------------------------------------
> file count:1 million (average of 3 trials)
> kernel: 3.0.0-rc3
> 
> There is a slightly difference, but I think it is acceptable.

I'm surprised that the difference is even measurable for such a
change.

> Thanks and Regards,
> Akira Fujita
> 
> Signed-off-by: Akira Fujita <a-fujita@xxxxxxxxxxxxx>
> ---
> fs/ext4/ext4.h |   30 +++++++++++++++++++++---------
> 1 file changed, 21 insertions(+), 9 deletions(-)
> 
> diff -Nrup -X linux-3.0-rc3-a/Documentation/dontdiff linux-3.0-rc3-a/fs/ext4/ext4.h linux-3.0-rc3-b/fs/ext4/ext4.h
> --- linux-3.0-rc3-a/fs/ext4/ext4.h	2011-06-14 07:29:59.000000000 +0900
> +++ linux-3.0-rc3-b/fs/ext4/ext4.h	2011-06-23 14:18:47.000000000 +0900
> @@ -645,6 +645,7 @@ struct move_extent {
> #define EXT4_EPOCH_BITS 2
> #define EXT4_EPOCH_MASK ((1 << EXT4_EPOCH_BITS) - 1)
> #define EXT4_NSEC_MASK  (~0UL << EXT4_EPOCH_BITS)
> +#define EXT4_TIMESTAMP_SIGN_MASK 0x80000000
> 
> /*
> * Extended fields will fit into an inode if the filesystem was formatted
> @@ -665,16 +666,23 @@ struct move_extent {
> static inline __le32 ext4_encode_extra_time(struct timespec *time)
> {
> +       return cpu_to_le32((sizeof(time->tv_sec) > 4 ?
> +		(time->tv_sec >> 32) &
> +		(EXT4_EPOCH_MASK | EXT4_TIMESTAMP_SIGN_MASK) :
> +		time->tv_sec & EXT4_TIMESTAMP_SIGN_MASK) |
> +		((time->tv_nsec << EXT4_EPOCH_BITS) & EXT4_NSEC_MASK));
> }

As mentioned by Mark, this cannot work because the high bit is already used
for the most significant bit of the nanoseconds.

> static inline void ext4_decode_extra_time(struct timespec *time, __le32 extra)
> {
> +	if (sizeof(time->tv_sec) > 4) {
> 	       time->tv_sec |= (__u64)(le32_to_cpu(extra) & EXT4_EPOCH_MASK)
> 			       << 32;
> +		if (le32_to_cpu(extra) & EXT4_TIMESTAMP_SIGN_MASK)
> +			time->tv_sec |= EXT4_NSEC_MASK << 32;
> +	}
> +
> +	time->tv_nsec = ((le32_to_cpu(extra) & ~EXT4_TIMESTAMP_SIGN_MASK) >>
> +				EXT4_EPOCH_BITS);
> }
> 
> #define EXT4_INODE_SET_XTIME(xtime, inode, raw_inode)			       \
> @@ -696,19 +704,23 @@ do {									       \
> 
> #define EXT4_INODE_GET_XTIME(xtime, inode, raw_inode)			       \
> do {									       \
> -	(inode)->xtime.tv_sec = (signed)le32_to_cpu((raw_inode)->xtime);       \
> -	if (EXT4_FITS_IN_INODE(raw_inode, EXT4_I(inode), xtime ## _extra))     \
> +	if (EXT4_FITS_IN_INODE(raw_inode, EXT4_I(inode), xtime ## _extra)) {   \
> +		(inode)->xtime.tv_sec =                                        \
> +				(__u32)(le32_to_cpu((raw_inode)->xtime));      \
> 		ext4_decode_extra_time(&(inode)->xtime,			       \
> 				       raw_inode->xtime ## _extra);	       \
> -	else								       \
> +	} else {							       \
> +		(inode)->xtime.tv_sec =                                        \
> +			(signed)le32_to_cpu((raw_inode)->xtime);               \
> 		(inode)->xtime.tv_nsec = 0;				       \
> +	}                                                                      \
> } while (0)
> 
> #define EXT4_EINODE_GET_XTIME(xtime, einode, raw_inode)			       \
> do {									       \
> 	if (EXT4_FITS_IN_INODE(raw_inode, einode, xtime))		       \
> -		(einode)->xtime.tv_sec = 				       \
> -			(signed)le32_to_cpu((raw_inode)->xtime);	       \
> +		(einode)->xtime.tv_sec =                                       \
> +			(__u32)(le32_to_cpu((raw_inode)->xtime));              \
> 	if (EXT4_FITS_IN_INODE(raw_inode, einode, xtime ## _extra))	       \
> 		ext4_decode_extra_time(&(einode)->xtime,		       \
> 				       raw_inode->xtime ## _extra);	       \


Cheers, Andreas





--
To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Reiser Filesystem Development]     [Ceph FS]     [Kernel Newbies]     [Security]     [Netfilter]     [Bugtraq]     [Linux FS]     [Yosemite National Park]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Samba]     [Device Mapper]     [Linux Media]

  Powered by Linux