On 2019/6/4 0:23, Jaegeuk Kim wrote: > On 05/31, Chao Yu wrote: >> On 2019/5/31 0:06, Jaegeuk Kim wrote: >>> On 05/24, sunqiuyang wrote: >>>> From: Qiuyang Sun <sunqiuyang@xxxxxxxxxx> >>>> >>>> This ioctl shrinks a given length (aligned to sections) from end of the >>>> main area. Any cursegs and valid blocks will be moved out before >>>> invalidating the range. >>>> >>>> This feature can be used for adjusting partition sizes online. >>>> -- >>>> Changlog v1 ==> v2: >>>> >>>> Sahitya Tummala: >>>> - Add this ioctl for f2fs_compat_ioctl() as well. >>>> - Fix debugfs status to reflect the online resize changes. >>>> - Fix potential race between online resize path and allocate new data >>>> block path or gc path. >>>> >>>> Others: >>>> - Rename some identifiers. >>>> - Add some error handling branches. >>>> - Clear sbi->next_victim_seg[BG_GC/FG_GC] in shrinking range. >>>> -- >>>> Changelog v2 ==> v3: >>>> Implement this interface as ext4's, and change the parameter from shrunk >>>> bytes to new block count of F2FS. >>>> -- >>>> Changelog v3 ==> v4: >>>> - During resizing, force to empty sit_journal and forbid adding new >>>> entries to it, in order to avoid invalid segno in journal after resize. >>>> - Reduce sbi->user_block_count before resize starts. >>>> - Commit the updated superblock first, and then update in-memory metadata >>>> only when the former succeeds. >>>> - Target block count must align to sections. >>>> -- >>>> Changelog v4 ==> v5: >>>> Write checkpoint before and after committing the new superblock, w/o >>>> CP_FSCK_FLAG respectively, so that the FS can be fixed by fsck even if >>>> resize fails after the new superblock is committed. >>>> -- >>>> Changelog v5 ==> v6: >>>> - In free_segment_range(), reduce granularity of gc_mutex. >>>> - Add protection on curseg migration. >>>> >>>> Signed-off-by: Qiuyang Sun <sunqiuyang@xxxxxxxxxx> >>>> Signed-off-by: Chao Yu <yuchao0@xxxxxxxxxx> >>>> Signed-off-by: Sahitya Tummala <stummala@xxxxxxxxxxxxxx> >>>> --- >>>> fs/f2fs/checkpoint.c | 5 +- >>>> fs/f2fs/debug.c | 7 +++ >>>> fs/f2fs/f2fs.h | 7 +++ >>>> fs/f2fs/file.c | 28 +++++++++++ >>>> fs/f2fs/gc.c | 134 ++++++++++++++++++++++++++++++++++++++++++++++++++- >>>> fs/f2fs/segment.c | 54 +++++++++++++++++---- >>>> fs/f2fs/segment.h | 1 + >>>> fs/f2fs/super.c | 4 ++ >>>> 8 files changed, 228 insertions(+), 12 deletions(-) >>>> >>>> diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c >>>> index ed70b68..4706d0a 100644 >>>> --- a/fs/f2fs/checkpoint.c >>>> +++ b/fs/f2fs/checkpoint.c >>>> @@ -1313,8 +1313,11 @@ static void update_ckpt_flags(struct f2fs_sb_info *sbi, struct cp_control *cpc) >>>> else >>>> __clear_ckpt_flags(ckpt, CP_ORPHAN_PRESENT_FLAG); >>>> >>>> - if (is_sbi_flag_set(sbi, SBI_NEED_FSCK)) >>>> + if (is_sbi_flag_set(sbi, SBI_NEED_FSCK) || >>>> + is_sbi_flag_set(sbi, SBI_IS_RESIZEFS)) >>>> __set_ckpt_flags(ckpt, CP_FSCK_FLAG); >>>> + else >>>> + __clear_ckpt_flags(ckpt, CP_FSCK_FLAG); >>> >>> We don't need to clear this flag. >> >> During resizefs, we may face inconsistent status of filesystem's on-disk data, >> so I propose to use below flow, so once some thing breaks resizefs, fsck can >> detect the corruption by the CP_FSCK_FLAG directly. >> >> - resizefs() >> - set SBI_IS_RESIZEFS >> - do_checkpoint() >> - if (is_resizing) >> - set CP_FSCK_FLAG >> >> - clear SBI_IS_RESIZEFS >> - do_checkpoint() >> - if (!is_resizing && not_need_fsck) >> - clear CP_FSCK_FLAG >> >> It's safe to clear CP_FSCK_FLAG if there is no resizing and corruption, as once >> the inconsistency was detected we will keep SBI_NEED_FSCK in memory anyway, then >> checkpoint can set CP_FSCK_FLAG again. > > This tries to resize the image and I mostly worried whether fsck is able to fix So, Qiuyang, could you try break resizefs at some key points with power-cut, to check whether fsck can repair all corruption cases? and what is the result (resized fs or origianl fs)? > the corrupted metadata area. Moreover, I'm in doubt we really need to do this in > parallel with FS operations. What do you mean? We have wrapped main resizefs operaion with {freeze,thaw}_bdev, so there should be no parallel FS operations. Thanks,