On 2010-09-08, at 10:59, Lukas Czerner wrote: > +extern int ext4_init_inode_table(struct super_block *sb, ext4_group_t > +{ > + if (sb->s_flags & MS_RDONLY) { > + ext4_warning(sb, "Filesystem mounted read only. " > + "Won't zeroout anything!"); > + ret = 1; > + goto out; > + } Is it actually needed to print this error message? This thread shouldn't be run if the filesystem was originally mounted read-only (that is checked in ext4_register_li_request()), and if it is later remounted read-only (due to user action or filesystem error) I don't think it is critical to print a message here. The ext4lazyinit thread will be restarted when the filesystem is remounted read-write in the future. > + /* > + * Skip zeroout if the inode table is full. But we set the ZEROED > + * flag anyway, because obviously, when it is full it does not need > + * further zeroing. > + */ > + if (unlikely(num == 0)) > + goto skip_zeroout; In fact, this is still safe as long as num < EXT4_INODES_PER_GROUP(sb). > + ext4_debug("going to zero out inode table in group %d\n", > + group); the "group" should align with the '(' on the previous line. > +/* Find next suitable group adn run ext4_init_inode_table */ > +static int ext4_run_li_request(struct ext4_li_request *elr) > +{ > + for (group = elr->lr_next_group; group < ngroups; group++) { > + gdp = ext4_get_group_desc(sb, group, NULL); > + if (!gdp) { > + ret = 1; > + break; > + } > + > + if (!(gdp->bg_flags & cpu_to_le16(EXT4_BG_INODE_ZEROED))) > + break; > + } There is nowhere in the code that lr_next_group is increased. That should probably be done in this function. Otherwise, for a large filesystem with 256k groups in it there will be an O(n^2) search for the next group to zero. > +static int ext4_lazyinit_thread(void *arg) > +{ > + while (true) { > + list_for_each_safe(pos, n, &eli->li_request_list) { > + /* Handle jiffies overflow - is it even possible?*/ > + if (timeout > jiffies) > + timeout = (ULONG_MAX - timeout) + jiffies; > + else > + timeout = jiffies - timeout; I'm not sure this is needed. When you compute "jiffies - timeout", it doesn't matter if "jiffies" has suddenly wrapped, the result will be the same. > +static ext4_group_t ext4_has_uninit_itable(struct super_block *sb) > +{ > + for (group = 0; group < ngroups; group++) { > + gdp = ext4_get_group_desc(sb, group, NULL); > + if (!(gdp->bg_flags & cpu_to_le16(EXT4_BG_INODE_ZEROED))) > + break; It would be much more efficient to do this in ext4_check_descriptors(), which is already walking through all of the groups checking the validity of the group descriptors at mount time. An alternative that would speed up mounting of large filesystems is to do the ext4_check_descriptors() checks from the ext4lazyinit thread. If any groups are found to be in error (which should be very rare) they can be marked in the group descriptor with a new per-group EXT4_BG_ERROR flag and skipped for any future allocations. This would need an additional in-memory flag that tracked if the group descriptor's checksum and block pointers had been validated yet or not, so that they are not used by anything before they are checked. That is a change that could be made at some later stage, after the initial patch is landed. Cheers, Andreas -- To unsubscribe from this list: send the line "unsubscribe linux-ext4" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html