On Fri, Apr 12, 2024 at 09:27:55AM +0800, yebin (H) wrote: > I thought of a solution that when the commit block checksum is > incorrect, retain the first 512 bytes of data, clear the subsequent > data, and then calculate the checksum to see if it is correct. This > solution can distinguish whether the commit is complete for > components that can ensure the atomicity of 512 bytes or more. But > for HDD, it may not be able to distinguish, but it should be > alleviated to some extent. Yeah, we discussed something similar at the weekly ext4 call; the idea was to change the kernel to zero out the jbd2 block before we fill in any jbd2 tags (including in the commit block) when writing the journal. Then in the journal replay path, if the checksum doesn't match, we can try zeroing out everything beyond the size in the header struct, and then retry the the checksum and see if it matches. This also has the benefit of making sure that we aren't leaking stale (uninitialized) kernel memory to disk, which could be considered a security vulnerability in some cases --- although the likelihood that something truly sensitive could be leaked is quite low; the attack requires raw access to the storate device; and exposure similar to what gets written to the swap device. Still there are people who do worry about such things. - Ted