On Fri, Oct 23, 2015 at 1:56 AM, Ronny Hegewald <ronny.hegewald@xxxxxxxxx> wrote: > On Thursday 22 October 2015, Ilya Dryomov wrote: >> Well, checksum mismatches are to be expected given what we are doing >> now, but I wouldn't expect any data corruptions. Ronny writes that he >> saw frequent ext4 corruptions on krbd devices before he enabled stable >> pages, which leads me to believe that the !crc case, for which we won't >> be setting BDI_CAP_STABLE_WRITES, is going to be/remain broken. Ronny, >> could you describe it in more detail and maybe share some of those osd >> logs with bad crc messages? >> > This is from a 10 minute period from one of the OSDs. > > 23:11:02.423728 ce5dfb70 0 bad crc in data 1657725429 != exp 496797267 > 23:11:37.586411 ce5dfb70 0 bad crc in data 1216602498 != exp 1888811161 > 23:12:07.805675 cc3ffb70 0 bad crc in data 3140625666 != exp 2614069504 > 23:12:44.485713 c96ffb70 0 bad crc in data 1712148977 != exp 3239079328 > 23:13:24.746217 ce5dfb70 0 bad crc in data 144620426 != exp 3156694286 > 23:13:52.792367 ce5dfb70 0 bad crc in data 4033880920 != exp 4159672481 > 23:14:22.958999 c96ffb70 0 bad crc in data 847688321 != exp 1551499144 > 23:16:35.015629 ce5dfb70 0 bad crc in data 2790209714 != exp 3779604715 > 23:17:48.482049 c96ffb70 0 bad crc in data 1563466764 != exp 528198494 > 23:19:28.925357 cc3ffb70 0 bad crc in data 1764275395 != exp 2075504274 > 23:19:59.039843 cc3ffb70 0 bad crc in data 2960172683 != exp 1215950691 Could you share the entire log snippet for those 10 minutes? > > The filesystem corruptions are usually ones with messages of > > EXT4-fs error (device rbd4): ext4_mb_generate_buddy:757: group 155, block > bitmap and bg descriptor inconsistent: 23625 vs 23660 free clusters > > These were pretty common, at least every other day, often multiple times a > day. > > Sometimes there was a additional > > JBD2: Spotted dirty metadata buffer (dev = rbd4, blocknr = 0). There's a risk > of filesystem corruption in case of system crash. > > Another type of Filesystem corruption i experienced through kernel > compilations, that lead to the following messages. > > EXT4-fs warning (device rbd3): empty_dir:2488: bad directory (dir #282221) - > no `.' or `..' > EXT4-fs warning (device rbd3): empty_dir:2488: bad directory (dir #273062) - > no `.' or `..' > EXT4-fs warning (device rbd3): empty_dir:2488: bad directory (dir #272270) - > no `.' or `..' > EXT4-fs warning (device rbd3): empty_dir:2488: bad directory (dir #282254) - > no `.' or `..' > EXT4-fs warning (device rbd3): empty_dir:2488: bad directory (dir #273070) - > no `.' or `..' > EXT4-fs warning (device rbd3): empty_dir:2488: bad directory (dir #272308) - > no `.' or `..' > EXT4-fs error (device rbd3): ext4_lookup:1417: inode #270033: comm rm: deleted > inode referenced: 270039 > last message repeated 2 times > EXT4-fs warning (device rbd3): empty_dir:2488: bad directory (dir #271534) - > no `.' or `..' > EXT4-fs warning (device rbd3): empty_dir:2488: bad directory (dir #271275) - > no `.' or `..' > EXT4-fs warning (device rbd3): empty_dir:2488: bad directory (dir #282290) - > no `.' or `..' > EXT4-fs warning (device rbd3): empty_dir:2488: bad directory (dir #281914) - > no `.' or `..' > EXT4-fs error (device rbd3): ext4_lookup:1417: inode #270033: comm rm: deleted > inode referenced: 270039 > last message repeated 2 times > kernel: EXT4-fs error (device rbd3): ext4_lookup:1417: inode #273018: comm rm: > deleted inode referenced: 282221 > EXT4-fs error (device rbd3): ext4_lookup:1417: inode #273018: comm rm: deleted > inode referenced: 282221 > EXT4-fs error (device rbd3): ext4_lookup:1417: inode #273018: comm rm: deleted > inode referenced: 281914 > EXT4-fs error (device rbd3): ext4_lookup:1417: inode #273018: comm rm: deleted > inode referenced: 281914 > EXT4-fs error: 243 callbacks suppressed > EXT4-fs error (device rbd3): ext4_lookup:1417: inode #282002: comm cp: deleted > inode referenced: 45375 > kernel: EXT4-fs error (device rbd3): ext4_lookup:1417: inode #282002: comm cp: > deleted inode referenced: 45371 > > The result was that various files and directories in the kernel sourcedir > couldn't be accessed anymore and even fsck couldn't repair it, so i had to > finally delete it. But these ones were pretty rare. > > Another issue were the data-corruptions in the files itself, that happened > independently from the filesystem-corruptions. These happened on most days, > sometimes only once, sometimes multiple times a day. > > Newly written files that contained corrupted data seem to always have it only > at one place. These corrupt data replaced the original data from the file, but > never changed the file-size. The position of this corruptions in the files > were always different. > > Interesting part is that this corrupted parts always followed the same > pattern. First some few hundred 0x0 bytes, then a few kb (10-30) of random > binary data, that finished again with a few hundred bytes of 0x0. > > In a few cases i could trace this data back to origin from another file that > was read at the same time from the same programm. But that might be > accidentially, because other corruptions that happened in the same scenario I > couldn't trace back this way. > > In other cases that corrupt data originated from files that were deleted > recently (a few minutes ago). Which kernel was this on? You really should have reported all of this as soon as you hit it - it sounds like you've been dealing with this issue for a while. There's definitely more than stable pages in play here, I'll look into it. Thanks, Ilya -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html