Re: A lot of NILFS: bad btree node messages (readonly fs)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Tue, Jan 08, 2013 at 03:52:30PM +0300, Vyacheslav Dubeyko wrote:
> Hi guys,
> 
> I am trying to reproduce the issue last three days but without
> success. I tried different workloads and different environments. As I
> know all of you have the issue in reproduced state. So I have
> additional questions.
> 
> 1. All of you have such messages:
> 
> Jan 03 22:36:38 [kernel] [  953.289973] NILFS: bad btree node (blocknr=26229286): level = 67, flags = 0xee, nchildren = 40
> Jan 03 22:36:38 [kernel] [  953.289976] NILFS error (device sda2): nilfs_bmap_lookup_contig: broken bmap (inode number=102230)

I noticed already, that before I had:
Jan 04 19:40:29 [kernel] [  801.937864] NILFS: bad btree node (blocknr=26229286): level = 67, flags = 0xee, nchildren = 40

And now it is:
[34101.963275] NILFS: bad btree node (blocknr=26229286): level = 84, flags = 0x89, nchildren = 13348

So different level, flags and nchildren, but blocknr is the same.


> As I understand, you still have message for concrete block number (for
> example, blocknr=26229286) during remount. But you haven't the message
> for this block number (for example, blocknr=26229286) after umount and
> mount again. But you can get error messages for another block number
> after it. Am I correct?

Looking at my logs I only have an error for the mentioned blocknr.


> Could you share strace output for "cat" command for such corrupted
> file? Maybe syslog can contain some interesting details during
> execution of "cat" command. Could you check syslog for interesting
> error messages during such try?

Attached strace output.


> 3. Could you share configuration file of your kernel (.config)? I
> suspect that you can have some special configuration of your
> environment that I haven't.

Attached.


> 4. Could you share content of nilfs_cleanerd.conf file for NILFS2
> partition that has such issue? Sorry, if I ask about it again.

Attached.



> 5. Did you have any sudden power-off before you encounter the issue
> firstly?

No, I think not.


Piotr Szymaniak.
-- 
 Odpowiedz, nie zastanawiając się: co jest najgroźniejsze dla firmowych
danych? Pomyślałeś o sabotażu, hakerach albo szpiegostwie przemysłowym?
Owszem,  wszystko to prawda,  ale największym zagrożeniem dla firmowych
danych są żółte samoprzylepne karteczki.
  -- Ben Rothke, "Computer security"

Attachment: config.bz2
Description: BZip2 compressed data

Attachment: dmesg_cat.log.bz2
Description: BZip2 compressed data

protection_period	3600
min_clean_segments	15%
max_clean_segments	25%
clean_check_interval	10
selection_policy	timestamp	# timestamp in ascend order
nsegments_per_clean	2
mc_nsegments_per_clean	4
cleaning_interval	5
mc_cleaning_interval	1
retry_interval		60
use_mmap
log_priority		info
execve("/bin/cat", ["cat", "/var/tmp/kdecache-malpa/plasma_t"...], [/* 45 vars */]) = 0
brk(0)                                  = 0x8ba0000
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0xb772a000
access("/etc/ld.so.preload", R_OK)      = -1 ENOENT (No such file or directory)
open("/etc/ld.so.cache", O_RDONLY|O_CLOEXEC) = 3
fstat64(3, {st_mode=S_IFREG|0644, st_size=105158, ...}) = 0
mmap2(NULL, 105158, PROT_READ, MAP_PRIVATE, 3, 0) = 0xb7710000
close(3)                                = 0
open("/lib/libc.so.6", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\1\1\1\0\0\0\0\0\0\0\0\0\3\0\3\0\1\0\0\0\0\315\1\0004\0\0\0"..., 512) = 512
fstat64(3, {st_mode=S_IFREG|0755, st_size=1729888, ...}) = 0
mmap2(NULL, 1743644, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0xb7566000
mprotect(0xb7709000, 4096, PROT_NONE)   = 0
mmap2(0xb770a000, 12288, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, 3, 0x1a3) = 0xb770a000
mmap2(0xb770d000, 11036, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_ANONYMOUS, -1, 0) = 0xb770d000
close(3)                                = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0xb7565000
set_thread_area({entry_number:-1 -> 6, base_addr:0xb75656c0, limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, limit_in_pages:1, seg_not_present:0, useable:1}) = 0
mprotect(0xb770a000, 8192, PROT_READ)   = 0
mprotect(0x8052000, 4096, PROT_READ)    = 0
mprotect(0xb774c000, 4096, PROT_READ)   = 0
munmap(0xb7710000, 105158)              = 0
brk(0)                                  = 0x8ba0000
brk(0x8bc1000)                          = 0x8bc1000
open("/usr/lib/locale/locale-archive", O_RDONLY|O_LARGEFILE|O_CLOEXEC) = 3
fstat64(3, {st_mode=S_IFREG|0644, st_size=1716432, ...}) = 0
mmap2(NULL, 1716432, PROT_READ, MAP_PRIVATE, 3, 0) = 0xb73c1000
close(3)                                = 0
fstat64(1, {st_mode=S_IFREG|0664, st_size=1904, ...}) = 0
open("/var/tmp/kdecache-malpa/plasma_theme_Transparent-sima84.kcache", O_RDONLY|O_LARGEFILE) = 3
fstat64(3, {st_mode=S_IFREG|0664, st_size=84213856, ...}) = 0
fadvise64_64(3, 0, 0, POSIX_FADV_SEQUENTIAL) = 0
read(3, 0x8ba1000, 65536)               = -1 EIO (Input/output error)
write(2, "cat: ", 5cat: )                    = 5
write(2, "/var/tmp/kdecache-malpa/plasma_t"..., 62/var/tmp/kdecache-malpa/plasma_theme_Transparent-sima84.kcache) = 62
open("/usr/share/locale/locale.alias", O_RDONLY|O_CLOEXEC) = 4
fcntl64(4, F_GETFD)                     = 0x1 (flags FD_CLOEXEC)
fstat64(4, {st_mode=S_IFREG|0644, st_size=2502, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0xb7729000
read(4, "# Locale name alias data base.\n#"..., 4096) = 2502
read(4, "", 4096)                       = 0
close(4)                                = 0
munmap(0xb7729000, 4096)                = 0
open("/usr/share/locale/pl_PL.UTF-8/LC_MESSAGES/libc.mo", O_RDONLY) = -1 ENOENT (No such file or directory)
open("/usr/share/locale/pl_PL.utf8/LC_MESSAGES/libc.mo", O_RDONLY) = -1 ENOENT (No such file or directory)
open("/usr/share/locale/pl_PL/LC_MESSAGES/libc.mo", O_RDONLY) = -1 ENOENT (No such file or directory)
open("/usr/share/locale/pl.UTF-8/LC_MESSAGES/libc.mo", O_RDONLY) = -1 ENOENT (No such file or directory)
open("/usr/share/locale/pl.utf8/LC_MESSAGES/libc.mo", O_RDONLY) = -1 ENOENT (No such file or directory)
open("/usr/share/locale/pl/LC_MESSAGES/libc.mo", O_RDONLY) = 4
fstat64(4, {st_mode=S_IFREG|0644, st_size=143799, ...}) = 0
mmap2(NULL, 143799, PROT_READ, MAP_PRIVATE, 4, 0) = 0xb739d000
close(4)                                = 0
open("/usr/lib/gconv/gconv-modules.cache", O_RDONLY) = 4
fstat64(4, {st_mode=S_IFREG|0644, st_size=26242, ...}) = 0
mmap2(NULL, 26242, PROT_READ, MAP_SHARED, 4, 0) = 0xb7396000
close(4)                                = 0
write(2, ": B\305\202\304\205d wej\305\233cia/wyj\305\233cia", 26: Błąd wejścia/wyjścia) = 26
write(2, "\n", 1
)                       = 1
close(3)                                = 0
close(1)                                = 0
close(2)                                = 0
exit_group(1)                           = ?
+++ exited with 1 +++

Attachment: signature.asc
Description: Digital signature


[Index of Archives]     [Linux Filesystem Development]     [Linux BTRFS]     [Linux CIFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux SCSI]

  Powered by Linux