Hello ext3-users,
we have an oopsy situation here:
we have 4 machines: 3 client nodes, 1 master: the master holds a fairly
big repository of small files. The repo's current size is ~40GB
with ~1.2 M files in ~100 directories. Now, we like to rsync changes
from the master to the client nodes, which is working perfectly for 2
nodes, but our 3rd node oopses "sometimes", rendering the machine
unusable and we are forced to reboot the box (no serial console, no sysrq
possible). Below is the oops and a few details, more details, .config,
dmesg, tune2fs-l are here:
http://nerdbynature.de/bits/2.6.18-debian
Yes, it's a debian kernel, 2.6.18-2-k7 to be specific, it happend with
2.6.17-2-k7 too. We haven't tried vanilla yet. All boxen are the same
hardware (amd64, 32bit kernel+userland (debian/unstable) 1GB ram).
The filesystem is residing on a raid0-md, consisting of 2 sata-disks.
Any ideas what could cause this?
Thanks,
Christian.
f8836d37
Modules linked in: ipt_TCPMSS xt_tcpudp xt_state iptable_filter
ip_conntrack_ftp ip_conntrack_irc ip_conntrack nfnetlink ip_tables
x_tables ipv6 ipip tunnel4 dm_snapshot dm_mirror dm_mod shpchp
pci_hotplug i2c_viapro psmouse i2c_core serio_raw pcspkr evdev amd64_agp
agpgart parport_pc parport rtc floppy ide_generic r8169 uhci_hcd
ehci_hcd usbcore thermal processor fan raid0 raid1 md_mod sata_via
sd_mod libata scsi_mod via82cxxx ide_core ext3 jbd mbcache
EIP: 0060:[<f8836d37>] Not tainted VLI
EFLAGS: 00010283 (2.6.18-2-k7 #1)
[<f8832d2f>] journal_try_to_free_buffers+0x59/0x13a [jbd]
[<f8868318>] ext3_releasepage+0x0/0x61 [ext3]
[<c015bd3f>] try_to_release_page+0x34/0x46
[<c0148e23>] shrink_inactive_list+0x44b/0x71c
[<c01050ea>] do_IRQ+0x48/0x52
[<c0103692>] common_interrupt+0x1a/0x20
[<c016e3cb>] dput+0x1a/0x119
[<c016e551>] prune_one_dentry+0x68/0x74
[<f881b2ea>] mb_cache_shrink_fn+0x1d/0xb5 [mbcache]
[<c01491a3>] shrink_zone+0xaf/0xd0
[<c014962c>] kswapd+0x295/0x399
[<c012dbb1>] autoremove_wake_function+0x0/0x2d
[<c0149397>] kswapd+0x0/0x399
[<c012dae3>] kthread+0xc2/0xef
[<c012da21>] kthread+0x0/0xef
[<c0101005>] kernel_thread_helper+0x5/0xb
and for 2.6.17-2-k7:
f0872d73
Modules linked in: ipv6 ipip tunnel4 dm_snapshot dm_mirror dm_mod
shpchp pci_hotplug floppy i2c_viapro parport_pc i2c_core psmouse parport
8250_pnp serio_raw evdev amd64_agp agpgart pcspkr rtc raid10 raid6 raid5
xor multipath linear ide_generic r8169 uhci_hcd ehci_hcd usbcore thermal
processor fan raid0 raid1 md_mod sata_via sd_mod libata scsi_mod
via82cxxx ide_core ext3 jbd mbcache
EIP: 0060:[<f0872d73>] Not tainted VLI
EFLAGS: 00210246 (2.6.17-2-k7 #1)
BUG: warning at kernel/softirq.c:141/local_bh_enable()
<b0120e27> local_bh_enable+0x25/0x64 <b0218215> lock_sock+0x85/0x8d
<b021604b> sock_fasync+0x5c/0x111 <b0216fa1> sock_close+0x1e/0x2a
<b0153fcd> __fput+0x87/0x13c <b0151b57> filp_close+0x4e/0x54
<b011e0a1> put_files_struct+0x64/0xa6 <b011f006> do_exit+0x1b0/0x6be
<b0114e08> bust_spinlocks+0x3a/0x43 <b0103ee6> die+0x1d3/0x288
<b0103f76> die+0x263/0x288 <b011533a> do_page_fault+0x441/0x526
<b0114ef9> do_page_fault+0x0/0x526 <b01036f7> error_code+0x4f/0x54
<f0872d73> ext3_xattr_delete_inode+0x5/0xab [ext3] <f0865bcb>
ext3_free_inode+0x92/0x2c7 [ext3]
<f08666ee> ext3_mark_inode_dirty+0x20/0x27 [ext3] <f08687dd>
ext3_delete_inode+0xa3/0xba [ext3]
<f086873a> ext3_delete_inode+0x0/0xba [ext3] <b016809c>
generic_delete_inode+0x9e/0x101
<b0167b87> iput+0x5e/0x60 <b0166d66> dput+0xfe/0x116
<b0160a4f> sys_renameat+0x15f/0x1b9 <b01b04d2>
_atomic_dec_and_lock+0x2a/0x44
<b0160aba> sys_rename+0x11/0x15 <b0102af3>
sysenter_past_esp+0x54/0x75
BUG: warning at kernel/softirq.c:141/local_bh_enable()
<b0120e27> local_bh_enable+0x25/0x64 <b02160f4>
sock_fasync+0x105/0x111
<b0216fa1> sock_close+0x1e/0x2a <b0153fcd> __fput+0x87/0x13c
<b0151b57> filp_close+0x4e/0x54 <b011e0a1> put_files_struct+0x64/0xa6
<b011f006> do_exit+0x1b0/0x6be <b0114e08> bust_spinlocks+0x3a/0x43
<b0103ee6> die+0x1d3/0x288 <b0103f76> die+0x263/0x288
<b011533a> do_page_fault+0x441/0x526 <b0114ef9>
do_page_fault+0x0/0x526
<b01036f7> error_code+0x4f/0x54 <f0872d73>
ext3_xattr_delete_inode+0x5/0xab [ext3]
<f0865bcb> ext3_free_inode+0x92/0x2c7 [ext3] <f08666ee>
ext3_mark_inode_dirty+0x20/0x27 [ext3]
<f08687dd> ext3_delete_inode+0xa3/0xba [ext3] <f086873a>
ext3_delete_inode+0x0/0xba [ext3]
<b016809c> generic_delete_inode+0x9e/0x101 <b0167b87> iput+0x5e/0x60
<b0166d66> dput+0xfe/0x116 <b0160a4f> sys_renameat+0x15f/0x1b9
<b01b04d2> _atomic_dec_and_lock+0x2a/0x44 <b0160aba>
sys_rename+0x11/0x15
<b0102af3> sysenter_past_esp+0x54/0x75
BUG: warning at kernel/softirq.c:141/local_bh_enable()
<b0120e27> local_bh_enable+0x25/0x64 <b0269ca6>
unix_release_sock+0x5c/0x1bf
<b0216ce8> sock_release+0x11/0x85 <b0216fa9> sock_close+0x26/0x2a
<b0153fcd> __fput+0x87/0x13c <b0151b57> filp_close+0x4e/0x54
<b011e0a1> put_files_struct+0x64/0xa6 <b011f006> do_exit+0x1b0/0x6be
<b0114e08> bust_spinlocks+0x3a/0x43 <b0103ee6> die+0x1d3/0x288
<b0103f76> die+0x263/0x288 <b011533a> do_page_fault+0x441/0x526
<b0114ef9> do_page_fault+0x0/0x526 <b01036f7> error_code+0x4f/0x54
<f0872d73> ext3_xattr_delete_inode+0x5/0xab [ext3] <f0865bcb>
ext3_free_inode+0x92/0x2c7 [ext3]
<f08666ee> ext3_mark_inode_dirty+0x20/0x27 [ext3] <f08687dd>
ext3_delete_inode+0xa3/0xba [ext3]
<f086873a> ext3_delete_inode+0x0/0xba [ext3] <b016809c>
generic_delete_inode+0x9e/0x101
<b0167b87> iput+0x5e/0x60 <b0166d66> dput+0xfe/0x116
<b0160a4f> sys_renameat+0x15f/0x1b9 <b01b04d2>
_atomic_dec_and_lock+0x2a/0x44
<b0160aba> sys_rename+0x11/0x15 <b0102af3>
sysenter_past_esp+0x54/0x75
--
BOFH excuse #324:
Your packets were eaten by the terminator
_______________________________________________
Ext3-users mailing list
Ext3-users@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/ext3-users