Hi Bogdan, Are you running your OSD data on ext3? It seems that you are hitting some ext3 bug. Could you try changing to btrfs? This since ext is not yet fully supported. Wido On Fri, 2010-08-27 at 12:18 +0000, Bogdan Lobodzinski wrote: > Hello, > > working with ceph on my test configuration > (3 nodes Ubuntu 10.04.1 LTS, Linux 2.6.32-24-generic-pae #41-Ubuntu SMP) > After starting > svn co https://root.cern.ch/svn/root/trunk root > > on the /ceph directory, the command become stuck, and also: > root 5303 0.0 0.0 0 0 ? D Aug26 0:00 [kjournald] > root 30181 0.0 0.0 6972 2056 pts/1 D+ 13:46 0:00 /usr//bin/cosd > -i 2 -c /etc/ceph/ceph.conf > > any mount, unmount are going also to the state D. > This is a permanennt behaviour of the ceph if the command is started. > > dmesg shows: > ------------- > [99048.567704] ------------[ cut here ]------------ > [99048.568767] kernel BUG at > /build/buildd/linux-2.6.32/fs/ext3/balloc.c:1384! > [99048.568767] invalid opcode: 0000 [#1] SMP > [99048.568767] last sysfs file: /sys/devices/pci0000:00/0000:00:00.0/device > [99048.596652] Modules linked in: nfs lockd nfs_acl auth_rpcgss sunrpc ceph > crc32c libcrc32c openafs(P) fbcon tileblit font bitblit softcursor vga > vgastate mptctl radeon ttm drm_kms_helper drm bnx2 psmouse i5000_edac > edac_core agpgart serio_raw i5k_amb i2c_algo_bit shpchp dell_wmi dcdbas > usbhid mptsas mptscsih mptbase scsi_transport_sas > [99048.596652] > [99048.596652] Pid: 6258, comm: cosd Tainted: P > (2.6.32-24-generic-pae #41-Ubuntu) PowerEdge 1950 > [99048.596652] EIP: 0060:[<c026dc8d>] EFLAGS: 00210296 CPU: 3 > [99048.596652] EIP is at ext3_try_to_allocate_with_rsv+0x1cd/0x2b0 > [99048.596652] EAX: 00000027 EBX: f6dd5480 ECX: fffe48f7 EDX: 00000000 > [99048.596652] ESI: 02147fff EDI: f625e200 EBP: f5ccbc54 ESP: f5ccbc14 > [99048.596652] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 > [99048.596652] Process cosd (pid: 6258, ti=f5cca000 task=f6263300 > task.ti=f5cca000) > [99048.596652] Stack: > [99048.596652] 00000428 f14f1bc0 c026cc88 00001000 00000007 f1a80e9c > f6dd5494 02147fff > [99048.596652] <0> f70d89c0 02140000 00000428 f625d800 00000001 00000428 > f1058500 00000000 > [99048.596652] <0> f5ccbcc8 c026e048 f14f1bc0 ffffffff f6dd5480 f5ccbcb0 > f5ccbcb4 f5ccbc90 > [99048.596652] Call Trace: > [99048.596652] [<c026cc88>] ? read_block_bitmap+0x48/0x160 > [99048.596652] [<c026e048>] ? ext3_new_blocks+0x228/0x6c0 > [99048.596652] [<c024fbd7>] ? mb_cache_entry_find_first+0x67/0x80 > [99048.596652] [<c026e505>] ? ext3_new_block+0x25/0x30 > [99048.596652] [<c02809a4>] ? ext3_xattr_block_set+0x554/0x670 > [99048.596652] [<c027f589>] ? ext3_xattr_set_entry+0x29/0x350 > [99048.596652] [<c0280d8b>] ? ext3_xattr_set_handle+0x2cb/0x3e0 > [99048.596652] [<c0280f15>] ? ext3_xattr_set+0x75/0xc0 > [99048.596652] [<c0280fd6>] ? ext3_xattr_user_set+0x76/0x80 > [99048.596652] [<c022dd8c>] ? generic_setxattr+0x9c/0xb0 > [99048.596652] [<c022dcf0>] ? generic_setxattr+0x0/0xb0 > [99048.596652] [<c022e984>] ? __vfs_setxattr_noperm+0x44/0x160 > [99048.596652] [<c02fed4c>] ? cap_inode_setxattr+0x2c/0x60 > [99048.596652] [<c022eb31>] ? vfs_setxattr+0x91/0xa0 > [99048.596652] [<c022ebf8>] ? setxattr+0xb8/0x110 > [99048.596652] [<c021d512>] ? __link_path_walk+0x632/0xca0 > [99048.596652] [<c014e369>] ? enqueue_task_fair+0x39/0x80 > [99048.596652] [<c022a9bf>] ? mntput_no_expire+0x1f/0xe0 > [99048.596652] [<c022a9bf>] ? mntput_no_expire+0x1f/0xe0 > [99048.596652] [<c021be45>] ? path_put+0x25/0x30 > [99048.596652] [<c021ba8b>] ? putname+0x2b/0x40 > [99048.596652] [<c021ea6a>] ? user_path_at+0x4a/0x80 > [99048.596652] [<c0183242>] ? sys_futex+0x72/0x120 > [99048.596652] [<c022ee13>] ? sys_setxattr+0x83/0x90 > [99048.596652] [<c0109763>] ? sysenter_do_call+0x12/0x28 > [99048.596652] Code: 83 3a ff ff ff 90 39 45 dc c7 45 0c ff ff ff ff 0f 83 > 32 ff ff ff 8b 87 84 01 00 00 ba ba c6 5c c0 05 d0 00 00 00 e8 73 f1 ff<0f> > 0b eb fe 8b 45 ec 89 55 d4 05 cc 00 00 00 89 45 ec e8 fc 53 > [99048.596652] EIP: [<c026dc8d>] ext3_try_to_allocate_with_rsv+0x1cd/0x2b0 > SS:ESP 0068:f5ccbc14 > [99049.044090] ---[ end trace 35860103963ee444 ]--- > h1farm184# > -------------------- > > my ceph.conf is: > ------- > [global] > pid file = /var/run/ceph/$name.pid > debug ms = 1 > keyring = /etc/ceph/keyring.bin > ; monitors > [mon] > ;Directory for monitor files > mon data = /x02/mon$id > debug mon = 20 > debug paxos = 20 > mon lease wiggle room = 0.5 > > [mon0] > host = h1farm182 > mon addr = xxx.xxx.xx.116:6789 > [mon1] > host = h1farm183 > mon addr = xxx.xxx.xx.117:6789 > ; metadata servers > [mds] > debug mds = 20 > mds log max segments = 2 > keyring = /etc/ceph/keyring.$name > [mds0] > host = h1farm182 > [mds1] > host = h1farm183 > [osd] > sudo = true > osd data = /x02/osd$id > osd journal = /x02/osd$id/journal > osd journal size = 100 > keyring = /etc/ceph/keyring.$name > debug osd = 20 > debug journal = 20 > debug filestore = 20 > ;osd journal size = 100 > [osd0] > host = h1farm182 > [osd1] > host = h1farm183 > [osd2] > host = h1farm184 > > ------- > > Any idea how to improve the situation ? > > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > the body of a message to majordomo@xxxxxxxxxxxxxxx > More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html