Hi Bogdan, Yes, you can place your journal on a file, that is no problem. Performance wise you might want to use a block device (or partition) and a other device then the one where your data is one. Wido On Wed, 2010-09-01 at 17:21 +0200, Bogdan Lobodzinski wrote: > Hello Sage, > > replacing ext3 by btrfs my ceph test-bed survived my test comand: > svn co https://root.cern.ch/svn/root/trunk root > > I didn't try ext4. > > However, I did a few changes in my initial ceph.conf. > Could you, please, check if such a configuration is reasonable ? > Is it correct to use "osd journal" location as it is done below ? > > My new ceph.conf: > ----------- > [global] > pid file = /var/run/ceph/$name.pid > debug ms = 1 > keyring = /etc/ceph/keyring.bin > [mon] > mon data = /x01/mon$id > debug mon = 20 > debug paxos = 20 > mon lease wiggle room = 0.5 > [mon0] > host = h1farm182 > mon addr = xxx.xxx.xxx.116:6789 > [mon1] > host = h1farm183 > mon addr = xxx.xxx.xxx.117:6789 > [mds] > debug mds = 10 > mds log max segments = 2 > keyring = /etc/ceph/keyring.$name > [mds0] > host = h1farm182 > [mds1] > host = h1farm183 > [osd] > sudo = true > keyring = /etc/ceph/keyring.$name > osd data = /x02/osd$id > osd journal = /x02/osd$id/journal > osd journal size = 100 > debug osd = 20 > debug journal = 20 > debug filestore = 20 > [osd0] > host = h1farm183 > btrfs devs = /dev/sdb1 > [osd1] > host = h1farm184 > btrfs devs = /dev/sdb1 > ----------- > > Thank you for help, > > Cheers, > > Bogdan > > > On Tue, 31 Aug 2010, Bogdan Lobodzinski wrote: > > > > > Hello Sage, > > > > On Mon, 30 Aug 2010, Sage Weil wrote: > > > >> On Mon, 30 Aug 2010, Bogdan Lobodzinski wrote: > >>> > >>> Hello Sage, > >>> > >>> I moved to the kernel 2.6.35, keeping ext3 filesystem. > >>> After executing teh same command: > >>> svn co https://root.cern.ch/svn/root/trunk root > >>> > >>> System is again dead. The command and kjournald are stuck > >>> bogdan 8539 0.9 0.6 31168 22040 pts/0 DL+ 16:44 0:21 svn co > >>> https://root.cern.ch/svn/root/trunk root > >>> root 802 0.0 0.0 0 0 ? D 12:59 0:01 [kjournald] > >> > >> Hmm. Have you tried ext4? > >> > >> I stopped seeing this on my own machine with recent kernels, but it looks > >> like it isn't in fact fixed. This should be reported to the ext4 list. > >> Are you running ceph via vstart.sh or a custom ceph.conf? > > I am using vstart.sh taken from compiled by me source tarball > > ceph-0.21.tar.gz (http://ceph.newdream.net/download/) > > and the client from > > git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client-standalone.git > > > > Cheers, > > > > Bogdan > > > >> > >> sage > >> > >>> > >>> Looks like the bug is not fixed, dmesg shows: > >>> --------- > >>> [14325.304068] kernel BUG at > >>> /build/buildd/linux-maverick-2.6.35/fs/ext3/balloc.c:1385! > >>> [14325.304191] invalid opcode: 0000 [#1] SMP > >>> [14325.304263] last sysfs file: > >>> /sys/devices/pci0000:00/0000:00:00.0/device > >>> [14325.304266] Modules linked in: nfs lockd fscache nfs_acl auth_rpcgss > >>> sunrpc > >>> ceph crc32c libcrc32c radeon ttm drm_kms_helper drm mptctl psmouse agpgart > >>> i5000_edac usbhid hid edac_core i2c_algo_bit bnx2 i5k_amb dcdbas shpchp > >>> serio_raw mptsas mptscsih mptbase scsi_transport_sas > >>> [14325.304266] > >>> [14325.304266] Pid: 8391, comm: cosd Not tainted 2.6.35-14-generic > >>> #20~lucid2-Ubuntu 0DT097/PowerEdge 1950 > >>> [14325.304266] EIP: 0060:[<c0274a4d>] EFLAGS: 00210286 CPU: 1 > >>> [14325.304266] EIP is at ext3_try_to_allocate_with_rsv+0x1cd/0x2b0 > >>> [14325.304266] EAX: 00000027 EBX: c8641440 ECX: c07d7cfc EDX: 00000000 > >>> [14325.304266] ESI: 007b7fff EDI: f640fa00 EBP: f5823c50 ESP: f5823c10 > >>> [14325.304266] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 > >>> [14325.304266] Process cosd (pid: 8391, ti=f5822000 task=f6b7bf70 > >>> task.ti=f5822000) > >>> [14325.304266] Stack: > >>> [14325.304266] 000000f6 c6c11930 c0273a58 00001000 f62e549c 00000007 > >>> c8641454 > >>> 007b7fff > >>> [14325.304266] <0> f6f7e420 007b0000 000000f6 f640de00 00000001 000000f6 > >>> c4063ec0 00000000 > >>> [14325.304266] <0> f5823cc0 c0274daf c6c11930 ffffffff c8641440 f5823ca8 > >>> f5823cac c0256017 > >>> [14325.304266] Call Trace: > >>> [14325.304266] [<c0273a58>] ? read_block_bitmap+0x48/0x160 > >>> [14325.304266] [<c0274daf>] ? ext3_new_blocks+0x1ff/0x610 > >>> [14325.304266] [<c0256017>] ? mb_cache_entry_find_first+0x67/0x80 > >>> [14325.304266] [<c02751e5>] ? ext3_new_block+0x25/0x30 > >>> [14325.304266] [<c0287721>] ? ext3_xattr_block_set+0x481/0x550 > >>> [14325.304266] [<c0286490>] ? ext3_xattr_set_entry+0x20/0x2f0 > >>> [14325.304266] [<c0287b0b>] ? ext3_xattr_set_handle+0x31b/0x400 > >>> [14325.304266] [<c0287c65>] ? ext3_xattr_set+0x75/0xc0 > >>> [14325.304266] [<c0287d24>] ? ext3_xattr_user_set+0x74/0x80 > >>> [14325.304266] [<c023348b>] ? generic_setxattr+0x9b/0xb0 > >>> [14325.304266] [<c02333f0>] ? generic_setxattr+0x0/0xb0 > >>> [14325.304266] [<c0234084>] ? __vfs_setxattr_noperm+0x44/0x150 > >>> [14325.304266] [<c03017dc>] ? cap_inode_setxattr+0x2c/0x60 > >>> [14325.304266] [<c0234221>] ? vfs_setxattr+0x91/0xa0 > >>> [14325.304266] [<c02342e8>] ? setxattr+0xb8/0x110 > >>> [14325.304266] [<c0221d0e>] ? path_to_nameidata+0x1e/0x50 > >>> [14325.304266] [<c0223492>] ? link_path_walk+0x412/0x890 > >>> [14325.304266] [<c013a159>] ? enqueue_task_fair+0x39/0x80 > >>> [14325.304266] [<c022ff3f>] ? mntput_no_expire+0x1f/0xd0 > >>> [14325.304266] [<c022ff3f>] ? mntput_no_expire+0x1f/0xd0 > >>> [14325.304266] [<c022168b>] ? putname+0x2b/0x40 > >>> [14325.304266] [<c022470a>] ? user_path_at+0x4a/0x80 > >>> [14325.304266] [<c0179902>] ? sys_futex+0x72/0x120 > >>> [14325.304266] [<c0234503>] ? sys_setxattr+0x83/0x90 > >>> [14325.304266] [<c05c9bb4>] ? syscall_call+0x7/0xb > >>> [14325.304266] [<c05c0000>] ? cache_add_dev+0x73/0x195 > >>> [14325.304266] Code: 83 3a ff ff ff 90 39 45 dc c7 45 0c ff ff ff ff 0f 83 > >>> 32 > >>> ff ff ff 8b 87 80 01 00 00 ba 5a 7e 5e c0 05 d0 00 00 00 e8 83 f1 ff ff > >>> <0f> > >>> 0b eb fe 8b 45 ec 89 55 d4 05 cc 00 00 00 89 45 ec e8 ec 4b > >>> [14325.304266] EIP: [<c0274a4d>] ext3_try_to_allocate_with_rsv+0x1cd/0x2b0 > >>> SS:ESP 0068:f5823c10 > >>> [14325.326777] ---[ end trace 53e0b3b55af7a83c ]--- > >>> [14384.001261] ceph: mds0 caps stale > >>> [14413.616132] ceph: tid 33594 timed out on osd2, will reset osd > >>> [14628.992279] ceph: mds0 hung > >>> --------- > >>> > >>> as a next step I wil try to use btrfs . > >>> > >>> Cheers, > >>> > >>> Bogdan > >>> > >>> > >>> On Fri, 27 Aug 2010, Sage Weil wrote: > >>> > >>>> Hi Bogdan, > >>>> > >>>> This is a bug in the ext3 xattr code. It seems to be gone in 2.6.34 and > >>>> later. Or, you can switch to btrfs! > >>>> > >>>> sage > >>>> > >>>> > >>>> On Fri, 27 Aug 2010, Bogdan Lobodzinski wrote: > >>>> > >>>>> Hello, > >>>>> > >>>>> working with ceph on my test configuration > >>>>> (3 nodes Ubuntu 10.04.1 LTS, Linux 2.6.32-24-generic-pae #41-Ubuntu SMP) > >>>>> After starting > >>>>> svn co https://root.cern.ch/svn/root/trunk root > >>>>> > >>>>> on the /ceph directory, the command become stuck, and also: > >>>>> root 5303 0.0 0.0 0 0 ? D Aug26 0:00 > >>>>> [kjournald] > >>>>> root 30181 0.0 0.0 6972 2056 pts/1 D+ 13:46 0:00 > >>>>> /usr//bin/cosd > >>>>> -i 2 -c /etc/ceph/ceph.conf > >>>>> > >>>>> any mount, unmount are going also to the state D. > >>>>> This is a permanennt behaviour of the ceph if the command is started. > >>>>> > >>>>> dmesg shows: > >>>>> ------------- > >>>>> [99048.567704] ------------[ cut here ]------------ > >>>>> [99048.568767] kernel BUG at > >>>>> /build/buildd/linux-2.6.32/fs/ext3/balloc.c:1384! > >>>>> [99048.568767] invalid opcode: 0000 [#1] SMP > >>>>> [99048.568767] last sysfs file: > >>>>> /sys/devices/pci0000:00/0000:00:00.0/device > >>>>> [99048.596652] Modules linked in: nfs lockd nfs_acl auth_rpcgss sunrpc > >>>>> ceph > >>>>> crc32c libcrc32c openafs(P) fbcon tileblit font bitblit softcursor vga > >>>>> vgastate mptctl radeon ttm drm_kms_helper drm bnx2 psmouse i5000_edac > >>>>> edac_core agpgart serio_raw i5k_amb i2c_algo_bit shpchp dell_wmi dcdbas > >>>>> usbhid mptsas mptscsih mptbase scsi_transport_sas > >>>>> [99048.596652] > >>>>> [99048.596652] Pid: 6258, comm: cosd Tainted: P > >>>>> (2.6.32-24-generic-pae #41-Ubuntu) PowerEdge 1950 > >>>>> [99048.596652] EIP: 0060:[<c026dc8d>] EFLAGS: 00210296 CPU: 3 > >>>>> [99048.596652] EIP is at ext3_try_to_allocate_with_rsv+0x1cd/0x2b0 > >>>>> [99048.596652] EAX: 00000027 EBX: f6dd5480 ECX: fffe48f7 EDX: 00000000 > >>>>> [99048.596652] ESI: 02147fff EDI: f625e200 EBP: f5ccbc54 ESP: f5ccbc14 > >>>>> [99048.596652] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 > >>>>> [99048.596652] Process cosd (pid: 6258, ti=f5cca000 task=f6263300 > >>>>> task.ti=f5cca000) > >>>>> [99048.596652] Stack: > >>>>> [99048.596652] 00000428 f14f1bc0 c026cc88 00001000 00000007 f1a80e9c > >>>>> f6dd5494 02147fff > >>>>> [99048.596652] <0> f70d89c0 02140000 00000428 f625d800 00000001 00000428 > >>>>> f1058500 00000000 > >>>>> [99048.596652] <0> f5ccbcc8 c026e048 f14f1bc0 ffffffff f6dd5480 f5ccbcb0 > >>>>> f5ccbcb4 f5ccbc90 > >>>>> [99048.596652] Call Trace: > >>>>> [99048.596652] [<c026cc88>] ? read_block_bitmap+0x48/0x160 > >>>>> [99048.596652] [<c026e048>] ? ext3_new_blocks+0x228/0x6c0 > >>>>> [99048.596652] [<c024fbd7>] ? mb_cache_entry_find_first+0x67/0x80 > >>>>> [99048.596652] [<c026e505>] ? ext3_new_block+0x25/0x30 > >>>>> [99048.596652] [<c02809a4>] ? ext3_xattr_block_set+0x554/0x670 > >>>>> [99048.596652] [<c027f589>] ? ext3_xattr_set_entry+0x29/0x350 > >>>>> [99048.596652] [<c0280d8b>] ? ext3_xattr_set_handle+0x2cb/0x3e0 > >>>>> [99048.596652] [<c0280f15>] ? ext3_xattr_set+0x75/0xc0 > >>>>> [99048.596652] [<c0280fd6>] ? ext3_xattr_user_set+0x76/0x80 > >>>>> [99048.596652] [<c022dd8c>] ? generic_setxattr+0x9c/0xb0 > >>>>> [99048.596652] [<c022dcf0>] ? generic_setxattr+0x0/0xb0 > >>>>> [99048.596652] [<c022e984>] ? __vfs_setxattr_noperm+0x44/0x160 > >>>>> [99048.596652] [<c02fed4c>] ? cap_inode_setxattr+0x2c/0x60 > >>>>> [99048.596652] [<c022eb31>] ? vfs_setxattr+0x91/0xa0 > >>>>> [99048.596652] [<c022ebf8>] ? setxattr+0xb8/0x110 > >>>>> [99048.596652] [<c021d512>] ? __link_path_walk+0x632/0xca0 > >>>>> [99048.596652] [<c014e369>] ? enqueue_task_fair+0x39/0x80 > >>>>> [99048.596652] [<c022a9bf>] ? mntput_no_expire+0x1f/0xe0 > >>>>> [99048.596652] [<c022a9bf>] ? mntput_no_expire+0x1f/0xe0 > >>>>> [99048.596652] [<c021be45>] ? path_put+0x25/0x30 > >>>>> [99048.596652] [<c021ba8b>] ? putname+0x2b/0x40 > >>>>> [99048.596652] [<c021ea6a>] ? user_path_at+0x4a/0x80 > >>>>> [99048.596652] [<c0183242>] ? sys_futex+0x72/0x120 > >>>>> [99048.596652] [<c022ee13>] ? sys_setxattr+0x83/0x90 > >>>>> [99048.596652] [<c0109763>] ? sysenter_do_call+0x12/0x28 > >>>>> [99048.596652] Code: 83 3a ff ff ff 90 39 45 dc c7 45 0c ff ff ff ff 0f > >>>>> 83 > >>>>> 32 ff ff ff 8b 87 84 01 00 00 ba ba c6 5c c0 05 d0 00 00 00 e8 73 f1 > >>>>> ff<0f> > >>>>> 0b eb fe 8b 45 ec 89 55 d4 05 cc 00 00 00 89 45 ec e8 fc 53 > >>>>> [99048.596652] EIP: [<c026dc8d>] > >>>>> ext3_try_to_allocate_with_rsv+0x1cd/0x2b0 > >>>>> SS:ESP 0068:f5ccbc14 > >>>>> [99049.044090] ---[ end trace 35860103963ee444 ]--- > >>>>> h1farm184# > >>>>> -------------------- > >>>>> > >>>>> my ceph.conf is: > >>>>> ------- > >>>>> [global] > >>>>> pid file = /var/run/ceph/$name.pid > >>>>> debug ms = 1 > >>>>> keyring = /etc/ceph/keyring.bin > >>>>> ; monitors > >>>>> [mon] > >>>>> ;Directory for monitor files > >>>>> mon data = /x02/mon$id > >>>>> debug mon = 20 > >>>>> debug paxos = 20 > >>>>> mon lease wiggle room = 0.5 > >>>>> > >>>>> [mon0] > >>>>> host = h1farm182 > >>>>> mon addr = xxx.xxx.xx.116:6789 > >>>>> [mon1] > >>>>> host = h1farm183 > >>>>> mon addr = xxx.xxx.xx.117:6789 > >>>>> ; metadata servers > >>>>> [mds] > >>>>> debug mds = 20 > >>>>> mds log max segments = 2 > >>>>> keyring = /etc/ceph/keyring.$name > >>>>> [mds0] > >>>>> host = h1farm182 > >>>>> [mds1] > >>>>> host = h1farm183 > >>>>> [osd] > >>>>> sudo = true > >>>>> osd data = /x02/osd$id > >>>>> osd journal = /x02/osd$id/journal > >>>>> osd journal size = 100 > >>>>> keyring = /etc/ceph/keyring.$name > >>>>> debug osd = 20 > >>>>> debug journal = 20 > >>>>> debug filestore = 20 > >>>>> ;osd journal size = 100 > >>>>> [osd0] > >>>>> host = h1farm182 > >>>>> [osd1] > >>>>> host = h1farm183 > >>>>> [osd2] > >>>>> host = h1farm184 > >>>>> > >>>>> ------- > >>>>> > >>>>> Any idea how to improve the situation ? > >>>>> > >>>>> -- > >>>>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > >>>>> the body of a message to majordomo@xxxxxxxxxxxxxxx > >>>>> More majordomo info at http://vger.kernel.org/majordomo-info.html > >>>>> > >>>>> > >>>> > >>> -- > >>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > >>> the body of a message to majordomo@xxxxxxxxxxxxxxx > >>> More majordomo info at http://vger.kernel.org/majordomo-info.html > >>> > >>> > >> > > > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > the body of a message to majordomo@xxxxxxxxxxxxxxx > More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html