Re: Write operation is stuck

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Bogdan,

Yes, you can place your journal on a file, that is no problem.

Performance wise you might want to use a block device (or partition) and
a other device then the one where your data is one.

Wido

On Wed, 2010-09-01 at 17:21 +0200, Bogdan Lobodzinski wrote:
> Hello Sage,
> 
> replacing ext3 by btrfs my ceph test-bed survived my test comand:
> svn co https://root.cern.ch/svn/root/trunk root
> 
> I didn't try ext4.
> 
> However, I did a few changes in my initial ceph.conf.
> Could you, please, check if such a configuration is reasonable ?
> Is it correct to use "osd journal" location as it is done below ?
> 
> My new ceph.conf:
> -----------
> [global]
>         pid file = /var/run/ceph/$name.pid
>         debug ms = 1
>         keyring = /etc/ceph/keyring.bin
> [mon]
>         mon data = /x01/mon$id
>         debug mon = 20
>         debug paxos = 20
>         mon lease wiggle room = 0.5
> [mon0]
>         host = h1farm182
>         mon addr = xxx.xxx.xxx.116:6789
> [mon1]
>         host = h1farm183
>         mon addr = xxx.xxx.xxx.117:6789
> [mds]
>         debug mds = 10
>         mds log max segments = 2
>         keyring = /etc/ceph/keyring.$name
> [mds0]
>         host = h1farm182
> [mds1]
>         host = h1farm183
> [osd]
>         sudo = true
>         keyring = /etc/ceph/keyring.$name
>         osd data = /x02/osd$id
>         osd journal = /x02/osd$id/journal
>         osd journal size = 100
>         debug osd = 20
>         debug journal = 20
>         debug filestore = 20
> [osd0]
>         host = h1farm183
>         btrfs devs = /dev/sdb1
> [osd1]
>         host = h1farm184
>         btrfs devs = /dev/sdb1
> -----------
> 
> Thank you for help,
> 
> Cheers,
> 
> Bogdan
> 
> 
> On Tue, 31 Aug 2010, Bogdan Lobodzinski wrote:
> 
> >
> > Hello Sage,
> >
> > On Mon, 30 Aug 2010, Sage Weil wrote:
> >
> >> On Mon, 30 Aug 2010, Bogdan Lobodzinski wrote:
> >>> 
> >>> Hello Sage,
> >>> 
> >>> I moved to the kernel 2.6.35, keeping ext3 filesystem.
> >>> After executing teh same command:
> >>> svn co https://root.cern.ch/svn/root/trunk root
> >>> 
> >>> System is again dead. The command and kjournald are stuck
> >>> bogdan  8539  0.9  0.6  31168 22040 pts/0  DL+  16:44  0:21 svn co
> >>> https://root.cern.ch/svn/root/trunk root
> >>> root    802   0.0  0.0      0     0 ?        D  12:59  0:01 [kjournald]
> >> 
> >> Hmm.  Have you tried ext4?
> >> 
> >> I stopped seeing this on my own machine with recent kernels, but it looks
> >> like it isn't in fact fixed.  This should be reported to the ext4 list.
> >> Are you running ceph via vstart.sh or a custom ceph.conf?
> > I am using vstart.sh taken from compiled by me source tarball 
> > ceph-0.21.tar.gz  (http://ceph.newdream.net/download/)
> > and the client from
> > git://git.kernel.org/pub/scm/linux/kernel/git/sage/ceph-client-standalone.git
> >
> > Cheers,
> >
> > Bogdan
> >
> >> 
> >> sage
> >> 
> >>> 
> >>> Looks like the bug is not fixed, dmesg shows:
> >>> ---------
> >>> [14325.304068] kernel BUG at
> >>> /build/buildd/linux-maverick-2.6.35/fs/ext3/balloc.c:1385!
> >>> [14325.304191] invalid opcode: 0000 [#1] SMP
> >>> [14325.304263] last sysfs file: 
> >>> /sys/devices/pci0000:00/0000:00:00.0/device
> >>> [14325.304266] Modules linked in: nfs lockd fscache nfs_acl auth_rpcgss 
> >>> sunrpc
> >>> ceph crc32c libcrc32c radeon ttm drm_kms_helper drm mptctl psmouse agpgart
> >>> i5000_edac usbhid hid edac_core i2c_algo_bit bnx2 i5k_amb dcdbas shpchp
> >>> serio_raw mptsas mptscsih mptbase scsi_transport_sas
> >>> [14325.304266]
> >>> [14325.304266] Pid: 8391, comm: cosd Not tainted 2.6.35-14-generic
> >>> #20~lucid2-Ubuntu 0DT097/PowerEdge 1950
> >>> [14325.304266] EIP: 0060:[<c0274a4d>] EFLAGS: 00210286 CPU: 1
> >>> [14325.304266] EIP is at ext3_try_to_allocate_with_rsv+0x1cd/0x2b0
> >>> [14325.304266] EAX: 00000027 EBX: c8641440 ECX: c07d7cfc EDX: 00000000
> >>> [14325.304266] ESI: 007b7fff EDI: f640fa00 EBP: f5823c50 ESP: f5823c10
> >>> [14325.304266]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
> >>> [14325.304266] Process cosd (pid: 8391, ti=f5822000 task=f6b7bf70
> >>> task.ti=f5822000)
> >>> [14325.304266] Stack:
> >>> [14325.304266]  000000f6 c6c11930 c0273a58 00001000 f62e549c 00000007 
> >>> c8641454
> >>> 007b7fff
> >>> [14325.304266] <0> f6f7e420 007b0000 000000f6 f640de00 00000001 000000f6
> >>> c4063ec0 00000000
> >>> [14325.304266] <0> f5823cc0 c0274daf c6c11930 ffffffff c8641440 f5823ca8
> >>> f5823cac c0256017
> >>> [14325.304266] Call Trace:
> >>> [14325.304266]  [<c0273a58>] ? read_block_bitmap+0x48/0x160
> >>> [14325.304266]  [<c0274daf>] ? ext3_new_blocks+0x1ff/0x610
> >>> [14325.304266]  [<c0256017>] ? mb_cache_entry_find_first+0x67/0x80
> >>> [14325.304266]  [<c02751e5>] ? ext3_new_block+0x25/0x30
> >>> [14325.304266]  [<c0287721>] ? ext3_xattr_block_set+0x481/0x550
> >>> [14325.304266]  [<c0286490>] ? ext3_xattr_set_entry+0x20/0x2f0
> >>> [14325.304266]  [<c0287b0b>] ? ext3_xattr_set_handle+0x31b/0x400
> >>> [14325.304266]  [<c0287c65>] ? ext3_xattr_set+0x75/0xc0
> >>> [14325.304266]  [<c0287d24>] ? ext3_xattr_user_set+0x74/0x80
> >>> [14325.304266]  [<c023348b>] ? generic_setxattr+0x9b/0xb0
> >>> [14325.304266]  [<c02333f0>] ? generic_setxattr+0x0/0xb0
> >>> [14325.304266]  [<c0234084>] ? __vfs_setxattr_noperm+0x44/0x150
> >>> [14325.304266]  [<c03017dc>] ? cap_inode_setxattr+0x2c/0x60
> >>> [14325.304266]  [<c0234221>] ? vfs_setxattr+0x91/0xa0
> >>> [14325.304266]  [<c02342e8>] ? setxattr+0xb8/0x110
> >>> [14325.304266]  [<c0221d0e>] ? path_to_nameidata+0x1e/0x50
> >>> [14325.304266]  [<c0223492>] ? link_path_walk+0x412/0x890
> >>> [14325.304266]  [<c013a159>] ? enqueue_task_fair+0x39/0x80
> >>> [14325.304266]  [<c022ff3f>] ? mntput_no_expire+0x1f/0xd0
> >>> [14325.304266]  [<c022ff3f>] ? mntput_no_expire+0x1f/0xd0
> >>> [14325.304266]  [<c022168b>] ? putname+0x2b/0x40
> >>> [14325.304266]  [<c022470a>] ? user_path_at+0x4a/0x80
> >>> [14325.304266]  [<c0179902>] ? sys_futex+0x72/0x120
> >>> [14325.304266]  [<c0234503>] ? sys_setxattr+0x83/0x90
> >>> [14325.304266]  [<c05c9bb4>] ? syscall_call+0x7/0xb
> >>> [14325.304266]  [<c05c0000>] ? cache_add_dev+0x73/0x195
> >>> [14325.304266] Code: 83 3a ff ff ff 90 39 45 dc c7 45 0c ff ff ff ff 0f 83 
> >>> 32
> >>> ff ff ff 8b 87 80 01 00 00 ba 5a 7e 5e c0 05 d0 00 00 00 e8 83 f1 ff ff 
> >>> <0f>
> >>> 0b eb fe 8b 45 ec 89 55 d4 05 cc 00 00 00 89 45 ec e8 ec 4b
> >>> [14325.304266] EIP: [<c0274a4d>] ext3_try_to_allocate_with_rsv+0x1cd/0x2b0
> >>> SS:ESP 0068:f5823c10
> >>> [14325.326777] ---[ end trace 53e0b3b55af7a83c ]---
> >>> [14384.001261] ceph: mds0 caps stale
> >>> [14413.616132] ceph:  tid 33594 timed out on osd2, will reset osd
> >>> [14628.992279] ceph: mds0 hung
> >>> ---------
> >>> 
> >>> as a next step I wil try to use btrfs .
> >>> 
> >>> Cheers,
> >>> 
> >>> Bogdan
> >>> 
> >>> 
> >>> On Fri, 27 Aug 2010, Sage Weil wrote:
> >>> 
> >>>> Hi Bogdan,
> >>>> 
> >>>> This is a bug in the ext3 xattr code.  It seems to be gone in 2.6.34 and
> >>>> later.  Or, you can switch to btrfs!
> >>>> 
> >>>> sage
> >>>> 
> >>>> 
> >>>> On Fri, 27 Aug 2010, Bogdan Lobodzinski wrote:
> >>>> 
> >>>>> Hello,
> >>>>> 
> >>>>> working with ceph on my test configuration
> >>>>> (3 nodes Ubuntu 10.04.1 LTS, Linux 2.6.32-24-generic-pae #41-Ubuntu SMP)
> >>>>> After starting
> >>>>> svn co https://root.cern.ch/svn/root/trunk root
> >>>>> 
> >>>>> on the /ceph directory, the command become stuck, and also:
> >>>>> root      5303  0.0  0.0      0     0 ?        D    Aug26   0:00
> >>>>> [kjournald]
> >>>>> root     30181  0.0  0.0   6972  2056 pts/1    D+   13:46   0:00
> >>>>> /usr//bin/cosd
> >>>>> -i 2 -c /etc/ceph/ceph.conf
> >>>>> 
> >>>>> any mount, unmount are going also to the state D.
> >>>>> This is a permanennt behaviour of the ceph if the command is started.
> >>>>> 
> >>>>> dmesg shows:
> >>>>> -------------
> >>>>> [99048.567704] ------------[ cut here ]------------
> >>>>> [99048.568767] kernel BUG at
> >>>>> /build/buildd/linux-2.6.32/fs/ext3/balloc.c:1384!
> >>>>> [99048.568767] invalid opcode: 0000 [#1] SMP
> >>>>> [99048.568767] last sysfs file:
> >>>>> /sys/devices/pci0000:00/0000:00:00.0/device
> >>>>> [99048.596652] Modules linked in: nfs lockd nfs_acl auth_rpcgss sunrpc
> >>>>> ceph
> >>>>> crc32c libcrc32c openafs(P) fbcon tileblit font bitblit softcursor vga
> >>>>> vgastate mptctl radeon ttm drm_kms_helper drm bnx2 psmouse i5000_edac
> >>>>> edac_core agpgart serio_raw i5k_amb i2c_algo_bit shpchp dell_wmi dcdbas
> >>>>> usbhid mptsas mptscsih mptbase scsi_transport_sas
> >>>>> [99048.596652]
> >>>>> [99048.596652] Pid: 6258, comm: cosd Tainted: P
> >>>>> (2.6.32-24-generic-pae #41-Ubuntu) PowerEdge 1950
> >>>>> [99048.596652] EIP: 0060:[<c026dc8d>] EFLAGS: 00210296 CPU: 3
> >>>>> [99048.596652] EIP is at ext3_try_to_allocate_with_rsv+0x1cd/0x2b0
> >>>>> [99048.596652] EAX: 00000027 EBX: f6dd5480 ECX: fffe48f7 EDX: 00000000
> >>>>> [99048.596652] ESI: 02147fff EDI: f625e200 EBP: f5ccbc54 ESP: f5ccbc14
> >>>>> [99048.596652]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
> >>>>> [99048.596652] Process cosd (pid: 6258, ti=f5cca000 task=f6263300
> >>>>> task.ti=f5cca000)
> >>>>> [99048.596652] Stack:
> >>>>> [99048.596652]  00000428 f14f1bc0 c026cc88 00001000 00000007 f1a80e9c
> >>>>> f6dd5494 02147fff
> >>>>> [99048.596652] <0> f70d89c0 02140000 00000428 f625d800 00000001 00000428
> >>>>> f1058500 00000000
> >>>>> [99048.596652] <0> f5ccbcc8 c026e048 f14f1bc0 ffffffff f6dd5480 f5ccbcb0
> >>>>> f5ccbcb4 f5ccbc90
> >>>>> [99048.596652] Call Trace:
> >>>>> [99048.596652]  [<c026cc88>] ? read_block_bitmap+0x48/0x160
> >>>>> [99048.596652]  [<c026e048>] ? ext3_new_blocks+0x228/0x6c0
> >>>>> [99048.596652]  [<c024fbd7>] ? mb_cache_entry_find_first+0x67/0x80
> >>>>> [99048.596652]  [<c026e505>] ? ext3_new_block+0x25/0x30
> >>>>> [99048.596652]  [<c02809a4>] ? ext3_xattr_block_set+0x554/0x670
> >>>>> [99048.596652]  [<c027f589>] ? ext3_xattr_set_entry+0x29/0x350
> >>>>> [99048.596652]  [<c0280d8b>] ? ext3_xattr_set_handle+0x2cb/0x3e0
> >>>>> [99048.596652]  [<c0280f15>] ? ext3_xattr_set+0x75/0xc0
> >>>>> [99048.596652]  [<c0280fd6>] ? ext3_xattr_user_set+0x76/0x80
> >>>>> [99048.596652]  [<c022dd8c>] ? generic_setxattr+0x9c/0xb0
> >>>>> [99048.596652]  [<c022dcf0>] ? generic_setxattr+0x0/0xb0
> >>>>> [99048.596652]  [<c022e984>] ? __vfs_setxattr_noperm+0x44/0x160
> >>>>> [99048.596652]  [<c02fed4c>] ? cap_inode_setxattr+0x2c/0x60
> >>>>> [99048.596652]  [<c022eb31>] ? vfs_setxattr+0x91/0xa0
> >>>>> [99048.596652]  [<c022ebf8>] ? setxattr+0xb8/0x110
> >>>>> [99048.596652]  [<c021d512>] ? __link_path_walk+0x632/0xca0
> >>>>> [99048.596652]  [<c014e369>] ? enqueue_task_fair+0x39/0x80
> >>>>> [99048.596652]  [<c022a9bf>] ? mntput_no_expire+0x1f/0xe0
> >>>>> [99048.596652]  [<c022a9bf>] ? mntput_no_expire+0x1f/0xe0
> >>>>> [99048.596652]  [<c021be45>] ? path_put+0x25/0x30
> >>>>> [99048.596652]  [<c021ba8b>] ? putname+0x2b/0x40
> >>>>> [99048.596652]  [<c021ea6a>] ? user_path_at+0x4a/0x80
> >>>>> [99048.596652]  [<c0183242>] ? sys_futex+0x72/0x120
> >>>>> [99048.596652]  [<c022ee13>] ? sys_setxattr+0x83/0x90
> >>>>> [99048.596652]  [<c0109763>] ? sysenter_do_call+0x12/0x28
> >>>>> [99048.596652] Code: 83 3a ff ff ff 90 39 45 dc c7 45 0c ff ff ff ff 0f 
> >>>>> 83
> >>>>> 32 ff ff ff 8b 87 84 01 00 00 ba ba c6 5c c0 05 d0 00 00 00 e8 73 f1
> >>>>> ff<0f>
> >>>>> 0b eb fe 8b 45 ec 89 55 d4 05 cc 00 00 00 89 45 ec e8 fc 53
> >>>>> [99048.596652] EIP: [<c026dc8d>] 
> >>>>> ext3_try_to_allocate_with_rsv+0x1cd/0x2b0
> >>>>> SS:ESP 0068:f5ccbc14
> >>>>> [99049.044090] ---[ end trace 35860103963ee444 ]---
> >>>>> h1farm184#
> >>>>> --------------------
> >>>>> 
> >>>>> my ceph.conf is:
> >>>>> -------
> >>>>> [global]
> >>>>>        pid file = /var/run/ceph/$name.pid
> >>>>>        debug ms = 1
> >>>>>        keyring = /etc/ceph/keyring.bin
> >>>>> ; monitors
> >>>>> [mon]
> >>>>>        ;Directory for monitor files
> >>>>>        mon data = /x02/mon$id
> >>>>>        debug mon = 20
> >>>>>        debug paxos = 20
> >>>>>        mon lease wiggle room = 0.5
> >>>>> 
> >>>>> [mon0]
> >>>>>        host = h1farm182
> >>>>>        mon addr = xxx.xxx.xx.116:6789
> >>>>> [mon1]
> >>>>>        host = h1farm183
> >>>>>        mon addr = xxx.xxx.xx.117:6789
> >>>>> ; metadata servers
> >>>>> [mds]
> >>>>>        debug mds = 20
> >>>>>        mds log max segments = 2
> >>>>>        keyring = /etc/ceph/keyring.$name
> >>>>> [mds0]
> >>>>>        host = h1farm182
> >>>>> [mds1]
> >>>>>        host = h1farm183
> >>>>> [osd]
> >>>>>        sudo = true
> >>>>>        osd data = /x02/osd$id
> >>>>>        osd journal = /x02/osd$id/journal
> >>>>>        osd journal size = 100
> >>>>>        keyring = /etc/ceph/keyring.$name
> >>>>>        debug osd = 20
> >>>>>        debug journal = 20
> >>>>>        debug filestore = 20
> >>>>>        ;osd journal size = 100
> >>>>> [osd0]
> >>>>>        host = h1farm182
> >>>>> [osd1]
> >>>>>        host = h1farm183
> >>>>> [osd2]
> >>>>>        host = h1farm184
> >>>>> 
> >>>>> -------
> >>>>> 
> >>>>> Any idea how to improve the situation ?
> >>>>> 
> >>>>> --
> >>>>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> >>>>> the body of a message to majordomo@xxxxxxxxxxxxxxx
> >>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> >>>>> 
> >>>>> 
> >>>> 
> >>> --
> >>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> >>> the body of a message to majordomo@xxxxxxxxxxxxxxx
> >>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> >>> 
> >>> 
> >> 
> >
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux