On Mon, Apr 11, 2016 at 4:37 PM, Simon Ferber <ferber@xxxxxxxxxxxxxxxxxxxxxxxx> wrote: > Hi, > > I try to setup an ceph cluster on Debian 8.4. Mainly I followed a > tutorial at > http://adminforge.de/raid/ceph/ceph-cluster-unter-debian-wheezy-installieren/ > > As far as I can see, the first steps are just working fine. I have two > nodes with four OSD on both nodes. > This is the output of ceph -s > > cluster 2a028d5e-5708-4fc4-9c0d-3495c1a3ef3d > health HEALTH_OK > monmap e2: 2 mons at > {ollie2=129.217.207.207:6789/0,stan2=129.217.207.206:6789/0} > election epoch 12, quorum 0,1 stan2,ollie2 > mdsmap e10: 1/1/1 up {0=ollie2=up:active}, 1 up:standby > osdmap e72: 8 osds: 8 up, 8 in > flags sortbitwise > pgmap v137: 428 pgs, 4 pools, 2396 bytes data, 20 objects > 281 MB used, 14856 GB / 14856 GB avail > 428 active+clean > > Then I tried to add cephfs following the manual at > http://docs.ceph.com/docs/hammer/cephfs/createfs/ which seem to do it's > magic: > root@stan2:~# ceph fs ls > name: cephfs, metadata pool: cephfs_metadata, data pools: [cephfs_data ] > > However, as soon as I try to mount the cephfs with mount.ceph > 129.217.207.206:6789:/ /mnt/ -v -o > name=cephfs,secretfile=/etc/ceph/client.cephfs the server which tries to > mount crashes and has to be cold started again. To be able to use > mount.ceph I had to install ceph-fs-common - if that does matter... > > Here is the kernel.log. Can you give me hints? I am pretty stuck on this > for the last few days. > > Apr 11 16:25:02 stan2 kernel: [ 171.086381] Key type ceph registered > Apr 11 16:25:02 stan2 kernel: [ 171.086649] libceph: loaded (mon/osd > proto 15/24) > Apr 11 16:25:02 stan2 kernel: [ 171.090582] FS-Cache: Netfs 'ceph' > registered for caching > Apr 11 16:25:02 stan2 kernel: [ 171.090596] ceph: loaded (mds proto 32) > Apr 11 16:25:02 stan2 kernel: [ 171.096727] libceph: client34164 fsid > 2a028d5e-5708-4fc4-9c0d-3495c1a3ef3d > Apr 11 16:25:02 stan2 kernel: [ 171.133832] libceph: mon0 > 129.217.207.206:6789 session established > Apr 11 16:25:02 stan2 kernel: [ 171.161199] ------------[ cut here > ]------------ > Apr 11 16:25:02 stan2 kernel: [ 171.161239] kernel BUG at > /build/linux-lqALYs/linux-3.16.7-ckt25/fs/ceph/mds_client.c:1846! > Apr 11 16:25:02 stan2 kernel: [ 171.161294] invalid opcode: 0000 [#1] SMP > Apr 11 16:25:02 stan2 kernel: [ 171.161328] Modules linked in: cbc ceph > libceph xfs libcrc32c crc32c_generic binfmt_misc mptctl mptbase nfsd > auth_rpcgss oid_registry nfs_acl nfs lockd fscache sunrpc nls_utf8 > nls_cp437 vfat fat x86_pkg_temp_thermal intel_powerclamp intel_rapl > coretemp kvm_intel kvm crc32_pclmul cryptd iTCO_wdt iTCO_vendor_support > efi_pstore efivars pcspkr joydev evdev ast i2c_i801 ttm drm_kms_helper > drm lpc_ich mfd_core mei_me mei shpchp ioatdma tpm_tis wmi tpm ipmi_si > ipmi_msghandler processor thermal_sys acpi_power_meter button acpi_pad > fuse autofs4 ext4 crc16 mbcache jbd2 dm_mod raid1 md_mod hid_generic sg > usbhid hid sd_mod crc_t10dif crct10dif_generic crct10dif_pclmul > crct10dif_common crc32c_intel ahci libahci ehci_pci mpt3sas igb > raid_class i2c_algo_bit xhci_hcd libata ehci_hcd scsi_transport_sas > i2c_core dca usbcore ptp usb_common scsi_mod pps_core > Apr 11 16:25:02 stan2 kernel: [ 171.162046] CPU: 0 PID: 3513 Comm: > kworker/0:9 Not tainted 3.16.0-4-amd64 #1 Debian 3.16.7-ckt25-2 > Apr 11 16:25:02 stan2 kernel: [ 171.162104] Hardware name: Supermicro > SYS-6028R-WTR/X10DRW-i, BIOS 1.0c 01/07/2015 > Apr 11 16:25:02 stan2 kernel: [ 171.162158] Workqueue: ceph-msgr > con_work [libceph] > Apr 11 16:25:02 stan2 kernel: [ 171.162194] task: ffff88103f2e8ae0 ti: > ffff88103bfbc000 task.ti: ffff88103bfbc000 > Apr 11 16:25:02 stan2 kernel: [ 171.162243] RIP: > 0010:[<ffffffffa0733071>] [<ffffffffa0733071>] > __prepare_send_request+0x801/0x810 [ceph] > Apr 11 16:25:02 stan2 kernel: [ 171.162312] RSP: 0018:ffff88103bfbfba8 > EFLAGS: 00010283 > Apr 11 16:25:02 stan2 kernel: [ 171.162347] RAX: ffff88103f88ad42 RBX: > ffff88103f7f7400 RCX: 0000000000000000 > Apr 11 16:25:02 stan2 kernel: [ 171.162394] RDX: 00000000164c5ec6 RSI: > 0000000000000000 RDI: ffff88103f88ad32 > Apr 11 16:25:02 stan2 kernel: [ 171.162440] RBP: ffff88103f7f95e0 R08: > 0000000000000000 R09: 0000000000000000 > Apr 11 16:25:02 stan2 kernel: [ 171.162485] R10: 0000000000000000 R11: > 000000000000002c R12: ffff88103f7f7c00 > Apr 11 16:25:02 stan2 kernel: [ 171.162531] R13: ffff88103f88acc0 R14: > 0000000000000000 R15: ffff88103f88ad3a > Apr 11 16:25:02 stan2 kernel: [ 171.162578] FS: 0000000000000000(0000) > GS:ffff88107fc00000(0000) knlGS:0000000000000000 > Apr 11 16:25:02 stan2 kernel: [ 171.162629] CS: 0010 DS: 0000 ES: 0000 > CR0: 0000000080050033 > Apr 11 16:25:02 stan2 kernel: [ 171.162668] CR2: 00007fa73ca0a000 CR3: > 0000000001a13000 CR4: 00000000001407f0 > Apr 11 16:25:02 stan2 kernel: [ 171.162713] Stack: > Apr 11 16:25:02 stan2 kernel: [ 171.162730] ffff88103bfbfbd4 > ffff88103ef39540 0000000000000001 0000000000000000 > Apr 11 16:25:02 stan2 kernel: [ 171.162787] 0000000000000000 > 0000000000000000 ffff88103ef39540 0000000000000000 > Apr 11 16:25:02 stan2 kernel: [ 171.162845] 0000000000000001 > 0000000000000000 ffff88103f88ad42 ffff88103f7f7400 > Apr 11 16:25:02 stan2 kernel: [ 171.162903] Call Trace: > Apr 11 16:25:02 stan2 kernel: [ 171.162928] [<ffffffffa073328f>] ? > __do_request+0x20f/0x310 [ceph] > Apr 11 16:25:02 stan2 kernel: [ 171.162974] [<ffffffffa0733400>] ? > __wake_requests+0x70/0xb0 [ceph] > Apr 11 16:25:02 stan2 kernel: [ 171.163021] [<ffffffffa0735750>] ? > dispatch+0x440/0x1780 [ceph] > Apr 11 16:25:02 stan2 kernel: [ 171.163065] [<ffffffffa06e57f7>] ? > ceph_tcp_recvmsg+0x47/0x60 [libceph] > Apr 11 16:25:02 stan2 kernel: [ 171.163112] [<ffffffffa06e97ce>] ? > con_work+0x18fe/0x2c40 [libceph] > Apr 11 16:25:02 stan2 kernel: [ 171.163159] [<ffffffff8101b975>] ? > sched_clock+0x5/0x10 > Apr 11 16:25:02 stan2 kernel: [ 171.163197] [<ffffffff810817c2>] ? > process_one_work+0x172/0x420 > Apr 11 16:25:02 stan2 kernel: [ 171.163238] [<ffffffff81081e53>] ? > worker_thread+0x113/0x4f0 > Apr 11 16:25:02 stan2 kernel: [ 171.163277] [<ffffffff81081d40>] ? > rescuer_thread+0x2d0/0x2d0 > Apr 11 16:25:02 stan2 kernel: [ 171.163318] [<ffffffff8108809d>] ? > kthread+0xbd/0xe0 > Apr 11 16:25:02 stan2 kernel: [ 171.163355] [<ffffffff81087fe0>] ? > kthread_create_on_node+0x180/0x180 > Apr 11 16:25:02 stan2 kernel: [ 171.163402] [<ffffffff81514958>] ? > ret_from_fork+0x58/0x90 > Apr 11 16:25:02 stan2 kernel: [ 171.163441] [<ffffffff81087fe0>] ? > kthread_create_on_node+0x180/0x180 > Apr 11 16:25:02 stan2 kernel: [ 171.163485] Code: 49 8b 8d a0 fc ff ff > 4d 8b 85 a8 fc ff ff 4c 89 ea 48 c7 c6 64 89 74 a0 48 c7 c7 38 0e 75 a0 > 31 c0 e8 c4 d6 b9 e0 e9 a8 f9 ff ff <0f> 0b 0f 0b 66 66 2e 0f 1f 84 00 > 00 00 00 00 0f 1f 44 00 00 41 > Apr 11 16:25:02 stan2 kernel: [ 171.163758] RIP [<ffffffffa0733071>] > __prepare_send_request+0x801/0x810 [ceph] > Apr 11 16:25:02 stan2 kernel: [ 171.163812] RSP <ffff88103bfbfba8> > Apr 11 16:25:02 stan2 kernel: [ 171.163839] ---[ end trace > 797bb0683850e8df ]--- > Apr 11 16:25:02 stan2 kernel: [ 171.167929] BUG: unable to handle > kernel paging request at ffffffffffffffd8 > Apr 11 16:25:02 stan2 kernel: [ 171.167985] IP: [<ffffffff8108862c>] > kthread_data+0xc/0x20 Hi, This is an accidental breakage caused by a backport of a patch that wasn't explicitly marked for backporting. Your 3.16.7-ckt25-2 is one of the affected kernels. Ubuntu kernel team has the fix queued - it's about to be released. You can also grab an unreleased kernel with the fix applied from one of the links in the ceph-devel thread [1]. See [1] http://www.spinics.net/lists/ceph-devel/msg29504.html [2] http://tracker.ceph.com/issues/15302 Thanks, Ilya _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com